Skip to content

Computer use SDK for building agents that learn from human screen recordings. Cross-platform (Windows/macOS/Linux), deterministic, and ready for L5 desktop automation.

License

Notifications You must be signed in to change notification settings

mediar-ai/terminator

Repository files navigation

terminator πŸ€–

paint051225.mp4

Join us on Discord docs YouTube @mediar_ai

Crates.io - terminator-rs Crates.io - workflow recorder

Computer use SDK for building agents that learn from human screen recordings. Cross-platform (Windows/macOS/Linux), deterministic, and ready for L5 desktop automation.

⚑ TL;DR β€” Hello World Example

Skip the boilerplate. This is the fastest way to feel the magic.

🐍 Python

pip install terminator.py
import terminator
desktop = terminator.Desktop()
desktop.open_application('calc')
seven = desktop.locator('name:Seven')
seven.click()

🟦 TypeScript / Node.js

bun i terminator.js # or npm, pnpm, yarn
const { Desktop } = require('terminator.js');
const desktop = new Desktop();
await desktop.openApplication('notepad')
await desktop.locator('name:Edit').typeText('hello world')

🧠 What is Terminator?

Terminator is an AI-first Playwright-style SDK for automating operating systems.

  • πŸͺŸ Built for Windows, with partial support on Linux and macOS
  • πŸ€– Learns deterministically from screen recordings of real workflows
  • 🧠 Designed for AI agentsβ€”not humans
  • ⚑ Uses OS-level accessibility APIs, with OCR/Vision as fallback
  • 🧩 Supports TypeScript, Python, MCP, and Rust
  • πŸ“ˆ Scans the UI in ~80msβ€”up to 10,000x faster and cheaper than a human

Terminator runs β€œheadless” by default. It doesn’t require a visible screen, relying instead on accessibility layers (like UI Automation on Windows) to interact with apps.

Feature Support

While Terminator aims for full cross-platform support, current capabilities vary by OS. Windows is the primary development target and has the most complete feature set.

Feature Windows macOS Linux Notes
Core Automation
Element Locators βœ… 🟑 🟑 Find elements by name, role, window, etc.
UI Actions (click, type) βœ… 🟑 🟑 Core interactions with UI elements.
Application Management βœ… 🟑 🟑 Launch, list, and manage applications.
Window Management βœ… 🟑 🟑 Get active window, list windows.
Advanced Features
Workflow Recording βœ… ❌ ❌ Record human workflows for deterministic automation.
Monitor Management βœ… 🟑 🟑 Multi-display support.
Screen & Element Capture βœ… βœ… 🟑 Take screenshots of displays or elements.
Language Bindings
Python (terminator.py) βœ… βœ… βœ… pip install terminator.py
TypeScript (terminator.js) βœ… βœ… βœ… npm i terminator.js
MCP (terminator-mcp-agent) βœ… βœ… βœ… npx -y terminator-mcp-agent --add-to-app [app]
Rust (terminator-rs) βœ… βœ… βœ… cargo add terminator-rs

Legend:

  • βœ…: Supported - The feature is stable and well-tested.
  • 🟑: Partial / Experimental - The feature is in development and may have limitations.
  • ❌: Not Supported - The feature is not yet available on this platform.

Documentation

For detailed information on features, installation, usage, and the API, please visit the Official Documentation.

Here's a section you can add under your README.md to document tools for inspecting accessibility elements across Windows, macOS, and Linux β€” tailored to Terminator users trying to find correct selectors:


πŸ•΅οΈ How to Inspect Accessibility Elements (like name:Seven)

To create reliable selectors (e.g. name:Seven, role:Button, window:Calculator), you need to inspect the Accessibility Tree of your OS. Here's how to explore UI elements on each platform:

πŸͺŸ Windows

  • Tool: Accessibility Insights for Windows
  • Alt: Inspect.exe (comes with Windows SDK)
  • Usage: Open the app you want to inspect β†’ launch Accessibility Insights β†’ hover or use keyboard navigation to explore the UI tree (Name, Role, ControlType, AutomationId).

These tools show you the Name, Role, ControlType, and other metadata used in Terminator selectors.


🍎 macOS

  • Tool: Accessibility Inspector
  • Usage: Comes with Xcode β†’ Open Xcode > Open Developer Tool > Accessibility Inspector β†’ Use the target icon to explore UI elements on screen.

🐧 Linux

  • Tool: Accerciser

  • Install:

    sudo apt install accerciser
  • Usage: Launch Accerciser β†’ Select the window/app β†’ Browse the accessible widget tree.


πŸ’‘ Tip

Once you identify the structure of your UI:

# Sample pattern
desktop.locator('window:Calculator')
       .locator('role:Button')
       .locator('name:Seven')

You can build and debug selector paths incrementally using .locator() chaining.

Explore Further

contributing

contributions are welcome! please feel free to submit issues and pull requests. many parts are experimental, and help is appreciated. join our discord to discuss.

businesses

if you want desktop automation at scale for your business, let's talk

About

Computer use SDK for building agents that learn from human screen recordings. Cross-platform (Windows/macOS/Linux), deterministic, and ready for L5 desktop automation.

Topics

Resources

License

Stars

Watchers

Forks

Contributors 7