Skip to content

cmaliwal/Claude-GUI

Repository files navigation

Claude-GUI

About Mini app where Claude moves the mouse to interact with an HTML page, and uses that interaction to trigger or reflect something in a Flask backend.

To install and set up the project locally, follow these steps:

Note: Python version >= 3.11 is required.

  1. Create and activate the virtual environment
  # On macOS/Linux
  python3.11 -m venv .venv
  source .venv/bin/activate

  # On Windows (PowerShell)
  python3.11 -m venv .venv
  .\.venv\Scripts\Activate.ps1
  1. Install dependencies using pip:

    pip install -r requirements.txt
  2. Create your .env file using env.example:

        cp example.env .env

Run the App

Open terminal:

  1. Terminal: Start the mouse interaction script
    python mouse_demo.py

Claude will:

  • Start the Flask app in the background.
  • Take screenshots to understand what's on screen.
  • Launch (or attempt to launch) Firefox. -Try to interact with the GUI using the tools.

Current Status

  • Screenshot functionality works.
  • Claude can visually recognize the interface and describe UI components.
  • Firefox can be launched using the bash tool.
  • Mouse clicks via the computer tool are being invoked but do not yet trigger actual interactions.
Screencast.from.04-21-2025.06_46_17.PM.webm

Future Scope of Improvements

Here’s what we plan to improve or implement next:

  • Make mouse click actions actually affect the browser (functional GUI interaction)
  • Detect and interact with GUI elements based on visual position or text (e.g., click "Yes" button by name)

About

About Mini app where Claude moves the mouse to interact with an HTML page, and uses that interaction to trigger or reflect something in a Flask backend.

Topics

Resources

Stars

Watchers

Forks