Skip to content

Auto WebCursor is a browser-extension AI agent that uses LLMs to automate web browsing tasks. By interacting with the HTML elements, WebCursor can automatically navigate pages, and perform actions like click or input with simple text prompts.

License

Notifications You must be signed in to change notification settings

Richard28277/Auto-WebCursor

Repository files navigation

Auto WebCursor

Auto WebCursor is a Chrome extension that automates web browsing tasks using AI. It allows users to control the cursor and perform actions on web pages through simple text or voice inputs. By interacting with the HTML elements, WebCursor can automatically navigate pages, and perform actions like click or input with simple text prompts.

Demo

Demo Video


Features

  • LLM-based Computer Use: Control your and automate web tasks using text or voice commands, allowing the LLM to click, read, and input.
  • Accessibility First: Designed to make web navigation easier for individuals with disabilities.
  • Voice and Text Input: Interact with the web using natural language.
  • Customizable Workflows: Create personalized workflows for common tasks.

How to Download and Load the Extension

Option 1: Clone the Repository

  1. Clone this repository to your local machine:
    git clone https://github.com/Richard28277/auto-webcursor.git
  2. Navigate to the project folder:
    cd auto-webcursor

Option 2: Download and Extract the ZIP File

  1. Download the ZIP file from the GitHub repository:
    • Go to the repository page.
    • Click the Code button and select Download ZIP.
  2. Extract the ZIP file to a folder on your computer.

Load the Extension in Chrome

  1. Open Google Chrome and go to chrome://extensions/.
  2. Enable Developer Mode by toggling the switch in the top-right corner.
  3. Click Load unpacked.
  4. Select the folder where you cloned the repository or extracted the ZIP file (e.g., auto-webcursor).
  5. The extension will now appear in your list of installed extensions.

How to Use Auto WebCursor

  1. Open the Extension:

    • Click the Auto WebCursor icon in the Chrome toolbar to open the side panel.
  2. Enter Commands:

    • Type or speak your command in the input field (e.g., "Click the login button").
    • The AI will process your command and perform the action on the web page.
  3. Customize API:

    • Use the settings menu to modify the API provider and model type.

License

This project is licensed under the MIT License. See the LICENSE file for details.

About

Auto WebCursor is a browser-extension AI agent that uses LLMs to automate web browsing tasks. By interacting with the HTML elements, WebCursor can automatically navigate pages, and perform actions like click or input with simple text prompts.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published