A GUI Agent application based on UI-TARS(Vision-Lanuage Model) that allows you to control your computer using natural language.
-
Updated
Feb 20, 2025 - TypeScript
A GUI Agent application based on UI-TARS(Vision-Lanuage Model) that allows you to control your computer using natural language.
✨ Use natural language to control your browser, powered by LLM and playwright
Mark web pages for use with vision-language models
This is the crud backend for our QA test application
AI-powered computer control for automated testing. FactifAI uses vision models (Claude, GPT-4o, Gemini) to interact with applications naturally - clicking, typing, and verifying results just like a human would.
A web playground for a secure and open source computer use. Powered by E2B.
🤖 LLM-powered computer control through local and Docker environments. Features VNC integration, automated interactions, and a chat interface for natural language system control.
Build your own AI operators like OpenAI
Anthropic's Computer use implementation in Nodejs
Anthropic's Computer Use tools within VSCode
Add a description, image, and links to the computer-use topic page so that developers can more easily learn about it.
To associate your repository with the computer-use topic, visit your repo's landing page and select "manage topics."