gongliming7

[Open Source] xAgent CLI - The FIRST AI assistant that can actually control your desktop

by

I've been building something unique - an AI CLI tool that doesn't just read/write files, but can actually control your mouse and keyboard. It's called xAgent CLI.

Why is this a big deal?

Most AI coding tools (Claude Code, Cursor, Copilot) can only:

  • read_file and write files

  • Execute shell commands

  • Search codebases

But they can't:

  • Click buttons on screen

  • Fill out web forms

  • Navigate websites

  • Control desktop apps

xAgent CLI can do ALL of this.

Key Features:

  1. 🖱️ True GUI Automation

    • Precise mouse coordinate control

    • Keyboard input simulation

    • Browser automation

    • Control ANY application on your PC

  2. 🧠 Access to Frontier Models

    • MiniMax M2.1 (High-performance reasoning and coding)

    • GLM-4.7 (From Zhipu AI)

    • Kimi K2 (MoE model from Moonshot AI)

    • Qwen3 Coder (Alibaba's coding model)

    • ALL FREE, no API keys needed

  3. 💻 Developer Tools

    • Code analysis and refactoring

    • Bug detection and fixing

    • Project architecture analysis

    • Context compression for large repos

  4. 🏠 Life Automation

    • "Organize my desktop"

    • "Download all PDFs from this page"

    • "Set up daily backups"

    • "Find and remove duplicate files"

  5. 🔒 Security Modes

    • 5 modes from YOLO (full access) to DEFAULT (approval required)

Example Usage:

xagent gui --url https://example.com
> Click the login button at coordinates (500, 300)
> Type "myemail@example.com" in the username field
> Type "mypassword" in the password field
> Click the submit button

Quick Start:

npm i -g @xagent-ai/cli
xagent start

Cross-platform: Windows, macOS, Linux.

Would love your feedback!

Repo: https://github.com/xAgent-AI/xagent

6 views

Add a comment

Replies

Be the first to comment