MCP server to let LLM take full control on your device by providing screen automation toolkit for controlling and interacting with graphical user interfaces. Good for automation, education and having fun. - 📷 Screen capture and analysis - 🖱️ Mouse control (clicking, positioning) - ⌨️ Keyboard input (typing, key presses, hotkeys) 0. Install python 3.12 1. Clone the repository: 2. create virtiual
Add this skill
npx mdskills install Mtehabsim/screenpilotProvides complete UI automation for agents but lacks detailed tool API documentation and usage examples
MCP server to let LLM take full control on your device by providing screen automation toolkit for controlling and interacting with graphical user interfaces. Good for automation, education and having fun.
https://github.com/user-attachments/assets/c18380c0-b3dd-4b7c-925d-28ef205ca11f
git clone https://github.com/Mtehabsim/ScreenPilot.git
python -m venv venv
venv\Scripts\activate
pip install -r requirements.txt
{
"mcpServers": {
"device-controll": {
"command": "pathToEnv\\venv\\Scripts\\python.exe",
"args": [
"pathToProject\\ScreenPilot\\main.py"
]
}
}
}
Replace "pathToEnv\venv\Scripts\python.exe" → with the full path to your python.exe "pathToProject\ScreenPilot\main.py" → with the full path to your main.py file
Save the config file.
Open Claude AI Desktop.
Go to File → Exit
You can now open Claude AI Desktop and enjoy ScreenPilot.
Contributions are welcome! Please feel free to submit a Pull Request.
Install via CLI
npx mdskills install Mtehabsim/screenpilotScreenPilot is a free, open-source AI agent skill. MCP server to let LLM take full control on your device by providing screen automation toolkit for controlling and interacting with graphical user interfaces. Good for automation, education and having fun. - 📷 Screen capture and analysis - 🖱️ Mouse control (clicking, positioning) - ⌨️ Keyboard input (typing, key presses, hotkeys) 0. Install python 3.12 1. Clone the repository: 2. create virtiual
Install ScreenPilot with a single command:
npx mdskills install Mtehabsim/screenpilotThis downloads the skill files into your project and your AI agent picks them up automatically.
ScreenPilot works with Claude Code, Claude Desktop, Cursor, Vscode Copilot, Windsurf, Continue Dev, Codex, Gemini Cli, Amp, Roo Code, Goose, Opencode, Trae, Qodo, Command Code. Skills use the open SKILL.md format which is compatible with any AI coding agent that reads markdown instructions.