← All tags
Multimodal AI Agent Skills
Browse AI agent skills tagged "Multimodal". Find and install skills, MCP servers, and plugins for your AI coding assistant.
2 listings
UI Tars Desktop
A fast, lightweight Model Context Protocol (MCP) server that empowers LLMs with browser automation via Puppeteer’s structured accessibility data, featuring optional vision mode for complex visual understanding and flexible, cross-platform configuration. - ⚡ Fast & lightweight. Utilizes Puppeteer's label index, not pixel-based input and accessibility DOM tree. - 👁️ Vision Mode Support. Optional vi
8.01 weeklybytedance/UI-TARS-desktop
Brand Style
PluginScreenpipe brand style guide. Reference this when designing UI components, writing copy, or making visual decisions.
8.0mediar-ai/screenpipe