Audio & Video
Transcription, TTS, podcasts, video editing, streaming, music
16 listings
Youtube Summarizer
Extract transcripts from YouTube videos and generate comprehensive, detailed summaries using intelligent analysis frameworks
Voice AI Engine Development
Build real-time conversational AI voice engines using async worker pipelines, streaming transcription, LLM agents, and TTS synthesis with interrupt handling and multi-provider support
Barevalue MCP
MCP ServerMCP (Model Context Protocol) server for the Barevalue AI podcast editing API. Allows Claude Code and other MCP-compatible tools to submit and manage podcast editing orders programmatically. - Upload audio files directly from your local machine - Submit orders for AI-powered podcast editing - Check order status and download completed files - Manage webhooks for automated notifications - Pre-validat
Azure AI Voicelive Py
Build real-time voice AI applications using Azure AI Voice Live SDK (azure-ai-voicelive). Use this skill when creating Python applications that need real-time bidirectional audio communication with Azure AI, including voice assistants, voice-enabled chatbots, real-time speech-to-speech translation, voice-driven avatars, or any WebSocket-based audio streaming with AI models. Supports Server VAD (Voice Activity Detection), turn-based conversation, function calling, MCP tools, avatar integration, a
atsurae
MCP Server for AI-powered video editing. Let Claude, GPT, or any AI agent edit videos through natural language. Add to your Claude Desktop config (claudedesktopconfig.json): Then restart Claude Desktop. You can now edit videos through conversation. Layer Compositing Model: Output: 1920x1080, 30fps, H.264 + AAC, MP4 atsurae.ai also exposes a REST API that any AI agent can call directly, without MCP
Manim MCP Server
MCP ServerThis is an MCP (Model Context Protocol) server that executes Manim animation code and returns the generated video. It allows users to send Manim scripts and receive the rendered animation. - Executes Manim Python scripts. - Saves animation output in a visible media folder. - Allows users to clean up temporary files after execution. - Portable and configurable via environment variables. Ensure you
Torrentclaw MCP
MCP ServerModel Context Protocol server for TorrentClaw β giving AI assistants the ability to search movies and TV shows, find torrents with magnet links, check streaming availability, and explore cast/crew metadata. torrentclaw-mcp is developed by TorrentClaw as part of its open-source ecosystem. It wraps the TorrentClaw API into the MCP standard so that any compatible AI assistant (Claude, GPT, etc.) can
Video Edit MCP Server π¬
MCP ServerA powerful Model Context Protocol (MCP) server designed for advanced video and audio editing operations. This server enables MCP clientsβsuch as Claude Desktop, Cursor, and othersβto perform comprehensive multimedia editing tasks through a standardized and unified interface. - Basic Editing: Trim, merge, resize, crop, rotate videos - Effects: Speed control, fade in/out, grayscale, mirror - Overlay
TypeScript SDK for S2
TypeScript SDK for S2 This repo contains the official TypeScript SDK for S2, a serverless data store for streams, built on the service's REST API. S2 is a managed service that provides unlimited, durable streams. Streams can be appended to, with all new records added to the tail of the stream. You can read from any portion of a stream β indexing by record sequence number, or timestamp β and follow
Video Podcast Maker
Use when user provides a topic and wants an automated video podcast created, OR when user wants to learn/analyze video design patterns from reference videos β handles research, script writing, TTS audio synthesis, Remotion video creation, and final MP4 output with background music. Also supports design learning from reference videos (learn command), style profile management, and design reference library.
AI Video Remix
MCP ServerGenerate styled video compositions from your local video footage library using natural language. ShotAI handles shot-level indexing and semantic search; this tool handles the planning, music, and rendering. This repo ships a ready-to-install Claude Agent Skill in the skill/ directory. Install in Claude Code: Or point Claude Code settings to the local skill/ folder. Once installed, just describe wh
Claude Code TTS Plugin
PluginA Text-to-Speech MCP server plugin for Claude Code that converts text to speech using OpenAI's TTS API. Get audio feedback from Claude as you work! - Deterministic Auto-Speak: Every Claude response is automatically spoken (via Stop hook) - 6 High-Quality Voices: alloy, echo, fable, onyx, nova, shimmer - Worker Pool Architecture: Non-blocking queue with concurrent processing - Mutex-Protected Playb
Game Audio
Game audio principles. Sound design, music integration, adaptive audio systems.
Sonos TypeScript MCP Server
MCP ServerYour comprehensive Sonos control companion powered by the Model Context Protocol (MCP). This intelligent server provides seamless access to Sonos audio devices over your local network using UPnP/SOAP protocols. Whether you're controlling playback, managing zones, browsing your music library, or setting up alarms, this MCP server delivers complete device control directly to your AI assistant, enabl
Azure Speech To Text REST Py
|
Audio Transcriber
Transform audio recordings into professional Markdown documentation with intelligent summaries using LLM integration