mdskills
← All use cases

Audio & Video

Transcription, TTS, podcasts, video editing, streaming, music

16 listings

Youtube Summarizer

Extract transcripts from YouTube videos and generate comprehensive, detailed summaries using intelligent analysis frameworks

6.012 weeklysickn33/antigravity-awesome-skills

Voice AI Engine Development

Build real-time conversational AI voice engines using async worker pipelines, streaming transcription, LLM agents, and TTS synthesis with interrupt handling and multi-provider support

9.01 weeklysickn33/antigravity-awesome-skills

Barevalue MCP

MCP Server

MCP (Model Context Protocol) server for the Barevalue AI podcast editing API. Allows Claude Code and other MCP-compatible tools to submit and manage podcast editing orders programmatically. - Upload audio files directly from your local machine - Submit orders for AI-powered podcast editing - Check order status and download completed files - Manage webhooks for automated notifications - Pre-validat

8.0quietnotion/barevalue-mcp

Azure AI Voicelive Py

Build real-time voice AI applications using Azure AI Voice Live SDK (azure-ai-voicelive). Use this skill when creating Python applications that need real-time bidirectional audio communication with Azure AI, including voice assistants, voice-enabled chatbots, real-time speech-to-speech translation, voice-driven avatars, or any WebSocket-based audio streaming with AI models. Supports Server VAD (Voice Activity Detection), turn-based conversation, function calling, MCP tools, avatar integration, a

7.0sickn33/antigravity-awesome-skills

atsurae

MCP Server for AI-powered video editing. Let Claude, GPT, or any AI agent edit videos through natural language. Add to your Claude Desktop config (claudedesktopconfig.json): Then restart Claude Desktop. You can now edit videos through conversation. Layer Compositing Model: Output: 1920x1080, 30fps, H.264 + AAC, MP4 atsurae.ai also exposes a REST API that any AI agent can call directly, without MCP

8.01000ri-jp/atsurae

Manim MCP Server

MCP Server

This is an MCP (Model Context Protocol) server that executes Manim animation code and returns the generated video. It allows users to send Manim scripts and receive the rendered animation. - Executes Manim Python scripts. - Saves animation output in a visible media folder. - Allows users to clean up temporary files after execution. - Portable and configurable via environment variables. Ensure you

6.0abhiemj/manim-mcp-server

Torrentclaw MCP

MCP Server

Model Context Protocol server for TorrentClaw β€” giving AI assistants the ability to search movies and TV shows, find torrents with magnet links, check streaming availability, and explore cast/crew metadata. torrentclaw-mcp is developed by TorrentClaw as part of its open-source ecosystem. It wraps the TorrentClaw API into the MCP standard so that any compatible AI assistant (Claude, GPT, etc.) can

7.0torrentclaw/torrentclaw-mcp

Video Edit MCP Server 🎬

MCP Server

A powerful Model Context Protocol (MCP) server designed for advanced video and audio editing operations. This server enables MCP clientsβ€”such as Claude Desktop, Cursor, and othersβ€”to perform comprehensive multimedia editing tasks through a standardized and unified interface. - Basic Editing: Trim, merge, resize, crop, rotate videos - Effects: Speed control, fade in/out, grayscale, mirror - Overlay

8.0Aditya2755/video-edit-mcp

TypeScript SDK for S2

TypeScript SDK for S2 This repo contains the official TypeScript SDK for S2, a serverless data store for streams, built on the service's REST API. S2 is a managed service that provides unlimited, durable streams. Streams can be appended to, with all new records added to the tail of the stream. You can read from any portion of a stream – indexing by record sequence number, or timestamp – and follow

7.0s2-streamstore/s2-sdk-typescript

Video Podcast Maker

Use when user provides a topic and wants an automated video podcast created, OR when user wants to learn/analyze video design patterns from reference videos β€” handles research, script writing, TTS audio synthesis, Remotion video creation, and final MP4 output with background music. Also supports design learning from reference videos (learn command), style profile management, and design reference library.

8.3Agents365-ai/video-podcast-maker

AI Video Remix

MCP Server

Generate styled video compositions from your local video footage library using natural language. ShotAI handles shot-level indexing and semantic search; this tool handles the planning, music, and rendering. This repo ships a ready-to-install Claude Agent Skill in the skill/ directory. Install in Claude Code: Or point Claude Code settings to the local skill/ folder. Once installed, just describe wh

7.8abu-ShotAI/ai-video-remix

Claude Code TTS Plugin

Plugin

A Text-to-Speech MCP server plugin for Claude Code that converts text to speech using OpenAI's TTS API. Get audio feedback from Claude as you work! - Deterministic Auto-Speak: Every Claude response is automatically spoken (via Stop hook) - 6 High-Quality Voices: alloy, echo, fable, onyx, nova, shimmer - Worker Pool Architecture: Non-blocking queue with concurrent processing - Mutex-Protected Playb

9.0ybouhjira/claude-code-tts

Game Audio

Game audio principles. Sound design, music integration, adaptive audio systems.

6.0sickn33/antigravity-awesome-skills

Sonos TypeScript MCP Server

MCP Server

Your comprehensive Sonos control companion powered by the Model Context Protocol (MCP). This intelligent server provides seamless access to Sonos audio devices over your local network using UPnP/SOAP protocols. Whether you're controlling playback, managing zones, browsing your music library, or setting up alarms, this MCP server delivers complete device control directly to your AI assistant, enabl

9.0Tommertom/sonos-ts-mcp

Azure Speech To Text REST Py

|

8.0sickn33/antigravity-awesome-skills

Audio Transcriber

Transform audio recordings into professional Markdown documentation with intelligent summaries using LLM integration

7.0sickn33/antigravity-awesome-skills