SKILL.md files package domain expertise into something any AI agent can use. Drop one into your project and your agent learns how to process PDFs, design interfaces, write tests, or whatever the skill teaches.
9 skills
Extract transcripts from YouTube videos and generate comprehensive, detailed summaries using intelligent analysis frameworks
Build real-time conversational AI voice engines using async worker pipelines, streaming transcription, LLM agents, and TTS synthesis with interrupt handling and multi-provider support
MCP Server for AI-powered video editing. Let Claude, GPT, or any AI agent edit videos through natural language. Add to your Claude Desktop config (claudedesktopconfig.json): Then restart Claude Desktop. You can now edit videos through conversation. Layer Compositing Model: Output: 1920x1080, 30fps, H.264 + AAC, MP4 atsurae.ai also exposes a REST API that any AI agent can call directly, without MCP
TypeScript SDK for S2 This repo contains the official TypeScript SDK for S2, a serverless data store for streams, built on the service's REST API. S2 is a managed service that provides unlimited, durable streams. Streams can be appended to, with all new records added to the tail of the stream. You can read from any portion of a stream – indexing by record sequence number, or timestamp – and follow
A Text-to-Speech MCP server plugin for Claude Code that converts text to speech using OpenAI's TTS API. Get audio feedback from Claude as you work! - Deterministic Auto-Speak: Every Claude response is automatically spoken (via Stop hook) - 6 High-Quality Voices: alloy, echo, fable, onyx, nova, shimmer - Worker Pool Architecture: Non-blocking queue with concurrent processing - Mutex-Protected Playb
Game audio principles. Sound design, music integration, adaptive audio systems.
Transform audio recordings into professional Markdown documentation with intelligent summaries using LLM integration
|
Build real-time voice AI applications using Azure AI Voice Live SDK (azure-ai-voicelive). Use this skill when creating Python applications that need real-time bidirectional audio communication with Azure AI, including voice assistants, voice-enabled chatbots, real-time speech-to-speech translation, voice-driven avatars, or any WebSocket-based audio streaming with AI models. Supports Server VAD (Voice Activity Detection), turn-based conversation, function calling, MCP tools, avatar integration, a