How do I install Video Podcast Maker?

Install Video Podcast Maker with a single command: npx mdskills install Agents365-ai/video-podcast-maker. This downloads the skill files into your project and your AI agent picks them up automatically.

What platforms support Video Podcast Maker?

Video Podcast Maker works with Claude Code, Claude Desktop, Cursor, Vscode Copilot, Windsurf, Continue Dev, Codex, Gemini Cli, Amp, Roo Code, Goose, Opencode, Trae, Qodo, Command Code. Skills use the open SKILL.md format which is compatible with any AI coding agent that reads markdown instructions.

← Back to skills

Video Podcast Maker

Name: Video Podcast Maker: AI Agent Skill
Brand: Agents365-ai
Availability: InStock
Rating: 8.3 (1 reviews)
Author: Agents365-ai

Verified

Video & PodcastIntermediate

Use when user provides a topic and wants an automated video podcast created, OR when user wants to learn/analyze video design patterns from reference videos — handles research, script writing, TTS audio synthesis, Remotion video creation, and final MP4 output with background music. Also supports design learning from reference videos (learn command), style profile management, and design reference library.

by @Agents365-ai8 downloads330Updated 4/4/2026

Add this skill

npx mdskills install Agents365-ai/video-podcast-maker

Fork & Edit

Are you @Agents365-ai? Sign in with GitHub to claim this listing.

Skill Advisor8.3

Comprehensive video podcast automation pipeline with strong platform support, TTS options, and visual design learning

+Provides complete 14-step workflow with auto/interactive modes and state tracking
+Supports multiple platforms (Bilibili, YouTube, etc.) with proper localization and multi-TTS engines
+Includes design learning system to extract and apply visual patterns from reference videos
-Lacks concrete validation criteria and error handling patterns for render failures
-Dependencies on external services (Azure TTS, Remotion) create potential failure points

SKILL.md

Edit in Browser

1---
2name: video-podcast-maker
3description: Use when user provides a topic and wants an automated video podcast created, OR when user wants to learn/analyze video design patterns from reference videos — handles research, script writing, TTS audio synthesis, Remotion video creation, and final MP4 output with background music. Also supports design learning from reference videos (learn command), style profile management, and design reference library. Supports Bilibili, YouTube, Xiaohongshu, Douyin, and WeChat Channels platforms with independent language configuration (zh-CN, en-US).
4argument-hint: "[topic]"
5effort: high
6allowed-tools: Bash, Read, Write, Edit, Glob, Grep, WebFetch, WebSearch, Agent
7# --- Claude Code fields above, OpenClaw/SkillsMP fields below ---
8author: Agents365-ai
9category: Content Creation
10version: 2.0.0
11created: 2025-01-27
12updated: 2026-04-03
13bilibili: https://space.bilibili.com/441831884
14github: https://github.com/Agents365-ai/video-podcast-maker
15dependencies:
16  - remotion-best-practices
17metadata:
18  openclaw:
19    requires:
20      bins:
21        - python3
22        - ffmpeg
23        - node
24        - npx
25    primaryEnv: AZURE_SPEECH_KEY
26    emoji: "🎬"
27    homepage: https://github.com/Agents365-ai/video-podcast-maker
28    os: ["macos", "linux"]
29    install:
30      - kind: brew
31        formula: ffmpeg
32        bins: [ffmpeg]
33      - kind: uv
34        package: edge-tts
35        bins: [edge-tts]
36---
37 
38> **REQUIRED: Load Remotion Best Practices First**
39>
40> This skill depends on `remotion-best-practices`. **You MUST invoke it before proceeding:**
41> ```
42> Skill tool: skill="remotion-best-practices"
43> ```
44 
45# Video Podcast Maker
46 
47## Quick Start
48 
49Open Claude Code and say: **"Make a video podcast about $ARGUMENTS"**
50 
51Or invoke directly: `/video-podcast-maker AI Agent tutorial`
52 
53---
54 
55## Design Learning
56 
57Extract visual design patterns from reference videos or images, store them in a searchable library, and apply them to new video compositions.
58 
59### Commands
60 
61```bash
62# Learn from images (Claude Vision analyzes design patterns)
63python3 learn_design.py ./screenshot1.png ./screenshot2.png
64 
65# Learn from a local video (ffmpeg extracts frames automatically)
66python3 learn_design.py ./reference.mp4
67 
68# Learn from a URL (Playwright captures screenshots — experimental)
69python3 learn_design.py https://www.bilibili.com/video/BV1xx411c7mD
70 
71# Save with a named profile and tags
72python3 learn_design.py ./reference.mp4 --profile "tech-minimal" --tags "tech,minimal,dark"
73```
74 
75### Reference Library Commands
76 
77```
78references list          # List all stored references (auto-cleans orphaned entries)
79references show <id>     # Show full design report for a reference
80references delete <id>   # Delete a reference and its files
81```
82 
83### Style Profile Commands
84 
85```
86profiles list            # List all saved style profiles
87profiles show <name>     # Show profile props_override
88profiles delete <name>   # Delete a style profile
89profiles create <name>   # Create a new style profile interactively
90```
91 
92### Pre-Workflow Usage
93 
94When the user provides a reference video or image alongside a video creation request, extract design patterns before Step 1 and apply them as session overrides. See `references/workflow-steps.md` → Pre-workflow section for the full extraction flow.
95 
96### Step 9 Integration
97 
98Before choosing visual design in Step 9, check for matching style profiles or reference library entries. Apply the best match as a starting point for Remotion composition props. See `references/workflow-steps.md` → Step 9 Style Profile Integration for the priority chain.
99 
100---
101 
102## Auto Update Check
103 
104**Agent behavior:** Check for updates at most once per day (throttled by timestamp file):
105 
106```bash
107STAMP="${CLAUDE_SKILL_DIR}/.last_update_check"
108NOW=$(date +%s)
109LAST=$(cat "$STAMP" 2>/dev/null || echo 0)
110if [ $((NOW - LAST)) -gt 86400 ]; then
111  timeout 5 git -C ${CLAUDE_SKILL_DIR} fetch --quiet 2>/dev/null || true
112  LOCAL=$(git -C ${CLAUDE_SKILL_DIR} rev-parse HEAD 2>/dev/null)
113  REMOTE=$(git -C ${CLAUDE_SKILL_DIR} rev-parse origin/main 2>/dev/null)
114  echo "$NOW" > "$STAMP"
115  if [ -n "$LOCAL" ] && [ -n "$REMOTE" ] && [ "$LOCAL" != "$REMOTE" ]; then
116    echo "UPDATE_AVAILABLE"
117  else
118    echo "UP_TO_DATE"
119  fi
120else
121  echo "SKIPPED_RECENT_CHECK"
122fi
123```
124 
125- **Update available**: Ask user via AskUserQuestion. Yes → `git -C ${CLAUDE_SKILL_DIR} pull`. No → continue.
126- **Up to date / Skipped**: Continue silently.
127 
128---
129 
130## Prerequisites Check
131 
132!`( missing=""; node -v >/dev/null 2>&1 || missing="$missing node"; python3 --version >/dev/null 2>&1 || missing="$missing python3"; ffmpeg -version >/dev/null 2>&1 || missing="$missing ffmpeg"; [ -n "$AZURE_SPEECH_KEY" ] || missing="$missing AZURE_SPEECH_KEY"; if [ -n "$missing" ]; then echo "MISSING:$missing"; else echo "ALL_OK"; fi )`
133 
134**If MISSING reported above**, see README.md for full setup instructions (install commands, API key setup, Remotion project init).
135 
136---
137 
138## Overview
139 
140Automated pipeline for professional **Bilibili horizontal knowledge videos** from a topic.
141 
142> **Target: Bilibili horizontal video (16:9)**
143> - Resolution: 3840×2160 (4K) or 1920×1080 (1080p)
144> - Style: Clean white (default)
145 
146**Tech stack:** Claude + Azure TTS + Remotion + FFmpeg
147 
148### Output Specs
149 
150| Parameter | Horizontal (16:9) | Vertical (9:16) |
151|-----------|-------------------|-----------------|
152| **Resolution** | 3840×2160 (4K) | 2160×3840 (4K) |
153| **Frame rate** | 30 fps | 30 fps |
154| **Encoding** | H.264, 16Mbps | H.264, 16Mbps |
155| **Audio** | AAC, 192kbps | AAC, 192kbps |
156| **Duration** | 1-15 min | 60-90s (highlight) |
157 
158---
159 
160## Execution Modes
161 
162**Agent behavior:** Detect user intent at workflow start:
163 
164- "Make a video about..." / no special instructions → **Auto Mode**
165- "I want to control each step" / mentions interactive → **Interactive Mode**
166- Default: **Auto Mode**
167 
168### Auto Mode (Default)
169 
170Full pipeline with sensible defaults. **Only 1 mandatory stop:**
171 
1721. **Step 10**: After 720p preview render — user reviews, confirms before 4K
173 
174| Step | Decision | Auto Default |
175|------|----------|-------------|
176| 3 | Title position | top-center |
177| 5 | Media assets | Skip (text-only animations) |
178| 7 | Thumbnail method | Remotion-generated (16:9 + 4:3) |
179| 9 | Outro animation | Pre-made MP4 (white/black by theme) |
180| 9 | Preview method | Preview render (720p, self-validates) |
181| 12 | Subtitles | Skip |
182| 14 | Cleanup | Auto-clean temp files |
183 
184Users can override any default in their initial request:
185- "make a video about AI, burn subtitles" → auto + subtitles on
186- "use dark theme, AI thumbnails" → auto + dark + imagen
187- "need screenshots" → auto + media collection enabled
188 
189### Interactive Mode
190 
191Prompts at each decision point. Activated by:
192- "interactive mode" / "I want to choose each option"
193- User explicitly requests control
194 
195---
196 
197## Workflow State & Resume
198 
199> **Planned feature (not yet implemented).** Currently, workflow progress is tracked via Claude's conversation context. If a session is interrupted, re-invoke the skill and Claude will check existing files in `videos/{name}/` to determine where to resume.
200 
201---
202 
203## Technical Rules
204 
205Hard constraints for video production — visual design is Claude's creative freedom:
206 
207| Rule | Requirement |
208|------|-------------|
209| **Single Project** | All videos under `videos/{name}/` in user's Remotion project. NEVER create a new project per video. |
210| **4K Output** | 3840×2160, use `scale(2)` wrapper over 1920×1080 design space |
211| **Content Width** | ≥85% of screen width |
212| **Bottom Safe Zone** | Bottom 100px reserved for subtitles |
213| **Audio Sync** | All animations driven by `timing.json` timestamps |
214| **Thumbnail** | MUST generate 16:9 (1920×1080) AND 4:3 (1200×900). Title ≥80px bold, high contrast. |
215| **Font** | PingFang SC / Noto Sans SC for Chinese text |
216 
217---
218 
219## Additional Resources
220 
221Claude loads these files on demand — **do NOT load all at once**:
222 
223- **[references/workflow-steps.md](references/workflow-steps.md)**: Detailed step-by-step instructions (Steps 1-14). Load at workflow start.
224- **[references/design-guide.md](references/design-guide.md)**: Visual minimums, typography, layout patterns, checklists. **MUST load before Step 9.**
225- **[references/troubleshooting.md](references/troubleshooting.md)**: Error fixes, BGM options, preference commands, preference learning. Load on error or user request.
226- **[examples/](examples/)**: Real production video projects. Claude may reference these for composition structure and timing.json format.
227 
228---
229 
230## Directory Structure
231 
232```
233project-root/                           # Remotion project root
234├── src/remotion/                       # Remotion source
235│   ├── compositions/                   # Video composition definitions
236│   ├── Root.tsx                        # Remotion entry
237│   └── index.ts                        # Exports
238│
239├── public/                             # Remotion default (unused — use --public-dir videos/{name}/)
240│
241├── videos/{video-name}/                # Video project assets
242│   ├── workflow_state.json             # Workflow progress
243│   ├── topic_definition.md             # Step 1
244│   ├── topic_research.md               # Step 2
245│   ├── podcast.txt                     # Step 4: narration script
246│   ├── podcast_audio.wav               # Step 8: TTS audio
247│   ├── podcast_audio.srt               # Step 8: subtitles
248│   ├── timing.json                     # Step 8: timeline
249│   ├── thumbnail_*.png                 # Step 7
250│   ├── output.mp4                      # Step 10
251│   ├── video_with_bgm.mp4             # Step 11
252│   ├── final_video.mp4                 # Step 12: final output
253│   └── bgm.mp3                         # Background music
254│
255└── remotion.config.ts
256```
257 
258> **Important**: Always use `--public-dir` and full output path for Remotion render:
259> ```bash
260> npx remotion render src/remotion/index.ts CompositionId videos/{name}/output.mp4 --public-dir videos/{name}/
261> ```
262 
263### Naming Rules
264 
265**Video name `{video-name}`**: lowercase English, hyphen-separated (e.g., `reference-manager-comparison`)
266 
267**Section name `{section}`**: lowercase English, underscore-separated, matches `[SECTION:xxx]`
268 
269**Thumbnail naming** (16:9 AND 4:3 both required):
270| Type | 16:9 | 4:3 |
271|------|------|-----|
272| Remotion | `thumbnail_remotion_16x9.png` | `thumbnail_remotion_4x3.png` |
273| AI | `thumbnail_ai_16x9.png` | `thumbnail_ai_4x3.png` |
274 
275### Public Directory
276 
277Use `--public-dir videos/{name}/` for all Remotion commands. Each video's assets (timing.json, podcast_audio.wav, bgm.mp3) stay in its own directory — no copying to `public/` needed. This enables parallel renders of different videos.
278 
279```bash
280# All render/studio/still commands use --public-dir
281npx remotion studio src/remotion/index.ts --public-dir videos/{name}/
282npx remotion render src/remotion/index.ts CompositionId videos/{name}/output.mp4 --public-dir videos/{name}/ --video-bitrate 16M
283npx remotion still src/remotion/index.ts Thumbnail16x9 videos/{name}/thumbnail.png --public-dir videos/{name}/
284```
285 
286---
287 
288## Workflow
289 
290### Progress Tracking
291 
292At Step 1 start:
2931. Create `videos/{name}/workflow_state.json`
2942. Use `TaskCreate` to create tasks per step. Mark `in_progress` on start, `completed` on finish.
2953. Each step updates BOTH `workflow_state.json` AND TaskUpdate.
296 
297```
298 1. Define topic direction → topic_definition.md
299 2. Research topic → topic_research.md
300 3. Design video sections (5-7 chapters)
301 4. Write narration script → podcast.txt
302 5. Collect media assets → media_manifest.json
303 6. Generate publish info (Part 1) → publish_info.md
304 7. Generate thumbnails (16:9 + 4:3) → thumbnail_*.png
305 8. Generate TTS audio → podcast_audio.wav, timing.json
306 9. Create Remotion composition + Studio preview
30710. Render 4K video → output.mp4
30811. Mix background music → video_with_bgm.mp4
30912. Add subtitles (optional) → final_video.mp4
31013. Complete publish info (Part 2) → chapter timestamps
31114. Verify output & cleanup
31215. Generate vertical shorts (optional) → shorts/
313```
314 
315### Validation Checkpoints
316 
317**After Step 8 (TTS)**:
318- [ ] `podcast_audio.wav` exists and plays correctly
319- [ ] `timing.json` has all sections with correct timestamps
320- [ ] `podcast_audio.srt` encoding is UTF-8
321 
322**After Step 10 (Render)**:
323- [ ] `output.mp4` resolution is 3840x2160
324- [ ] Audio-video sync verified
325- [ ] No black frames
326 
327---
328 
329## Key Commands Reference
330 
331See **CLAUDE.md** for the full command reference (TTS, Remotion, FFmpeg, shorts generation).
332 
333---
334 
335## User Preference System
336 
337Skill learns and applies preferences automatically. See [references/troubleshooting.md](references/troubleshooting.md) for commands and learning details.
338 
339### Storage Files
340 
341| File | Purpose |
342|------|---------|
343| `user_prefs.json` | Learned preferences (auto-created from template) |
344| `user_prefs.template.json` | Default values |
345| `prefs_schema.json` | JSON schema definition |
346 
347### Priority
348 
349```
350Final = merge(Root.tsx defaults < global < topic_patterns[type] < current instructions)
351```
352 
353### User Commands
354 
355| Command | Effect |
356|---------|--------|
357| "show preferences" | Show current preferences |
358| "reset preferences" | Reset to defaults |
359| "save as X default" | Save to topic_patterns |
360 
361---
362 
363## Troubleshooting & Preferences
364 
365> **Full reference:** Read [references/troubleshooting.md](references/troubleshooting.md) on errors, preference questions, or BGM options.
366

Full transparency — inspect the skill content before installing.

New to skill.md files?

See what a SKILL.md file is, how to install one, and how it differs from AGENTS.md or cursorrules.

Read the guide →