How do I install Batch Translate?

Install Batch Translate with a single command: npx mdskills install Embassy-of-the-Free-Mind/batch-translate. This downloads the skill files into your project and your AI agent picks them up automatically.

What platforms support Batch Translate?

Batch Translate works with Claude Code, Claude Desktop, Cursor, Vscode Copilot, Windsurf, Continue Dev, Codex, Gemini Cli, Amp, Roo Code, Goose, Opencode, Trae, Qodo, Command Code. Skills use the open SKILL.md format which is compatible with any AI coding agent that reads markdown instructions.

← Back to skills

Batch Translate

Name: Batch Translate: AI Agent Skill
Brand: Embassy-of-the-Free-Mind
Availability: InStock
Rating: 8 (1 reviews)
Author: Embassy-of-the-Free-Mind

Verified

SKILL + PLUGINDevOps & CloudIntermediate

Batch process books through the complete pipeline - generate cropped images for split pages, OCR all pages, then translate with context. Use when asked to process, OCR, translate, or batch process one or more books.

by @Embassy-of-the-Free-Mind1 downloadsUpdated 2/24/2026

Add this skill

npx mdskills install Embassy-of-the-Free-Mind/batch-translate

Fork & Edit

Are you @Embassy-of-the-Free-Mind? Sign in with GitHub to claim this listing.

Skill Advisor8.0

Comprehensive pipeline for OCR and translation of historical texts with excellent API documentation

+Provides detailed API endpoints and processing options for batch OCR and translation
+Includes practical examples with jq filters for status analysis and debugging
+Documents critical edge cases like empty string checks and bad OCR detection
-Lacks explicit trigger conditions or prompts for when the agent should invoke this skill
-Does not specify validation steps for OCR/translation quality before proceeding

SKILL.md

Edit in Browser

1---
2name: batch-translate
3description: Batch process books through the complete pipeline - generate cropped images for split pages, OCR all pages, then translate with context. Use when asked to process, OCR, translate, or batch process one or more books.
4---
5 
6# Batch Book Translation Workflow
7 
8Process books through the complete pipeline: Crop → OCR → Translate
9 
10## Roadmap Reference
11 
12See `.claude/ROADMAP.md` for the translation priority list.
13 
14**Priority 1 = UNTRANSLATED** - These are highest priority for processing:
15- Kircher encyclopedias (Oedipus, Musurgia, Ars Magna Lucis)
16- Fludd: Utriusque Cosmi Historia
17- Theatrum Chemicum, Musaeum Hermeticum
18- Cardano: De Subtilitate
19- Della Porta: Magia Naturalis
20- Lomazzo, Poliziano, Landino
21 
22```bash
23# Get roadmap with priorities
24curl -s "https://sourcelibrary.org/api/books/roadmap" | jq '.books[] | select(.priority == 1) | {title, notes}'
25```
26 
27Roadmap source: `src/app/api/books/roadmap/route.ts`
28 
29## Overview
30 
31This workflow handles the full processing pipeline for historical book scans:
321. **Generate Cropped Images** - For split two-page spreads, extract individual pages
332. **OCR** - Extract text from page images using Gemini vision
343. **Translate** - Translate OCR'd text with prior page context for continuity
35 
36## API Endpoints
37 
38| Endpoint | Purpose |
39|----------|---------|
40| `GET /api/books` | List all books |
41| `GET /api/books/BOOK_ID` | Get book with all pages |
42| `POST /api/jobs/queue-books` | Queue pages for Lambda worker processing (primary path) |
43| `GET /api/jobs` | List processing jobs |
44| `POST /api/jobs/JOB_ID/retry` | Retry failed pages in a job |
45| `POST /api/jobs/JOB_ID/cancel` | Cancel a running job |
46| `POST /api/books/BOOK_ID/batch-ocr-async` | Submit Gemini Batch API OCR job (50% cheaper, ~24h) |
47| `POST /api/books/BOOK_ID/batch-translate-async` | Submit Gemini Batch API translation job |
48 
49## Processing Options
50 
51### Option 1: Lambda Workers via Job System (Primary Path)
52 
53The primary processing path uses AWS Lambda workers via SQS queues. Each page is processed independently with automatic job tracking.
54 
55```bash
56# Queue OCR for a book's pages
57curl -s -X POST "https://sourcelibrary.org/api/jobs/queue-books" \
58  -H "Content-Type: application/json" \
59  -d '{"bookIds": ["BOOK_ID"], "action": "ocr"}'
60 
61# Queue translation
62curl -s -X POST "https://sourcelibrary.org/api/jobs/queue-books" \
63  -H "Content-Type: application/json" \
64  -d '{"bookIds": ["BOOK_ID"], "action": "translation"}'
65 
66# Queue image extraction
67curl -s -X POST "https://sourcelibrary.org/api/jobs/queue-books" \
68  -H "Content-Type: application/json" \
69  -d '{"bookIds": ["BOOK_ID"], "action": "image_extraction"}'
70```
71 
72**IMPORTANT: Always use `gemini-3-flash-preview` for all OCR and translation tasks. Do NOT use `gemini-2.5-flash`.**
73 
74### Option 2: Gemini Batch API (50% Cheaper, Automated Pipeline)
75 
76The post-import-pipeline cron uses Gemini Batch API for automated processing of newly imported books. Results arrive in ~24 hours at 50% cost.
77 
78| Job Type | API | Model | Cost |
79|----------|-----|-------|------|
80| Single page | Realtime (Lambda) | gemini-3-flash-preview | Full price |
81| batch_ocr | Batch API | gemini-3-flash-preview | **50% off** |
82| batch_translate | Batch API | gemini-3-flash-preview | **50% off** |
83 
84## OCR Output Format
85 
86OCR uses **Markdown output** with semantic tags:
87 
88### Markdown Formatting
89- `# ## ###` for headings (bigger text = bigger heading)
90- `**bold**`, `*italic*` for emphasis
91- `->centered text<-` for centered lines (NOT for headings)
92- `> blockquotes` for quotes/prayers
93- `---` for dividers
94- Tables only for actual tabular data
95 
96### Metadata Tags (hidden from readers)
97| Tag | Purpose |
98|-----|---------|
99| `<lang>X</lang>` | Detected language |
100| `<page-num>N</page-num>` | Page/folio number |
101| `<header>X</header>` | Running headers |
102| `<sig>X</sig>` | Printer's marks (A2, B1) |
103| `<meta>X</meta>` | Hidden metadata |
104| `<warning>X</warning>` | Quality issues |
105| `<vocab>X</vocab>` | Key terms for indexing |
106 
107### Inline Annotations (visible to readers)
108| Tag | Purpose |
109|-----|---------|
110| `<margin>X</margin>` | Marginal notes (before paragraph) |
111| `<gloss>X</gloss>` | Interlinear annotations |
112| `<insert>X</insert>` | Boxed text, additions |
113| `<unclear>X</unclear>` | Illegible readings |
114| `<note>X</note>` | Interpretive notes |
115| `<term>X</term>` | Technical vocabulary |
116| `<image-desc>X</image-desc>` | Describe illustrations |
117 
118### Critical OCR Rules
1191. Preserve original spelling, capitalization, punctuation
1202. Page numbers/headers/signatures go in metadata tags only
1213. IGNORE partial text at edges (from facing page in spread)
1224. Describe images/diagrams with `<image-desc>`, never tables
1235. End with `<vocab>key terms, names, concepts</vocab>`
124 
125## Step 1: Analyze Book Status
126 
127First, check what work is needed for a book:
128 
129```bash
130# Get book and analyze page status
131curl -s "https://sourcelibrary.org/api/books/BOOK_ID" > /tmp/book.json
132 
133# Count pages by status (IMPORTANT: check length > 0, not just existence - empty strings are truthy!)
134jq '{
135  title: .title,
136  total_pages: (.pages | length),
137  split_pages: [.pages[] | select(.crop)] | length,
138  needs_crop: [.pages[] | select(.crop) | select(.cropped_photo | not)] | length,
139  has_ocr: [.pages[] | select((.ocr.data // "") | length > 0)] | length,
140  needs_ocr: [.pages[] | select((.ocr.data // "") | length == 0)] | length,
141  has_translation: [.pages[] | select((.translation.data // "") | length > 0)] | length,
142  needs_translation: [.pages[] | select((.ocr.data // "") | length > 0) | select((.translation.data // "") | length == 0)] | length
143}' /tmp/book.json
144```
145 
146### Detecting Bad OCR
147 
148Pages that were OCR'd before cropped images were generated have incorrect OCR (contains both pages of the spread). Detect these:
149 
150```bash
151# Find pages with crop data + OCR but missing cropped_photo at OCR time
152# These often contain "two-page" or "spread" in the OCR text
153jq '[.pages[] | select(.crop) | select(.ocr.data) |
154  select(.ocr.data | test("two-page|spread"; "i"))] | length' /tmp/book.json
155```
156 
157## Step 2: Generate Cropped Images
158 
159For books with split two-page spreads, generate individual page images:
160 
161```bash
162# Get page IDs needing crops
163CROP_IDS=$(jq '[.pages[] | select(.crop) | select(.cropped_photo | not) | .id]' /tmp/book.json)
164 
165# Create crop job
166curl -s -X POST "https://sourcelibrary.org/api/jobs" \
167  -H "Content-Type: application/json" \
168  -d "{
169    \"type\": \"generate_cropped_images\",
170    \"book_id\": \"BOOK_ID\",
171    \"book_title\": \"BOOK_TITLE\",
172    \"page_ids\": $CROP_IDS
173  }"
174```
175 
176Process the job:
177 
178```bash
179# Trigger processing (40 pages per request, auto-continues)
180curl -s -X POST "https://sourcelibrary.org/api/jobs/JOB_ID/process"
181```
182 
183## Step 3: OCR Pages
184 
185### Option A: Using Job System (for large batches)
186 
187```bash
188# Get page IDs needing OCR (check for empty strings, not just null)
189OCR_IDS=$(jq '[.pages[] | select((.ocr.data // "") | length == 0) | .id]' /tmp/book.json)
190 
191# Create OCR job
192curl -s -X POST "https://sourcelibrary.org/api/jobs" \
193  -H "Content-Type: application/json" \
194  -d "{
195    \"type\": \"batch_ocr\",
196    \"book_id\": \"BOOK_ID\",
197    \"book_title\": \"BOOK_TITLE\",
198    \"model\": \"gemini-3-flash-preview\",
199    \"language\": \"Latin\",
200    \"page_ids\": $OCR_IDS
201  }"
202```
203 
204### Option B: Using Lambda Workers with Page IDs
205 
206```bash
207# OCR specific pages (including overwrite)
208curl -s -X POST "https://sourcelibrary.org/api/jobs/queue-books" \
209  -H "Content-Type: application/json" \
210  -d '{
211    "bookIds": ["BOOK_ID"],
212    "action": "ocr",
213    "pageIds": ["PAGE_ID_1", "PAGE_ID_2"],
214    "overwrite": true
215  }'
216```
217 
218Lambda workers automatically use `cropped_photo` when available.
219 
220## Step 4: Translate Pages
221 
222### Option A: Using Job System
223 
224```bash
225# Get page IDs needing translation (must have OCR content, check for empty strings)
226TRANS_IDS=$(jq '[.pages[] | select((.ocr.data // "") | length > 0) | select((.translation.data // "") | length == 0) | .id]' /tmp/book.json)
227 
228# Create translation job
229curl -s -X POST "https://sourcelibrary.org/api/jobs" \
230  -H "Content-Type: application/json" \
231  -d "{
232    \"type\": \"batch_translate\",
233    \"book_id\": \"BOOK_ID\",
234    \"book_title\": \"BOOK_TITLE\",
235    \"model\": \"gemini-3-flash-preview\",
236    \"language\": \"Latin\",
237    \"page_ids\": $TRANS_IDS
238  }"
239```
240 
241### Option B: Using Lambda Workers (Recommended)
242 
243Lambda FIFO queue automatically provides previous page context for translation continuity:
244 
245```bash
246# Queue translation for pages that have OCR but no translation
247curl -s -X POST "https://sourcelibrary.org/api/jobs/queue-books" \
248  -H "Content-Type: application/json" \
249  -d '{"bookIds": ["BOOK_ID"], "action": "translation"}'
250```
251 
252The translation Lambda worker processes pages sequentially via FIFO queue and fetches the previous page's translation for context.
253 
254## Complete Book Processing Script
255 
256Process a single book through the full pipeline using Lambda workers:
257 
258```bash
259#!/bin/bash
260BOOK_ID="YOUR_BOOK_ID"
261BASE_URL="https://sourcelibrary.org"
262 
263# 1. Fetch book data
264echo "Fetching book..."
265BOOK=$(curl -s "$BASE_URL/api/books/$BOOK_ID")
266TITLE=$(echo "$BOOK" | jq -r '.title[0:40]')
267echo "Processing: $TITLE"
268 
269# 2. Queue OCR (Lambda workers handle all pages automatically)
270NEEDS_OCR=$(echo "$BOOK" | jq '[.pages[] | select((.ocr.data // "") | length == 0)] | length')
271if [ "$NEEDS_OCR" != "0" ]; then
272  echo "Queueing OCR for $NEEDS_OCR pages..."
273  curl -s -X POST "$BASE_URL/api/jobs/queue-books" \
274    -H "Content-Type: application/json" \
275    -d "{\"bookIds\": [\"$BOOK_ID\"], \"action\": \"ocr\"}"
276  echo "OCR job queued!"
277fi
278 
279# 3. Queue translation (after OCR completes — check /jobs page)
280NEEDS_TRANS=$(echo "$BOOK" | jq '[.pages[] | select((.ocr.data // "") | length > 0) | select((.translation.data // "") | length == 0)] | length')
281if [ "$NEEDS_TRANS" != "0" ]; then
282  echo "Queueing translation for $NEEDS_TRANS pages..."
283  curl -s -X POST "$BASE_URL/api/jobs/queue-books" \
284    -H "Content-Type: application/json" \
285    -d "{\"bookIds\": [\"$BOOK_ID\"], \"action\": \"translation\"}"
286  echo "Translation job queued!"
287fi
288 
289echo "Jobs queued! Monitor progress at $BASE_URL/jobs"
290```
291 
292## Fixing Bad OCR
293 
294When pages were OCR'd before cropped images existed, they contain text from both pages. Fix with:
295 
296```bash
297# 1. Generate cropped images first (Step 2 above)
298 
299# 2. Find pages with bad OCR
300BAD_OCR_IDS=$(jq '[.pages[] | select(.crop) | select(.ocr.data) |
301  select(.ocr.data | test("two-page|spread"; "i")) | .id]' /tmp/book.json)
302 
303# 3. Re-OCR with overwrite via Lambda workers
304curl -s -X POST "https://sourcelibrary.org/api/jobs/queue-books" \
305  -H "Content-Type: application/json" \
306  -d "{\"bookIds\": [\"BOOK_ID\"], \"action\": \"ocr\", \"pageIds\": $BAD_OCR_IDS, \"overwrite\": true}"
307```
308 
309## Processing All Books
310 
311Use the Lambda worker job system for bulk processing:
312 
313```bash
314#!/bin/bash
315BASE_URL="https://sourcelibrary.org"
316 
317# Get all book IDs
318BOOK_IDS=$(curl -s "$BASE_URL/api/books" | jq -r '[.[].id]')
319 
320# Queue OCR for all books (Lambda workers handle parallelism and rate limiting)
321curl -s -X POST "$BASE_URL/api/jobs/queue-books" \
322  -H "Content-Type: application/json" \
323  -d "{\"bookIds\": $BOOK_IDS, \"action\": \"ocr\"}"
324 
325# After OCR completes, queue translation
326curl -s -X POST "$BASE_URL/api/jobs/queue-books" \
327  -H "Content-Type: application/json" \
328  -d "{\"bookIds\": $BOOK_IDS, \"action\": \"translation\"}"
329```
330 
331Monitor progress at https://sourcelibrary.org/jobs
332 
333## Monitoring Progress
334 
335Check overall library status:
336 
337```bash
338curl -s "https://sourcelibrary.org/api/books" | jq '[.[] | {
339  title: .title[0:30],
340  pages: .pages_count,
341  ocr: .ocr_count,
342  translated: .translation_count
343}] | sort_by(-.pages)'
344```
345 
346## Troubleshooting
347 
348### Empty Strings vs Null (CRITICAL)
349In jq, empty strings `""` are truthy! This means:
350- `select(.ocr.data)` matches pages with `""` (WRONG)
351- `select(.ocr.data | not)` does NOT match pages with `""` (WRONG)
352- Use `select((.ocr.data // "") | length == 0)` to find missing/empty OCR
353- Use `select((.ocr.data // "") | length > 0)` to find pages WITH OCR content
354 
355### Rate Limits (429 errors)
356 
357#### Gemini API Tiers
358| Tier | RPM | How to Qualify |
359|------|-----|----------------|
360| Free | 15 | Default |
361| Tier 1 | 300 | Enable billing + $50 spend |
362| Tier 2 | 1000 | $250 spend |
363| Tier 3 | 2000 | $1000 spend |
364 
365#### Optimal Sleep Times by Tier
366| Tier | Max RPM | Safe Sleep Time | Effective Rate |
367|------|---------|-----------------|----------------|
368| Free | 15 | 4.0s | ~15/min |
369| Tier 1 | 300 | 0.4s | ~150/min |
370| Tier 2 | 1000 | 0.12s | ~500/min |
371| Tier 3 | 2000 | 0.06s | ~1000/min |
372 
373**Note:** Use ~50% of max rate to leave headroom for bursts.
374 
375#### API Key Rotation
376The system supports multiple API keys for higher throughput:
377- Set `GEMINI_API_KEY` (primary)
378- Set `GEMINI_API_KEY_2`, `GEMINI_API_KEY_3`, ... up to `GEMINI_API_KEY_10`
379- Keys rotate automatically with 60s cooldown after rate limit
380 
381With N keys at Tier 1, you get N × 300 RPM = N × 150 safe req/min
382 
383### Function Timeouts
384- Jobs have `maxDuration=300s` for Vercel Pro
385- If hitting timeouts, reduce `CROP_CHUNK_SIZE` in job processing
386 
387### Missing Cropped Photos
388- Check if crop job completed successfully
389- Verify page has `crop` data with `xStart` and `xEnd`
390- Re-run crop generation for specific pages
391 
392### Bad OCR Detection
393Look for these patterns in OCR text indicating wrong image was used:
394- "two-page spread"
395- "left page" / "right page" descriptions
396- Duplicate text blocks
397- References to facing pages
398

Full transparency — inspect the skill content before installing.

New to skill.md files?

See what a SKILL.md file is, how to install one, and how it differs from AGENTS.md or cursorrules.

Read the guide →