How do I install xCOMET MCP Server?

Install xCOMET MCP Server with a single command: npx mdskills install shuji-bonji/xcomet-mcp-server. This downloads the skill files into your project and your AI agent picks them up automatically.

What platforms support xCOMET MCP Server?

xCOMET MCP Server works with Claude Code, Claude Desktop, Cursor, Vscode Copilot, Windsurf, Continue Dev, Gemini Cli, Amp, Roo Code, Goose. Skills use the open SKILL.md format which is compatible with any AI coding agent that reads markdown instructions.

← Back to MCP servers

xCOMET MCP Server

Name: xCOMET MCP Server: AI Agent Skill
Brand: shuji-bonji
Availability: InStock
Rating: 8 (1 reviews)
Author: shuji-bonji

Verified

MCP ServerGit & WorkflowIntermediate

日本語版 README はこちら Translation quality evaluation MCP Server powered by xCOMET (eXplainable COMET). xCOMET MCP Server provides AI agents with the ability to evaluate machine translation quality. It integrates with the xCOMET model from Unbabel to provide: - Quality Scoring: Scores between 0-1 indicating translation quality - Error Detection: Identifies error spans with severity levels (minor/major/c

by @shuji-bonji 1Updated 2/24/2026

Add this skill

npx mdskills install shuji-bonji/xcomet-mcp-server

Fork & Edit

Are you @shuji-bonji? Sign in with GitHub to claim this listing.

Skill Advisor8.0

Well-architected translation quality evaluation server with persistent model loading and comprehensive batch processing.

+Provides genuine translation quality scoring with error detection using industry-standard xCOMET model
+Implements persistent server architecture for 177x speedup on consecutive evaluations
+Offers excellent documentation with clear setup instructions, performance metrics, and troubleshooting guidance
-Requires significant resources (8-10GB RAM, 14GB+ model download) limiting accessibility
-Declared filesystem write permission appears unnecessary for evaluation-only operations

SKILL.md

Edit in Browser

1# xCOMET MCP Server
2 
3[![npm version](https://img.shields.io/npm/v/xcomet-mcp-server.svg)](https://www.npmjs.com/package/xcomet-mcp-server)
4[![CI](https://github.com/shuji-bonji/xcomet-mcp-server/actions/workflows/ci.yml/badge.svg)](https://github.com/shuji-bonji/xcomet-mcp-server/actions/workflows/ci.yml)
5[![MCP](https://img.shields.io/badge/MCP-Model%20Context%20Protocol-blue)](https://modelcontextprotocol.io)
6[![License: MIT](https://img.shields.io/badge/License-MIT-yellow.svg)](https://opensource.org/licenses/MIT)
7 
8**[日本語版 README はこちら](README.ja.md)**
9 
10> ⚠️ This is an unofficial community project, not affiliated with Unbabel.
11 
12Translation quality evaluation MCP Server powered by [xCOMET](https://github.com/Unbabel/COMET) (eXplainable COMET).
13 
14## 🎯 Overview
15 
16xCOMET MCP Server provides AI agents with the ability to evaluate machine translation quality. It integrates with the xCOMET model from Unbabel to provide:
17 
18- **Quality Scoring**: Scores between 0-1 indicating translation quality
19- **Error Detection**: Identifies error spans with severity levels (minor/major/critical)
20- **Batch Processing**: Evaluate multiple translation pairs efficiently (optimized single model load)
21- **GPU Support**: Optional GPU acceleration for faster inference
22 
23```mermaid
24graph LR
25    A[AI Agent] --> B[Node.js MCP Server]
26    B --> C[Python FastAPI Server]
27    C --> D[xCOMET Model<br/>Persistent in Memory]
28    D --> C
29    C --> B
30    B --> A
31 
32    style D fill:#9f9
33```
34 
35## 🔧 Prerequisites
36 
37### Python Environment
38 
39xCOMET requires Python with the following packages:
40 
41```bash
42pip install "unbabel-comet>=2.2.0" fastapi uvicorn
43```
44 
45### Model Download
46 
47The first run will download the xCOMET model (~14GB for XL, ~42GB for XXL):
48 
49```bash
50# Test model availability
51python -c "from comet import download_model; download_model('Unbabel/XCOMET-XL')"
52```
53 
54### Node.js
55 
56- Node.js >= 18.0.0
57- npm or yarn
58 
59## 📦 Installation
60 
61```bash
62# Clone the repository
63git clone https://github.com/shuji-bonji/xcomet-mcp-server.git
64cd xcomet-mcp-server
65 
66# Install dependencies
67npm install
68 
69# Build
70npm run build
71```
72 
73## 🚀 Usage
74 
75### With Claude Desktop (npx)
76 
77Add to your Claude Desktop configuration (`claude_desktop_config.json`):
78 
79```json
80{
81  "mcpServers": {
82    "xcomet": {
83      "command": "npx",
84      "args": ["-y", "xcomet-mcp-server"]
85    }
86  }
87}
88```
89 
90### With Claude Code
91 
92```bash
93claude mcp add xcomet -- npx -y xcomet-mcp-server
94```
95 
96### Local Installation
97 
98If you prefer a local installation:
99 
100```bash
101npm install -g xcomet-mcp-server
102```
103 
104Then configure:
105```json
106{
107  "mcpServers": {
108    "xcomet": {
109      "command": "xcomet-mcp-server"
110    }
111  }
112}
113```
114 
115### HTTP Mode (Remote Access)
116 
117```bash
118TRANSPORT=http PORT=3000 npm start
119```
120 
121Then connect to `http://localhost:3000/mcp`
122 
123## 🛠️ Available Tools
124 
125### `xcomet_evaluate`
126 
127Evaluate translation quality for a single source-translation pair.
128 
129**Parameters:**
130| Name | Type | Required | Description |
131|------|------|----------|-------------|
132| `source` | string | ✅ | Original source text |
133| `translation` | string | ✅ | Translated text to evaluate |
134| `reference` | string | ❌ | Reference translation |
135| `source_lang` | string | ❌ | Source language code (ISO 639-1) |
136| `target_lang` | string | ❌ | Target language code (ISO 639-1) |
137| `response_format` | "json" \| "markdown" | ❌ | Output format (default: "json") |
138| `use_gpu` | boolean | ❌ | Use GPU for inference (default: false) |
139 
140**Example:**
141```json
142{
143  "source": "The quick brown fox jumps over the lazy dog.",
144  "translation": "素早い茶色のキツネが怠惰な犬を飛び越える。",
145  "source_lang": "en",
146  "target_lang": "ja",
147  "use_gpu": true
148}
149```
150 
151**Response:**
152```json
153{
154  "score": 0.847,
155  "errors": [],
156  "summary": "Good quality (score: 0.847) with 0 error(s) detected."
157}
158```
159 
160### `xcomet_detect_errors`
161 
162Focus on detecting and categorizing translation errors.
163 
164**Parameters:**
165| Name | Type | Required | Description |
166|------|------|----------|-------------|
167| `source` | string | ✅ | Original source text |
168| `translation` | string | ✅ | Translated text to analyze |
169| `reference` | string | ❌ | Reference translation |
170| `min_severity` | "minor" \| "major" \| "critical" | ❌ | Minimum severity (default: "minor") |
171| `response_format` | "json" \| "markdown" | ❌ | Output format |
172| `use_gpu` | boolean | ❌ | Use GPU for inference (default: false) |
173 
174### `xcomet_batch_evaluate`
175 
176Evaluate multiple translation pairs in a single request.
177 
178> **Performance Note**: With the persistent server architecture (v0.3.0+), the model stays loaded in memory. Batch evaluation processes all pairs efficiently without reloading the model.
179 
180**Parameters:**
181| Name | Type | Required | Description |
182|------|------|----------|-------------|
183| `pairs` | array | ✅ | Array of {source, translation, reference?} (max 500) |
184| `source_lang` | string | ❌ | Source language code |
185| `target_lang` | string | ❌ | Target language code |
186| `response_format` | "json" \| "markdown" | ❌ | Output format |
187| `use_gpu` | boolean | ❌ | Use GPU for inference (default: false) |
188| `batch_size` | number | ❌ | Batch size 1-64 (default: 8). Larger = faster but uses more memory |
189 
190**Example:**
191```json
192{
193  "pairs": [
194    {"source": "Hello", "translation": "こんにちは"},
195    {"source": "Goodbye", "translation": "さようなら"}
196  ],
197  "use_gpu": true,
198  "batch_size": 16
199}
200```
201 
202## 🔗 Integration with Other MCP Servers
203 
204xCOMET MCP Server is designed to work alongside other MCP servers for complete translation workflows:
205 
206```mermaid
207sequenceDiagram
208    participant Agent as AI Agent
209    participant DeepL as DeepL MCP Server
210    participant xCOMET as xCOMET MCP Server
211    
212    Agent->>DeepL: Translate text
213    DeepL-->>Agent: Translation result
214    Agent->>xCOMET: Evaluate quality
215    xCOMET-->>Agent: Score + Errors
216    Agent->>Agent: Decide: Accept or retry?
217```
218 
219### Recommended Workflow
220 
2211. **Translate** using DeepL MCP Server (official)
2222. **Evaluate** using xCOMET MCP Server
2233. **Iterate** if quality is below threshold
224 
225### Example: DeepL + xCOMET Integration
226 
227Configure both servers in Claude Desktop:
228 
229```json
230{
231  "mcpServers": {
232    "deepl": {
233      "command": "npx",
234      "args": ["-y", "@anthropic/deepl-mcp-server"],
235      "env": {
236        "DEEPL_API_KEY": "your-api-key"
237      }
238    },
239    "xcomet": {
240      "command": "npx",
241      "args": ["-y", "xcomet-mcp-server"]
242    }
243  }
244}
245```
246 
247Then ask Claude:
248> "Translate this text to Japanese using DeepL, then evaluate the translation quality with xCOMET. If the score is below 0.8, suggest improvements."
249 
250## ⚙️ Configuration
251 
252### Environment Variables
253 
254| Variable | Default | Description |
255|----------|---------|-------------|
256| `TRANSPORT` | `stdio` | Transport mode: `stdio` or `http` |
257| `PORT` | `3000` | HTTP server port (when TRANSPORT=http) |
258| `XCOMET_MODEL` | `Unbabel/XCOMET-XL` | xCOMET model to use |
259| `XCOMET_PYTHON_PATH` | (auto-detect) | Python executable path (see below) |
260| `XCOMET_PRELOAD` | `false` | Pre-load model at startup (v0.3.1+) |
261| `XCOMET_DEBUG` | `false` | Enable verbose debug logging (v0.3.1+) |
262 
263### Model Selection
264 
265Choose the model based on your quality/performance needs:
266 
267| Model | Parameters | Size | Memory | Reference | Quality | Use Case |
268|-------|------------|------|--------|-----------|---------|----------|
269| `Unbabel/XCOMET-XL` | 3.5B | ~14GB | ~8-10GB | Optional | ⭐⭐⭐⭐ | Recommended for most use cases |
270| `Unbabel/XCOMET-XXL` | 10.7B | ~42GB | ~20GB | Optional | ⭐⭐⭐⭐⭐ | Highest quality, requires more resources |
271| `Unbabel/wmt22-comet-da` | 580M | ~2GB | ~3GB | **Required** | ⭐⭐⭐ | Lightweight, faster loading |
272 
273> **Important**: `wmt22-comet-da` requires a `reference` translation for evaluation. XCOMET models support referenceless evaluation.
274 
275> **Tip**: If you experience memory issues or slow model loading, try `Unbabel/wmt22-comet-da` for faster performance with slightly lower accuracy (but remember to provide reference translations).
276 
277**To use a different model**, set the `XCOMET_MODEL` environment variable:
278 
279```json
280{
281  "mcpServers": {
282    "xcomet": {
283      "command": "npx",
284      "args": ["-y", "xcomet-mcp-server"],
285      "env": {
286        "XCOMET_MODEL": "Unbabel/XCOMET-XXL"
287      }
288    }
289  }
290}
291```
292 
293### Python Path Auto-Detection
294 
295The server automatically detects a Python environment with `unbabel-comet` installed:
296 
2971. **`XCOMET_PYTHON_PATH`** environment variable (if set)
2982. **pyenv** versions (`~/.pyenv/versions/*/bin/python3`) - checks for `comet` module
2993. **Homebrew** Python (`/opt/homebrew/bin/python3`, `/usr/local/bin/python3`)
3004. **Fallback**: `python3` command
301 
302This ensures the server works correctly even when the MCP host (e.g., Claude Desktop) uses a different Python than your terminal.
303 
304**Example: Explicit Python path configuration**
305```json
306{
307  "mcpServers": {
308    "xcomet": {
309      "command": "npx",
310      "args": ["-y", "xcomet-mcp-server"],
311      "env": {
312        "XCOMET_PYTHON_PATH": "/Users/you/.pyenv/versions/3.11.0/bin/python3"
313      }
314    }
315  }
316}
317```
318 
319## ⚡ Performance
320 
321### Persistent Server Architecture (v0.3.0+)
322 
323The server uses a **persistent Python FastAPI server** that keeps the xCOMET model loaded in memory:
324 
325| Request | Time | Notes |
326|---------|------|-------|
327| First request | ~25-90s | Model loading (varies by model size) |
328| Subsequent requests | **~500ms** | Model already loaded |
329 
330This provides a **177x speedup** for consecutive evaluations compared to reloading the model each time.
331 
332### Eager Loading (v0.3.1+)
333 
334Enable `XCOMET_PRELOAD=true` to pre-load the model at server startup:
335 
336```json
337{
338  "mcpServers": {
339    "xcomet": {
340      "command": "npx",
341      "args": ["-y", "xcomet-mcp-server"],
342      "env": {
343        "XCOMET_PRELOAD": "true"
344      }
345    }
346  }
347}
348```
349 
350With preload enabled, **all requests are fast** (~500ms), including the first one.
351 
352```mermaid
353graph LR
354    A[MCP Request] --> B[Node.js Server]
355    B --> C[Python FastAPI Server]
356    C --> D[xCOMET Model<br/>in Memory]
357    D --> C
358    C --> B
359    B --> A
360 
361    style D fill:#9f9
362```
363 
364### Batch Processing Optimization
365 
366The `xcomet_batch_evaluate` tool processes all pairs with a single model load:
367 
368| Pairs | Estimated Time |
369|-------|----------------|
370| 10 | ~30-40 sec |
371| 50 | ~1-1.5 min |
372| 100 | ~2 min |
373 
374### GPU vs CPU Performance
375 
376| Mode | 100 Pairs (Estimated) |
377|------|----------------------|
378| CPU (batch_size=8) | ~2 min |
379| GPU (batch_size=16) | ~20-30 sec |
380 
381> **Note**: GPU requires CUDA-compatible hardware and PyTorch with CUDA support. If GPU is not available, set `use_gpu: false` (default).
382 
383### Best Practices
384 
385**1. Let the persistent server do its job**
386 
387With v0.3.0+, the model stays in memory. Multiple `xcomet_evaluate` calls are now efficient:
388 
389```
390✅ Fast: First call loads model, subsequent calls reuse it
391   xcomet_evaluate(pair1)  # ~90s (model loads)
392   xcomet_evaluate(pair2)  # ~500ms (model cached)
393   xcomet_evaluate(pair3)  # ~500ms (model cached)
394```
395 
396**2. For many pairs, use batch evaluation**
397 
398```
399✅ Even faster: Batch all pairs in one call
400   xcomet_batch_evaluate(allPairs)  # Optimal throughput
401```
402 
403**3. Memory considerations**
404 
405- XCOMET-XL requires ~8-10GB RAM
406- For large batches (500 pairs), ensure sufficient memory
407- If memory is limited, split into smaller batches (100-200 pairs)
408 
409### Auto-Restart (v0.3.1+)
410 
411The server automatically recovers from failures:
412- Monitors health every 30 seconds
413- Restarts after 3 consecutive health check failures
414- Up to 3 restart attempts before giving up
415 
416## 📊 Quality Score Interpretation
417 
418| Score Range | Quality | Recommendation |
419|-------------|---------|----------------|
420| 0.9 - 1.0 | Excellent | Ready for use |
421| 0.7 - 0.9 | Good | Minor review recommended |
422| 0.5 - 0.7 | Fair | Post-editing needed |
423| 0.0 - 0.5 | Poor | Re-translation recommended |
424 
425## 🔍 Troubleshooting
426 
427### Common Issues
428 
429#### "No module named 'comet'"
430 
431**Cause**: Python environment without `unbabel-comet` installed.
432 
433**Solution**:
434```bash
435# Check which Python is being used
436python3 -c "import sys; print(sys.executable)"
437 
438# Install all required packages
439pip install "unbabel-comet>=2.2.0" fastapi uvicorn
440 
441# Or specify Python path explicitly
442export XCOMET_PYTHON_PATH=/path/to/python3
443```
444 
445#### Model download fails or times out
446 
447**Cause**: Large model files (~14GB for XL) require stable internet connection.
448 
449**Solution**:
450```bash
451# Pre-download the model manually
452python -c "from comet import download_model; download_model('Unbabel/XCOMET-XL')"
453```
454 
455#### GPU not detected
456 
457**Cause**: PyTorch not installed with CUDA support.
458 
459**Solution**:
460```bash
461# Check CUDA availability
462python -c "import torch; print(torch.cuda.is_available())"
463 
464# If False, reinstall PyTorch with CUDA
465pip install torch --index-url https://download.pytorch.org/whl/cu118
466```
467 
468#### Slow performance on Mac (MPS)
469 
470**Cause**: Mac MPS (Metal Performance Shaders) has compatibility issues with some operations.
471 
472**Solution**: The server automatically uses `num_workers=1` for Mac MPS compatibility. For best performance on Mac, use CPU mode (`use_gpu: false`).
473 
474#### High memory usage or crashes
475 
476**Cause**: XCOMET-XL requires ~8-10GB RAM.
477 
478**Solutions**:
4791. **Use the persistent server** (v0.3.0+): Model loads once and stays in memory, avoiding repeated memory spikes
4802. **Use a lighter model**: Set `XCOMET_MODEL=Unbabel/wmt22-comet-da` for lower memory usage (~3GB)
4813. **Reduce batch size**: For large batches, process in smaller chunks (100-200 pairs)
4824. **Close other applications**: Free up RAM before running large evaluations
483 
484```bash
485# Check available memory
486free -h  # Linux
487vm_stat | head -5  # macOS
488```
489 
490#### VS Code or IDE crashes during evaluation
491 
492**Cause**: High memory usage from the xCOMET model (~8-10GB for XL).
493 
494**Solution**:
495- With v0.3.0+, the model loads once and stays in memory (no repeated loading)
496- If memory is still an issue, use a lighter model: `XCOMET_MODEL=Unbabel/wmt22-comet-da`
497- Close other memory-intensive applications before evaluation
498 
499### Getting Help
500 
501If you encounter issues:
502 
5031. Check the [GitHub Issues](https://github.com/shuji-bonji/xcomet-mcp-server/issues)
5042. Enable debug logging by checking Claude Desktop's Developer Mode logs
5053. Open a new issue with:
506   - Your OS and Python version
507   - The error message
508   - Your configuration (without sensitive data)
509 
510## 🧪 Development
511 
512```bash
513# Install dependencies
514npm install
515 
516# Build TypeScript
517npm run build
518 
519# Watch mode
520npm run dev
521 
522# Test with MCP Inspector
523npm run inspect
524```
525 
526## 📋 Changelog
527 
528See [CHANGELOG.md](CHANGELOG.md) for version history and updates.
529 
530## 📝 License
531 
532MIT License - see [LICENSE](LICENSE) for details.
533 
534## 🙏 Acknowledgments
535 
536- [Unbabel](https://unbabel.com/) for the xCOMET model
537- [Anthropic](https://anthropic.com/) for the MCP protocol
538- [Model Context Protocol](https://modelcontextprotocol.io/) community
539 
540## 📚 References
541 
542- [xCOMET Paper](https://direct.mit.edu/tacl/article/doi/10.1162/tacl_a_00683/124263/xcomet-Transparent-Machine-Translation-Evaluation)
543- [COMET Framework](https://github.com/Unbabel/COMET)
544- [MCP Specification](https://spec.modelcontextprotocol.io/)
545

Full transparency — inspect the skill content before installing.

New to skill.md files?

See what a SKILL.md file is, how to install one, and how it differs from AGENTS.md or cursorrules.

Read the guide →