AgentSys is a free, open-source AI agent skill. AgentSys A modular runtime and orchestration system for AI agents.
How do I install AgentSys?

Install AgentSys with a single command: npx mdskills install avifenesh/agentsys. This downloads the skill files into your project and your AI agent picks them up automatically.
What platforms support AgentSys?

AgentSys works with Claude Code, Claude Desktop, Cursor, Vscode Copilot, Windsurf, Continue Dev, Codex, Gemini Cli, Amp, Roo Code, Goose, Opencode, Trae, Qodo, Command Code. Skills use the open SKILL.md format which is compatible with any AI coding agent that reads markdown instructions.
← Back to plugins
AgentSys

Name: AgentSys: AI Agent Skill
Brand: avifenesh
Availability: InStock
Rating: 7 (1 reviews)
Author: avifenesh
Verified
PluginProductivityIntermediate
AgentSys A modular runtime and orchestration system for AI agents.
by @avifenesh1 downloads451Updated 2/20/2026
Add this skill
npx mdskills install avifenesh/agentsys
Fork & Edit
Are you @avifenesh? Sign in with GitHub to claim this listing.
Skill Advisor7.0
Comprehensive orchestration system with 13 plugins, 42 agents, and 28 skills for full-cycle dev automation
+Provides end-to-end task automation from selection through PR merge with phase gates
+Uses hybrid approach combining deterministic code analysis with targeted LLM judgment
+Includes specialized agents with clear responsibility boundaries and model assignments
-Content appears truncated making full capability assessment difficult
-All permissions declared but scope justification unclear from partial documentation
SKILL.md
Edit in Browser
1<p align="center">
2  <img src="site/assets/logo.png" alt="AgentSys" width="120">
3</p>
4 
5<h1 align="center">AgentSys</h1>
6 
7<p align="center">
8  <strong>A modular runtime and orchestration system for AI agents.</strong>
9</p>
10 
11> **Renamed from `awesome-slash`** — The `awesome-` prefix implies a curated list of links, but this project is a functional software suite and runtime. Please update your installs: `npm install -g agentsys`
12 
13<p align="center">
14  <a href="https://www.npmjs.com/package/agentsys"><img src="https://img.shields.io/npm/v/agentsys.svg" alt="npm version"></a>
15  <a href="https://www.npmjs.com/package/agentsys"><img src="https://img.shields.io/npm/dm/agentsys.svg" alt="npm downloads"></a>
16  <a href="https://github.com/avifenesh/agentsys/actions/workflows/ci.yml"><img src="https://github.com/avifenesh/agentsys/actions/workflows/ci.yml/badge.svg" alt="CI"></a>
17  <a href="https://github.com/avifenesh/agentsys/stargazers"><img src="https://img.shields.io/github/stars/avifenesh/agentsys.svg" alt="GitHub stars"></a>
18  <a href="https://opensource.org/licenses/MIT"><img src="https://img.shields.io/badge/License-MIT-yellow.svg" alt="License: MIT"></a>
19  <a href="https://avifenesh.github.io/agentsys/"><img src="https://img.shields.io/badge/Website-AgentSys-blue?style=flat&logo=github" alt="Website"></a>
20  <a href="https://github.com/hesreallyhim/awesome-claude-code"><img src="https://awesome.re/mentioned-badge.svg" alt="Mentioned in Awesome Claude Code"></a>
21</p>
22 
23<p align="center">
24  <b>13 plugins · 42 agents · 28 skills · 26k lines of lib code · 3,357 tests · 3 platforms</b>
25</p>
26 
27<p align="center">
28  <a href="#commands">Commands</a> · <a href="#installation">Installation</a> · <a href="https://avifenesh.github.io/agentsys/">Website</a> · <a href="https://github.com/avifenesh/agentsys/discussions">Discussions</a>
29</p>
30 
31<p align="center">
32  <b>Built for Claude Code · Codex CLI · OpenCode</b>
33</p>
34 
35<p align="center"><em>New skills, agents, and integrations ship constantly. Follow for real-time updates:</em></p>
36<p align="center">
37  <a href="https://x.com/avi_fenesh"><img src="https://img.shields.io/badge/Follow-@avi__fenesh-1DA1F2?style=for-the-badge&logo=x&logoColor=white" alt="Follow on X"></a>
38</p>
39 
40---
41 
42AI models can write code. That's not the hard part anymore. The hard part is everything around it — task selection, branch management, code review, artifact cleanup, CI, PR comments, deployment. **AgentSys is the runtime that orchestrates agents to handle all of it** — structured pipelines, gated phases, specialized agents, and persistent state that survives session boundaries.
43 
44---
45> Building custom skills, agents, hooks, or MCP tools? [agnix](https://github.com/avifenesh/agnix) is the CLI + LSP linter that catches config errors before they fail silently - real-time IDE validation, auto suggestions, auto-fix, and 155 rules for Cursor, Claude Code, Cline, Copilot, Codex, Windsurf, and more.
46 
47## What This Is
48 
49An agent orchestration system — 13 plugins, 42 agents, and 28 skills that compose into structured pipelines for software development.
50 
51Each agent has a single responsibility, a specific model assignment, and defined inputs/outputs. Pipelines enforce phase gates so agents can't skip steps. State persists across sessions so work survives interruptions.
52 
53The system runs on Claude Code, OpenCode, and Codex CLI. Install the plugins, get the runtime.
54 
55---
56 
57## The Approach
58 
59**Code does code work. AI does AI work.**
60 
61- **Detection**: regex, AST analysis, static analysis—fast, deterministic, no tokens wasted
62- **Judgment**: LLM calls for synthesis, planning, review—where reasoning matters
63- **Result**: 77% fewer tokens for [/drift-detect](#drift-detect) vs multi-agent approaches, certainty-graded findings throughout
64 
65**Certainty levels exist because not all findings are equal:**
66 
67| Level | Meaning | Action |
68|-------|---------|--------|
69| HIGH | Definitely a problem | Safe to auto-fix |
70| MEDIUM | Probably a problem | Needs context |
71| LOW | Might be a problem | Needs human judgment |
72 
73This came from testing on 1,000+ repositories.
74 
75---
76 
77## Commands
78 
79<!-- GEN:START:readme-commands -->
80| Command | What it does |
81|---------|--------------|
82| [`/next-task`](#next-task) | Task → exploration → plan → implementation → review → ship |
83| [`/agnix`](#agnix) | **Lint agent configs** - 155 rules for Skills, Memory, Hooks, MCP across 10+ AI tools |
84| [`/ship`](#ship) | Branch → PR → CI → reviews addressed → merge → cleanup |
85| [`/deslop`](#deslop) | 3-phase detection pipeline, certainty-graded findings |
86| [`/perf`](#perf) | 10-phase performance investigation with baselines and profiling |
87| [`/drift-detect`](#drift-detect) | AST-based plan vs code analysis, finds what's documented but not built |
88| [`/audit-project`](#audit-project) | Multi-agent code review, iterates until issues resolved |
89| [`/enhance`](#enhance) | Analyzes prompts, agents, plugins, docs, hooks, skills |
90| [`/repo-map`](#repo-map) | AST symbol and import mapping via ast-grep |
91| [`/sync-docs`](#sync-docs) | Finds outdated references, stale examples, missing CHANGELOG entries |
92| [`/learn`](#learn) | Research any topic, gather online sources, create learning guide with RAG index |
93| [`/consult`](#consult) | Consult another AI CLI tool for a second opinion. Use when you want to cross-check ideas, get alternative approaches, or validate decisions with Gemini, Codex, Claude, OpenCode, or Copilot. |
94| [`/debate`](#debate) | Use when user asks to "debate", "argue about", "compare perspectives", "stress test idea", "devil advocate", or "tool vs tool". Structured debate between two AI tools with proposer/challenger roles and a verdict. |
95<!-- GEN:END:readme-commands -->
96 
97Each command works standalone. Together, they compose into end-to-end pipelines.
98 
99---
100 
101## Skills
102 
103<!-- GEN:START:readme-skills -->
10428 skills included across the plugins:
105 
106| Category | Skills |
107|----------|--------|
108| **Performance** | `perf:perf-analyzer`, `perf:perf-baseline-manager`, `perf:perf-benchmarker`, `perf:perf-code-paths`, `perf:perf-investigation-logger`, `perf:perf-profiler`, `perf:perf-theory-gatherer`, `perf:perf-theory-tester` |
109| **Enhancement** | `enhance:enhance-agent-prompts`, `enhance:enhance-claude-memory`, `enhance:enhance-cross-file`, `enhance:enhance-docs`, `enhance:enhance-hooks`, `enhance:enhance-orchestrator`, `enhance:enhance-plugins`, `enhance:enhance-prompts`, `enhance:enhance-skills` |
110| **Workflow** | `next-task:discover-tasks`, `next-task:orchestrate-review`, `next-task:validate-delivery` |
111| **Cleanup** | `deslop:deslop`, `sync-docs:sync-docs` |
112| **Analysis** | `debate:debate`, `drift-detect:drift-analysis`, `repo-map:repo-mapping` |
113| **Productivity** | `consult:consult` |
114| **Learning** | `learn:learn` |
115| **Linting** | `agnix:agnix` |
116<!-- GEN:END:readme-skills -->
117 
118Skills are the reusable implementation units. Agents invoke skills; commands orchestrate agents. When you install a plugin, its skills become available to all agents in that session.
119 
120---
121 
122## Quick Navigation
123 
124| Section | What's there |
125|---------|--------------|
126| [The Approach](#the-approach) | Why it's built this way |
127| [Commands](#commands) | All 12 commands overview |
128| [Skills](#skills) | 28 skills across plugins |
129| [Command Details](#command-details) | Deep dive into each command |
130| [How Commands Work Together](#how-commands-work-together) | Standalone vs integrated |
131| [Design Philosophy](#design-philosophy) | The thinking behind the architecture |
132| [Installation](#installation) | Get started |
133| [Research & Testing](#research--testing) | What went into building this |
134| [Documentation](#documentation) | Links to detailed docs |
135 
136---
137 
138## Command Details
139 
140### /next-task
141 
142**Purpose:** Complete task-to-production automation.
143 
144**What happens when you run it:**
145 
1461. **Policy Selection** - Choose task source (GitHub issues, GitLab, local file), priority filter, stopping point
1472. **Task Discovery** - Shows top 5 prioritized tasks, you pick one
1483. **Worktree Setup** - Creates isolated branch and working directory
1494. **Exploration** - Deep codebase analysis to understand context
1505. **Planning** - Designs implementation approach
1516. **User Approval** - You review and approve the plan (last human interaction)
1527. **Implementation** - Executes the plan
1538. **Pre-Review** - Runs [deslop](#deslop)-agent and test-coverage-checker
1549. **Review Loop** - Multi-agent review iterates until clean
15510. **Delivery Validation** - Verifies tests pass, build passes, requirements met
15611. **Docs Update** - Updates CHANGELOG and related documentation
15712. **[Ship](#ship)** - Creates PR, monitors CI, addresses comments, merges
158 
159Phase 9 uses the `orchestrate-review` skill to spawn parallel reviewers (code quality, security, performance, test coverage) plus conditional specialists.
160 
161**Agents involved:**
162 
163| Agent | Model | Role |
164|-------|-------|------|
165| task-discoverer | sonnet | Finds and ranks tasks from your source |
166| worktree-manager | haiku | Creates git worktrees and branches |
167| exploration-agent | opus | Deep codebase analysis before planning |
168| planning-agent | opus | Designs step-by-step implementation plan |
169| implementation-agent | opus | Writes the actual code |
170| test-coverage-checker | sonnet | Validates tests exist and are meaningful |
171| delivery-validator | sonnet | Final checks before shipping |
172| ci-monitor | haiku | Watches CI status |
173| ci-fixer | sonnet | Fixes CI failures and review comments |
174| simple-fixer | haiku | Executes mechanical edits |
175 
176**Cross-plugin agent:**
177| Agent | Plugin | Role |
178|-------|--------|------|
179| deslop-agent | deslop | Removes AI artifacts before review |
180| sync-docs-agent | sync-docs | Updates documentation |
181 
182**Usage:**
183 
184```bash
185/next-task              # Start new workflow
186/next-task --resume     # Resume interrupted workflow
187/next-task --status     # Check current state
188/next-task --abort      # Cancel and cleanup
189```
190 
191[Full workflow documentation →](./docs/workflows/NEXT-TASK.md)
192 
193---
194 
195### /agnix
196 
197**Purpose:** Lint agent configurations before they break your workflow. The first dedicated linter for AI agent configs.
198 
199**[agnix](https://github.com/avifenesh/agnix)** is a standalone open-source project that provides the validation engine. This plugin integrates it into your workflow.
200 
201**The problem it solves:**
202 
203Agent configurations are code. They affect behavior, security, and reliability. But unlike application code, they have no linting. You find out your SKILL.md is malformed when the agent fails. You discover your hooks have security issues when they're exploited. You realize your CLAUDE.md has conflicting rules when the AI behaves unexpectedly.
204 
205agnix catches these issues before they cause problems.
206 
207**What it validates:**
208 
209| Category | What It Checks |
210|----------|----------------|
211| **Structure** | Required fields, valid YAML/JSON, proper frontmatter |
212| **Security** | Prompt injection vectors, overpermissive tools, exposed secrets |
213| **Consistency** | Conflicting rules, duplicate definitions, broken references |
214| **Best Practices** | Tool restrictions, model selection, trigger phrase quality |
215| **Cross-Platform** | Compatibility across Claude Code, Cursor, Copilot, Codex, OpenCode, Gemini CLI, Cline, and more |
216 
217**155 validation rules** (57 auto-fixable) derived from:
218- Official tool specifications (Claude Code, Cursor, GitHub Copilot, Codex CLI, OpenCode, Gemini CLI, and more)
219- Research papers on agent reliability and prompt injection
220- Real-world testing across 500+ repositories
221- Community-reported issues and edge cases
222 
223**Supported files:**
224 
225| File Type | Examples |
226|-----------|----------|
227| Skills | `SKILL.md`, `*/SKILL.md` |
228| Memory | `CLAUDE.md`, `AGENTS.md`, `.github/CLAUDE.md` |
229| Hooks | `.claude/settings.json`, hooks configuration |
230| MCP | `*.mcp.json`, MCP server configs |
231| Cursor | `.cursor/rules/*.mdc`, `.cursorrules` |
232| Copilot | `.github/copilot-instructions.md` |
233 
234**CI/CD Integration:**
235 
236agnix outputs SARIF format for GitHub Code Scanning. Add it to your workflow:
237 
238```yaml
239- name: Lint agent configs
240  run: agnix --format sarif > results.sarif
241- uses: github/codeql-action/upload-sarif@v3
242  with:
243    sarif_file: results.sarif
244```
245 
246**Usage:**
247 
248```bash
249/agnix                       # Validate current project
250/agnix --fix                 # Auto-fix fixable issues
251/agnix --strict              # Treat warnings as errors
252/agnix --target claude-code  # Only Claude Code rules
253/agnix --format sarif        # Output for GitHub Code Scanning
254```
255 
256**Agent:** agnix-agent (sonnet model)
257 
258**External tool:** Requires [agnix CLI](https://github.com/avifenesh/agnix)
259 
260```bash
261npm install -g agnix         # Install via npm
262# or
263cargo install agnix-cli      # Install via Cargo
264# or
265brew install agnix           # Install via Homebrew (macOS)
266```
267 
268**Why use agnix:**
269- Catch config errors before they cause agent failures
270- Enforce security best practices across your team
271- Maintain consistency as your agent configs grow
272- Integrate validation into CI/CD pipelines
273- Support multiple AI tools from one linter
274 
275---
276 
277### /ship
278 
279**Purpose:** Takes your current branch from "ready to commit" to "merged PR."
280 
281**What happens when you run it:**
282 
2831. **Pre-flight** - Detects CI platform, deployment platform, branch strategy
2842. **Commit** - Stages and commits with generated message (if uncommitted changes)
2853. **Push & PR** - Pushes branch, creates pull request
2864. **CI Monitor** - Waits for CI, retries on transient failures
2875. **Review Wait** - Waits 3 minutes for auto-reviewers (Copilot, Claude, Gemini, Codex)
2886. **Address Comments** - Handles every comment from every reviewer
2897. **Merge** - Merges when all comments resolved and CI passes
2908. **Deploy** - Deploys and validates (if multi-branch workflow)
2919. **Cleanup** - Removes worktree, closes issue, deletes branch
292 
293**Platform Detection:**
294 
295| Type | Detected |
296|------|----------|
297| CI | GitHub Actions, GitLab CI, CircleCI, Jenkins, Travis |
298| Deploy | Railway, Vercel, Netlify, Fly.io, Render |
299| Project | Node.js, Python, Rust, Go, Java |
300 
301**Review Comment Handling:**
302 
303Every comment gets addressed. No exceptions. The workflow categorizes comments and handles each:
304- Code fixes get implemented
305- Style suggestions get applied
306- Questions get answered
307- False positives get explained
308 
309If something can't be fixed, the workflow replies explaining why and resolves the thread.
310 
311**Usage:**
312 
313```bash
314/ship                       # Full workflow
315/ship --dry-run             # Preview without executing
316/ship --strategy rebase     # Use rebase instead of squash
317```
318 
319[Full workflow documentation →](./docs/workflows/SHIP.md)
320 
321---
322 
323### /deslop
324 
325**Purpose:** Finds AI slop—debug statements, placeholder text, verbose comments, TODOs—and removes it.
326 
327**How detection works:**
328 
329Three phases run in sequence:
330 
3311. **Phase 1: Regex Patterns** (HIGH certainty)
332   - `console.log`, `print()`, `dbg!()`, `println!()`
333   - `// TODO`, `// FIXME`, `// HACK`
334   - Empty catch blocks, disabled linters
335   - Hardcoded secrets (API keys, tokens)
336 
3372. **Phase 2: Multi-Pass Analyzers** (MEDIUM certainty)
338   - Doc-to-code ratio (excessive comments)
339   - Verbosity ratio (AI preambles)
340   - Over-engineering patterns
341   - Buzzword inflation
342   - Dead code detection
343   - Stub functions
344 
3453. **Phase 3: CLI Tools** (LOW certainty, optional)
346   - jscpd, madge, escomplex (JS/TS)
347   - pylint, radon (Python)
348   - golangci-lint (Go)
349   - clippy (Rust)
350 
351**Languages supported:** JavaScript/TypeScript, Python, Rust, Go, Java
352 
353**Usage:**
354 
355```bash
356/deslop              # Report only (safe)
357/deslop apply        # Fix HIGH certainty issues
358/deslop apply src/ 10  # Fix 10 issues in src/
359```
360 
361**Thoroughness levels:**
362 
363- `quick` - Phase 1 only (fastest)
364- `normal` - Phase 1 + Phase 2 (default)
365- `deep` - All phases if tools available
366 
367[Pattern reference →](./docs/reference/SLOP-PATTERNS.md)
368 
369---
370 
371### /perf
372 
373**Purpose:** Structured performance investigation with baselines, profiling, and evidence-backed decisions.
374 
375**10-phase methodology** (based on recorded real performance investigation sessions):
376 
3771. **Setup** - Confirm scenario, success criteria, benchmark command
3782. **Baseline** - 60s minimum runs, PERF_METRICS markers required
3793. **Breaking Point** - Binary search to find failure threshold
3804. **Constraints** - CPU/memory limits, measure delta vs baseline
3815. **Hypotheses** - Generate up to 5 hypotheses with evidence and confidence
3826. **Code Paths** - Use repo-map to identify entrypoints and hot files
3837. **Profiling** - Language-specific tools (--cpu-prof, JFR, cProfile, pprof)
3848. **Optimization** - One change per experiment, 2+ validation passes
3859. **Decision** - Continue or stop based on measurable improvement
38610. **Consolidation** - Final baseline, evidence log, investigation complete
387 
388**Agents and skills:**
389 
390| Component | Role |
391|-----------|------|
392| perf-orchestrator | Coordinates all phases |
393| perf-theory-gatherer | Generates hypotheses from git history and code |
394| perf-theory-tester | Validates hypotheses with controlled experiments |
395| perf-analyzer | Synthesizes findings into recommendations |
396| perf-code-paths | Maps entrypoints and likely hot paths |
397| perf-investigation-logger | Structured evidence logging |
398 
399**Usage:**
400 
401```bash
402/perf                 # Start new investigation
403/perf --resume        # Resume previous investigation
404```
405 
406**Phase flags (advanced):**
407 
408```bash
409/perf --phase baseline --command "npm run bench" --version v1.2.0
410/perf --phase breaking-point --param-min 1 --param-max 500
411/perf --phase constraints --cpu 1 --memory 1GB
412/perf --phase hypotheses --hypotheses-file perf-hypotheses.json
413/perf --phase optimization --change "reduce allocations"
414/perf --phase decision --verdict stop --rationale "no measurable improvement"
415```
416 
417---
418 
419### /drift-detect
420 
421**Purpose:** Compares your documentation and plans to what's actually in the code.
422 
423**The problem it solves:**
424 
425Your roadmap says "user authentication: done." But is it actually implemented? Your GitHub issue says "add dark mode." Is it already in the codebase? Plans drift from reality. This command finds the drift.
426 
427**How it works:**
428 
4291. **JavaScript collectors** gather data (fast, token-efficient)
430   - GitHub issues and their labels
431   - Documentation files
432   - Actual code exports and implementations
433 
4342. **Single Opus call** performs semantic analysis
435   - Matches concepts, not strings ("user auth" matches `auth/`, `login.js`, `session.ts`)
436   - Identifies implemented but not documented
437   - Identifies documented but not implemented
438   - Finds stale issues that should be closed
439 
440**Why this approach:**
441 
442Multi-agent collection wastes tokens on coordination. JavaScript collectors are fast and deterministic. One well-prompted LLM call does the actual analysis. Result: 77% token reduction vs multi-agent approaches.
443 
444**Tested on 1,000+ repositories** before release.
445 
446**Usage:**
447 
448```bash
449/drift-detect              # Full analysis
450/drift-detect --depth quick  # Quick scan
451```
452 
453---
454 
455### /audit-project
456 
457**Purpose:** Multi-agent code review that iterates until issues are resolved.
458 
459**What happens when you run it:**
460 
461Up to 10 specialized role-based agents run based on your project:
462 
463| Agent | When Active | Focus Area |
464|-------|-------------|------------|
465| code-quality-reviewer | Always | Code quality, error handling |
466| security-expert | Always | Vulnerabilities, auth, secrets |
467| performance-engineer | Always | N+1 queries, memory, blocking ops |
468| test-quality-guardian | Always | Coverage, edge cases, mocking |
469| architecture-reviewer | If 50+ files | Modularity, patterns, SOLID |
470| database-specialist | If DB detected | Queries, indexes, transactions |
471| api-designer | If API detected | REST, errors, pagination |
472| frontend-specialist | If frontend detected | Components, state, UX |
473| backend-specialist | If backend detected | Services, domain logic |
474| devops-reviewer | If CI/CD detected | Pipelines, configs, secrets |
475 
476Findings are collected and categorized by severity (critical/high/medium/low). All non-false-positive issues get fixed automatically. The loop repeats until no open issues remain.
477 
478**Usage:**
479 
480```bash
481/audit-project                   # Full review
482/audit-project --quick           # Single pass
483/audit-project --resume          # Resume from queue file
484/audit-project --domain security # Security focus only
485/audit-project --recent          # Only recent changes
486```
487 
488[Agent reference →](./docs/reference/AGENTS.md#audit-project-plugin-agents)
489 
490---
491 
492### /enhance
493 
494**Purpose:** Analyzes your prompts, plugins, agents, docs, hooks, and skills for improvement opportunities.
495 
496**Seven analyzers run in parallel:**
497 
498| Analyzer | What it checks |
499|----------|----------------|
500| plugin-enhancer | Plugin structure, MCP tool definitions, security patterns |
501| agent-enhancer | Agent frontmatter, prompt quality |
502| claudemd-enhancer | CLAUDE.md/AGENTS.md structure, token efficiency |
503| docs-enhancer | Documentation readability, RAG optimization |
504| prompt-enhancer | Prompt engineering patterns, clarity, examples |
505| hooks-enhancer | Hook frontmatter, structure, safety |
506| skills-enhancer | SKILL.md structure, trigger phrases |
507 
508**Each finding includes:**
509- Certainty level (HIGH/MEDIUM/LOW)
510- Specific location (file:line)
511- What's wrong
512- How to fix it
513- Whether it can be auto-fixed
514 
515**Auto-learning:** Detects obvious false positives (pattern docs, workflow gates) and saves them for future runs. Reduces noise over time without manual suppression files.
516 
517**Usage:**
518 
519```bash
520/enhance                    # Run all analyzers
521/enhance --focus=agent      # Just agent prompts
522/enhance --apply            # Apply HIGH certainty fixes
523/enhance --show-suppressed  # Show what's being filtered
524/enhance --no-learn         # Analyze but don't save false positives
525```
526 
527---
528 
529### /repo-map
530 
531**Purpose:** Builds an AST-based map of symbols and imports for fast repo analysis.
532 
533**What it generates:**
534 
535- Cached file→symbols map (exports, functions, classes)
536- Import graph for dependency hints
537 
538Output is cached at `{state-dir}/repo-map.json` and exposed via the MCP `repo_map` tool.
539 
540**Why it matters:**
541 
542Tools like `/drift-detect` and planners can use the map instead of re-scanning the repo every time.
543 
544**Usage:**
545 
546```bash
547/repo-map init        # First-time map generation
548/repo-map update      # Incremental update
549/repo-map status      # Check freshness
550```
551 
552**Required:** ast-grep (`sg`) must be installed.
553 
554---
555 
556### /sync-docs
557 
558**Purpose:** Sync documentation with actual code changes—find outdated refs, update CHANGELOG, flag stale examples.
559 
560**The problem it solves:**
561 
562You refactor `auth.js` into `auth/index.js`. Your README still says `import from './auth'`. You rename a function. Three docs still reference the old name. You ship a feature. CHANGELOG doesn't mention it. Documentation drifts from code. This command finds the drift.
563 
564**What it detects:**
565 
566| Category | Examples |
567|----------|----------|
568| Broken references | Imports to moved/renamed files, deleted exports |
569| Version mismatches | Doc says v2.0, package.json says v2.1 |
570| Stale code examples | Import paths that no longer exist |
571| Missing CHANGELOG | `feat:` and `fix:` commits without entries |
572 
573**Auto-fixable vs flagged:**
574 
575| Auto-fixable (apply mode) | Flagged for review |
576|---------------------------|-------------------|
577| Version number updates | Removed exports referenced in docs |
578| CHANGELOG entries for commits | Code examples needing context |
579| | Function renames |
580 
581**Usage:**
582 
583```bash
584/sync-docs              # Check what docs need updates (safe)
585/sync-docs apply        # Apply safe fixes
586/sync-docs report src/  # Check docs related to src/
587/sync-docs --all        # Full codebase scan
588```
589 
590---
591 
592### /learn
593 
594**Purpose:** Research any topic online and create a comprehensive learning guide with RAG-optimized indexes.
595 
596**What it does:**
597 
5981. **Progressive Discovery** - Uses funnel approach (broad → specific → deep) to find quality sources
5992. **Quality Scoring** - Scores sources by authority, recency, depth, examples, uniqueness
6003. **Just-In-Time Extraction** - Fetches only high-scoring sources to save tokens
6014. **Synthesis** - Creates structured learning guide with examples and best practices
6025. **RAG Index** - Updates CLAUDE.md/AGENTS.md master index for future lookups
6036. **Enhancement** - Runs enhance:enhance-docs and enhance:enhance-prompts
604 
605**Depth levels:**
606 
607| Depth | Sources | Use Case |
608|-------|---------|----------|
609| brief | 10 | Quick overview |
610| medium | 20 | Default, balanced |
611| deep | 40 | Comprehensive |
612 
613**Output structure:**
614 
615```
616agent-knowledge/
617  CLAUDE.md                    # Master index (updated each run)
618  AGENTS.md                    # Index for OpenCode/Codex
619  recursion.md                 # Topic-specific guide
620  resources/
621    recursion-sources.json     # Source metadata with quality scores
622```
623 
624**Usage:**
625 
626```bash
627/learn recursion                    # Default (20 sources)
628/learn react hooks --depth=deep     # Comprehensive (40 sources)
629/learn kubernetes --depth=brief     # Quick overview (10 sources)
630/learn python async --no-enhance    # Skip enhancement pass
631```
632 
633**Agent:** learn-agent (opus model for research quality)
634 
635---
636 
637### /consult
638 
639**Purpose:** Get a second opinion from another AI CLI tool without leaving your current session.
640 
641**What it does:**
642 
6431. **Tool Detection** - Detects which AI CLI tools are installed (cross-platform)
6442. **Interactive Picker** - If no tool specified, shows only installed tools to choose from
6453. **Effort Mapping** - Maps effort levels to per-provider models and reasoning flags
6464. **Execution** - Runs the consultation with safe-mode defaults and 120s timeout
6475. **Session Continuity** - Saves session state for Claude and Gemini (supports `--continue`)
648 
649**Supported tools:**
650 
651| Tool | Default Model (high) | Reasoning Control |
652|------|---------------------|-------------------|
653| Claude | opus | max-turns |
654| Gemini | gemini-3-pro | built-in |
655| Codex | gpt-5.3-codex | model_reasoning_effort |
656| OpenCode | github-copilot/claude-opus-4-6 | --variant |
657| Copilot | (default) | none |
658 
659**Usage:**
660 
661```bash
662/consult "Is this the right approach?" --tool=gemini --effort=high
663/consult "Review for performance issues" --tool=codex
664/consult "Suggest alternatives" --tool=claude --effort=max
665/consult "Continue from where we left off" --continue
666/consult "Explain this error" --context=diff --tool=gemini
667```
668 
669**Agent:** consult-agent (sonnet model for orchestration)
670 
671---
672 
673### /debate
674 
675**Purpose:** Stress-test ideas through structured multi-round debate between two AI CLI tools.
676 
677**What it does:**
678 
6791. **Tool Detection** - Detects which AI CLI tools are installed (cross-platform)
6802. **Interactive Picker** - If no tools specified, prompts for proposer, challenger, effort, rounds, and context in a single batch question
6813. **Proposer/Challenger Format** - First tool argues for the topic; second tool challenges with evidence
6824. **Multi-Round Exchange** - Each round the proposer defends and the challenger responds (1–5 rounds)
6835. **Verdict** - Orchestrator delivers a final synthesis picking a winner with reasoning
684 
685**Usage:**
686 
687```bash
688# Natural language
689/debate codex vs gemini about microservices vs monolith
690/debate with claude and codex about our auth implementation
691/debate thoroughly gemini vs codex about database schema design
692/debate codex vs gemini 3 rounds about event sourcing
693 
694# Explicit flags
695/debate "Should we use event sourcing?" --tools=claude,gemini --rounds=3 --effort=high
696/debate "Valkey vs PostgreSQL for caching" --tools=codex,opencode
697 
698# With codebase context
699/debate "Is our current approach correct?" --tools=gemini,codex --context=diff
700```
701 
702**Options:**
703 
704| Flag | Description |
705|------|-------------|
706| `--tools=TOOL1,TOOL2` | Proposer and challenger (comma-separated) |
707| `--rounds=N` | Number of debate rounds, 1–5 (default: 2) |
708| `--effort=low\|medium\|high\|max` | Reasoning depth per tool call |
709| `--context=diff\|file=PATH\|none` | Codebase context passed to both tools |
710 
711**Agent:** debate-orchestrator (opus model for orchestration)
712 
713---
714 
715## How Commands Work Together
716 
717**Standalone use:**
718 
719```bash
720/deslop apply          # Just clean up your code
721/sync-docs             # Just check if docs need updates
722/ship                  # Just ship this branch
723/audit-project         # Just review the codebase
724```
725 
726**Integrated workflow:**
727 
728When you run [`/next-task`](#next-task), it orchestrates everything:
729 
730```
731/next-task picks task → explores codebase → plans implementation
732    ↓
733implementation-agent writes code
734    ↓
735deslop-agent cleans AI artifacts
736    ↓
737Phase 9 review loop iterates until approved
738    ↓
739delivery-validator checks requirements
740    ↓
741sync-docs-agent syncs documentation
742    ↓
743[/ship](#ship) creates PR → monitors CI → merges
744```
745 
746The workflow tracks state so you can resume from any point.
747 
748---
749 
750## Design Philosophy
751 
752<details>
753<summary><strong>Architecture decisions and trade-offs</strong> (click to expand)</summary>
754 
755### The Actual Problem
756 
757Frontier models write good code. That's solved. What's not solved:
758 
759- **Context management** - Models forget what they're doing mid-session
760- **Compaction amnesia** - Long sessions get summarized, losing critical state
761- **Task drift** - Without structure, agents wander from the actual goal
762- **Skipped steps** - Agents skip reviews, tests, or cleanup when not enforced
763- **Token waste** - Using LLM calls for work that static analysis can do faster
764- **Babysitting** - Manually orchestrating each phase of development
765- **Repetitive requests** - Asking for the same workflow every single session
766 
767### How This Addresses It
768 
769**1. One agent, one job, done extremely well**
770 
771Same principle as good code: single responsibility. The exploration-agent explores. The implementation-agent implements. Phase 9 spawns multiple focused reviewers. No agent tries to do everything. Specialized agents, each with narrow scope and clear success criteria.
772 
773**2. Pipeline with gates, not a monolith**
774 
775Same principle as DevOps. Each step must pass before the next begins. Can't push before review. Can't merge before CI passes. Hooks enforce this—agents literally cannot skip phases.
776 
777**3. Tools do tool work, agents do agent work**
778 
779If static analysis, regex, or a shell command can do it, don't ask an LLM. Pattern detection uses pre-indexed regex. File discovery uses glob. Platform detection uses file existence checks. The LLM only handles what requires judgment.
780 
781**4. Agents don't need to know how tools work**
782 
783The slop detector returns findings with certainty levels. The agent doesn't need to understand the three-phase pipeline, the regex patterns, or the analyzer heuristics. Good tool design means the consumer doesn't need implementation details.
784 
785**5. Build tools where tools don't exist**
786 
787Many tasks lack existing tools. JavaScript collectors for drift-detect. Multi-pass analyzers for slop detection. The result: agents receive structured data, not raw problems to figure out.
788 
789**6. Research-backed prompt engineering**
790 
791Documented techniques that measurably improve results:
792- **Progressive disclosure** - Agents see only what's needed for the current step
793- **Structured output** - JSON between delimiters, XML tags for sections
794- **Explicit constraints** - What agents MUST NOT do matters as much as what they do
795- **Few-shot examples** - Where patterns aren't obvious
796- **Tool calling over generation** - Let the model use tools rather than generate tool-like output
797 
798**7. Validate plan and results, not every step**
799 
800Approve the plan. See the results. The middle is automated. One plan approval unlocks autonomous execution through implementation, review, cleanup, and shipping.
801 
802**8. Right model for the task**
803 
804Match model capability to task complexity:
805- **opus** - Exploration, planning, implementation, review orchestration
806- **sonnet** - Pattern matching, validation, discovery
807- **haiku** - Git operations, file moves, CI polling
808 
809Quality compounds. Poor exploration → poor plan → poor implementation → review cycles. Early phases deserve the best model.
810 
811**9. Persistent state survives sessions**
812 
813Two JSON files track everything: what task, what phase. Sessions can die and resume. Multiple sessions run in parallel on different tasks using separate worktrees.
814 
815**10. Delegate everything automatable**
816 
817Agents don't just write code. They:
818- Clean their own output (deslop-agent)
819- Update documentation (sync-docs-agent)
820- Fix CI failures (ci-fixer)
821- Respond to review comments
822- Check for plan drift ([/drift-detect](#drift-detect))
823- Analyze their own prompts ([/enhance](#enhance))
824 
825If it can be specified, it can be delegated.
826 
827**11. Orchestrator stays high-level**
828 
829The main workflow orchestrator doesn't read files, search code, or write implementations. It launches specialized agents and receives their outputs. Keeps the orchestrator's context window available for coordination rather than filled with file contents.
830 
831**12. Composable, not monolithic**
832 
833Every command works standalone. [`/deslop`](#deslop) cleans code without needing [`/next-task`](#next-task). [`/ship`](#ship) merges PRs without needing the full workflow. Pieces compose together, but each piece is useful on its own.
834 
835### What This Gets You
836 
837- **Run multiple sessions** - Different tasks in different worktrees, no interference
838- **Fast iteration** - Approve plan, check results, repeat
839- **Stay in the interesting parts** - Policy decisions, architecture choices, edge cases
840- **Minimal review burden** - Most issues caught and fixed before you see the output
841- **No repetitive requests** - The workflow you want, without asking each time
842- **Scale horizontally** - More sessions, more tasks, same oversight level
843 
844</details>
845 
846---
847 
848## Installation
849 
850### Claude Code (Recommended way)
851 
852```bash
853/plugin marketplace add avifenesh/agentsys
854/plugin install next-task@agentsys
855/plugin install ship@agentsys
856```
857 
858### All Platforms (npm)
859 
860```bash
861npm install -g agentsys && agentsys
862```
863 
864Interactive installer for Claude Code, OpenCode, and Codex CLI.
865 
866```bash
867# Non-interactive install
868agentsys --tool claude              # Single tool
869agentsys --tools "claude,opencode"  # Multiple tools
870agentsys --development              # Dev mode (bypasses marketplace)
871```
872 
873[Full installation guide →](./docs/INSTALLATION.md)
874 
875---
876 
877## Requirements
878 
879**Required:**
880- Git
881- Node.js 18+
882 
883**For GitHub workflows:**
884- GitHub CLI (`gh`) authenticated
885 
886**For GitLab workflows:**
887- GitLab CLI (`glab`) authenticated
888 
889**For /repo-map:**
890- ast-grep (`sg`) installed
891 
892**For /agnix:**
893- [agnix CLI](https://github.com/avifenesh/agnix) installed (`cargo install agnix-cli` or `brew install agnix`)
894 
895**Local diagnostics (optional):**
896```bash
897npm run detect   # Platform detection (CI, deploy, project type)
898npm run verify   # Tool availability + versions
899```
900 
901---
902 
903## Research & Testing
904 
905The system is built on research, not guesswork.
906 
907**Knowledge base** (`agent-docs/`): 8,000 lines of curated documentation from Anthropic, OpenAI, Google, and Microsoft covering:
908- Agent architecture and design patterns
909- Prompt engineering techniques
910- Function calling and tool use
911- Context efficiency and token optimization
912- Multi-agent systems and orchestration
913- Instruction following reliability
914 
915**Testing:**
916- 1,818 tests passing
917- Drift-detect validated on 1,000+ repositories
918- E2E workflow testing across all commands
919- Cross-platform validation (Claude Code, OpenCode, Codex CLI)
920 
921**Methodology:**
922- `/perf` investigation phases based on recorded real performance investigation sessions
923- Certainty levels derived from pattern analysis across repositories
924- Token optimization measured and validated (77% reduction in drift-detect)
925 
926---
927 
928## Documentation
929 
930| Topic | Link |
931|-------|------|
932| Installation | [docs/INSTALLATION.md](./docs/INSTALLATION.md) |
933| Cross-Platform Setup | [docs/CROSS_PLATFORM.md](./docs/CROSS_PLATFORM.md) |
934| Usage Examples | [docs/USAGE.md](./docs/USAGE.md) |
935| Architecture | [docs/ARCHITECTURE.md](./docs/ARCHITECTURE.md) |
936 
937### Workflow Deep-Dives
938 
939| Workflow | Link |
940|----------|------|
941| /next-task Flow | [docs/workflows/NEXT-TASK.md](./docs/workflows/NEXT-TASK.md) |
942| /ship Flow | [docs/workflows/SHIP.md](./docs/workflows/SHIP.md) |
943 
944### Reference
945 
946| Topic | Link |
947|-------|------|
948| Slop Patterns | [docs/reference/SLOP-PATTERNS.md](./docs/reference/SLOP-PATTERNS.md) |
949| Agent Reference | [docs/reference/AGENTS.md](./docs/reference/AGENTS.md) |
950 
951---
952 
953## Support
954 
955- **Issues:** [github.com/avifenesh/agentsys/issues](https://github.com/avifenesh/agentsys/issues)
956- **Discussions:** [github.com/avifenesh/agentsys/discussions](https://github.com/avifenesh/agentsys/discussions)
957 
958---
959 
960MIT License | Made by [Avi Fenesh](https://github.com/avifenesh)
961
Full transparency — inspect the skill content before installing.
New to skill.md files?
See what a SKILL.md file is, how to install one, and how it differs from AGENTS.md or cursorrules.
Read the guide →