Testing AI Agent Skills
AI agent skills for software testing. Unit tests, integration tests, E2E testing, TDD workflows, and test automation.
137 listings
mcp-recorder — VCR.py for MCP servers
MCP ServerRecord, replay, and verify Model Context Protocol interactions for deterministic testing. MCP servers break silently. Tool schemas change, prompts drift, responses shift. Without wire-level regression tests, you find out from your users. mcp-recorder captures the full protocol exchange into a cassette file and lets you test from both sides. Try it right now — a scenarios.yml and a public demo serv
Durable Objects
Create and review Cloudflare Durable Objects. Use when building stateful coordination (chat rooms, multiplayer games, booking systems), implementing RPC methods, SQLite storage, alarms, WebSockets, or reviewing DO code for best practices. Covers Workers integration, wrangler config, and testing with Vitest.
Postmancer
A standalone MCP server for API testing and management, allowing AI assistants to interact with RESTful APIs through natural language. Postmancer is an MCP (Model Context Protocol) server that enables AI assistants like Claude to make HTTP requests, manage collections of API endpoints, and test API responses. It provides similar functionality to tools like Postman or Insomnia but designed specific
Metasploit Framework
This skill should be used when the user asks to "use Metasploit for penetration testing", "exploit vulnerabilities with msfconsole", "create payloads with msfvenom", "perform post-exploitation", "use auxiliary modules for scanning", or "develop custom exploits". It provides comprehensive guidance for leveraging the Metasploit Framework in security assessments.
Ethical Hacking Methodology
This skill should be used when the user asks to "learn ethical hacking", "understand penetration testing lifecycle", "perform reconnaissance", "conduct security scanning", "exploit vulnerabilities", or "write penetration test reports". It provides comprehensive ethical hacking methodology and techniques.
Temporal Python Pro
Master Temporal workflow orchestration with Python SDK. Implements
Framework Migration Deps Upgrade
You are a dependency management expert specializing in safe, incremental upgrades of project dependencies. Plan and execute dependency updates with minimal risk, proper testing, and clear migration pa
Nestjs Expert
Nest.js framework expert specializing in module architecture, dependency injection, middleware, guards, interceptors, testing with Jest/Supertest, TypeORM/Mongoose integration, and Passport.js authentication. Use PROACTIVELY for any Nest.js application issues including architecture decisions, testing strategies, performance optimization, or debugging complex dependency injection problems. If a specialized expert is a better fit, I will recommend switching and stop.
Performance Testing Review Multi Agent Review
Use when working with performance testing review multi agent review
Backend Development Feature Development
Orchestrate end-to-end backend feature development from requirements to deployment. Use when coordinating multi-phase feature delivery across teams and services.
File Path Traversal Testing
This skill should be used when the user asks to "test for directory traversal", "exploit path traversal vulnerabilities", "read arbitrary files through web applications", "find LFI vulnerabilities", or "access files outside web root". It provides comprehensive file path traversal attack and testing methodologies.
Testrail Test Case
RulesPersona
Test Automator
Master AI-powered test automation with modern frameworks,
Go Playwright
Expert capability for robust, stealthy, and efficient browser automation using Playwright Go.
Python
RulesYou are an AI assistant specialized in Python development.
Agent Skills for Context Engineering
This skill should be used when the user asks to "evaluate agent performance", "build test framework", "measure agent quality", "create evaluation rubrics", or mentions LLM-as-judge, multi-dimensional evaluation, agent testing, or quality gates for agent pipelines.
API Fuzzing for Bug Bounty
This skill should be used when the user asks to "test API security", "fuzz APIs", "find IDOR vulnerabilities", "test REST API", "test GraphQL", "API penetration testing", "bug bounty API testing", or needs guidance on API security assessment techniques.
Terraform Skill
Terraform infrastructure as code best practices
SQLMap Database Penetration Testing
This skill should be used when the user asks to "automate SQL injection testing," "enumerate database structure," "extract database credentials using sqlmap," "dump tables and columns from a vulnerable database," or "perform automated database penetration testing." It provides comprehensive guidance for using SQLMap to detect and exploit SQL injection vulnerabilities.
Firmware Analyst
Expert firmware analyst specializing in embedded systems, IoT
CipherTrust Manager MCP Server
MCP ServerThis project implements an independently-developed CipherTrust MCP (Model Context Protocol) server that allows AI Assistants like Claude or Cursor to interact with CipherTrust Manager resources using the ksctl CLI. - Important Notice - Prerequisites - Installation - Configuration - Integration with AI Assistants - Environment Variables - Troubleshooting - Project Structure - Contributing This is a
LLM Evaluation
PluginLLM evaluation and testing patterns including prompt testing, hallucination detection, benchmark creation, and quality metrics. Use when testing LLM applications, validating prompt quality, implementing systematic evaluation, or measuring LLM performance.
Setup Plugin
PluginVersion: 1.0.0 Purpose: Project initialization and Claude Code configuration The setup plugin provides commands for initializing Claude Code in new and existing projects, configuring user-level settings, and setting up project-specific configurations. Set up a new Python project with modern tooling and Claude framework integration. - Poetry or pip + venv support - pytest, black, ruff, mypy - .clau
Python Projects Guide
RulesYou are an AI assistant specialized in Python development.