Test Agent<>Agent
Cyber Skills
for Real

Evaluate and improve your Claude or OpenAI skills.md using real cyber environments and expert evaluation criteria.


Supported Agent Providers

Anthropic Claude
OpenAI GPT / o-series
Custom Any LLM

Get Started

$ docker compose - coming soon

No account needed. Runs locally in your own container.