Test Agent<>Agent
Cyber Skills
for Real
Evaluate and improve your
Claude or
OpenAI
skills.md using real cyber environments
and expert evaluation criteria.
Supported Agent Providers
Anthropic
Claude
OpenAI
GPT / o-series
Custom
Any LLM
Get Started
$
docker compose - coming soon
No account needed. Runs locally in your own container.