trophyBattle Arena

Compare two skills under identical conditions with multi-factor scoring.

The Battle Arena compares two skills using the same input and runtime conditions.

It gives users a structured way to evaluate alternatives.

How a battle works

  1. Select two skills

  2. Define the shared protocol and action context

  3. Execute both skills simultaneously

  4. Score both outputs across six dimensions

  5. Determine the winner by total score

  6. Save the result for history and analysis

Scoring dimensions

Each contestant is scored across:

  • Speed

  • Depth

  • Clarity

  • Protocol coverage

  • Docs quality

  • Execution readiness

Example battle request

Example battle response

Battle history

Recent results can be queried from the history endpoint.

History entries can show expanded score breakdowns for both contestants.

Why Arena matters

The Arena gives teams a repeatable benchmarking surface.

It reduces guesswork when choosing between skills or protocols.

Last updated