Question 1

What does the Agent Evaluation skill do?

Accepted Answer

Testing and benchmarking LLM agents including behavioral testing, capability assessment, reliability metrics, and production monitoring—where even top agents achieve less than 50% on real-world benchmarks Use when: agent testing, agent evaluation, benchmark agents, agent reliability, test agent.

Question 2

How do I install Agent Evaluation on OpenClawdBots?

Accepted Answer

Deploy an OpenClaw bot on OpenClawdBots, then go to the Skills tab in your dashboard and search for "Agent Evaluation" in the ClawHub tab. Click Install to add it to your bot.

Question 3

Is the Agent Evaluation skill free?

Accepted Answer

Yes. All community skills on ClawHub are free to install. You only pay for your OpenClawdBots hosting plan.

Agent Evaluation

Skill Details

How to Install

Changelog — v1.0.0