Agent evaluation and benchmarking for AgentsKit.
Coming soon. This package is scaffolded and will be implemented in a future release. See #15.
eval.run(agent, testCases)
Full documentation