Evaluation Frameworks

Frameworks for evaluating agents in production beyond accuracy: cost, latency, reliability, assurance.