Evaluations
Llama Stack Evaluation API for running evaluations on model and agent candidates.
📄️ Evaluate Rows
Evaluate a list of rows on a benchmark.
📄️ Run Eval
Run an evaluation on a benchmark.
📄️ Job Status
Get the status of a job.
📄️ Job Cancel
Cancel a job.
📄️ Job Result
Get the result of a job.