Llama Stack Evaluation API for running evaluations on model and agent candidates.
📄️ Evaluate a list of rows on a benchmark.
Evaluate a list of rows on a benchmark.
📄️ Evaluate a list of rows on a benchmark.
Evaluate a list of rows on a benchmark.
📄️ Get the status of a job.
Get the status of a job.
📄️ Cancel a job.
Cancel a job.
📄️ Get the status of a job.
Get the status of a job.
📄️ Cancel a job.
Cancel a job.
📄️ Get the result of a job.
Get the result of a job.
📄️ Get the result of a job.
Get the result of a job.
📄️ Run an evaluation on a benchmark.
Run an evaluation on a benchmark.
📄️ Run an evaluation on a benchmark.
Run an evaluation on a benchmark.