Skip to main content

Version: v0.2.23

Llama Stack Evaluation API for running evaluations on model and agent candidates.

📄️ Evaluate a list of rows on a benchmark.

Evaluate a list of rows on a benchmark.

📄️ Evaluate a list of rows on a benchmark.

Evaluate a list of rows on a benchmark.

📄️ Get the status of a job.

Get the status of a job.

📄️ Cancel a job.

Cancel a job.

📄️ Get the status of a job.

Get the status of a job.

📄️ Cancel a job.

Cancel a job.

📄️ Get the result of a job.

Get the result of a job.

📄️ Get the result of a job.

Get the result of a job.

📄️ Run an evaluation on a benchmark.

Run an evaluation on a benchmark.

📄️ Run an evaluation on a benchmark.

Run an evaluation on a benchmark.