Version: v0.4.3

Evaluations

Llama Stack Evaluation API for running evaluations on model and agent candidates.

📄️ Evaluate Rows

Evaluate a list of rows on a benchmark.

Run an evaluation on a benchmark.

Get the status of a job.

Cancel a job.

Get the result of a job.