mix scoria.eval (scoria v0.1.0)

Copy Markdown View Source

Runs LLM-as-judge evaluations over dataset items.

Options

  • --dataset - The UUID of the dataset to evaluate

Example

mix scoria.eval --dataset 00000000-0000-0000-0000-000000000000