Agentic.Strategy.Experiment
(agentic v0.2.2)
Copy Markdown
Experiment runner for head-to-head strategy comparison.
Runs the same prompts through multiple strategies with configurable repetitions, then computes comparison metrics.
Summary
Functions
Compare results across strategies, computing aggregate metrics.
Run an experiment, collecting results for each (prompt, strategy, repetition) triple.
Types
@type comparison() :: %{ strategy: atom(), run_count: non_neg_integer(), success_count: non_neg_integer(), success_rate: float(), avg_duration_ms: float(), avg_cost: float(), avg_tokens: non_neg_integer(), avg_tool_calls: non_neg_integer() }
@type result() :: %{ strategy: atom(), prompt: String.t(), repetition: pos_integer(), result: {:ok, map()} | {:error, term()}, duration_ms: non_neg_integer() }
Functions
@spec compare(t()) :: [comparison()]
Compare results across strategies, computing aggregate metrics.
Run an experiment, collecting results for each (prompt, strategy, repetition) triple.