mix llm_core.bench (llm_core v0.3.0)

Copy Markdown View Source

Runs ALF routing and inference pipeline benchmarks.

Registers a lightweight test provider, configures a routing table pointing at it, then fires iterations calls through the inference pipeline with configurable parallelism. Reports total time, average latency, and throughput.

Examples

mix llm_core.bench
mix llm_core.bench --iterations 500 --parallel 8
mix llm_core.bench --mode packet