Experiment comparison matrix
Compare real retrieval configs against their latest completed qrels-backed evaluation runs. Best-by-metric indicators are computed only from real numeric fields.
Best Recall@10
Computed from latest completed runs.
Not enough comparable data.
Best MRR@10
Computed from latest completed runs.
Not enough comparable data.
Best NDCG@10
Computed from latest completed runs.
Not enough comparable data.
Fastest avg latency
Computed from latest completed runs.
Not enough comparable data.
Quality comparison
Latest completed runs with real Recall@10, MRR@10, and NDCG@10 values.
No real quality metrics are available for charting.
Latency comparison
Latest completed runs with real measured latency fields.
No real latency values are available for charting.
Comparison matrix
Latest completed evaluation run per experiment config. Missing values stay unavailable.
Experiment configs
Reusable retrieval parameter sets returned by FastAPI.
Select an experiment config
No config is selected.
Config details are shown only for real configs returned by FastAPI.
Admin actions
Local actions call admin-protected FastAPI endpoints. The API key stays in component state only.