Benchmark workflows
This page summarizes the current benchmark methodology and the latest published benchmark snapshot.Methodology
The benchmark posture currently combines:- competitive workflow benchmarks in
benchmarks/competitive/results/omni_apps_sec_workflows_latest.json - rollup scoring in
benchmarks/competitive/results/scorecard.json - the human-readable report in
benchmarks/competitive/latest-report.md
2026-03-16T07:00:17.745659+00:00.
Current snapshot vs sec-api.io
mapping_lookup: Omni p50=264.52msvs sec-api.io p50=392.33msfiling_search: Omni p50=264.83msvs sec-api.io p50=441.72mssection_extract: Omni p50=655.76msvs sec-api.io p50=526.99msxbrl_to_json: Omni p50=268.3msvs sec-api.io p50=545.39ms
Current snapshot vs financialdatasets.ai
income_statement: Omni p50=263.22msvs financialdatasets.ai p50=396.24msbalance_sheet: Omni p50=263.22msvs financialdatasets.ai p50=360.7mscash_flow: Omni p50=263.22msvs financialdatasets.ai p50=328.26msfinancial_metrics: Omni p50=268.3msvs financialdatasets.ai p50=321.99mssec_filings: Omni p50=264.83msvs financialdatasets.ai p50=379.96msinsider_trades: Omni p50=258.2msvs financialdatasets.ai p50=330.97ms
Current quality posture
- API eval baseline: 75 questions, 72 passed, 3 gated, scored pass rate 100.0%
- agent eval sample: 5 questions, 5 passed, scored pass rate 100.0%
- search quality: filings top-1=
1.00, filings top-3=1.00, sections top-3=1.00
Heaviest current cases
filing_search_text_googl: p95=1389.03mslatest_section: p95=867.45ms, payload=69070 Bsection_search_jpm: p95=734.3msinsiders_amzn: p95=570.23mscompensation_amzn: p95=524.47ms, payload=3567 B
Run locally
Source artifacts
benchmarks/competitive/latest-report.mdbenchmarks/competitive/results/scorecard.jsonbenchmarks/competitive/results/omni_apps_sec_workflows_latest.json