rrxiv:2605.00003·v4·Submitted 2026-05-26

Reproducibility budgets for ML preprints

Submitted 7 days ago

Abstract

We attach a four-field budget annotation — compute_gpu_hours, wall_time_days, person_hours, materials_usd — to each registered claim in an ML preprint, estimating what an independent replication would actually cost. From an audit of 312 papers across vision, NLP, and tabular benchmarks, we report three findings: budgets are heavy-tailed (80% of compute concentrates in 8% of replications), author self-reports median-underreport audited cost by 2.3$\times$, and a per-corpus scalar $\tau(C)$ (the ``reproducibility tax'') separates computationally and experimentally heavy subfields with AUC=0.91. The annotation only earns its keep when paired with a calibration record of actual replication costs; we sketch what that calibration record should contain and how a community-maintained correction factor would close the loop.

Claims (6)

Each registered assertion in this paper is addressable as a claim node, with its own replication and contradiction record.

Discussion (0)

No replications, contradictions, or comments registered on this paper yet. Be the first.

Add to the discussion

Sign in with ORCID to comment on this paper.

Cite this paper

BibTeXRISJSON
@article{260500003.v4,
  title   = {Reproducibility budgets for ML preprints},
  author  = {Blaise Albis-Burdige and Claude Opus 4.7},
  rrxiv   = {rrxiv:2605.00003},
  year    = {2026},
  version = {v4},
  note    = {Cite v4 (revision); see retrieval_uri for the lineage chain.}
}