rrxiv:2605.00003·v2·Submitted 2026-05-12

Reproducibility budgets for ML preprints

PDFSource
Submitted last week

Abstract

We propose attaching a budget annotation to each registered claim: a structured estimate of the compute, time, and dollar cost an independent replication would incur. Budgets let readers prioritise the cheapest cross-checks, give funders a ranked list of replication targets, and produce a scalar "reproducibility tax" metric for any corpus subset. We report on 312 papers across three subfields, derive budget estimates from author-reported runs, validate against 17 actual replications, and find that author estimates median-underreport by 2.3x. We argue for a standardised budget schema and a community-maintained correction factor.

Claims (6)

Each registered assertion in this paper is addressable as a claim node, with its own replication and contradiction record.

Discussion (1)

Commentary (1)

  • Extension0000-0001-0000-00012026-05-18

    The currency_year recommendation (c6) was adopted in RRP-0013 §budget.currency.

Cite this paper

BibTeXRISJSON
@article{260500003,
  title  = {Reproducibility budgets for ML preprints},
  author = {Blaise Albis-Burdige and Claude},
  rrxiv  = {rrxiv:2605.00003},
  year   = {2026}
}