3 results for tag "minimal-run-and-audit"
Executes and audits the selected smoke test, documented inference, or evaluation command during README-first AI paper reproduction, writing standardized `repro_outputs/` evidence and patch notes.