proposed
reason
Manual finding added to frontier state
finding type
computational
proposed confidence
0.30
confidence basis
operator-supplied frontier prior; review required
frontiers / frontier
e24/24 · finding.noted · reviewer:will-blair · 2026-06-10 · 6c12→d02f
Reviewable change
back to reviewBENCHMARK CLAIM (MiniF2F) — Draft-Sketch-Prove (DSP) REPORTS improved miniF2F-test pass by drafting an informal proof, sketching a formal skeleton, then closing gaps with an ATP. VERIFICATION STATE: author-reported; pipeline described; depends on the underlying ATP and the autoformalizer, both of which drift. NOT re-run here. Open obligation: reproduce with pinned ATP + LLM versions.
accept gate
2 of 4 on recordtimeline
vpr_f08818383df2a902Manual finding added to frontier statenull→ac87dcffvev_e5d45a5605897295Manual finding added to frontier stateproposed
reason
Manual finding added to frontier state
finding type
computational
proposed confidence
0.30
confidence basis
operator-supplied frontier prior; review required
provenance
proposed by
reviewer:will-blair
actor type
human
created at
2026-06-10
target type
finding
affected
inspect finding →BENCHMARK CLAIM (MiniF2F) — Draft-Sketch-Prove (DSP) REPORTS improved miniF2F-test pass by drafting an informal proof, sketching a formal skeleton, then closing gaps with an ATP. VERIFICATION STATE: author-reported; pipeline described; depends on the underlying ATP and the autoformalizer, both of which drift. NOT re-run here. Open obligation: reproduce with pinned ATP + LLM versions.
vf_368ec6ffb5747092Read-only frontier; diff not recomputed.
AI-for-science benchmark state receives a reviewable source, finding, caveat, replication, evaluation, or proof-affecting edit.
The packet names affected record objects, evidence, rationale, reviewer-facing fields, and expected proof impact.
Schema, provenance, benchmark, contradiction, and proof checks decide whether the request is ready to read.
A steward accepts, rejects, caveats, revises, or retracts the request under an inspectable identity.
Only the accepted event mutates frontier state. Atlases, constellations, and search update from that record state.
Jump to a section, signal, campaign, document, primitive, work path, frontier, record index, atlas, constellation, agent, capability, or full-state search.