proposed
reason
Manual finding added to frontier state
finding type
computational
proposed confidence
0.30
confidence basis
operator-supplied frontier prior; review required
frontiers / frontier
e24/24 · finding.noted · reviewer:will-blair · 2026-06-10 · 6c12→d02f
Reviewable change
back to reviewBENCHMARK CLAIM (MiniF2F) — HyperTree Proof Search (HTPS, Lample et al.) REPORTS a miniF2F pass rate via learned best-first proof search. VERIFICATION STATE: author-reported; search budget and version-specific. NOT re-run here. Open obligation: re-run at the stated budget on a pinned split.
accept gate
2 of 4 on recordtimeline
vpr_8ebb01be4aedad3bManual finding added to frontier statenull→42db6392vev_03b2b7f5e7e0be96Manual finding added to frontier stateproposed
reason
Manual finding added to frontier state
finding type
computational
proposed confidence
0.30
confidence basis
operator-supplied frontier prior; review required
provenance
proposed by
reviewer:will-blair
actor type
human
created at
2026-06-10
target type
finding
affected
inspect finding →BENCHMARK CLAIM (MiniF2F) — HyperTree Proof Search (HTPS, Lample et al.) REPORTS a miniF2F pass rate via learned best-first proof search. VERIFICATION STATE: author-reported; search budget and version-specific. NOT re-run here. Open obligation: re-run at the stated budget on a pinned split.
vf_9a454a597ddee070Read-only frontier; diff not recomputed.
AI-for-science benchmark state receives a reviewable source, finding, caveat, replication, evaluation, or proof-affecting edit.
The packet names affected record objects, evidence, rationale, reviewer-facing fields, and expected proof impact.
Schema, provenance, benchmark, contradiction, and proof checks decide whether the request is ready to read.
A steward accepts, rejects, caveats, revises, or retracts the request under an inspectable identity.
Only the accepted event mutates frontier state. Atlases, constellations, and search update from that record state.
Jump to a section, signal, campaign, document, primitive, work path, frontier, record index, atlas, constellation, agent, capability, or full-state search.