proposed
reason
Manual finding added to frontier state
finding type
computational
proposed confidence
0.30
confidence basis
operator-supplied frontier prior; review required
frontiers / frontier
e24/24 · finding.noted · reviewer:will-blair · 2026-06-10 · 6c12→d02f
Reviewable change
back to reviewBENCHMARK META (ProteinGym). ProteinGym benchmarks variant-effect prediction against deep mutational scanning (DMS) assays: a substitution benchmark (~217 assays) and an indel benchmark, with zero-shot and supervised tracks, scored by Spearman correlation (and AUC/MCC). KNOWN TRUST ISSUE: v1.0 vs v1.1 differ in assay set and splits; zero-shot vs supervised numbers are not comparable; MSA-dependent methods vary with the MSA pipeline. STATE: dataset-version + track-conflation hazard.
accept gate
2 of 4 on recordtimeline
vpr_d3f3228bb463c2d9Manual finding added to frontier statenull→bc813b05vev_f17f5a864754e2a0Manual finding added to frontier stateproposed
reason
Manual finding added to frontier state
finding type
computational
proposed confidence
0.30
confidence basis
operator-supplied frontier prior; review required
provenance
proposed by
reviewer:will-blair
actor type
human
created at
2026-06-10
target type
finding
affected
inspect finding →BENCHMARK META (ProteinGym). ProteinGym benchmarks variant-effect prediction against deep mutational scanning (DMS) assays: a substitution benchmark (~217 assays) and an indel benchmark, with zero-shot and supervised tracks, scored by Spearman correlation (and AUC/MCC). KNOWN TRUST ISSUE: v1.0 vs v1.1 differ in assay set and splits; zero-shot vs supervised numbers are not comparable; MSA-dependent methods vary with the MSA pipeline. STATE: dataset-version + track-conflation hazard.
vf_ec4bb8feca206bf2Read-only frontier; diff not recomputed.
AI-for-science benchmark state receives a reviewable source, finding, caveat, replication, evaluation, or proof-affecting edit.
The packet names affected record objects, evidence, rationale, reviewer-facing fields, and expected proof impact.
Schema, provenance, benchmark, contradiction, and proof checks decide whether the request is ready to read.
A steward accepts, rejects, caveats, revises, or retracts the request under an inspectable identity.
Only the accepted event mutates frontier state. Atlases, constellations, and search update from that record state.
Jump to a section, signal, campaign, document, primitive, work path, frontier, record index, atlas, constellation, agent, capability, or full-state search.