record state
frontier-ownedReview status
This finding is part of accepted frontier state. Review events, reviewable changes, and proof state explain how it can change.
frontiers / frontier
Finding bundle
back to staterecord state
frontier-ownedThis finding is part of accepted frontier state. Review events, reviewable changes, and proof state explain how it can change.
finding statement
finding typeNo entity list is declared.
evidence
source-boundtheoretical · manual state transition
proof impact
packet context1 reviewable changes and 0 evaluation records are attached to this finding id.
Evidence and conditions
method
manual state transition
evidence type
theoretical
conditions
Provenance
source title
Scalable Oversight review (2024); Doubly-Efficient Debate (Brown-Cohen et al., ICML 2024)
authors
reviewer:will-blair
Scalable oversight approaches (iterated amplification, recursive reward modeling, debate) provide frameworks for human oversight of superhuman tasks, but they assume the honest strategy can simulate the AI system for exponentially many steps—an assumption that breaks for sufficiently advanced models.
vs_6930b4944d805a78 · manual_curation
outgoing
No outgoing links.
incoming
supports · vf_4927eb9384edd7ce
contradicts · vf_7bff72eaad13e7e2
events
vev_76ad0bcfebef4559finding.assertedManual finding added to frontier state
reviewer:will-blair · 2026-05-29
reviewable changes
vpr_316479af44e3c3affinding.addManual finding added to frontier state
applied · reviewer:will-blair · 2026-05-29
evaluations
No evaluation record targets this finding id.