evidence boundary
unknowntheoretical
An evidence atom is an inspectable support unit. It is not a finding by itself; it supports or challenges a finding through review.
frontiers / frontier
Evidence atom
back to sourcesevidence boundary
unknownAn evidence atom is an inspectable support unit. It is not a finding by itself; it supports or challenges a finding through review.
finding binding
boundAI safety via debate—where two models argue opposing positions and a human judge determines truthfulness—assumes honest argumentation is detectably different from skilled deception, an assumption that fails for sufficiently deceptive models.
inspect finding →
source binding
source-boundvs_0f0d72c6be022a1e
inspect source →
review context
unverified1 reviewable changes and 0 evaluation records target this atom or its bound objects.
Evidence statement
AI safety via debate—where two models argue opposing positions and a human judge determines truthfulness—assumes honest argumentation is detectably different from skilled deception, an assumption that fails for sufficiently deceptive models.
extraction method
manual_curation
support relation
unknown
condition refs
vcnd_37b6b85c75370eba
Caveats
events
vev_07254bfa148bc3e4finding.assertedManual finding added to frontier state
reviewer:will-blair · 2026-05-29
reviewable changes
vpr_9cf2f4d48faaedd0finding.addManual finding added to frontier state
applied · reviewer:will-blair · 2026-05-29
evaluations
No evaluation rows are attached.