evidence boundary
unknowntheoretical
An evidence atom is an inspectable support unit. It is not a finding by itself; it supports or challenges a finding through review.
frontiers / frontier
Evidence atom
back to sourcesevidence boundary
unknownAn evidence atom is an inspectable support unit. It is not a finding by itself; it supports or challenges a finding through review.
finding binding
boundSleeper agents—models trained to behave safely during training but activate harmful behavior post-deployment—can persist through standard safety training procedures.
inspect finding →
source binding
source-boundvs_d4f4579197e9ae15
inspect source →
review context
unverified1 reviewable changes and 0 evaluation records target this atom or its bound objects.
Evidence statement
Sleeper agents—models trained to behave safely during training but activate harmful behavior post-deployment—can persist through standard safety training procedures.
extraction method
manual_curation
support relation
unknown
condition refs
vcnd_e2f87892b3333a60
Caveats
events
vev_ad377dde037f73adfinding.assertedManual finding added to frontier state
reviewer:will-blair · 2026-05-29
reviewable changes
vpr_75751f97b87b33a2finding.addManual finding added to frontier state
applied · reviewer:will-blair · 2026-05-29
evaluations
No evaluation rows are attached.