record state
frontier-ownedReview status
This finding is part of accepted frontier state. Review events, reviewable changes, and proof state explain how it can change.
frontiers / frontier
Finding bundle
back to staterecord state
frontier-ownedThis finding is part of accepted frontier state. Review events, reviewable changes, and proof state explain how it can change.
finding statement
finding typeNo entity list is declared.
evidence
source-boundtheoretical · manual state transition
proof impact
packet context1 reviewable changes and 0 evaluation records are attached to this finding id.
Evidence and conditions
method
manual state transition
evidence type
theoretical
conditions
Provenance
source title
AI Alignment Survey (2023); Benchmark Validity literature
authors
reviewer:will-blair
Current safety benchmarks (MMLU, TruthfulQA, HumanEval) were designed for capability measurement, not safety; their validity as alignment indicators is contested and they do not measure scheming or deception.
vs_3549be2124e758a8 · manual_curation
outgoing
No outgoing links.
incoming
contradicts · vf_3f73e69072a0dafd
contradicts · vf_1897f0ee215aca32
contradicts · vf_0d42e2d04ee3cc14
events
vev_c88928b9bc8057d6finding.assertedManual finding added to frontier state
reviewer:will-blair · 2026-05-29
reviewable changes
vpr_7f6fdab5abada815finding.addManual finding added to frontier state
applied · reviewer:will-blair · 2026-05-29
evaluations
No evaluation record targets this finding id.