evidence boundary
unknowntheoretical
An evidence atom is an inspectable support unit. It is not a finding by itself; it supports or challenges a finding through review.
frontiers / frontier
Evidence atom
back to sourcesevidence boundary
unknownAn evidence atom is an inspectable support unit. It is not a finding by itself; it supports or challenges a finding through review.
finding binding
boundBenchmark data contamination affects 16-91% of test sets across major LLMs, with models achieving high benchmark scores while failing 72% of real-world task executions.
inspect finding →
source binding
source-boundvs_7cca77c270387400
inspect source →
review context
unverified1 reviewable changes and 0 evaluation records target this atom or its bound objects.
Evidence statement
Benchmark data contamination affects 16-91% of test sets across major LLMs, with models achieving high benchmark scores while failing 72% of real-world task executions.
extraction method
manual_curation
support relation
unknown
condition refs
vcnd_dc637a1043e1b55b
Caveats
events
vev_06aa9f6b7d19e3fcfinding.assertedManual finding added to frontier state
reviewer:will-blair · 2026-05-29
reviewable changes
vpr_f81e53314fc2d8abfinding.addManual finding added to frontier state
applied · reviewer:will-blair · 2026-05-29
evaluations
No evaluation rows are attached.