source boundary
frontier-owneddeclared
A source record is provenance. It supports a finding only through evidence atoms, extraction spans, and reviewed finding bundles.
frontiers / frontier
Source record
back to sourcessource boundary
frontier-ownedA source record is provenance. It supports a finding only through evidence atoms, extraction spans, and reviewed finding bundles.
finding bindings
record contextFindings bound to this source through source ids, evidence atoms, provenance, or reviewed source-record slots.
evidence atoms
materializedEvidence atoms pin exact spans, measurements, selectors, or curation assertions to the source.
review context
inspectable1 reviewable changes and 0 evaluations are attached through this source or its findings.
Locator and citation
external sourcelocator
title:Hierarchy of Agentic Capabilities paper (2025); Frontier Model Performance assessments
imported
2026-05-29T02:53:36.980508+00:00
extraction mode
manual_curation
authors
reviewer:will-blair
Caveats
No source-specific caveats are recorded.
Interactive evaluation environments (agentic task suites with tool use) reveal capability gaps: frontier models pass only 28% of practical multi-step tasks despite 80th percentile benchmark performance.
events
vev_f5b1a6f83a707f74finding.assertedManual finding added to frontier state
reviewer:will-blair · 2026-05-29
reviewable changes
vpr_19bc37637b83e49cfinding.addManual finding added to frontier state
applied · reviewer:will-blair · 2026-05-29
evaluations
No evaluation rows are attached.