finding statement
In GPT-2 small, attention head L5H1 acts as an induction head (induction score > 0.9, replicated across five held-out random-repeat prompts and confirmed in distilgpt2).
frontiers / frontier
The accepted finding bundles: the reviewed findings that make up this frontier’s state. Each carries its statement, the evidence and confidence behind it, its review state, and its links to other findings.
by type
by review state
A finding bundle is the durable object. Source graphs, citation stance, candidate gaps, and agent summaries are derived signals until a review event accepts a reviewable change into this record.
finding statement
In GPT-2 small, attention head L5H1 acts as an induction head (induction score > 0.9, replicated across five held-out random-repeat prompts and confirmed in distilgpt2).
evidence
theoretical · manual state transition
provenance
manual finding
review state
The example finding is unreviewed. Frontier changes still pass through reviewable changes and accepted events.
derived signals
Links, candidate gaps, bridges, citation stance, nearby papers, and generated summaries route review. They do not rewrite the record by themselves.
42 findings
finding bundle
unreviewedvf_41d92edaba755cd1evidence unit
theoretical · manual state transition
source handle
manual finding
review state
downstream effect
no declared downstream links
inspect finding →
finding bundle
unreviewedvf_4bca49d448bf6d36evidence unit
computational · manual state transition
source handle
mechinterp circuit harness
review state
downstream effect
no declared downstream links
inspect finding →
finding bundle
unreviewedvf_61257c9696ec7855evidence unit
computational · manual state transition
source handle
mechinterp circuit harness
review state
downstream effect
no declared downstream links
inspect finding →
finding bundle
unreviewedvf_76ddacf8a33ee0cdevidence unit
computational · manual state transition
source handle
mechinterp circuit harness
review state
downstream effect
no declared downstream links
inspect finding →
finding bundle
unreviewedvf_23bfd31465df9981evidence unit
computational · manual state transition
source handle
mechinterp circuit harness
review state
downstream effect
no declared downstream links
inspect finding →
finding bundle
unreviewedvf_a75996be47f4909devidence unit
computational · manual state transition
source handle
mechinterp circuit harness
review state
downstream effect
no declared downstream links
inspect finding →
finding bundle
unreviewedvf_78e0a960c131c2beevidence unit
computational · manual state transition
source handle
mechinterp circuit harness
review state
downstream effect
no declared downstream links
inspect finding →
finding bundle
unreviewedvf_7c015dca286a122aevidence unit
computational · manual state transition
source handle
mechinterp circuit harness
review state
downstream effect
no declared downstream links
inspect finding →
finding bundle
unreviewedvf_f4c4057af4f9c42eevidence unit
computational · manual state transition
source handle
mechinterp circuit harness
review state
downstream effect
no declared downstream links
inspect finding →
finding bundle
unreviewedvf_6384b864e00c7479evidence unit
computational · manual state transition
source handle
mechinterp circuit harness
review state
downstream effect
no declared downstream links
inspect finding →
finding bundle
unreviewedvf_11f63ac26bc557f4evidence unit
computational · manual state transition
source handle
mechinterp circuit harness
review state
downstream effect
no declared downstream links
inspect finding →
finding bundle
unreviewedvf_59285c747f083c24evidence unit
computational · manual state transition
source handle
mechinterp causal sweep
review state
downstream effect
no declared downstream links
inspect finding →
finding bundle
unreviewedvf_d6c317a8ced07e36evidence unit
computational · manual state transition
source handle
mechinterp causal sweep
review state
downstream effect
no declared downstream links
inspect finding →
finding bundle
unreviewedvf_3595f5e61d02769devidence unit
computational · manual state transition
source handle
mechinterp causal sweep
review state
downstream effect
no declared downstream links
inspect finding →
finding bundle
unreviewedvf_da0f52be04ee54bcevidence unit
computational · manual state transition
source handle
mechinterp causal sweep
review state
downstream effect
no declared downstream links
inspect finding →
finding bundle
unreviewedvf_1565bc7a58ce0108evidence unit
computational · manual state transition
source handle
mechinterp causal sweep
review state
downstream effect
no declared downstream links
inspect finding →
finding bundle
unreviewedvf_3e68a029a451f01fevidence unit
computational · manual state transition
source handle
mechinterp causal sweep
review state
downstream effect
no declared downstream links
inspect finding →
finding bundle
unreviewedvf_4547fdc4c89640a3evidence unit
computational · manual state transition
source handle
mechinterp causal sweep
review state
downstream effect
no declared downstream links
inspect finding →
finding bundle
unreviewedvf_b9b16eba11ee2668evidence unit
computational · manual state transition
source handle
mechinterp causal sweep
review state
downstream effect
no declared downstream links
inspect finding →
finding bundle
unreviewedvf_f41d996b7c3aea03evidence unit
computational · manual state transition
source handle
mechinterp causal sweep
review state
downstream effect
no declared downstream links
inspect finding →
finding bundle
unreviewedvf_de8d1e8b8646db6aevidence unit
theoretical · manual state transition
source handle
Beyond Induction Heads (2025); Induction Heads & In-Context Learning (emergentmind.com)
review state
downstream effect
no declared downstream links
inspect finding →
finding bundle
unreviewedvf_182cb4f97418dc5devidence unit
theoretical · manual state transition
source handle
Quantifying LLM Attention-Head Stability (2026)
review state
downstream effect
1 downstream link
inspect finding →
finding bundle
unreviewedvf_ec692e5e87df4298evidence unit
theoretical · manual state transition
source handle
Evaluating Sparse Autoencoders for Monosemantic Representation (2024)
review state
downstream effect
no declared downstream links
inspect finding →
finding bundle
unreviewedvf_3b8b7f0eb1dd2305evidence unit
theoretical · manual state transition
source handle
Towards Universality: Studying Mechanistic Similarity Across Language Model Architectures (2024)
review state
downstream effect
1 downstream link
inspect finding →
finding bundle
unreviewedvf_3cbd8304240a3219evidence unit
theoretical · manual state transition
source handle
Discovering Transformer Circuits via a Hybrid Attribution and Pruning Framework (2024)
review state
downstream effect
2 downstream links
inspect finding →
finding bundle
unreviewedvf_841cfc4181456ee4evidence unit
theoretical · manual state transition
source handle
Quantifying LLM Attention-Head Stability (2026)
review state
downstream effect
1 downstream link
inspect finding →
finding bundle
unreviewedvf_39e5f05fb44e2760evidence unit
theoretical · manual state transition
source handle
Beyond Induction Heads: In-Context Meta Learning Induces Multi-Phase Circuit Emergence (2025)
review state
downstream effect
no declared downstream links
inspect finding →
finding bundle
unreviewedvf_9e8edcb419fd0229evidence unit
theoretical · manual state transition
source handle
Circuit-Aware Reward Training: A Mechanistic Framework for Longtail Robustness in RLHF (2025)
review state
downstream effect
1 downstream link
inspect finding →
finding bundle
unreviewedvf_05353ab782524863evidence unit
theoretical · manual state transition
source handle
Evaluating Sparse Autoencoders for Monosemantic Representation (2024)
review state
downstream effect
no declared downstream links
inspect finding →
finding bundle
unreviewedvf_7a690fac11e87c30evidence unit
theoretical · manual state transition
source handle
Weight-sparse transformers have interpretable circuits (2024)
review state
downstream effect
no declared downstream links
inspect finding →
finding bundle
unreviewedvf_efe9ddeab6b12e54evidence unit
theoretical · manual state transition
source handle
Efficient Automated Circuit Discovery in Transformers using Contextual Decomposition (2024)
review state
downstream effect
no declared downstream links
inspect finding →
finding bundle
unreviewedvf_777e7fc8759edf7eevidence unit
theoretical · manual state transition
source handle
A Practical Review of Mechanistic Interpretability for Transformer-Based Language Models (2024)
review state
downstream effect
no declared downstream links
inspect finding →
finding bundle
unreviewedvf_f23ae921f4703f37evidence unit
theoretical · manual state transition
source handle
Open Problems in Mechanistic Interpretability (Jan 2025); Quantifying LLM Attention-Head Stability (2026)
review state
downstream effect
no declared downstream links
inspect finding →
finding bundle
unreviewedvf_b10bdbb1f34c381eevidence unit
theoretical · manual state transition
source handle
Beyond Induction Heads: In-Context Meta Learning Induces Multi-Phase Circuit Emergence (2025); Towards Universality: Studying Mechanistic Similarity Across Language Model Architectures (2024)
review state
downstream effect
no declared downstream links
inspect finding →
finding bundle
unreviewedvf_672bfc44704d40feevidence unit
theoretical · manual state transition
source handle
Quantifying LLM Attention-Head Stability: Implications for Circuit Universality (2026)
review state
downstream effect
no declared downstream links
inspect finding →
finding bundle
unreviewedvf_7945671662762af1evidence unit
theoretical · manual state transition
source handle
Discovering Transformer Circuits via a Hybrid Attribution and Pruning Framework (2024); Towards Automated Circuit Discovery for Mechanistic Interpretability (2024)
review state
downstream effect
no declared downstream links
inspect finding →
finding bundle
unreviewedvf_6d5cc01c380c814cevidence unit
theoretical · manual state transition
source handle
Evaluating Sparse Autoencoders for Monosemantic Representation (2024); Sparse Autoencoders Find Highly Interpretable Features in Language Models (2023)
review state
downstream effect
no declared downstream links
inspect finding →
finding bundle
unreviewedvf_22ce0bb4da3c4146evidence unit
theoretical · manual state transition
source handle
Beyond Induction Heads: In-Context Meta Learning Induces Multi-Phase Circuit Emergence (2025); Induction Heads in Transformers (emergentmind.com)
review state
downstream effect
no declared downstream links
inspect finding →
finding bundle
unreviewedvf_d3dd34cd06e3d5ceevidence unit
theoretical · manual state transition
source handle
Discovering Transformer Circuits via a Hybrid Attribution and Pruning Framework (2024); Transformer Circuit Faithfulness Metrics are not Robust (2024)
review state
downstream effect
no declared downstream links
inspect finding →
finding bundle
unreviewedvf_9d8e0c6b076d22e3evidence unit
theoretical · manual state transition
source handle
Open Problems in Mechanistic Interpretability (Jan 2025)
review state
downstream effect
no declared downstream links
inspect finding →
finding bundle
unreviewedvf_1251bfd72b49c1efevidence unit
computational · manual state transition
source handle
mechinterp causal sweep wave2
review state
downstream effect
no declared downstream links
inspect finding →
finding bundle
unreviewedvf_47c5956978a83e65evidence unit
computational · manual state transition
source handle
mechinterp causal sweep wave2
review state
downstream effect
no declared downstream links
inspect finding →