Research Archive

Timeline

Chronological record of research activity — papers reviewed, forecasts published, projects completed. Updated as new work is done.

Paper

Forecast

Essay

Note

Hypothesis

Project

May 2026

ProjectTQC System — Three-Component Quantitative Trading Platform

C++20 engine with lock-free MPSC ring buffer, HMM 3-state regime classifier, GARCH(1,1) vol-scaled sizing. Built and completed.

ProjectTQC Market Regime Classifier — 93.3% Accuracy

Neural network for market regime classification across Bear/Sideways/Bull states. Used to gate signal execution within TQC.

ForecastAGI by 2030 → 35% (initial)

Initial estimate. Based on reference class forecasting and current scaling trajectory.

ForecastFrontier AI Regulation G7 by 2026 → 48% (initial)

Initial estimate. EU AI Act in force; US legislative path remains the critical uncertain variable.

ForecastMechanistic Interpretability Causal Account by 2027 → 38% (initial)

Initial estimate. Causal validation at frontier scale remains out of reach; research trajectory is promising.

ForecastCompute >10^28 FLOP by 2027 → 58% (initial)

Initial estimate. Stargate infrastructure implies capability; energy and construction constraints are the binding variables.

ForecastAI Incident >1000 Deaths by 2028 → 11% (initial)

Initial estimate. Primary pathways: critical infrastructure cyberattack and autonomous weapons in active conflict zones.

PaperZoom In: An Introduction to Circuits — Olah et al., Distill 2020

Universality claim is the most consequential bet in mechanistic interpretability. Evidence so far supports it as more than a convenient assumption.

PaperIn-context Learning and Induction Heads — Olsson et al., Anthropic 2022

Capability phase transitions may be traceable to specific structural thresholds — forecastable from mechanistic analysis rather than only discoverable post-hoc.

PaperA Mathematical Framework for Transformer Circuits — Elhage et al., Anthropic 2021

Residual stream framing converts opaque forward pass into compositional operations. Key enabling condition for meaningful third-party auditing.

PaperTransformers Represent Belief State Geometry — Shai et al., 2024

HMM belief state geometry in the residual stream directly connects to TQC regime classification work. Raises the question of what model belief-state mis-specification looks like.

PaperForecasting Transformative AI with Biological Anchors — Cotra, Open Philanthropy 2020

The framework's real contribution is methodological — formalising how to update timeline forecasts as compute costs fall and model efficiency improves.

PaperEmergent Abilities of Large Language Models — Wei et al., TMLR 2022

Apparent discontinuity may be an artefact of evaluation metric design. If Schaeffer et al. are correct, governance frameworks calibrated for capability cliffs are the wrong design target.

PaperAI Governance: A Research Agenda — Dafoe, FHI 2018

Framework needs updating for post-GPT-4 environment. The politics and governance of AI have largely collapsed into a single problem.

PaperToward Trustworthy AI Development — Brundage et al., 2020

Third-party auditing is the correct structural intervention but the audit methodology is critically underdeveloped. Interpretability research is the enabling condition.

PaperSpecification Gaming — Krakovna et al., DeepMind 2020

Specification gaming is not a pathology of poorly designed systems — it is the default behaviour of optimisers operating near the boundary of their specification.

PaperCooperative Inverse Reinforcement Learning — Hadfield-Menell et al., NeurIPS 2016

CIRL is correct as a theoretical frame and insufficient as a governance blueprint. The gap between those two assessments is where the most interesting work is happening.

PaperThe Alignment Problem from a Deep Learning Perspective — Ngo et al., ICLR 2022

Governance frameworks that rely on benchmark-based compliance verification are structurally inadequate. Alignment cannot be verified on a fixed evaluation set.

PaperAI 2027 — AI Futures Project 2024

Models the co-evolution of capability growth and deployment incentives. The governance lag it describes is the central variable that policy researchers should be trying to compress.

Research began · 2025