Research Papers

Paper Database

Our research layer tracks papers from arXiv, NeurIPS, ICML, ICLR, and other sources, analyzing their impact on deployed models.

Impact Levels

🔴 Revolutionary

Paradigm-shifting work that redefines what’s possible

🟠 Significant

Major advancement with measurable improvements

🟡 Incremental

Small but meaningful improvement

🟢 Validation

Confirms or reproduces existing work

⚪ Negative

Disproves or challenges existing claims

Browse Papers

Chain of Thought at Scale

Achieving human-level mathematical reasoning
Impact: Significant | Models: GPT-4o, Claude 3 Opus

All Papers

Browse complete paper database

Recent Discoveries

January 2026

Contributing Papers

Found a paper that impacts model performance? We track:

Direct model improvements - Papers that improve specific models
Benchmark changes - New evaluation methods or datasets
Architecture innovations - New techniques applicable across models
Negative results - Papers that challenge existing assumptions

Each paper is cross-referenced with affected models and capabilities.

Papers are reviewed by our graduate research layer before publication

Overview

Models

Frontier Index

Applied Tasks

Research Papers

Paper Database

Impact Levels

Browse Papers

Chain of Thought at Scale

All Papers

Recent Discoveries

Contributing Papers

Overview

Models

Research Papers

Frontier Index

Applied Tasks

​Paper Database

​Impact Levels

​Browse Papers

Chain of Thought at Scale

All Papers

​Recent Discoveries

​Contributing Papers

Paper Database

Impact Levels

Browse Papers

Recent Discoveries

Contributing Papers