Baselight
Loading...Loading chart...
1-- CHART 2: Data Exhaustion Timeline
2SELECT 
3    year,
4    model_example,
5    dataset_size_tokens_billion,
6    total_available_stock_tokens_billion,
7    source_type
8FROM "@adlrocha.llm_stats.ai_training_data_exhaustion"
9ORDER BY year
yearmodel_exampledataset_size_tokens_billiontotal_available_stock_tokens_billionsource_type
2018GPT-11300000Historical actual
2019GPT-24300000Historical actual
2020GPT-3374300000Historical actual
2021Gopher300300000Historical actual
2022Chinchilla1400300000Historical actual
2022.3GPT-3.54000300000Historical actual
2023GPT-413000300000Historical actual
2023.5PaLM-24000300000Historical actual
2024Llama 4 Scout40000300000Actual from Meta
2025Projected75000300000Epoch AI projection
2026Projected150000300000Epoch AI projection
2027Projected250000300000Epoch AI projection exceeds stock
2028DATA WALL300000300000Epoch AI median exhaustion date

Share link

Anyone who has the link will be able to view this.