nanochat-omni/dev at 2c062aaa949536c1cff2ffb3df2ae4aeba20dc4b - nanochat-omni - Gitea: Git with a cup of tea

fam/nanochat-omni

Files

T

History

Andrej Karpathy f41dd3cbd7 auto-calculate optimal batch size. the original setting of 0.5M was only optimal for d12, but d26 prefers 1M and so on

2026-02-05 19:40:37 +00:00

..

estimate_gpt3_core.ipynb

add notebook on deriving the CORE estimates for the GPT-3 miniseries.

2026-01-05 18:40:28 +00:00

gen_synthetic_data.py

tune the synthetic data generation script. delete the king andrej stuff lol. also, upgrade to gemini 3

2026-02-02 01:45:59 +00:00

generate_logo.html

initial commit

2025-10-13 06:49:24 -07:00

LEADERBOARD.md

Typo fixes (#480 )

2026-02-05 19:12:50 +01:00

LOG.md

auto-calculate optimal batch size. the original setting of 0.5M was only optimal for d12, but d26 prefers 1M and so on

2026-02-05 19:40:37 +00:00

nanochat.png

Update logo

2025-10-14 14:19:44 -04:00

repackage_data_reference.py

initial commit

2025-10-13 06:49:24 -07:00

scaling_analysis.ipynb

add engram-lite, add log, tune scaling laws analysis scripts

2026-01-27 22:31:17 +00:00

scaling_laws_jan26.png

nuke midtraining from orbit, it's not as needed now that we have a BOS-aligned dataloader. Also change the README a lot. midtrianing is not yet fully properly erased across the board, but good enough for step 1

2026-01-31 19:12:25 +00:00