Logo
Explore Help
Sign In
fam/nanochat-omni
1
0
Fork 0
You've already forked nanochat-omni
Code Issues Pull Requests Actions Packages Projects Releases Wiki Activity
Files
d96558bcb0dc11b546bebff79bc0f56fa944c362
nanochat-omni/dev
T
History
Andrej Karpathy 1076f97059 delete autocast, an unnecessary thorn in my side, manage dtypes directly
2026-03-04 23:55:30 +00:00
..
estimate_gpt3_core.ipynb
add notebook on deriving the CORE estimates for the GPT-3 miniseries.
2026-01-05 18:40:28 +00:00
gen_synthetic_data.py
tune the synthetic data generation script. delete the king andrej stuff lol. also, upgrade to gemini 3
2026-02-02 01:45:59 +00:00
generate_logo.html
initial commit
2025-10-13 06:49:24 -07:00
LEADERBOARD.md
Document new Leaderboard entry congrats @ddudek for pointing out ClimbMix, time to GPT-2 is now 2.01 hours, down from 2.76 previously
2026-03-04 20:02:07 +00:00
LOG.md
delete autocast, an unnecessary thorn in my side, manage dtypes directly
2026-03-04 23:55:30 +00:00
nanochat.png
Update logo
2025-10-14 14:19:44 -04:00
repackage_data_reference.py
document the legacy fineweb100b dataset and the new climbmix400b dataset
2026-03-03 17:24:31 +00:00
scaling_analysis.ipynb
add engram-lite, add log, tune scaling laws analysis scripts
2026-01-27 22:31:17 +00:00
scaling_laws_jan26.png
nuke midtraining from orbit, it's not as needed now that we have a BOS-aligned dataloader. Also change the README a lot. midtrianing is not yet fully properly erased across the board, but good enough for step 1
2026-01-31 19:12:25 +00:00
Powered by Gitea Version: 1.26.0+rc0 Page: 111ms Template: 14ms
Auto
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API