Logo
Explore Help
Sign In
fam/nanochat-omni
1
0
Fork 0
You've already forked nanochat-omni
Code Issues Pull Requests Actions Packages Projects Releases Wiki Activity
Files
dfe7d39ce8f40c948222e493c7f39e39f0e9189e
nanochat-omni/dev
T
History
svlandeg dfe7d39ce8 Merge branch 'master' into fix/typo
2026-03-18 17:01:45 +01:00
..
estimate_gpt3_core.ipynb
add notebook on deriving the CORE estimates for the GPT-3 miniseries.
2026-01-05 18:40:28 +00:00
gen_synthetic_data.py
tune the synthetic data generation script. delete the king andrej stuff lol. also, upgrade to gemini 3
2026-02-02 01:45:59 +00:00
generate_logo.html
initial commit
2025-10-13 06:49:24 -07:00
LEADERBOARD.md
~1.5h :-)
2026-03-15 22:29:27 +01:00
LOG.md
delete autocast, an unnecessary thorn in my side, manage dtypes directly
2026-03-04 23:55:30 +00:00
nanochat.png
Update logo
2025-10-14 14:19:44 -04:00
repackage_data_reference.py
document the legacy fineweb100b dataset and the new climbmix400b dataset
2026-03-03 17:24:31 +00:00
scaling_analysis.ipynb
fix scaling laws scripts after the bigram embeddings were removed
2026-03-17 16:55:56 +00:00
scaling_laws_jan26.png
nuke midtraining from orbit, it's not as needed now that we have a BOS-aligned dataloader. Also change the README a lot. midtrianing is not yet fully properly erased across the board, but good enough for step 1
2026-01-31 19:12:25 +00:00
Powered by Gitea Version: 1.26.0+rc0 Page: 48ms Template: 3ms
Auto
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API