Logo
Explore Help
Sign In
fam/nanochat-omni
1
0
Fork 0
You've already forked nanochat-omni
Code Issues Pull Requests Actions Packages Projects Releases Wiki Activity
Files
e569b59f92aea06bf8fc1c48489b3cc2e57189f4
nanochat-omni/scripts
T
History
Andrej Karpathy e569b59f92 delete torchao dependency, create our own exact API-matched version of Float8Linear, document it very well. for some poorly understood reason, the performance is not only ~identical but actually runs 3% faster. despite of it being significantly simpler and much less code. i don't fully understand why/how atm
2026-02-10 18:46:39 +00:00
..
base_eval.py
small touchups to the eval script, re-order items etc, cosmetic
2026-02-03 21:03:42 +00:00
base_train.py
delete torchao dependency, create our own exact API-matched version of Float8Linear, document it very well. for some poorly understood reason, the performance is not only ~identical but actually runs 3% faster. despite of it being significantly simpler and much less code. i don't fully understand why/how atm
2026-02-10 18:46:39 +00:00
chat_cli.py
remove leftover mid references (#491)
2026-02-02 08:33:46 -08:00
chat_eval.py
remove leftover mid references (#491)
2026-02-02 08:33:46 -08:00
chat_rl.py
remove leftover mid references (#491)
2026-02-02 08:33:46 -08:00
chat_sft.py
fix bug in chat_sft, the attention window must be preserved sigh
2026-02-01 20:58:44 +00:00
chat_web.py
remove leftover mid references (#491)
2026-02-02 08:33:46 -08:00
tok_eval.py
initial commit
2025-10-13 06:49:24 -07:00
tok_train.py
quick fix to not OOM main speedrun script
2026-01-26 22:31:42 +00:00
Powered by Gitea Version: 1.26.0+rc0 Page: 38ms Template: 4ms
Auto
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API