This website requires JavaScript.
Explore
Help
Sign In
fam
/
nanochat-omni
Watch
1
Star
0
Fork
0
You've already forked nanochat-omni
Code
Issues
Pull Requests
Actions
Packages
Projects
Releases
Wiki
Activity
Files
e569b59f92aea06bf8fc1c48489b3cc2e57189f4
nanochat-omni
/
scripts
T
History
Andrej Karpathy
e569b59f92
delete torchao dependency, create our own exact API-matched version of Float8Linear, document it very well. for some poorly understood reason, the performance is not only ~identical but actually runs 3% faster. despite of it being significantly simpler and much less code. i don't fully understand why/how atm
2026-02-10 18:46:39 +00:00
..
base_eval.py
small touchups to the eval script, re-order items etc, cosmetic
2026-02-03 21:03:42 +00:00
base_train.py
delete torchao dependency, create our own exact API-matched version of Float8Linear, document it very well. for some poorly understood reason, the performance is not only ~identical but actually runs 3% faster. despite of it being significantly simpler and much less code. i don't fully understand why/how atm
2026-02-10 18:46:39 +00:00
chat_cli.py
remove leftover mid references (
#491
)
2026-02-02 08:33:46 -08:00
chat_eval.py
remove leftover mid references (
#491
)
2026-02-02 08:33:46 -08:00
chat_rl.py
remove leftover mid references (
#491
)
2026-02-02 08:33:46 -08:00
chat_sft.py
fix bug in chat_sft, the attention window must be preserved sigh
2026-02-01 20:58:44 +00:00
chat_web.py
remove leftover mid references (
#491
)
2026-02-02 08:33:46 -08:00
tok_eval.py
initial commit
2025-10-13 06:49:24 -07:00
tok_train.py
quick fix to not OOM main speedrun script
2026-01-26 22:31:42 +00:00