This website requires JavaScript.
Explore
Help
Sign In
fam
/
nanochat-omni
Watch
1
Star
0
Fork
0
You've already forked nanochat-omni
Code
Issues
Pull Requests
Actions
Packages
Projects
Releases
Wiki
Activity
Files
2bb93b2ae4c8a4afc6a3d5741c934f0e0976b4c2
nanochat-omni
/
scripts
T
History
2bitbit
2bb93b2ae4
fix: correct minor typos in help text, README, and comments
2026-03-12 17:03:26 +08:00
..
base_eval.py
delete autocast, an unnecessary thorn in my side, manage dtypes directly
2026-03-04 23:55:30 +00:00
base_train.py
All of these improvements were developed by Claude running autonomously over ~2 days using autoresearch. I didn't touch anything - incredible. All tuning was done on d12 but generalized easily to larger models (e.g. d24 in particular). This means we will also get a new "Time to GPT-2" Leaderboard entry, which I will push separately.
2026-03-09 20:45:17 +00:00
chat_cli.py
delete autocast, an unnecessary thorn in my side, manage dtypes directly
2026-03-04 23:55:30 +00:00
chat_eval.py
delete autocast, an unnecessary thorn in my side, manage dtypes directly
2026-03-04 23:55:30 +00:00
chat_rl.py
delete autocast, an unnecessary thorn in my side, manage dtypes directly
2026-03-04 23:55:30 +00:00
chat_sft.py
fix: correct minor typos in help text, README, and comments
2026-03-12 17:03:26 +08:00
chat_web.py
delete autocast, an unnecessary thorn in my side, manage dtypes directly
2026-03-04 23:55:30 +00:00
tok_eval.py
initial commit
2025-10-13 06:49:24 -07:00
tok_train.py
fix: correct minor typos in help text, README, and comments
2026-03-12 17:03:26 +08:00