Andrej Karpathy
|
98eed6df18
|
bring back an assert guarding against bad param sizing
|
2026-02-05 18:14:30 +00:00 |
|
Andrej Karpathy
|
718e5e9d67
|
correctly reference NorMuon and fix misleading terms that i may have hastily ported over from modded-nanogpt
|
2026-02-05 01:39:26 +00:00 |
|
Andrej Karpathy
|
41bb2eac32
|
Combine AdamW and Muon into single MuonAdamW optimizer, cleaner, ty @chrisjmccormick for idea/help
|
2026-01-29 00:52:08 +00:00 |
|