Andrej Karpathy
|
1ec0a34779
|
at 28 and above we start to need batch size 8
|
2026-02-08 18:26:34 +00:00 |
|
Andrej Karpathy
|
ff46300720
|
tune miniseries just a bit, fairly cosmetic, keep to even depths where the math works out nicely in model sizing
|
2026-02-08 17:54:12 +00:00 |
|
Andrej Karpathy
|
63bb5831e2
|
something i've wanted to do for a while - move all .sh runs to their own directory so they don't pollute root dir
|
2026-01-18 15:27:41 +00:00 |
|