Andrej Karpathy
|
aa95fb2e03
|
make miniseries more generic and easier to run and less hard coded
|
2026-01-12 02:54:35 +00:00 |
|
Andrej Karpathy
|
3af4dcf6ee
|
also add scaling_laws.sh script if it's a useful reference
|
2026-01-07 22:25:13 +00:00 |
|
Andrej Karpathy
|
ccf4b7f9bf
|
nudge hyperparameters of the base script with the results of the sweeps and miniseries. vocab size down to 32K. D:N ratio from 20 to 8. add miniseries script
|
2026-01-07 22:11:59 +00:00 |
|