12 Commits

Author SHA1 Message Date
Mathieu Lacage a641b6ca96 MMLU main split is named auxiliary_train, not train 2026-03-13 13:19:10 +01:00
Sofie Van Landeghem 72b9064f9d remove leftover mid references (#491) 2026-02-02 08:33:46 -08:00
Sofie Van Landeghem d4ea28d4e2 Fix args in readme (#438)
* fix commands in readme, using new arg format

* fix typo

* add required -i flag to chat_eval example runs
2026-01-15 16:26:38 -08:00
Andrej d5759400f9 fixing two typos in comments 2025-12-08 20:03:08 -08:00
Eric Silberstein 024781f9df fixing two typos in comments 2025-11-19 15:12:53 -05:00
Eric Silberstein 97770700f2 change test/train split approach because random.seed(1) and random.seed(-1) do the same thing 2025-11-19 14:51:02 -05:00
svlandeg c72b8b2309 add explicit UTF-8 encoding 2025-11-03 21:27:12 +01:00
Andrej Karpathy 05a051dbe9 fix tokenization bug, there should be no space before first letter. sigh 2025-10-24 15:06:06 +00:00
Andrej Karpathy 8892470f29 add the SpellingBee task so that nanochat can count r in strawberry etc. along the way we had to add a bunch of new functionality, e.g. extend the calculator to support the count function of python. possibly the current TaskMixture uses way too many synthetic examples of SpellingBee because the eval gives us exactly 100% performance on spelling. We can tune this later to reclaim some wall clock time here I think 2025-10-24 14:02:48 +00:00
Andrej Karpathy 03cddd9878 actually let's not brick code on git pull. change error to warning 2025-10-21 15:13:25 +00:00
Andrej Karpathy fe5aed940b add personality to nanochat. breaks previous code on git pull and requires download of a new file from s3, but there is a helpful error message so hopefully its ok 2025-10-21 15:04:58 +00:00
karpathy 3a5e0bc50b initial commit 2025-10-13 06:49:24 -07:00