fix: correct minor typos in help text, README, and comments

This commit is contained in:
2bitbit
2026-03-12 17:03:26 +08:00
parent f068604948
commit 2bb93b2ae4
3 changed files with 3 additions and 3 deletions
+1 -1
View File
@@ -177,7 +177,7 @@ val_dataset = TaskMixture([
SmolTalk(split="test"), # 24K rows in test set
MMLU(subset="all", split="test", stop=5200), # 14K rows in test set, use only 5.2K to match the train ratios
GSM8K(subset="main", split="test", stop=420), # 1.32K rows in test set, use only 420 to match the train ratios
]) # total: 24K + 14K + 1.32K ~= 39K rows
]) # total: 24K + 5.2K + 0.42K ~= 29.6K rows
# DataLoader is defined here, it emits inputs, targets : 2D tensors of shape (device_batch_size, max_seq_len)
# A big problem is that we don't know the final num_iterations in advance. So we create
# these two global variables and update them from within the data generator.