nudge hyperparameters of the base script with the results of the sweeps and miniseries. vocab size down to 32K. D:N ratio from 20 to 8. add miniseries script
This commit is contained in:
@@ -13,7 +13,9 @@ dependencies = [
|
||||
"python-dotenv>=1.2.1",
|
||||
"regex>=2025.9.1",
|
||||
"rustbpe>=0.1.0",
|
||||
"scipy>=1.15.3",
|
||||
"setuptools>=80.9.0",
|
||||
"tabulate>=0.9.0",
|
||||
"tiktoken>=0.11.0",
|
||||
"tokenizers>=0.22.0",
|
||||
"torch>=2.9.0",
|
||||
|
||||
Reference in New Issue
Block a user