dfc88334b6
Fix tok/sec metrics for base_train and mid_train when gradient accumulation is not 1
16 KiB
16 KiB