Files
nanochat-omni/nanochat
Alan d9678ff0f9 Save FP8 tensors in autograd ctx instead of full-precision inputs
Store quantized input/weight and their inverse scales in _Float8Matmul ctx to avoid re-quantization in backward and reduce saved-activation memory without changing numerics.
2026-02-15 14:31:54 +00:00
..
2025-10-13 06:49:24 -07:00
2025-10-13 06:49:24 -07:00
2025-10-13 06:49:24 -07:00
2025-10-13 06:49:24 -07:00
2025-11-14 11:20:25 +01:00