nanochat-omni

Files

T

Alan d9678ff0f9 Save FP8 tensors in autograd ctx instead of full-precision inputs

Store quantized input/weight and their inverse scales in _Float8Matmul ctx to avoid re-quantization in backward and reduce saved-activation memory without changing numerics.

2026-02-15 14:31:54 +00:00

__init__.py

initial commit

2025-10-13 06:49:24 -07:00

checkpoint_manager.py

remove leftover mid references (#491 )

2026-02-02 08:33:46 -08:00

common.py

use _PEAK_FLOPS_TABLE instead of if-else structure (#479 )

2026-01-31 19:45:06 -08:00

core_eval.py

initial commit

2025-10-13 06:49:24 -07:00

dataloader.py

slightly more efficient dataloader that reduces the number of python objects flying around and causing strain on runtime and garbage collector

2026-02-02 01:17:30 +00:00

dataset.py

initial commit

2025-10-13 06:49:24 -07:00

engine.py

fix: pass device_type to compute_init in engine.__main__ (#451 )

2026-01-19 17:19:51 -08:00

execution.py

nit delete redundant catch/raise in execute

2025-10-29 08:10:03 -07:00

flash_attention.py

Add Blackwell (SM100) GPU support via SDPA fallback (#475 )

2026-01-31 19:42:58 -08:00

fp8.py

Save FP8 tensors in autograd ctx instead of full-precision inputs

2026-02-15 14:31:54 +00:00

gpt.py

Fix generate() crash when top_k=0 (#467 )

2026-01-30 09:21:02 -08:00

logo.svg

initial commit

2025-10-13 06:49:24 -07:00

loss_eval.py

fix typos

2025-11-14 11:20:25 +01:00

optim.py

bring back an assert guarding against bad param sizing

2026-02-05 18:14:30 +00:00

report.py

remove leftover mid references (#491 )

2026-02-02 08:33:46 -08:00

tokenizer.py

adjust the comment on the regex pattern per recent experimnet see dev/LOG.md

2026-01-13 17:50:39 +00:00

ui.html

Fix conversation scroll to bottom on some browsers + remove duplicated padding (#348 )

2025-12-31 13:03:22 -08:00