Files
nanochat-omni/nanochat
Andrej Karpathy 3ba42e8135 Fix SDPA KV-cache decode to respect sliding window (#456)
SDPA fallback now respects sliding window during single-token KV-cache
decode by slicing K/V to the last (window + 1) tokens.

Also simplifies the mask building for chunk inference to properly apply
sliding window in that path as well.

Fixes #452

Co-Authored-By: Kartik Vashishta <kartikv776@gmail.com>
Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
2026-01-30 17:32:12 +00:00
..
2025-10-13 06:49:24 -07:00
2026-01-15 03:20:21 +00:00
2025-10-13 06:49:24 -07:00
2025-10-13 06:49:24 -07:00
2025-10-13 06:49:24 -07:00
2025-11-14 11:20:25 +01:00