x/nuonuo

Fam Zheng d923aa1e31 NuoNuo: Hippocampal memory module prototype

Hopfield + Hebbian hybrid memory system for LLMs.
Two nights of experiments (16 iterations), validated on LongMemEval (ICLR 2025).

Architecture:
- Single-hop: Two-Stage Hopfield (NN top-20 → softmax settle)
- Multi-hop: Hebbian W matrix with WTA pattern separation
- 64% on LongMemEval (500 questions), retrieval-only, no LLM dependency
- 4ms latency @ 20K memories, ~1GB VRAM

Key findings:
- Hopfield attention solved noise tolerance (20% → 100% vs flat Hebbian)
- WTA pattern separation enables 20K+ capacity
- Multi-hop associative chains (6 hops, CosSim=1.0) — RAG can't do this
- MiniLM-L6 is optimal (discrimination gap > absolute similarity)
- Paraphrase cue augmentation: 55% → 100% on synthetic, 36% → 64% on benchmark
- SNN encoder viable (CosSim 0.99) but not needed for current architecture

2026-04-07 10:37:24 +01:00

1.9 KiB

Raw Permalink Blame History

P2: Auto Paraphrase Generation

核心数据

策略	bg=0	bg=500	bg=2000	实现难度
None	95%	65%	55%	-
Heuristic (synonym swap)	95%	85%	75%	零成本
Oracle (hard cases only)	100%	95%	95%	需 LLM
Oracle (全覆盖)	100%	100%	100%	需 LLM

发现

Heuristic 已经很有价值：55% → 75%（+20pp），不需要 LLM
Oracle 全覆盖 = 100%：证明问题完全可通过 paraphrase 解决
大部分 failure 可被 paraphrase 修复：9 个 failure 中 8 个有 oracle fix

Failure 分类

类型	例子	原因	修复方式
词汇鸿沟	"Ship the release" ↔ "deploy" (cos=0.46)	完全不同的词	LLM paraphrase ✓
概念映射	"Need observability" ↔ "monitoring" (cos=0.26)	抽象→具体	LLM paraphrase ✓
领域知识	"Fix login issue" ↔ "auth bug" (cos=0.65)	需要知道 login=auth	LLM paraphrase ✓
竞争	"DB terrible" ↔ "DB slow" (cos=0.72) 但被 bg 抢走	cos 够高但 bg 更近	增加 augmentation 密度

实际部署策略

存储时（异步，不影响延迟）

1. 用户说了一句话
2. 提取 (cue, target)
3. 同步存原始 cue
4. 异步：LLM 生成 3-5 个 paraphrase → 追加存入

Heuristic fallback（LLM 不可用时）

当前 heuristic 规则已验证有效（+20pp），可以作为 baseline：

去除常见前缀 ("Can you", "I need to", "How do I")
同义词替换 (deploy↔release, database↔DB, fix↔resolve)
添加 "issue with X" 模式

LLM Prompt（待 Gateway 恢复后验证）

Generate 3-5 different ways a user might say this:
"The database is slow again"

Requirements:
- Same core meaning, different wording
- Include informal/colloquial versions
- Include technical jargon alternatives

1.9 KiB Raw Permalink Blame History Unescape Escape

P2: Auto Paraphrase Generation

核心数据

发现

Failure 分类

实际部署策略

存储时（异步，不影响延迟）

Heuristic fallback（LLM 不可用时）

LLM Prompt（待 Gateway 恢复后验证）

1.9 KiB

Raw Permalink Blame History