diff --git a/paper.md b/paper.md
index e5067ab..478a1d5 100644
--- a/paper.md
+++ b/paper.md
@@ -134,6 +134,7 @@ $$
 - overlap > 0.45 → **继续当前话题**
 - overlap < 0.20 且 new_ratio > 0.70 → **切换新话题**
 - 存在指代词（"这个"、"它"、"上面"等）→ **强制继续**
+- **中间地带**（0.20 ≤ overlap ≤ 0.45 且无指代词）→ **默认继续**（保守策略，避免误切换）
 
 ### 3.4 稀疏召回
 
@@ -146,7 +147,7 @@ $$
 其中：
 - $\text{lex}(x, q)$：基于 IDF-overlap 的词项重叠得分
 - $\text{exact}(b, q)$：英文术语、代码标识符、版本号完整命中加分
-- $\text{recency}(b)$：时间衰减因子（弱先验，仅微调）
+- $\text{recency}(b)$：新鲜度奖励因子（越新越大，仅作弱先验微调）
 
 用户侧消息权重（1.5）高于助手侧（0.7），因为用户消息的语义更代表对话意图。
 
@@ -172,12 +173,6 @@ $$
 \text{gain}(b \mid S) = \frac{\sum_{t \in \text{cov}(b) \setminus \text{covered}(S)} \text{idf}(t)}{\text{cost}(b)^\alpha}, \quad \alpha = 0.8
 $$
 
-### 3.6 句级裁剪（可选）
-
-对选中的块内部进一步按句子级别裁剪，保留覆盖了 query 锚点的句子，去除冗余内容。
-
----
-
 ## 4. 实现
 
 ### 4.1 项目结构
@@ -248,7 +243,7 @@ prompt = gate.build_prompt("锁的 TTL 怎么设")
 ### 5.3 性能分析
 
 在 2 核 2G CPU 环境下：
-- 单次 `select()` 调用延迟：< 10ms
+- 单次 `select()` 调用延迟：理论估算 < 10ms（未做专门性能基准测试）
 - 内存占用：< 50MB
 - 无外部模型依赖，纯 Python 标准库+re模块
 
@@ -310,7 +305,7 @@ prompt = gate.build_prompt("锁的 TTL 怎么设")
 
 [8] Radford, A., et al. (2019). Language Models are Unsupervised Multitask Learners. *OpenAI Technical Report*.
 
-[9] Liu, P. J., et al. (2019). MS MARCO: A Human Generated MAchine Reading COmprehension Dataset. *NeurIPS*.
+[9] Nguyen, P., et al. (2016). MS MARCO: A Human Generated MAchine Reading COmprehension Dataset. *NIPS*.
 
 [10] Karpukhin, V., et al. (2020). Dense Passage Retrieval for Open-Domain Question Answering. *EMNLP*.