feat(eval): internal-eval stack, judge fixes, and eval web overhaul

- Merge internal-eval into development.sh (single Celery/infra); internal-eval.sh
  wraps with LIFE_ECHO_WITH_INTERNAL_EVAL; EVAL_ATTACH_ONLY for attaching 8001
  when :8000 is already up; document in api/docs/internal-eval.md.
- Evaluation: transcript_for_judge, judge error surfacing, rubric/schema tweaks,
  execution_service and router updates; tests for judge and composite eval.
- Memory: ingest nested transaction for embedding/enrichment rollback safety.
- Conversation WS: logger.exception for pipeline errors (avoid loguru KeyError).
- app-eval-web: Playground saved replays, dialogue turns helper, hash user_id
  for Memoir; Memoir chapter baseline↔DB row compare with title heuristics;
  Stories page (#memoir-stories); Markdown + copy buttons; toolbar/panel UI;
  react-markdown; development proxy and fixture updates.
This commit is contained in:
Kevin
2026-04-07 17:15:01 +08:00
parent a50b72e7b5
commit 99543d04c6
47 changed files with 4968 additions and 1279 deletions

View File

@@ -14,7 +14,6 @@
#### 轮次 1 — 2026-02-12 10:52:57.64403+00
**用户:**
我的童年
@@ -1242,7 +1241,6 @@
#### 轮次 85 — 2026-03-01 09:46:41.862847+00
**用户:**
1989
@@ -1255,7 +1253,6 @@
#### 轮次 86 — 2026-03-01 09:46:54.57827+00
**用户:**
weifang
@@ -1338,32 +1335,6 @@ weifang
你在瑞士待了八年多,最喜欢哪个城市的生活氛围呀?是苏黎世的现代感,还是洛桑的湖畔宁静,或者南部那种意大利式的悠闲?
#### 轮次 92 — 2026-03-09 03:02:16.980767+00
**用户:**
你好
**AI:**
你好呀!很高兴见到你。
想先了解一下,你是哪一年出生的呢?
#### 轮次 93 — 2026-03-09 03:04:44.569861+00
**用户:**
1988年
**AI:**
88年呀那比我还大几岁呢正是最有魅力的年纪
你是在哪里出生的呀?是家乡的山水把你养大的吗?
---
## 人生故事stories
@@ -1968,3 +1939,4 @@ weifang
这便是我所珍视的信念:以清晰的底线守护灵魂的坐标,又以丰沛的包容去体验世界的纷繁。在原则与宽容之间,找到那种充满张力的平衡,生活方能既不失方向,又不乏温情。
---