life-echo

Author	SHA1	Message	Date
Kevin	204ae24697	Merge branch 'eval/elapsed-time-memoir-batch-chunk' into development	2026-04-10 10:27:41 +08:00
Kevin	ac49bc7f23	feat(eval): memoir A/B chapter judging and eval-web parity with dialogue - Judge baseline excerpt and library chapter separately; build_memoir_compare_summary for gate, nine-dim and leaf deltas. - Memoir SSE chapter payload: baseline_judge, compare_summary, baseline_judge_error. - MemoirJudgeOutput: loose score coercion and post-validate clamp; memoir judge prompt caps from settings. - app-eval-web: two-column MemoirScoreCard layout, MemoirCompareSummary, chapter blocks and CSS. - Add memoir_compare_summary, log_events, celery_log_context, memoir_pipeline_progress; tests and migration 0014. - Misc: memory/evidence and enrichment paths, task/orchestrator updates, internal-eval docs, env examples.	2026-04-10 10:25:15 +08:00
yangshilin	e1341c6d18	feat: 1. 建立问题库大纲，对应每个人生阶段槽位 2. 鼓励使用更生活化的交流语言共情与总结 3. 降低评审模型可能发生截断的概率 4. 成稿质量维度强化情感表达和上下文连贯性	2026-04-09 15:32:35 +08:00
Kevin	b0251e5b26	feat(eval): server-side replay/phase1 timing + memoir phase1 batch chunking - Replay and memoir-submit responses include started/finished UTC and elapsed_ms; Phase1 poll exposes Redis-backed submit time and elapsed_ms_since_submit. - Phase1 batch LLM splits segments by memoir_phase1_batch_llm_chunk_size with bisect fallback per chunk; Playground shows server timings. Made-with: Cursor	2026-04-09 13:39:04 +08:00
Kevin	064ad2161d	refactor(eval+memoir)：精简内部评测路由与服务，composite/对话摘要与 judge 能力补强 - 访谈：新增 interview_state_hints，联动 orchestrator 与提示词 - 回忆录：story_pipeline_sync/state/memory/post_commit 与 Celery 任务调整 - 基建：开发用 celery broker、compose/development 脚本、依赖注入 - eval-web：移除数据集/实验/版本等页面与流式轮询，突出 Playground - 文档与单测同步	2026-04-08 21:36:12 +08:00
Kevin	309a051038	feat: 回忆录证据血缘与内部评测可追溯，顺带对齐本地评测台与 CI 数据库与模型：新增多版迁移（章节证据快照、对话血缘、记忆事实/时间线 lineage 等），把「成稿 ↔ 对话/记忆」的溯源信息落到表结构里。业务链路：会话与 WS、回忆录/故事流水线、记忆写入与 enrichment 等跟着接上线索与快照；新增章节证据快照与评测侧 EvalTraceService 等模块，方便组评审用的证据包。内部评测：自动化 run 与手工 memoir 评审共用可追溯证据；rubric/ judge 相关脚本与文档有配套调整。 app-eval-web：Memoir/实验详情里能展开看证据摘要与 evidence_trace（含对话轮次 id）；Vite 代理与 development.sh 注入的 API 端口与当前默认内部评测端口一致，避免改端口后页面连错服务。工程杂项：GitHub Actions / 仓库说明有更新；各适配器与支付/配额/plan 等多处为小改动或跟随主改动的收尾；新增/扩充了?	2026-04-08 15:37:09 +08:00
Kevin	6772e1269c	feat(evaluation): memoir readiness, judge/replay updates, eval web playground Add memoir_readiness_service and router tests; extend judge schemas/services, replay_service, and conversation rubric; align story route agent, payload, prompts, and story_pipeline_sync; update agent logging, config, and DI. Document internal-eval; add replayDraft util and PlaygroundPage changes in app-eval-web.	2026-04-08 09:43:34 +08:00
Kevin	29dec8fe32	feat/ eval	2026-04-06 23:19:20 +08:00
Kevin	ca8bcc8489	feat(evaluation): session catalog, user export import, and eval web UI - Extend evaluation API: schemas, router, repo, admin and execution services - Improve user export markdown importer; add fixtures and importer tests - Session catalog repo/service updates; internal app wiring and docs - Add internal-eval.sh helper; refresh app-eval-web (App, styles, Vite)	2026-04-06 13:49:28 +08:00
Kevin	b75edacb5f	feat/ 导出开发容器内的数据用于评估	2026-04-03 14:44:46 +08:00

10 Commits