Kevin
ac49bc7f23
feat(eval): memoir A/B chapter judging and eval-web parity with dialogue
- Judge baseline excerpt and library chapter separately; build_memoir_compare_summary for gate, nine-dim and leaf deltas.
- Memoir SSE chapter payload: baseline_judge, compare_summary, baseline_judge_error.
- MemoirJudgeOutput: loose score coercion and post-validate clamp; memoir judge prompt caps from settings.
- app-eval-web: two-column MemoirScoreCard layout, MemoirCompareSummary, chapter blocks and CSS.
- Add memoir_compare_summary, log_events, celery_log_context, memoir_pipeline_progress; tests and migration 0014.
- Misc: memory/evidence and enrichment paths, task/orchestrator updates, internal-eval docs, env examples.
2026-04-10 10:25:15 +08:00
..
2026-03-18 17:18:23 +08:00
2026-04-10 10:25:15 +08:00
2026-03-26 12:13:36 +08:00
2026-04-08 21:36:12 +08:00
2026-04-10 10:25:15 +08:00
2026-03-30 11:53:04 +08:00
2026-04-10 10:25:15 +08:00
2026-03-26 15:51:24 +08:00
2026-03-20 15:15:35 +08:00
2026-04-08 21:36:12 +08:00
2026-03-30 13:54:35 +08:00
2026-03-26 12:13:36 +08:00
2026-04-08 15:37:09 +08:00
2026-04-03 13:44:11 +08:00
2026-04-10 10:25:15 +08:00
2026-04-10 10:25:15 +08:00
2026-04-10 10:25:15 +08:00
2026-04-10 10:25:15 +08:00
2026-04-03 10:12:59 +08:00
2026-03-30 10:46:35 +08:00
2026-04-08 15:37:09 +08:00
2026-03-18 17:18:23 +08:00
2026-03-19 14:36:40 +08:00
2026-03-30 10:46:35 +08:00
2026-04-08 15:37:09 +08:00
2026-03-18 17:18:23 +08:00
2026-03-26 12:13:36 +08:00
2026-04-08 15:37:09 +08:00
2026-04-08 15:37:09 +08:00