Commit Graph

3 Commits

Author SHA1 Message Date
Kevin
a50b72e7b5 feat(app-eval-web): 评测台 UI/UX 重构(侧栏导航、分页、数据集与实验能力)
- 采用 hash 路由与会话式壳层(Playground / Datasets / Experiments / Versions / Memoir)
- 抽取 api、types、hooks(轮询、通知、实验 SSE)与 NoticeContext
- Playground:基线/实际生成双栏、重放、流式自动评分与 ScoreCard
- Datasets:回归集与用例列表、Markdown/JSON 导入、会话快照
- Experiments:创建实验、提交运行、SSE 进度、DiffTable 与门禁展示
- 样式与无障碍:DM Sans + JetBrains Mono、侧栏响应式、? 快捷键帮助
2026-04-07 11:06:41 +08:00
Kevin
ca8bcc8489 feat(evaluation): session catalog, user export import, and eval web UI
- Extend evaluation API: schemas, router, repo, admin and execution services
- Improve user export markdown importer; add fixtures and importer tests
- Session catalog repo/service updates; internal app wiring and docs
- Add internal-eval.sh helper; refresh app-eval-web (App, styles, Vite)
2026-04-06 13:49:28 +08:00
Kevin
b75edacb5f feat/ 导出开发容器内的数据用于评估 2026-04-03 14:44:46 +08:00