feat: surgery pipeline API, video inference, voice confirm, and tests

- Add FastAPI routes for surgery start/end, results, pending confirmation (WAV upload), and health checks.
- Implement RTSP/Hikvision capture, consumable classification, session manager, MinIO/Baidu voice resolution, and DB persistence.
- Add documentation (client API, video backends, staging checklist) and sample camera/RTSP config.
- Add pytest suite (API contract, session manager, voice, repositories, pipeline persistence) and httpx dev dependency.
- Replace deprecated HTTP_422_UNPROCESSABLE_ENTITY with HTTP_422_UNPROCESSABLE_CONTENT.
- Fix SurgeryPipeline DB reads to use an explicit transaction with autobegin disabled.

Made-with: Cursor
This commit is contained in:
Kevin
2026-04-21 18:33:54 +08:00
parent d1a3d029ec
commit 04866559db
56 changed files with 7196 additions and 43 deletions

View File

@@ -1,8 +1,94 @@
# Copy to .env and adjust. Used by local `start.sh` and Docker Compose.
# Copy to `.env` and adjust. Loaded by pydantic-settings (field names → UPPER_SNAKE_CASE env vars).
# Used by local `start.sh` and Docker Compose. See also docs/video-backends.md.
# --- PostgreSQL ---
POSTGRES_USER=postgres
POSTGRES_PASSWORD=postgres
POSTGRES_DB=operation_room
POSTGRES_HOST=localhost
POSTGRES_PORT=35432
# Async SQLAlchemy URL (asyncpg). For local dev against docker-compose.dev.yml:
DATABASE_URL=postgresql+asyncpg://postgres:postgres@localhost:5432/operation_room
# Optional: full async SQLAlchemy URL (overrides POSTGRES_* when set and matches defaults logic — see Settings).
# DATABASE_URL=postgresql+asyncpg://postgres:postgres@localhost:35432/operation_room
# --- YOLO inference (internal, not exposed as HTTP) ---
# Weights default to bundled paths under app/resources/ if unset.
# CONSUMABLE_CLASSIFIER_WEIGHTS=/absolute/path/to/consumable_classifier.pt
CONSUMABLE_CLASSIFIER_IMGSZ=224
CONSUMABLE_CLASSIFIER_DEVICE=
CONSUMABLE_CLASSIFIER_TOPK=5
# TEAR_ACTION_WEIGHTS=/absolute/path/to/tear_action.pt
TEAR_ACTION_IMGSZ=224
TEAR_ACTION_DEVICE=
TEAR_ACTION_TOPK=5
# Device: empty → auto (macOS MPS if available; Linux CUDA if available). Docker image uses CPU torch unless you change it.
# --- Surgery recording API retries ---
# SURGERY_RECORDING_MAX_ATTEMPTS=3
# SURGERY_RECORDING_RETRY_DELAY_SECONDS=1.0
# --- Video: RTSP + optional Hikvision HCNetSDK (Linux x86_64 + glibc recommended) ---
# Client `camera_ids` must match keys in your RTSP map (sample IDs: or-cam-01, or-cam-02).
#
# VIDEO_DEFAULT_BACKEND=rtsp
# Values: rtsp | hikvision_sdk | auto (auto: SDK .so loaded and HIKVISION_SDK_ENABLED=true → prefer SDK)
#
# Per-camera backend override (JSON object):
# VIDEO_CAMERA_BACKEND_OVERRIDES_JSON={"or-cam-01":"rtsp","or-cam-02":"hikvision_sdk"}
#
# RTSP URL resolution (first match wins per camera_id):
# 1) VIDEO_RTSP_URLS_JSON_FILE — JSON file: {"or-cam-01":"rtsp://...","or-cam-02":"rtsp://..."}
# 2) merged with VIDEO_RTSP_URLS_JSON (inline JSON string; overrides same keys from file)
# 3) else VIDEO_RTSP_URL_TEMPLATE with {camera_id}
# Example file (committed): app/resources/camera_rtsp_urls.sample.json
# VIDEO_RTSP_URLS_JSON_FILE=app/resources/camera_rtsp_urls.sample.json
# In Docker (WORKDIR /app): VIDEO_RTSP_URLS_JSON_FILE=/app/app/resources/camera_rtsp_urls.sample.json
# WARNING: if VIDEO_RTSP_URLS_JSON_FILE is set but the path is missing, the app will fail to start.
#
# VIDEO_RTSP_URL_TEMPLATE=rtsp://user:pass@192.168.1.64:554/Streaming/Channels/101
# VIDEO_RTSP_URLS_JSON={"or-cam-01":"rtsp://user:pass@192.168.1.101:554/Streaming/Channels/101","or-cam-02":"rtsp://user:pass@192.168.1.102:554/Streaming/Channels/101"}
#
# VIDEO_OPEN_TIMEOUT_SEC=15
# 连续读帧失败次数达到阈值后释放 RTSP 并重连。
# VIDEO_READ_FAILURE_RECONNECT_THRESHOLD=15
# VIDEO_RECONNECT_BACKOFF_SECONDS=1.0
# VIDEO_INFERENCE_INTERVAL_SEC=2
# VIDEO_INFERENCE_CONFIDENCE_THRESHOLD=0.35
# 置信度 >= 此值且命中候选清单时自动记账vision
# VIDEO_AUTO_CONFIRM_CONFIDENCE=0.55
# 置信度处于 [VIDEO_VOICE_CONFIRM_MIN_CONFIDENCE, VIDEO_AUTO_CONFIRM_CONFIDENCE) 时入队待确认(客户端拉取 pending-confirmation
# VIDEO_VOICE_CONFIRM_MIN_CONFIDENCE=0.35
# 是否启用低置信度人工确认(客户端播报 + resolve 回传;服务端无麦克风/扬声器要求)。
# VOICE_CONFIRMATION_ENABLED=true
# VIDEO_VOICE_CONFIRM_DOCTOR_ID=voice
# (已弃用)服务端本机录音 / ffmpeg 音频输入;当前闭环不依赖。
# VOICE_RECORD_SECONDS=5
# VOICE_FFMPEG_INPUT=
# 停录后写库失败时,后台重试落库间隔(秒)。
# ARCHIVE_PERSIST_RETRY_INTERVAL_SECONDS=30
# VIDEO_DETAIL_COOLDOWN_SEC=15
# VIDEO_JPEG_QUALITY=85
# VIDEO_RESULT_DOCTOR_ID=vision
# --- Hikvision: mount vendor Linux x86_64 .so at runtime (do not commit proprietary binaries) ---
# HIKVISION_LIB_DIR=/opt/hikvision/lib
# Optional: single library path (overrides directory search in code)
# HIKVISION_LIB_PATH=
# HIKVISION_SDK_ENABLED=false
# HIKVISION_DEVICE_IP=
# HIKVISION_DEVICE_PORT=8000
# HIKVISION_USER=
# HIKVISION_PASSWORD=
# HIKVISION_CHANNEL=1
# After SDK login, OpenCV still pulls frames via RTSP; template placeholders: {ip} {user} {password} {channel} {camera_id}
# HIKVISION_PREVIEW_RTSP_TEMPLATE=rtsp://{user}:{password}@{ip}:554/Streaming/Channels/101
# Per-camera RTSP when using SDK path (same shape as VIDEO_RTSP_URLS_JSON):
# HIKVISION_CAMERA_RTSP_URLS_JSON={"or-cam-01":"rtsp://...","or-cam-02":"rtsp://..."}
# HIKVISION_SDK_FALLBACK_TO_RTSP=true
# --- Baidu Speech可选遗留当前手术闭环由客户端完成 TTS/ASR服务端可不配置---
# BAIDU_SPEECH_APP_ID=
# BAIDU_SPEECH_API_KEY=
# BAIDU_SPEECH_SECRET_KEY=
# BAIDU_SPEECH_CONNECTION_TIMEOUT_MS=
# BAIDU_SPEECH_SOCKET_TIMEOUT_MS=