feat(api): use Tencent ASR flash with 16k_zh_large and dev transcript logs

Replace CreateRecTask polling with recording-file flash API, add TENCENT_APP_ID,
remove server-side pydub slicing, and log ASR recognition text at INFO in development.

Co-authored-by: Cursor <cursoragent@cursor.com>
This commit is contained in:
Kevin
2026-05-25 11:28:22 +08:00
parent 22d282dc01
commit 07979bfb09
22 changed files with 354 additions and 185 deletions

View File

@@ -18,7 +18,7 @@ type StatusListener = (status: RecorderStatus) => void;
type RecordingCompleteListener = (uri: string, durationMs: number) => void;
/**
* Tencent SentenceRecognition uses `EngSerViceType=16k_zh_large` and
* Tencent ASR flash (`16k_zh_large`); client sends ~15s m4a segments per upload.
* `VoiceFormat=m4a`, so record speech in that shape directly instead of
* relying on Expo's default 44.1 kHz stereo preset.
*/