Mac M1 optimizations, fix train pipeline, add Hey Cosmo wake word model
- Fix install_mac.sh: use venv + Python 3.12 (3.14 incompatible with ML libs) - Fix run_mac.sh: activate venv, add CPU thread optimization env vars - Fix agent.py: remove f-string from SYSTEM_PROMPT template (NameError on import) - Add missing deps: sounddevice, pydub, imageio-ffmpeg, omegaconf - Optimize for M1: torch.inference_mode, set_num_threads, OMP/MKL tuning - Switch to qwen2.5:3b for faster LLM responses on Mac - Switch Whisper to medium model with auto compute (small+int8 had poor Russian) - Add initial_prompt for better Russian transcription - Add open_app tool for native macOS app launching - Fix TTS: sanitize Latin text to Cyrillic for Silero compatibility - Fix wake word echo: add cooldown after TTS, reset model state, raise threshold - Make "Слушаю" TTS synchronous to avoid mic interference - Fix train Dockerfile: remove tensorflow/onnx2tf (only ONNX needed), fix deps - Fix train.sh: use wget for dataset download, add --shm-size=2g - Add trained hey_cosmo.onnx wake word model - Add TODO section to CLAUDE.md (ChatterBox TTS, Ollama Modelfile ideas) Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
This commit is contained in:
@@ -20,11 +20,13 @@ class Transcriber:
|
||||
"language": whisper_cfg["language"],
|
||||
"device": whisper_cfg["device"],
|
||||
"compute_type": whisper_cfg["compute_type"],
|
||||
# Подсказка для Whisper — улучшает распознавание русского
|
||||
"initial_prompt": whisper_cfg.get("initial_prompt", ""),
|
||||
# Silero VAD параметры
|
||||
"silero_sensitivity": 0.4,
|
||||
"webrtc_sensitivity": 3,
|
||||
"post_speech_silence_duration": audio_cfg["silence_duration"],
|
||||
"min_length_of_recording": 0.3,
|
||||
"min_length_of_recording": 0.5,
|
||||
"min_gap_between_recordings": 0.01,
|
||||
# Отключаем wake word в RealtimeSTT — используем свой
|
||||
"wakeword_backend": "none",
|
||||
|
||||
Reference in New Issue
Block a user