smart-home-tablet

daniil/smart-home-tablet

Fork 0

Commit Graph

Author	SHA1	Message	Date
Cosmo	c29da75c19	feat(voice/tts): route ElevenLabs through HTTP proxy for non-RU egress All checks were successful Deploy / deploy (push) Successful in 4m3s Details ElevenLabs Cloudflare returns 302 to a region-restricted help page when requested from a Russian IP. Tablet host (.60) is in RU, so the Stage 2 call was failing with 502 upstream. Fix: use https-proxy-agent when ELEVENLABS_PROXY (or generic HTTPS_PROXY / HTTP_PROXY) env var is set. Tinyproxy on .103 (non-RU egress host) acts as the tunnel. - package.json: add https-proxy-agent ^7.0.6 - app/api/voice/tts: switch from global fetch to node:https with explicit Agent (either direct or HttpsProxyAgent). Still streams MP3 back via Readable.toWeb so Next.js Response pipes it to the browser as audio arrives. Operational: set ELEVENLABS_PROXY=http://192.168.31.103:8888 in tablet.env after bringing tinyproxy up on .103.	2026-04-23 13:00:55 +00:00
Cosmo	a780fc7bd5	feat(voice): play TTS through tablet speakers via ElevenLabs proxy All checks were successful Deploy / deploy (push) Successful in 2m58s Details Stage 2 of voice integration — centralizes TTS on the tablet so the Python satellite no longer needs ElevenLabs credentials or mpv. - app/api/voice/tts — POST {text, agent}, proxies to ElevenLabs streaming endpoint with flash_v2_5 default, returns audio/mpeg. Per-agent voice id via COSMO_TTS_VOICE / LUSYA_TTS_VOICE env. - VoiceOverlay — on response/error events fetches TTS and plays via HTMLAudioElement; on wake event stops playback (barge-in). Dismiss timer extended by text length so long responses do not cut off. - Autoplay caveat: browser may block first playback until user taps anywhere on the page (FKB: enable Force Autoplay to bypass).	2026-04-23 12:52:26 +00:00
Cosmo	51c3d6016a	feat(voice): SSE bridge + Siri-blob overlay for wake-word script All checks were successful Deploy / deploy (push) Successful in 3m12s Details Adds the tablet side of voice assistant integration. External Python script (openWakeWord + Groq STT + OpenClaw) will POST state transitions to /api/voice/event with a bearer token, and the tablet shows a fullscreen overlay with Siri-style animated blob + current agent + recognized text / response text. - lib/voice-bus.ts — in-process EventEmitter singleton, preserved across hot reloads via globalThis - app/api/voice/event — POST, bearer-auth via VOICE_API_KEY env, validates event kind, broadcasts on voiceBus - app/api/voice/stream — GET, SSE endpoint, per-connection listener with 15s keep-alive ping and abort-signal cleanup - components/VoiceOverlay — full-screen overlay, 3-layer pulsing Siri blob, per-agent palette (cosmo indigo/violet, lusya pink/rose), auto-dismiss timeouts (wake=20s safety, response=6s, error=4s), auto-reconnect on SSE drop - middleware bypasses /api/voice/event so the script does not need a user auth cookie - VoiceOverlay mounted in HomePageInner outside tab routing so it appears on every view	2026-04-23 12:36:26 +00:00

Author

SHA1

Message

Date

Cosmo

c29da75c19

feat(voice/tts): route ElevenLabs through HTTP proxy for non-RU egress

Deploy / deploy (push) Successful in 4m3s

Details

ElevenLabs Cloudflare returns 302 to a region-restricted help page
when requested from a Russian IP. Tablet host (.60) is in RU, so the
Stage 2 call was failing with 502 upstream.

Fix: use https-proxy-agent when ELEVENLABS_PROXY (or generic HTTPS_PROXY
/ HTTP_PROXY) env var is set. Tinyproxy on .103 (non-RU egress host)
acts as the tunnel.

- package.json: add https-proxy-agent ^7.0.6
- app/api/voice/tts: switch from global fetch to node:https with
  explicit Agent (either direct or HttpsProxyAgent). Still streams
  MP3 back via Readable.toWeb so Next.js Response pipes it to the
  browser as audio arrives.

Operational: set ELEVENLABS_PROXY=http://192.168.31.103:8888 in
tablet.env after bringing tinyproxy up on .103.

2026-04-23 13:00:55 +00:00

Cosmo

a780fc7bd5

feat(voice): play TTS through tablet speakers via ElevenLabs proxy

Deploy / deploy (push) Successful in 2m58s

Details

Stage 2 of voice integration — centralizes TTS on the tablet so the
Python satellite no longer needs ElevenLabs credentials or mpv.

- app/api/voice/tts — POST {text, agent}, proxies to ElevenLabs
  streaming endpoint with flash_v2_5 default, returns audio/mpeg.
  Per-agent voice id via COSMO_TTS_VOICE / LUSYA_TTS_VOICE env.
- VoiceOverlay — on response/error events fetches TTS and plays via
  HTMLAudioElement; on wake event stops playback (barge-in). Dismiss
  timer extended by text length so long responses do not cut off.
- Autoplay caveat: browser may block first playback until user taps
  anywhere on the page (FKB: enable Force Autoplay to bypass).

2026-04-23 12:52:26 +00:00

Cosmo

51c3d6016a

feat(voice): SSE bridge + Siri-blob overlay for wake-word script

Deploy / deploy (push) Successful in 3m12s

Details

Adds the tablet side of voice assistant integration. External Python
script (openWakeWord + Groq STT + OpenClaw) will POST state transitions
to /api/voice/event with a bearer token, and the tablet shows a
fullscreen overlay with Siri-style animated blob + current agent +
recognized text / response text.

- lib/voice-bus.ts — in-process EventEmitter singleton, preserved
  across hot reloads via globalThis
- app/api/voice/event — POST, bearer-auth via VOICE_API_KEY env,
  validates event kind, broadcasts on voiceBus
- app/api/voice/stream — GET, SSE endpoint, per-connection listener
  with 15s keep-alive ping and abort-signal cleanup
- components/VoiceOverlay — full-screen overlay, 3-layer pulsing
  Siri blob, per-agent palette (cosmo indigo/violet, lusya pink/rose),
  auto-dismiss timeouts (wake=20s safety, response=6s, error=4s),
  auto-reconnect on SSE drop
- middleware bypasses /api/voice/event so the script does not need
  a user auth cookie
- VoiceOverlay mounted in HomePageInner outside tab routing so it
  appears on every view

2026-04-23 12:36:26 +00:00

3 Commits