Hacker News new | past | comments | ask | show | jobs | submit login

The latency on this (or lack thereof) is the best I've seen, would love to know more about how it's achieved. I asked the bot and it claimed you're using Google's speech recognition, which I know supports streaming, but this result seems much lower lag than I remember Google's stuff being capable of



> I asked the bot and it claimed you're using Google's speech recognition

That doesn't sound plausible. How can the LLM part know which speech recognition service is being used?


It's not entirely unlikely that the llm is informed exactly what it's source data is, with the hope that it can potentially make corrections to transcription errors


Or just because it's interesting and people might ask. I could imagine it being a hallucination, but it could also be an easter egg of sorts.


Apparently it uses the Web Speech API [1], not a specific service.

[1] https://developer.mozilla.org/en-US/docs/Web/API/SpeechRecog...




Consider applying for YC's Fall 2025 batch! Applications are open till Aug 4

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: