Find Related products on Amazon

Shop on Amazon

Show HN: Open-source, native audio turn detection model

Published on: 2025-06-28 22:20:48

Smart turn detection This is an open source, community-driven, native audio turn detection model. HuggingFace page: pipecat-ai/smart-turn Turn detection is one of the most important functions of a conversational voice AI technology stack. Turn detection means deciding when a voice agent should respond to human speech. Most voice agents today use voice activity detection (VAD) as the basis for turn detection. VAD segments audio into "speech" and "non-speech" segments. VAD can't take into account the actual linguistic or acoustic content of the speech. Humans do turn detection based on grammar, tone and pace of speech, and various other complex audio and semantic cues. We want to build a model that matches human expectations more closely than the VAD-based approach can. This is a truly open model (BSD 2-clause license). Anyone can use, fork, and contribute to this project. This model started its life as a work in progress component of the Pipecat ecosystem. Pipecat is an open source ... Read full article.