Voice Trigger Detection

Question

Voice Trigger Detection

I have a voice application that would be greatly improved if it were possible to use the “trigger word” to start recording sound. I don’t need the full mechanism of the speech text, just the ability to reliably / efficiently detect the trigger word.

I am wondering if there are any specialized speech mechanisms that support this particular use case, or any libraries / methods for developing such a single object detection mechanism. Ideally, I would like it to work in noisy environments, but it can be trained for a single user voice.

Pointers to scientific articles / topics would also be appreciated, so I know what to ask for.

+3

speech-recognition signal-processing voice voice-recording

sehugg May 23 '09 at 17:03

source share

5 answers

Paul gregoire · Answer 1 · 2010-09-05T19:27:47+0000

My Red5 project colleague created a similar demo using trigger words to trigger a search in the image repository. Saying "cat" caused the cat to appear for about a second. The client application was written in Flash, and the back-end in Red5, using the free Sphinx library. You, of course, could do what you want with Sphinx effortlessly.

Sphinx Project: http://cmusphinx.sourceforge.net/sphinx4/

Nils Pipenbrinck · Answer 2 · 2009-05-23T17:20:52+0000

, , .

- , -, :

- . - . , .

, ( ) , 80% . 80% - /. thresold , .

, .

ChrisW · Answer 3 · 2009-05-23T17:21:09+0000

O/S? , , Windows Vista. .

hlovdal · Answer 4 · 2009-05-23T21:54:29+0000

Linux. , , , , . , joeforker, .

reinaldo crespo · Answer 5 · 2010-05-05T00:11:57+0000

win32. OCX /.

I know this is not exactly the solution you are asking for, but you might think about the pedal. It is easy to program and will be very similar to a spoken word to start / stop recording. Check them out: www.pedalpower.com

Hope this helps,

Reynaldo.

Voice Trigger Detection

More articles: