How to use speech recognition from / to a video file?

Question

How to use speech recognition from / to a video file?

How can I encode a speech recognition engine (using the Microsoft Speech SDK) to "listen" to a video file and save the detection to a file?

+3

c ++ video speech-recognition

Yusuke Nov 09 '09 at 12:17

source share

1 answer

Eric brown · Accepted Answer · 2009-11-10T23:18:22+0000

This is very similar to this question and has a very similar answer. You need to separate the audio part, convert it to the WAV format and send it to the inproc recognizer.

However, he has the same problems that I spoke about earlier (requires training, assumes one voice and assumes that the microphone is close to the speaker). If so, then you are likely to get pretty good results. If this is not the case (i.e., you are trying to decrypt a television show or, even worse, some kind of sound in the camcorder), then the results are likely to be unsatisfactory.

How to use speech recognition from / to a video file?

More articles: