How to use speech recognition from / to a video file?

How can I encode a speech recognition engine (using the Microsoft Speech SDK) to "listen" to a video file and save the detection to a file?

+3
source share
1 answer

This is very similar to this question and has a very similar answer. You need to separate the audio part, convert it to the WAV format and send it to the inproc recognizer.

However, he has the same problems that I spoke about earlier (requires training, assumes one voice and assumes that the microphone is close to the speaker). If so, then you are likely to get pretty good results. If this is not the case (i.e., you are trying to decrypt a television show or, even worse, some kind of sound in the camcorder), then the results are likely to be unsatisfactory.

+2
source

Source: https://habr.com/ru/post/1722315/


All Articles