I'm a superhero after hours, and I'm trying to create an application that analyzes audio for spoken keywords. (Think of emergency calls / 911) If the keyword is "robbery" and this word is pronounced in the provided audio, I would like to mark this file and possibly translate it into text.
What development libraries or software applications exist for this kind of thing? C ++ or Java libraries are preferred, but not required.
source share