I think Google’s speech for text tools (automatic transcription of Google Voice voicemail, automatic video signing on YouTube, etc.) is impressive.
I really looked to see if Google provided access to it through the API, and it seems not (I don't blame them!). The cloud computing service providing a speech function for text will be pretty cool.
Is there some kind of “hack” that I can use to access speech in text. My architecture basically comes down to this - a short 15-20 seconds wav / mp3 / other clip as input, the output is plain text.
Any ideas from people?
source share