Is there any speech in the text API or some kind of “hack” with which I can use Google speech for text tools?

Question

Is there any speech in the text API or some kind of “hack” with which I can use Google speech for text tools?

I think Google’s speech for text tools (automatic transcription of Google Voice voicemail, automatic video signing on YouTube, etc.) is impressive.

I really looked to see if Google provided access to it through the API, and it seems not (I don't blame them!). The cloud computing service providing a speech function for text will be pretty cool.

Is there some kind of “hack” that I can use to access speech in text. My architecture basically comes down to this - a short 15-20 seconds wav / mp3 / other clip as input, the output is plain text.

Any ideas from people?

+4

speech-recognition google-api

user245120 May 11, '10 at 23:11

source share

6 answers

Samuel neff · Answer 1 · 2010-11-06T23:57:22+0000

There are many text based APIs. Just because Google doesn't make them accessible doesn't mean you're out of luck.

Here is a good example for C #. You can search for others for your platform if it is not .NET.

http://cmusphinx.sourceforge.net/

Westy92 · Answer 2 · 2011-04-16T01:32:19+0000

Check it out: http://mikepultz.com/2011/03/accessing-google-speech-api-chrome-11/

I am currently trying to implement an API in PHP.

- Seth

Peter Moffatt · Answer 3 · 2010-12-09T11:47:55+0000

It is available in HTML5 via Chrome 8 or Opera: https://docs.google.com/View?id=dcfg79pz_5dhnp23f5&pli=1

Google voice technology is also available through the Android API on your Android phone.

Other products, such as Sphinx, are speech recognition engines that work best in specific areas, rather than “unconditional” speech text.

Theo · Answer 4 · 2011-07-18T17:28:38+0000

Here's a newer, more “official” version of Peter Moffatt's proposal:

http://lists.w3.org/Archives/Public/public-xg-htmlspeech/2011Feb/att-0020/api-draft.html

And the google related ad:

http://chrome.blogspot.com/2011/03/talking-to-your-computer-with-html5.html

wizgot · Answer 5 · 2013-05-06T06:23:56+0000

You can take a look at the following implementation using C # - I used Mike Pultz's link.

https://github.com/seigneur/Voice-Biometrics I used Sox to convert to flac, created a small SOX script to break it into pieces.

Havoc · Answer 6 · 2014-12-02T04:14:10+0000

If you really need google output ... Here is the Hack method

Have you thought about creating a messaging engine? Essentially, it calls your voicemail google ... plays mp3.

Run the output through https://code.google.com/p/google-voice-java/

The best answers.

Is there any speech in the text API or some kind of “hack” with which I can use Google speech for text tools?

More articles: