I have some ideas for voice-controlled applications. Unfortunately, based on what I saw from Siri and Google Voice Actions, the technology is not quite there yet. Even in a completely calm environment, accuracy is so poor that it is often easier to type into a telephone.
One way to alleviate the problem would be to limit the system to a few commands specially selected for sound very different, in contrast to transferring sound to a service and simply returning text.
So, I have the following requirements:
- Very high accuracy when accessing a limited set of commands
- It is desirable that it works on mobile devices, but only libraries on the computer can be useful.
- Offline is again preferred, but not necessary.
- No need to be open source - licensing is fine
Is there such an API or software?
source share