The maximum time limit for a single speech recognition session is about 60 seconds. This means that you can get Google to process the offer, but then you need to end this session and start a new one.
If you can use JavaScript, then you should use the SpeechRecognition API - take a look at the simple example I wrote in this tutorial
source share