I want to develop a module which will use a speech to text support in Android. I found out many documentation and demos related to RecognizerIntent and such others. But I found that all of such demos just fetch the voice till 10 secs or so. But I want my demo to run for more than 5-10 minutes. I don't have any issue if that is not running offline, as my app is always working online.
I have also looked in to Pocketsphinx on Android, but that didn't worked out well. Also, that gave support just for Android Studio and not on Eclipse.
I have seen many apps giving the facility to convert speech to text for 5-10 mins continuously, such as: Speech To Text Notepad.
Can anyone suggest any other library of demo code with which this can be achieved ? TIA.
In general, long audio speech recognition is a challenging problem, so you barely find anything open for that. Instead, I'd suggest you to apply one of the audio segmentation algorithms and recognize them separately. Also, if you have text transcript along with your audio and just want to obtain the time frames (e.g. for the video captioning problem), then the task becomes much easier, and you can try long audio alignment for that.
Please refer this Android Speech Recognition Without Dialog In A Custom Activity.
try override method onEndOfSpeech and restart service again using
speechRecognizer.startListening(recognizerIntent)
I got the same result as the app you referenced Speech To Text Notepad, here is my activity
I have successfully accomplished this with the help of Google Cloud Speech API. They have also added a demo here.
Google Cloud Speech-to-Text enables developers to convert audio to text by applying powerful neural network models in an easy to use API. The API recognizes 120 languages and variants, to support your global user base. You can enable voice command-and-control, transcribe audio from call centers, and more. It can process real-time streaming or pre-recorded audio, using Google’s machine learning technology.
You can transcribe the text of users dictating to an application’s microphone, enable command-and-control through voice, or transcribe audio files, among many other use cases. Recognize audio uploaded in the request, and integrate with your audio storage on Google Cloud Storage, by using the same technology Google uses to power its own products.