API or SDK for speech to text(speech recognition )

2019-01-23 09:38发布

问题:

Hi I want to have a speech recognition api or sdk which recognises the speech spoken by the user and gives it's text form.

Detailed Description is as follows:

In my application I need to play an audio file and text of which is already there with me. When audio starts playing the word should be highlighted which is spoken(from the audio file).

So if I am able to get the word from api or sdk then it is possible to highlight it.

Apart from I googled a lot for api and I came across ceedvocalsdk but it's not available for free trial.

If someone can provide any idea other than this suiting to my requirement or api or sdk , I will be highly Thankful.

回答1:

You can try

http://www.politepix.com/openears/

As for speed, it should be fast, you probably don't use it properly. As I understood you have text already and you need to build grammar from this text.



回答2:

You can take a look at https://github.com/KingOfBrian/VocalKit, but I have not tried it myself.



回答3:

You can also try Nexiwave.com.

I think the function you are looking for is what we can TimeStamping: http://nexiwave.com/index.php/applications/for-transcription-companies

It basically take an audio and the text, we then put timestamp on each sentence and word.

Ben