问题:

Hi I want to have a speech recognition api or sdk which recognises the speech spoken by the user and gives it's text form.

Detailed Description is as follows:

In my application I need to play an audio file and text of which is already there with me. When audio starts playing the word should be highlighted which is spoken(from the audio file).

So if I am able to get the word from api or sdk then it is possible to highlight it.

Apart from I googled a lot for api and I came across ceedvocalsdk but it's not available for free trial.

If someone can provide any idea other than this suiting to my requirement or api or sdk , I will be highly Thankful.

回答1:

You can try

http://www.politepix.com/openears/

As for speed, it should be fast, you probably don't use it properly. As I understood you have text already and you need to build grammar from this text.