API or SDK for speech to text(speech recognition )

2019-01-23 09:22发布

Hi I want to have a speech recognition api or sdk which recognises the speech spoken by the user and gives it's text form.

Detailed Description is as follows:

In my application I need to play an audio file and text of which is already there with me. When audio starts playing the word should be highlighted which is spoken(from the audio file).

So if I am able to get the word from api or sdk then it is possible to highlight it.

Apart from I googled a lot for api and I came across ceedvocalsdk but it's not available for free trial.

If someone can provide any idea other than this suiting to my requirement or api or sdk , I will be highly Thankful.

3条回答
男人必须洒脱
2楼-- · 2019-01-23 10:11

You can try

http://www.politepix.com/openears/

As for speed, it should be fast, you probably don't use it properly. As I understood you have text already and you need to build grammar from this text.

查看更多
我命由我不由天
3楼-- · 2019-01-23 10:12

You can take a look at https://github.com/KingOfBrian/VocalKit, but I have not tried it myself.

查看更多
贪生不怕死
4楼-- · 2019-01-23 10:16

You can also try Nexiwave.com.

I think the function you are looking for is what we can TimeStamping: http://nexiwave.com/index.php/applications/for-transcription-companies

It basically take an audio and the text, we then put timestamp on each sentence and word.

Ben

查看更多
登录 后发表回答