Speech recognition (web) services?

2019-02-24 00:56发布

问题:

I have a buffer of audio and I'd like to perform speech recognition/transcription on it. I have limited CPU and RAM locally so I want to perform recognition on a server.

Are there any (web) services that allow me to do this?

My searches so far have led nowhere...

回答1:

Google has just introduced browser-based access to its speech engine through HTML5.

http://slides.html5rocks.com/#speech-input

To get this page to work, I launched the Chromium browser as follows in Ubuntu:

$ chromium-browser --enable-speech-input

I believe that the idea is to be able to build applications that use Google's speech recognizer, but I haven't had a chance to look deeply into it.

Another interesting project is WAMI from MIT: http://wami.csail.mit.edu



回答2:

Lumenvox offers such a service but seems expensive for your needs.