I have a project in which I have created a chat program between a client and host, and I have to embed Speech to Text in it. Is there any way by which I can embed Google Speech to Text API in my program ??
相关问题
- how to define constructor for Python's new Nam
- streaming md5sum of contents of a large remote tar
- Can we recover audio from MFCC coefficients?
- How to get the background from multiple images by
- Evil ctypes hack in python
You can try Nexiwave's free speech-to-text api. Here is the python sample: http://nexiwave.com/api_samples/nexiwave_py.txt. Also check the API guide: http://nexiwave.com/index.php/119-integrate-in-5-minutes . Pretty straightforward.
You'd have to register first to use the free plan.
There is a package in PyPI called Speech Recognition which looks like it will do this. The live (i.e. via microphone) API looks fantastically simple.
It also has capabilities for transcribing WAV files, running as a background process, providing confidence values for the transcription, etc.
This is probably what you do not want to use in your case, but for other's who may need this for a one-off project, I hacked together a simple python client a while ago that uses the APIs built into Chrome for voice search:
https://github.com/korylprince/python-google-transcribe
For it to work, you must have 16000Hz encoded FLACs, and they have to be fairly short.
Also, like the comments mention, the API is unofficial, so who knows when it will stop working.