I'd like to build an app that uses voice recognition. I've seen big companies like Google etc implement this feature, but I'm curious about doing it on a start-up level. Anyone looked into this? Are there any tools out there for us to do this?
相关问题
- Can we recover audio from MFCC coefficients?
- CALayer - backgroundColor flipped?
- Core Data lightweight migration crashes after App
- How can I implement password recovery in an iPhone
- how do you prevent page scroll in textarea on mobi
相关文章
- Could I create “Call” button in HTML 5 IPhone appl
- How to embed Google Speech to Text API in Python p
- Unable to process app at this time due to a genera
- How do you detect key up / key down events from a
- “Storyboard.storyboard” could not be opened
- Open iOS 11 Files app via URL Scheme or some other
- Can keyboard of type UIKeyboardTypeNamePhonePad be
- Can not export audiofiles via “open in:” from Voic
OpenEars looks promising... http://www.politepix.com/openears/
Based on Pocket Sphinx.
The Dragon Mobile SDK from Nuance does what is asked for. You need an internet connection to be able to send the audio to Nuance's server and you get a list of text responses. You can then decide what to do with the text responses (e.g. ask your user to choose the one he meant or perform some action). Here is the link:
http://dragonmobile.nuancemobiledeveloper.com/
If you start here at wikipedia, you'll get a good list engines (http://en.wikipedia.org/wiki/Speech_recognition#Commercial_software.2Fmiddleware)
As I write this (June 24, 2009) it looks to me that are two viable open source solutions
Both have been used in iphone apps, but the iphone friendly source isn't readily available.
As I edit this (8 July, 2009) I recently learned that Loquendo (http://www.loquendo.com/en/) has voice recognition and speech synthesis (ASR & TTS) for the iphone.
The best approach will probably be to: