I just want to know if there is any build in libraries or external libraries in Java or C# that allow me to take an audio file and parse it and extract the text from it.
I need to make an application to do so, but I don't know from where I can start.
You can use SoX (the Swiss Army knife of sound processing programs) to convert audio file to text file with numeric values corresponding to sound frequency/volume.
I have done it for a previous project but don't know the exact command options.
Here is a link to the project: http://sox.sourceforge.net/Main/HomePage
For Java, it seems there is a solution from Sun: javax.speech.recognition
Here is a complete example using C# and System.Speech
The code can be divided into 2 main parts:
configuring the SpeechRecognitionEngine object (and its required elements) handling the SpeechRecognized and SpeechHypothesized events.
Step 1: Configuring the SpeechRecognitionEngine
At this point your object is ready to start transcribing audio from the microphone. You need to handle some events though, in order to actually get access to the results.
Step 2: Handling the SpeechRecognitionEngine Events
That’s it. If you want to use a pre-recorded .wav file instead of a microphone, you would use
instead of
There are a bunch of different options in these classes and they are worth exploring in more detail.
http://ellismis.com/2012/03/17/converting-or-transcribing-audio-to-text-using-c-and-net-system-speech/
You might check Microsoft Speech API. I think they provide a SDK that you can use for your objective.
Here are some of your options: