Requirement:
I want a system/algorithm that is similar to the call system in the car that you say a name that you want to call, it will find out that name match your voice. My requirement is that I will use pre-recorded voices instead of Name.
e.g.: User record target voices for several words in non-English language, e.g.:
- "Keyboard-JP-01.wav" => Keyboard
- "Monitor-JP-01.wav" => Monitor
After some training or even no training, when the user speak a word again, the speech recognizer should able to identify which word it is.
Known tools:
I searched existing speech recognition algorithm, but cannot find one, most of them are recognizing existing languages (kind of speech to text).
It also different from phonetic algorithm which seems only for English.
Thanks