Recognition by comparison with pre-recorded words

2019-06-12 20:36发布

站内文章 / 移动开发

44 0

贼婆χ

女 | 书童

私信

可以将文章内容翻译成中文,广告屏蔽插件可能会导致该功能失效(如失效，请关闭广告屏蔽插件后再试):

问题:

Requirement:

I want a system/algorithm that is similar to the call system in the car that you say a name that you want to call, it will find out that name match your voice. My requirement is that I will use pre-recorded voices instead of Name.

e.g.: User record target voices for several words in non-English language, e.g.:

"Keyboard-JP-01.wav" => Keyboard
"Monitor-JP-01.wav" => Monitor

After some training or even no training, when the user speak a word again, the speech recognizer should able to identify which word it is.

Known tools:

I searched existing speech recognition algorithm, but cannot find one, most of them are recognizing existing languages (kind of speech to text).

It also different from phonetic algorithm which seems only for English.

Thanks

回答1:

Dynamic Time Warping is a traditional and fairly simple algorithm for this kind of speech recognition, though I'm sure there must be better methods nowadays.