I want to train my neural network with different sounds. However, the size of each sounds are different. Does anyone know how to train a neural network with different size of input? Thanks.
可以将文章内容翻译成中文,广告屏蔽插件可能会导致该功能失效(如失效,请关闭广告屏蔽插件后再试):
问题:
回答1:
There is no way to classify inputs of different sizes, but you can transform your signal into a sequence of fixed-size feature vectors (or into a sequence of fixed-size pieces of the original sound). For a sound we usually employ MFCCs or just a spectrogram. Thus, you need to apply methods that operate on sequences. It can be a recurrent neural network, or you can employ a feed-forward network and then post-process its outputs for each frame somehow.