Beats per minute from real-time audio input

I'd like to write a simple C# application to monitor the line-in audio and give me the current (well, the rolling average) beats per minute.

I've seen this gamedev article, and that was absolutely no help. I went through and tried to implement what he was doing but it just wasn't working.

I know there have to be tons of solutions for this, because lots of DJ software does it, but I'm not having any luck in finding any open-source library or instructions on doing it myself.

标签： c# audio signal-processing

8条回答

放我归山

2楼-- · 2019-01-10 00:15

There's an excellent project called Dancing Monkeys, which procedurally generates DDR dance steps from music. A large part of what it does is based on (necessarily very accurate) beat analysis, and their project paper goes into much detail describing the various beat detection algorithms and their suitability to the task. They include references to the original papers for each of the algorithms. They've also published the matlab code for their solution. I'm sure that between those you can find what you need.

It's all available here: http://monket.net/dancing-monkeys-v2/Main_Page

0人赞添加讨论(0) 举报

唯我独甜

3楼-- · 2019-01-10 00:16

I found this library which seem to have a pretty solid implementation for detecting Beats per Minute. http://soundtouchdotnet.codeplex.com/

It's based on http://www.surina.net/soundtouch/index.html which is used in quite a few DJ projects http://www.surina.net/soundtouch/applications.html

0人赞添加讨论(0) 举报

疯言疯语

4楼-- · 2019-01-10 00:17

First of all, what Hallgrim is producing is not the power spectral density function. Statistical periodicities in any signal can be brought out through an autocorrelation function. The fourier transform of the autocorrelation signal is the power spectral density. Dominant peaks in the PSD other than at 0 Hz will correspond to the effective periodicity in the signal (in Hz)...

0人赞添加讨论(0) 举报

smile是对你的礼貌

5楼-- · 2019-01-10 00:17

The easy way to do it is to have the user tap a button in rhythm with the beat, and count the number of taps divided by the time.

0人赞添加讨论(0) 举报

啃猪蹄的小仙女

6楼-- · 2019-01-10 00:20

Calculate a powerspectrum with a sliding window FFT: Take 1024 samples:

double[] signal = stream.Take(1024);

Feed it to an FFT algorithm:

double[] real = new double[signal.Length];
double[] imag = new double[signal.Length);
FFT(signal, out real, out imag);

You will get a real part and an imaginary part. Do NOT throw away the imaginary part. Do the same to the real part as the imaginary. While it is true that the imaginary part is pi / 2 out of phase with the real, it still contains 50% of the spectrum information.

EDIT:

Calculate the power as opposed to the amplitude so that you have a high number when it is loud and close to zero when it is quiet:

for (i=0; i < real.Length; i++) real[i] = real[i] * real[i];

Similarly for the imaginary part.

for (i=0; i < imag.Length; i++) imag[i] = imag[i] * imag[i];

Now you have a power spectrum for the last 1024 samples. Where the first part of the spectrum is the low frequencies and the last part of the spectrum is the high frequencies.

If you want to find BPM in popular music you should probably focus on the bass. You can pick up the bass intensity by summing the lower part of the power spectrum. Which numbers to use depends on the sampling frequency:

double bassIntensity = 0;
for (i=8; i < 96; i++) bassIntensity += real[i];

Now do the same again but move the window 256 samples before you calculate a new spectrum. Now you end up with calculating the bassIntensity for every 256 samples.

This is a good input for your BPM analysis. When the bass is quiet you do not have a beat and when it is loud you have a beat.

Good luck!

0人赞添加讨论(0) 举报

Rolldiameter

7楼-- · 2019-01-10 00:21

This is by no means an easy problem. I'll try to give you an overview only.

What you could do is something like the following:

Compute the average (root-mean-square) loudness of the signal over blocks of, say, 5 milliseconds. (Having never done this before, I don't know what a good block size would be.)
Take the Fourier transform of the "blocked" signal, using the FFT algorithm.
Find the component in the transformed signal that has the largest magnitude.

A Fourier transform is basically a way of computing the strength of all frequencies present in the signal. If you do that over the "blocked" signal, the frequency of the beat will hopefully be the strongest one.

Maybe you need to apply a filter first, to focus on specific frequencies (like the bass) that usually contain the most information about the BPM.

0人赞添加讨论(0) 举报

1 2 下一页

Beats per minute from real-time audio input

采纳回答

编辑标签

举报内容

检举类型

检举原因

检举说明(必填)

打开微信“扫一扫”，打开网页后点击屏幕右上角分享按钮

付费偷看金额在0.1-10元之间