Speedup a short to float cast?-第2页回答

Speedup a short to float cast?

2019-01-24 14:21发布

I have a short to float cast in C++ that is bottlenecking my code.

The code translates from a hardware device buffer which is natively shorts, this represents the input from a fancy photon counter.

float factor=  1.0f/value;
for (int i = 0; i < W*H; i++)//25% of time is spent doing this
{
    int value = source[i];//ushort -> int
    destination[i] = value*factor;//int*float->float
}

A few details

Value should go from 0 to 2^16-1, it represents the pixel values of a highly sensitive camera
I'm on a multicore x86 machine with an i7 processor (i7 960 which is SSE 4.2 and 4.1).
Source is aligned to an 8 bit boundary (a requirement of the hardware device)
W*H is always divisible by 8, most of the time W and H are divisible by 8

This makes me sad, is there anything I can do about it?

I am using Visual Studios 2012...

标签： c++ x86 type-conversion sse

7条回答

Anthone

2楼-- · 2019-01-24 15:01

You could try to approximate the expression

float factor = 1.0f/value;

by an fraction numerator/denomitator where both numerator and denominator are ints. This can be done to the precision you need in your application like

int denominator = 10000;
int numerator = factor * denominator;

Then you can do your computation in integer arithmetics like

int value = source[i];
destination[i] = (value * numerator) / numerator;

You have to take care of overflows, perhaps you need to switch to long (or even long long on 64bit systems) for the calculation.

0人赞添加讨论(0) 举报

上一页 1 2

Speedup a short to float cast?

采纳回答

编辑标签

举报内容

检举类型

检举原因

检举说明(必填)

打开微信“扫一扫”，打开网页后点击屏幕右上角分享按钮

付费偷看金额在0.1-10元之间