Multithreaded image processing in C++

I am working on a program which manipulates images of different sizes. Many of these manipulations read pixel data from an input and write to a separate output (e.g. blur). This is done on a per-pixel basis.

Such image mapulations are very stressful on the CPU. I would like to use multithreading to speed things up. How would I do this? I was thinking of creating one thread per row of pixels.

I have several requirements:

Executable size must be minimized. In other words, I can't use massive libraries. What's the most light-weight, portable threading library for C/C++?
Executable size must be minimized. I was thinking of having a function forEachRow(fp* ) which runs a thread for each row, or even a forEachPixel(fp* ) where fp operates on a single pixel in its own thread. Which is best?
- Should I use normal functions or functors or functionoids or some lambda functions or ... something else?
- Some operations use optimizations which require information from the previous pixel processed. This makes forEachRow favorable. Would using forEachPixel be better even considering this?
Would I need to lock my read-only and write-only arrays?
- The input is only read from, but many operations require input from more than one pixel in the array.
- The ouput is only written once per pixel.
Speed is also important (of course), but optimize executable size takes precedence.

Thanks.

More information on this topic for the curious: C++ Parallelization Libraries: OpenMP vs. Thread Building Blocks

标签： c++ multithreading optimization image-processing parallel-processing

16条回答

beautiful°

2楼-- · 2019-02-03 10:23

It is very possible, that bottleneck is not CPU but memory bandwidth, so multi-threading WON'T help a lot. Try to minimize memory access and work on limited memory blocks, so that more data can be cached. I had a similar problem a while ago and I decided to optimize my code to use SSE instructions. Speed increase was almost 4x per single thread!

0人赞添加讨论(0) 举报

We Are One

3楼-- · 2019-02-03 10:26

Can I ask which platform you're writing this for? I'm guessing that because executable size is an issue you're not targetting on a desktop machine. In which case does the platform have multiple cores or hyperthreaded? If not then adding threads to your application could have the opposite effect and slow it down...

0人赞添加讨论(0) 举报

The star\"

4楼-- · 2019-02-03 10:27

Your compiler doesn't support OpenMP. Another option is to use a library approach, both Intel's Threading Building Blocks and Microsoft Concurrency Runtime are available (VS 2010).

There is also a set of interfaces called the Parallel Pattern Library which are supported by both libraries and in these have a templated parallel_for library call. so instead of:

#pragma omp parallel for 
for (i=0; i < numPixels; i++) 
{ ...}

you would write:

parallel_for(0,numPixels,1,ToGrayScale());

where ToGrayScale is a functor or pointer to function. (Note if your compiler supports lambda expressions which it likely doesn't you can inline the functor as a lambda expression).

parallel_for(0,numPixels,1,[&](int i)
{  
   pGrayScaleBitmap[i] = (unsigned BYTE)  
       (pRGBBitmap[i].red * 0.299 +  
        pRGBBitmap[i].green * 0.587 +  
        pRGBBitmap[i].blue * 0.114);  
});

-Rick

0人赞添加讨论(0) 举报

混吃等死

5楼-- · 2019-02-03 10:29

I would recommend boost::thread and boost::gil (generic image libray). Because there are quite much templates involved, I'm not sure whether the code-size will still be acceptable for you. But it's part of boost, so it is probably worth a look.

0人赞添加讨论(0) 举报

我欲成王，谁敢阻挡

6楼-- · 2019-02-03 10:29

To optimize simple image transformations, you are far better off using SIMD vector math than trying to multi-thread your program.

0人赞添加讨论(0) 举报

趁早两清

7楼-- · 2019-02-03 10:29

You also could use libraries like IPP or the Cassandra Vision C++ API that are mostly much more optimized than you own code.

0人赞添加讨论(0) 举报

1 2 3 下一页

Multithreaded image processing in C++

采纳回答

编辑标签

举报内容

检举类型

检举原因

检举说明(必填)

打开微信“扫一扫”，打开网页后点击屏幕右上角分享按钮

付费偷看金额在0.1-10元之间