C++ super fast thread-safe rand function

void NetClass::Modulate(vector <synapse> & synapses )
{
    int size = synapses.size();
    int split = 200 * 0.5;

    for(int w=0; w < size; w++)
        if(synapses[w].active)
            synapses[w].rmod = ((rand_r(seedp) % 200 - split ) / 1000.0);
}

The function rand_r(seedp) is seriously bottle-necking my program. Specifically, its slowing me by 3X when run serialy, and 4.4X when run on 16 cores. rand() is not an option because its even worse. Is there anything I can do to streamline this? If it will make a difference, I think I can sustain a loss in terms of statistical randomness. Would pre-generating (before execution) a list of random numbers and then loading to the thread stacks be an option?

标签： c++ multithreading optimization random

7条回答

乱世女痞

2楼-- · 2020-02-09 09:08

It depends on how good the statistical randomness needs to be. For high quality, the Mersenne twister, or its SIMD variant, is a good choice. You can generate and buffer a large block of pseudo-random numbers at a time, and each thread can have its own state vector. The Park-Miller-Carta PRNG is extremely simple - these guys even implemented it as a CUDA kernel.

0人赞添加讨论(0) 举报

ら.Afraid

3楼-- · 2020-02-09 09:09

do you absolutely need to have 1 shared random?

I had a similar contention problem a while ago, the solution that worked best for me was to create a new Random class (I was working in C#) for each thread. they're dead cheap anyway.

If you seed them properly to make sure you don't create duplicate seeds you should be fine. Then you won't have shared state so you don't need to use the threadsafe function.

Regards GJ

0人赞添加讨论(0) 举报

贪生不怕死

4楼-- · 2020-02-09 09:13

Have a look at Boost: http://www.boost.org/doc/libs/1_47_0/doc/html/boost_random.html It has a number of options which vary in complexity (= speed) and randomness (cycle length).

If you don't need maximum randomness, you might get away with a simple Mersenne Twister.

0人赞添加讨论(0) 举报

狗以群分

5楼-- · 2020-02-09 09:17

maybe you don't have to call it in every iteration? you could initialize an array of pre-randomized elements and make successive use of it...

0人赞添加讨论(0) 举报

一纸荒年 Trace。

6楼-- · 2020-02-09 09:22

I think you can use OpenMP for paralleling like this:

#pragma omp parallel
for(int w=0; w < size; w++)

0人赞添加讨论(0) 举报

再贱就再见

7楼-- · 2020-02-09 09:25

Problem is that seedp variable (and its memory location) is shared among several threads. Processor cores must synchronize their caches each time they access this ever changing value, which hampers performance. The solution is that all threads work with their own seedp, and so avoid cache synchronization.

0人赞添加讨论(0) 举报

1 2 下一页

C++ super fast thread-safe rand function

采纳回答

编辑标签

举报内容

检举类型

检举原因

检举说明(必填)

打开微信“扫一扫”，打开网页后点击屏幕右上角分享按钮

付费偷看金额在0.1-10元之间