Is there any way to optimize sincos calls in CUDA?

2019-02-28 09:16发布

I'm writing a program in CUDA that makes a huge amount of calls to the sincos() function, using double precision. I'm afraid this is one of the biggest bottlenecks of the code, and I cannot reduce the number of calls to the function.

Is there any decent approximation to sincos in CUDA or in a library I can import? I am also quite concerned with the accuracy, so the better the approximation is, the happier my code will be.

I've also thought about building a lookup table or approximating the values with their taylor series, but I want some opinions before going down that road.

标签： cuda trigonometry sin cos

1条回答

霸刀☆藐视天下

2楼-- · 2019-02-28 09:53

A pretty fast and accurate sincos function is available in the CUDA math api. Just include math.h. Or use sincosf (here) if that will work for you. (I'm aware that you said double precision in your question. Just pointing some things out.)

If you can use sincospif instead of sincosf, @njuffa has worked his magic here, which may interest you.

This question and this question may also interest you.

0人赞添加讨论(0) 举报

Is there any way to optimize sincos calls in CUDA?

采纳回答

编辑标签

举报内容

检举类型

检举原因

检举说明(必填)

打开微信“扫一扫”，打开网页后点击屏幕右上角分享按钮

付费偷看金额在0.1-10元之间