Is just-in-time (jit) compilation of a CUDA kernel

2020-07-14 05:43发布

Does CUDA support JIT compilation of a CUDA kernel?

I know that OpenCL offers this feature.

I have some variables which are not changed during runtime (i.e. only depend on the input file), therefore I would like to define these values with a macro at kernel compile time (i.e at runtime).

If I define these values manually at compile time my register usage drops from 53 to 46, what greatly improves performance.

标签： cuda jit

2条回答

Fickle 薄情

2楼-- · 2020-07-14 06:27

It became available with nvrtc library of cuda 7.0. By this library you can compile your cuda codes during runtime.

http://devblogs.nvidia.com/parallelforall/cuda-7-release-candidate-feature-overview/

Bu what kind of advantages you can gain? In my view, i couldn't find so much dramatic advantages of dynamic compilation.

0人赞添加讨论(0) 举报

Fickle 薄情

3楼-- · 2020-07-14 06:38

If it is feasible for you to use Python, you can use the excellent pycuda module to compile your kernels at runtime. Combined with a templating engine such as Mako, you will have a very powerful meta-programming environment that will allow you to dynamically tune your kernels for whatever architecture and specific device properties happen to be available to you (obviously some things will be difficult to make fully dynamic and automatic).

You could also consider just maintaining a few distinct versions of your kernel with different parameters, between which your program could choose at runtime based on whatever input you are feeding to it.

0人赞添加讨论(0) 举报

Is just-in-time (jit) compilation of a CUDA kernel

采纳回答

编辑标签

举报内容

检举类型

检举原因

检举说明(必填)

打开微信“扫一扫”，打开网页后点击屏幕右上角分享按钮

付费偷看金额在0.1-10元之间