CUDA function call-able by either the device or ho

2019-01-25 11:59发布

I have a re-useable function in some CUDA code that needs to be called from both the device and the host. Is there an appropriate qualifier for this?

e.g. what's the correct definition for func1 in this case:

int func1 (int a, int b) {
    return a+b;
}

__global__ devicecode (float *A) {
    int i = blockDim.x * blockIdx.x + threadIdx.x;
    A[i] = func1(i,i);
}

void main() {
    // Normal cuda memory set-up

    // Call func1 from inside main:
    int j = func1(2,4)

    // Normal cuda memory copy / program run / retrieve data
}

So far I can only get this to work by having the function twice: once explicitly for the device and once for the host. Is there a better way?

标签： c++ function scope cuda

1条回答

孤傲高冷的网名

2楼-- · 2019-01-25 12:43

From the CUDA Programming Guide:

The __device__ and __host__ qualifiers can be used together however, in which case the function is compiled for both the host and the device.

0人赞添加讨论(0) 举报

CUDA function call-able by either the device or ho

采纳回答

编辑标签

举报内容

检举类型

检举原因

检举说明(必填)

打开微信“扫一扫”，打开网页后点击屏幕右上角分享按钮

付费偷看金额在0.1-10元之间