How do I feed a 2-dimensional array into a kernel

2019-04-03 00:27发布

I have created a numpy array of float32s with shape (64, 128), and I want to send it to the GPU. How do I do that? What arguments should my kernel function accept? float** myArray?

I have tried directly sending the array as it is to the GPU, but pycuda complains that objects are being accessed...

标签： pycuda

1条回答

Animai°情兽

2楼-- · 2019-04-03 00:59

Two dimensional arrays in numpy/PyCUDA are stored in pitched linear memory in row major order by default. So you only need to have a kernel something like this:

__global__
void kernel(float* a, int lda, ...)
{
    int r0 = threadIdx.y + blockDim.y * blockIdx.y;
    int r1 = threadIdx.x + blockDim.x * blockIdx.x;

    float val = a[r0 + r1*lda];

    ....
}

to access a numpy ndarray or PyCUDA gpuarray passed by reference to the kernel from Python.

0人赞添加讨论(0) 举报

How do I feed a 2-dimensional array into a kernel

采纳回答

编辑标签

举报内容

检举类型

检举原因

检举说明(必填)

打开微信“扫一扫”，打开网页后点击屏幕右上角分享按钮

付费偷看金额在0.1-10元之间