Re: [PyCuda] global memory?

Dan Goodman Tue, 03 Feb 2009 08:49:40 -0800

Thanks Ian and Andreas,

About the algorithm, memory isn't a huge concern so if I'm doing thisoperation for an array of length N I don't mind having a permanentlyallocated extra array of length N that I'll probably only ever use thefirst 20 elements of for storing the indices. At the moment, this iswhat I'm doing in my C code to save going through the array twice.

About the GPU code, I think what you're saying is that I should have anarray x say, a global memory array J and a global index say j into J,and then do something like:


__global__ threshold(double *x, double x0)
{
 int i = blockIdx.x * blockDim.x + threadIdx.x;
 if(x[i]>x0){
  atomicInc(&j, N);
  J[j] = i;
 }
}

(Bear with me if I'm way off, as I said I only just started programmingwith CUDA.)

Isn't there a danger that at the end of the atomicInc instruction,before the J[j]=i instruction, another thread could do a secondatomicInc and so one of the elements of J would be skipped out? It'strue that this would be a rare event, but almost certain to happeneventually. Ah, although maybe the idea is to have global_j be theglobal index, and then do:


int j = atomicInc(&global_j, N);
J[j] = i;

I guess this would work even in that case?

One last technical question, I think I see how thepycuda.driver.mem_alloc function works, but how do I refer to thismemory in the CUDA code? (I don't think there's an example thatdemonstrates this in the pycuda release.) The Nvidia CUDA documentationtalks about having to manage the global memory by offsets, so I wouldguess you do something like this (based on the nvidia docs):


extern __device__ int J0[];
__global__ threshold(double *x, double x0)
{
 int i = blockIdx.x * blockDim.x + threadIdx.x;
 int *J = (int*)J0;
 int *global_j = (int*)&J[N];
 if(x[i]>x0){
  int j = atomicInc(global_j, N);
  J[j] = i;
 }
}

Is that right? I'll go and have a play around with this now, but Ifigure it probably won't work so I'm getting my question in early. ;-)

Dan

p.s. apologies if this posts twice, I sent it from the wrong emailaddress before but maybe it will go through anyway.


_______________________________________________
PyCuda mailing list
[email protected]
http://tiker.net/mailman/listinfo/pycuda_tiker.net

Re: [PyCuda] global memory?

Reply via email to