Sorry for the noise. I casted correctly my param, but then I made a
division. So the type changed. For those that looked in the code this
is the correct line should be in it:

    args += [numpy.intc(i / dtype.itemsize) for i in gpu_val.strides]

It was:

    args += [numpy.intc(i) / dtype.itemsize for i in gpu_val.strides]

Fred

2011/11/18 Frédéric Bastien <[email protected]>:
> Hi,
>
> I have a small example that when run, crash with this error:
>
> [...]
>
>    fct(*args, **d)
>  File 
> "/u/bastienf/repos/pycuda.git/build.fc9/lib.linux-x86_64-2.5/pycuda/driver.py",
> line 187, in function_call
>    func.launch_grid(*grid)
> LaunchError: cuLaunchGrid failed: launch out of resources
>
> ----------------------------------------------------------------------
>
>
> The problem is that I use only 1 thread per block and only 1 block.
> The code in the gpu function is very simple: "Z[0] = 0;". I think I
> pass correctly the parameters when calling the gpu function. Do
> someone have any idea what could be wrong? I join the example of the
> crash.
>
> thanks
>
> Frédéric Bastien
>

_______________________________________________
PyCUDA mailing list
[email protected]
http://lists.tiker.net/listinfo/pycuda

Reply via email to