i do not have a solution for this problem or a good guess what the problem
is, so i ask you to wait for the next release and if the problem remains i will
take a look at cuda-gdb and see whether it is usable or write a kernel that 
generates
more debugging information.
cuda-gdb should be able to print information about the error, so if you want to 
invest
time, you can try it out. it should be able to at least print the source file 
line number
of the instruction that was responsible for the error in the case of the failed 
cudaThreadSynchronize,
the error in the memcpy and the no_device_found are a different story as no 
code is
executed on the GPU in that case.
maybe your drivers are too old. also if you are on a 32bit system you have to 
compile
with -malign-double as enabled by default in the Makefile.local.dist.
Maybe you can post nvidia driver version, cpu arch and linux version.

> Trying again gave me a similar error:
> $ ./a51table --condition rounds:rounds=32 --roundfunc
> xor:condition=distinguished_point::bits=15:generator=lfsr::tablesize=32::advance=139584
> --implementation sharedmem --algorithm A51 --device
> cuda:operations=512 --work random:prefix=11,0 --consume
> file:prefix=data:append --logger normal generate --chains 380000000
> --chainlength 3000000 --intermediate filter:runlength=512
> Initialize implementation sharedmem...
> 106 chains done, current rate 1.77 chains/sec (interval: 00:01:00)
> 6633 chains done, current rate 108.78 chains/sec (interval: 00:01:00)
> 10350 chains done, current rate 61.95 chains/sec (interval: 00:01:00)
> 14632 chains done, current rate 71.37 chains/sec (interval: 00:01:00)
> 19810 chains done, current rate 86.30 chains/sec (interval: 00:01:00)
> ../tmto/device/cuda/working_set_methods.hpp(38)[void
> tmto::device::cuda::working_set::simple_host<T,
> Round>::copyToDevice(int) [with T =
> tmto::device::combined_work_item<tmto::algorithm::A51::data_type,
> tmto::configuration::state::state<void, void,
> tmto::condition::tag::rounds,
> tmto::round_function::arguments::selector<tmto::round_function::tag::xor_,
> tmto::condition::tag::distinguished_point,
> tmto::round_function::generator::tag::sharedmem<tmto::round_function::gen
> 
> Trying one more time I got
> $ ./a51table --condition rounds:rounds=32 --roundfunc
> xor:condition=distinguished_point::bits=15:generator=lfsr::tablesize=32::advance=139584
> --implementation sharedmem --algorithm A51 --device
> cuda:operations=512 --work random:prefix=11,0 --consume
> file:prefix=data:append --logger normal generate --chains 380000000
> --chainlength 3000000 --intermediate filter:runlength=512
> NVIDIA: could not open the device file /dev/nvidia0 (Input/output error).
> Initialize implementation sharedmem...
> ../tmto/round_function/generator/sharedmem_methods.hpp(12)[void
> tmto::round_function::generator::host_part<tmto::round_function::generator::tag::sharedmem<Real>
> >::copyToDevice() const [with Real =
> tmto::round_function::generator::tag::lfsr]]: cuda error: no
> CUDA-capable device is available
> 
> Im running on two GeForce GTX 260's
> 
> Regards Kugg
> 
> On 10/4/09, Christoffer Jerkeby <[email protected]> wrote:
> > Hi I got the same error, I was using the configuration generated from
> > http://reflextor.com/cgi-bin/a51/a51id.cgi .
> >
> > $ ./a51table --condition rounds:rounds=32 --roundfunc
> > xor:condition=distinguished_point::bits=15:generator=lfsr::tablesize=32::advance=139584
> > --implementation sharedmem --algorithm A51 --device
> > cuda:operations=512 --work random:prefix=11,0 --consume
> > file:prefix=data:append --logger normal generate --chains 380000000
> > --chainlength 3000000 --intermediate filter:runlength=512
> >
> > Initialize implementation sharedmem...
> > 148 chains done, current rate 2.47 chains/sec (interval: 00:01:00)
> > 6639 chains done, current rate 108.18 chains/sec (interval: 00:01:00)
> > 10356 chains done, current rate 61.95 chains/sec (interval: 00:01:00)
> > 14655 chains done, current rate 71.65 chains/sec (interval: 00:01:00)
> > 19769 chains done, current rate 85.23 chains/sec (interval: 00:01:00)
> > 24015 chains done, current rate 70.77 chains/sec (interval: 00:01:00)
> > 28610 chains done, current rate 76.58 chains/sec (interval: 00:01:00)
> > ../tmto/device/cuda/host_side_methods.hpp(76)[void
> > tmto::device::cuda::cudaSynchronize()]: cuda error: unspecified launch
> > failure
> >
> > Regards Kugg
> >
> > On 10/2/09, Sascha Krissler <[email protected]> wrote:
> >> gotta love those specific cuda error codes.
> >> does it happen more than just once?
> >> did you use any form of signaling through the fifo, like change number of
> >> operations?
> >> (if it happens more frequently) does it always happen on the same card?
> >> at which positions? (chains done).
> >>
> >>> Hi,
> >>>
> >>> after some time (around 2 hours) i get this error:
> >>>
> >>> 1334412 chains done, current rate 141.42 chains/sec (interval: 00:01:00)
> >>> ../tmto/device/cuda/host_side_methods.hpp(76)[void
> >>> tmto::device::cuda::cudaSynchronize()]: cuda error: unspecified launch
> >>> failure
> >>>
> >>> this happens only on 1 process, other processes on this machine are
> >>> still running..
> >>>
> >>> _______________________________________________
> >>> A51 mailing list
> >>> [email protected]
> >>> http://lists.lists.reflextor.com/cgi-bin/mailman/listinfo/a51
> >>>
> >>
> >>
> >> ______________________________________________________
> >> GRATIS für alle WEB.DE-Nutzer: Die maxdome Movie-FLAT!
> >> Jetzt freischalten unter http://movieflat.web.de
> >>
> >> _______________________________________________
> >> A51 mailing list
> >> [email protected]
> >> http://lists.lists.reflextor.com/cgi-bin/mailman/listinfo/a51
> >>
> >
> 


______________________________________________________
GRATIS für alle WEB.DE-Nutzer: Die maxdome Movie-FLAT!
Jetzt freischalten unter http://movieflat.web.de

_______________________________________________
A51 mailing list
[email protected]
http://lists.lists.reflextor.com/cgi-bin/mailman/listinfo/a51

Reply via email to