Re: [A51] Error

Christoffer Jerkeby Tue, 20 Oct 2009 11:36:20 -0700

Hey sorry for my late reply, I suspect that this problem persist on
other cuda applicaitons.
How ever here is the information:
1. Linux version
Linux kakmonstret 2.6.28-11-server #42-Ubuntu SMP Fri Apr 17 02:45:36
UTC 2009 x86_64 GNU/Linux


2. CPU info
vendor_id       : AuthenticAMD
model name      : AMD Athlon(tm) II X2 240 Processor
cpu MHz         : 2809.543

3. GPU and driver info
Device 0: "GeForce GTX 260"
  CUDA Driver Version:                           2.30
  CUDA Runtime Version:                          2.30
  CUDA Capability Major revision number:         1
  CUDA Capability Minor revision number:         3
  Total amount of global memory:                 938803200 bytes
  Clock rate:                                    1.46 GHz

Device 1: "GeForce GTX 260"
  CUDA Driver Version:                           2.30
  CUDA Runtime Version:                          2.30
  CUDA Capability Major revision number:         1
  CUDA Capability Minor revision number:         3
  Total amount of global memory:                 939261952 bytes
  Clock rate:                                    1.46 GHz

Ill try with the newer versions from: http://www.nvidia.com/object/cuda_get.html
and will report back if it works.

Regards Kugg

On Mon, Oct 5, 2009 at 6:20 PM, Sascha Krissler <[email protected]> wrote:
> i do not have a solution for this problem or a good guess what the problem
> is, so i ask you to wait for the next release and if the problem remains i 
> will
> take a look at cuda-gdb and see whether it is usable or write a kernel that 
> generates
> more debugging information.
> cuda-gdb should be able to print information about the error, so if you want 
> to invest
> time, you can try it out. it should be able to at least print the source file 
> line number
> of the instruction that was responsible for the error in the case of the 
> failed cudaThreadSynchronize,
> the error in the memcpy and the no_device_found are a different story as no 
> code is
> executed on the GPU in that case.
> maybe your drivers are too old. also if you are on a 32bit system you have to 
> compile
> with -malign-double as enabled by default in the Makefile.local.dist.
> Maybe you can post nvidia driver version, cpu arch and linux version.
>
>> Trying again gave me a similar error:
>> $ ./a51table --condition rounds:rounds=32 --roundfunc
>> xor:condition=distinguished_point::bits=15:generator=lfsr::tablesize=32::advance=139584
>> --implementation sharedmem --algorithm A51 --device
>> cuda:operations=512 --work random:prefix=11,0 --consume
>> file:prefix=data:append --logger normal generate --chains 380000000
>> --chainlength 3000000 --intermediate filter:runlength=512
>> Initialize implementation sharedmem...
>> 106 chains done, current rate 1.77 chains/sec (interval: 00:01:00)
>> 6633 chains done, current rate 108.78 chains/sec (interval: 00:01:00)
>> 10350 chains done, current rate 61.95 chains/sec (interval: 00:01:00)
>> 14632 chains done, current rate 71.37 chains/sec (interval: 00:01:00)
>> 19810 chains done, current rate 86.30 chains/sec (interval: 00:01:00)
>> ../tmto/device/cuda/working_set_methods.hpp(38)[void
>> tmto::device::cuda::working_set::simple_host<T,
>> Round>::copyToDevice(int) [with T =
>> tmto::device::combined_work_item<tmto::algorithm::A51::data_type,
>> tmto::configuration::state::state<void, void,
>> tmto::condition::tag::rounds,
>> tmto::round_function::arguments::selector<tmto::round_function::tag::xor_,
>> tmto::condition::tag::distinguished_point,
>> tmto::round_function::generator::tag::sharedmem<tmto::round_function::gen
>>
>> Trying one more time I got
>> $ ./a51table --condition rounds:rounds=32 --roundfunc
>> xor:condition=distinguished_point::bits=15:generator=lfsr::tablesize=32::advance=139584
>> --implementation sharedmem --algorithm A51 --device
>> cuda:operations=512 --work random:prefix=11,0 --consume
>> file:prefix=data:append --logger normal generate --chains 380000000
>> --chainlength 3000000 --intermediate filter:runlength=512
>> NVIDIA: could not open the device file /dev/nvidia0 (Input/output error).
>> Initialize implementation sharedmem...
>> ../tmto/round_function/generator/sharedmem_methods.hpp(12)[void
>> tmto::round_function::generator::host_part<tmto::round_function::generator::tag::sharedmem<Real>
>> >::copyToDevice() const [with Real =
>> tmto::round_function::generator::tag::lfsr]]: cuda error: no
>> CUDA-capable device is available
>>
>> Im running on two GeForce GTX 260's
>>
>> Regards Kugg
>>
>> On 10/4/09, Christoffer Jerkeby <[email protected]> wrote:
>> > Hi I got the same error, I was using the configuration generated from
>> > http://reflextor.com/cgi-bin/a51/a51id.cgi .
>> >
>> > $ ./a51table --condition rounds:rounds=32 --roundfunc
>> > xor:condition=distinguished_point::bits=15:generator=lfsr::tablesize=32::advance=139584
>> > --implementation sharedmem --algorithm A51 --device
>> > cuda:operations=512 --work random:prefix=11,0 --consume
>> > file:prefix=data:append --logger normal generate --chains 380000000
>> > --chainlength 3000000 --intermediate filter:runlength=512
>> >
>> > Initialize implementation sharedmem...
>> > 148 chains done, current rate 2.47 chains/sec (interval: 00:01:00)
>> > 6639 chains done, current rate 108.18 chains/sec (interval: 00:01:00)
>> > 10356 chains done, current rate 61.95 chains/sec (interval: 00:01:00)
>> > 14655 chains done, current rate 71.65 chains/sec (interval: 00:01:00)
>> > 19769 chains done, current rate 85.23 chains/sec (interval: 00:01:00)
>> > 24015 chains done, current rate 70.77 chains/sec (interval: 00:01:00)
>> > 28610 chains done, current rate 76.58 chains/sec (interval: 00:01:00)
>> > ../tmto/device/cuda/host_side_methods.hpp(76)[void
>> > tmto::device::cuda::cudaSynchronize()]: cuda error: unspecified launch
>> > failure
>> >
>> > Regards Kugg
>> >
>> > On 10/2/09, Sascha Krissler <[email protected]> wrote:
>> >> gotta love those specific cuda error codes.
>> >> does it happen more than just once?
>> >> did you use any form of signaling through the fifo, like change number of
>> >> operations?
>> >> (if it happens more frequently) does it always happen on the same card?
>> >> at which positions? (chains done).
>> >>
>> >>> Hi,
>> >>>
>> >>> after some time (around 2 hours) i get this error:
>> >>>
>> >>> 1334412 chains done, current rate 141.42 chains/sec (interval: 00:01:00)
>> >>> ../tmto/device/cuda/host_side_methods.hpp(76)[void
>> >>> tmto::device::cuda::cudaSynchronize()]: cuda error: unspecified launch
>> >>> failure
>> >>>
>> >>> this happens only on 1 process, other processes on this machine are
>> >>> still running..
>> >>>
>> >>> _______________________________________________
>> >>> A51 mailing list
>> >>> [email protected]
>> >>> http://lists.lists.reflextor.com/cgi-bin/mailman/listinfo/a51
>> >>>
>> >>
>> >>
>> >> ______________________________________________________
>> >> GRATIS für alle WEB.DE-Nutzer: Die maxdome Movie-FLAT!
>> >> Jetzt freischalten unter http://movieflat.web.de
>> >>
>> >> _______________________________________________
>> >> A51 mailing list
>> >> [email protected]
>> >> http://lists.lists.reflextor.com/cgi-bin/mailman/listinfo/a51
>> >>
>> >
>>
>
>
> ______________________________________________________
> GRATIS für alle WEB.DE-Nutzer: Die maxdome Movie-FLAT!
> Jetzt freischalten unter http://movieflat.web.de
>
> _______________________________________________
> A51 mailing list
> [email protected]
> http://lists.lists.reflextor.com/cgi-bin/mailman/listinfo/a51
>
_______________________________________________
A51 mailing list
[email protected]
http://lists.lists.reflextor.com/cgi-bin/mailman/listinfo/a51

Re: [A51] Error

Reply via email to