Hi guys,

I've executed the procedures necessary to install the Nvidia Driver, the 
development kit and examples. Everything seems to be working fine.

Running the NVidia's deviceQuery program, it says:

./deviceQuery Starting...

  CUDA Device Query (Runtime API) version (CUDART static linking)

There is 1 device supporting CUDA

Device 0: "GeForce 8300 GS"
   CUDA Driver Version:                           3.20
   CUDA Runtime Version:                          3.20
   CUDA Capability Major/Minor version number:    1.1
   Total amount of global memory:                 133496832 bytes
   Multiprocessors x Cores/MP = Cores:            1 (MP) x 8 (Cores/MP) 
= 8 (Cores)
   Total amount of constant memory:               65536 bytes
   Total amount of shared memory per block:       16384 bytes
   Total number of registers available per block: 8192
   Warp size:                                     32
   Maximum number of threads per block:           512
   Maximum sizes of each dimension of a block:    512 x 512 x 64
   Maximum sizes of each dimension of a grid:     65535 x 65535 x 1
   Maximum memory pitch:                          2147483647 bytes
   Texture alignment:                             256 bytes
   Clock rate:                                    0.92 GHz
   Concurrent copy and execution:                 No
   Run time limit on kernels:                     Yes
   Integrated:                                    No
   Support host page-locked memory mapping:       Yes
   Compute mode:                                  Default (multiple host 
threads can use this device simultaneously)
   Concurrent kernel execution:                   No
   Device has ECC support enabled:                No
   Device is using TCC driver mode:               No

deviceQuery, CUDA Driver = CUDART, CUDA Driver Version = 3.20, CUDA 
Runtime Version = 3.20, NumDevs = 1, Device = GeForce 8300 GS


OK. Next step is CUDATopology. Let's see if it recognizes the board.


I executed the instructions regarding compilation and standalone 
execution parameters. In a nutshell:

$ x10c++ -O -NO_CHECKS -STATIC_CALLS CUDATopology.x10 -o CUDATopology
$ X10RT_ACCELS=ALL ../../bin/runx10 CUDATopology


... and it says:


Dumping places at place: (Place 0)
Place: (Place 0)
   Parent: (Place 0)
   NumChildren: 0
   Is a Host place



What am I doing wrong?


I've tried to recompile the distribution files as recipe below


$ cd x10.dist
$ ant dist -DX10RT_CUDA=true -DNO_CHECKS=true -Doptimize=true


... and I've got this:


build-x10rt:
      [exec] g++ -g -DNDEBUG -DNO_PLACE_CHECKS -O2 -finline-functions 
-ansi -pedantic -Wall -Wextra -Wno-long-long -Wno-unused-parameter 
-pthread -Iinclude -Icommon 
-I"/opt/JavaIDE/jdk1.6.0_21-linux-x86_64"/include 
-I"/opt/JavaIDE/jdk1.6.0_21-linux-x86_64"/include/linux -DENABLE_CUDA 
-isystem/usr/local/cuda/include -shared -fPIC -c common/x10rt_cuda.cc -o 
common/x10rt_cuda.o
      [exec] common/x10rt_cuda.cc: In function 
‘void<unnamed>::cu_safe(CUresult, const char*, int)’:
      [exec] common/x10rt_cuda.cc:112: error: 
‘CUDA_ERROR_POINTER_IS_64BIT’ was not declared in this scope
      [exec] common/x10rt_cuda.cc:114: error: ‘CUDA_ERROR_SIZE_IS_64BIT’ 
was not declared in this scope
      [exec] common/x10rt_cuda.cc:42: warning: enumeration value 
‘CUDA_ERROR_UNSUPPORTED_LIMIT’ not handled in switch
      [exec] common/x10rt_cuda.cc:42: warning: enumeration value 
‘CUDA_ERROR_SHARED_OBJECT_SYMBOL_NOT_FOUND’ not handled in switch
      [exec] common/x10rt_cuda.cc:42: warning: enumeration value 
‘CUDA_ERROR_SHARED_OBJECT_INIT_FAILED’ not handled in switch
      [exec] common/x10rt_cuda.cc:42: warning: enumeration value 
‘CUDA_ERROR_OPERATING_SYSTEM’ not handled in switch
      [exec] common/x10rt_cuda.cc: In function ‘void 
x10rt_cuda_register_msg_receiver(x10rt_cuda_ctx*, x10rt_msg_type, void 
(*)(const x10rt_msg_params*, size_t*, size_t*, size_t*, size_t*, char**, 
size_t*, char**), void (*)(const x10rt_msg_params*, size_t, size_t, 
size_t, size_t, char*, size_t, char*), const char*, const char*)’:
      [exec] common/x10rt_cuda.cc:375: error: cannot convert ‘unsigned 
int*’ to ‘size_t*’ for argument ‘2’ to ‘CUresult 
cuModuleGetGlobal_v2(CUdeviceptr*, size_t*, CUmod_st*, const char*)’
      [exec] common/x10rt_cuda.cc: In function ‘void 
x10rt_cuda_probe(x10rt_cuda_ctx*)’:
      [exec] common/x10rt_cuda.cc:751: warning: dereferencing 
type-punned pointer will break strict-aliasing rules
      [exec] common/x10rt_cuda.cc:752: warning: dereferencing 
type-punned pointer will break strict-aliasing rules
      [exec] make: *** [common/x10rt_cuda.o] Error 1

BUILD FAILED
/home/rgomes/developer/sources/x10/x10-trunk/x10.dist/build.xml:214: The 
following error occurred while executing this line:
/home/rgomes/developer/sources/x10/x10-trunk/x10.runtime/build.xml:281: 
exec returned: 2


Any idea ???


Some more information:


$ which nvcc
/usr/local/cuda/bin/nvcc

$ echo $LD_LIBRARY_PATH
:/usr/local/cuda/lib64:/usr/local/cuda/lib

$ ant -version
Apache Ant version 1.7.1 compiled on June 27 2008

$ mvn -version
Apache Maven 2.2.1 (r801777; 2009-08-06 20:16:01+0100)
Java version: 1.6.0_21
Java home: /opt/JavaIDE/jdk1.6.0_21-linux-x86_64/jre
Default locale: en_GB, platform encoding: UTF-8
OS name: "linux" version: "2.6.32-5-amd64" arch: "amd64" Family: "unix"


Thanks a lot :)


-- 
Richard Gomes
M: +44(77)9955-6813
http://tinyurl.com/frgomes
twitter: frgomes

JQuantLib is a library for Quantitative Finance written in Java.
http://www.jquantlib.org/
twitter: jquantlib

------------------------------------------------------------------------------
The Next 800 Companies to Lead America's Growth: New Video Whitepaper
David G. Thomson, author of the best-selling book "Blueprint to a 
Billion" shares his insights and actions to help propel your 
business during the next growth cycle. Listen Now!
http://p.sf.net/sfu/SAP-dev2dev
_______________________________________________
X10-users mailing list
X10-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/x10-users

Reply via email to