Dear All, Unfortunately, a reinstall is not an option for our machine, so taking Lloyd's findings to heart in that a wait for a new NVidia driver is necessary we are currently using the quantian version of the Knoppix line DVD (http://dirk.eddelbuettel.com/quantian.html) to dual boot the machine.
DX on the latest release of quantian (0.6.9.1) is working fine, but of course has the 32 bit speed reductions Lloyd mentions. Of course, it reads and writes to the hard drives OK. So once loaded and having DX running isn't to slow. Peter -----Original Message----- From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] On Behalf Of Lloyd A Treinish Sent: 25 October 2004 05:15 To: opendx2-dev@lists.berlios.de Subject: Re: [opendx-dev] compile on 2cpu amd_64; rhel3 Here's a further update, for everyone's reference. The 64-bit version of the nvidia driver is definitely broken. But there are some workarounds for now. As a test, I installed the 32-bit version of RHEL-WS3.0 with the latest version of the standard x86 version of the nvidia driver. The RH9 4.3.2 rpm on the web site installed right away and everything works. Compared to the version I built from the July 30 4.3.3 tarball for the 64-bit version of RHEL, various modules run anywhere from 10% to 50% slower with the 32-bit RHEL. Nevertheless, the machine is still pretty fast. For anyone else that ends of doing this as a workaround, there is a known 32-bit RedHat issue on some Opteron machines if they have more than 4GB of RAM because the area of memory reserved for caching the AGP aperture ends up above that space, so AGPGART ends up with no memory and video performance disappears. The workaround is restricting the memory with 'mem=4095M' as a kernel option in the grub menu. Obviously, any additional memory is wasted, but the 32-bit kernel wouldn't be addressing it anyway... |---------+--------------------------------------> | | Lloyd A | | | Treinish/Watson/[EMAIL PROTECTED] | | | Sent by: | | | [EMAIL PROTECTED]| | | son.ibm.com | | | | | | | | | 10/19/2004 02:26 PM | | | Please respond to | | | opendx-dev | | | | |---------+--------------------------------------> >----------------------------------------------------------------------- ---------------------------------| | | | To: opendx2-dev@lists.berlios.de | | cc: | | Subject: Re: [opendx-dev] compile on 2cpu amd_64; rhel3 | | | >----------------------------------------------------------------------- ---------------------------------| Here's an update, for everyone's reference. I have been able to successfully make a few different builds with the July 30 4.3.3 tarball for the Opteron RHEL-WS3.0 system. The first was with large-arena and SMP support: ./configure --with-netcdf --with-large-arenas --enable-dependency-tracking --enable-smp-linux --includedir=/usr/include:/usr/local/include --libdir=/usr/lib64:/usr/local/lib --x-libraries=/usr/X11R6/lib64 --x-includes=/usr/X11R6/include The second was with small arenas and no SMP support. In both cases, I built a version of the libnetcdf for this platform. ./configure --with-netcdf --enable-dependency-tracking --includedir=/usr/include:/usr/local/include --libdir=/usr/lib64:/usr/local/lib --x-libraries=/usr/X11R6/lib64 --x-includes=/usr/X11R6/include Unfortunately, dxexec from the SMP build hangs when two cpus are requested. With p1, it works fine. In addition, requests for memory beyond 2GB remained clipped at 2 GB. Both versions appear to perform similarly when running with a single cpu and less than 2GB of memory (pretty fast). The hardware rendering problem is the same in both versions, which suggests problems from nvidia. I tried building with and without pointing specifically to the gl includes from nvidia, although it didn't seem to matter. I have also sent an e-mail to nvidia's support group, but I have not received a response yet. Lloyd A Treinish/Watson/[EMAIL PROTECTED] To: Sent by: opendx2-dev@lists.berlios.de [EMAIL PROTECTED] cc: ibm.com Subject: Re: [opendx-dev] compile on 2cpu amd_64; rhel3 10/17/2004 07:07 PM Please respond to opendx-dev I figured that was the situation. I have not had the chance to look into it in much more detail. I just booted the machine for the first time on Friday and tried the few things that I cited in my posting. Unfortunately, I don't have the time to put a lot of effort into this. So, I'll do some more experimentation and report any useful findings. However, I see the most serious problem as being with the OpenGL drivers, for which I don't expect an immediate solution. Admittedly, I don't have a lot of experience on 64-bit platforms. I have found that under AIX I didn't have a lot of these problems. On the other hand, I only use those systems for production visualization with software rendering. |---------+--------------------------------------> | | David Thompson | | | <[EMAIL PROTECTED]>| | | Sent by: | | | [EMAIL PROTECTED]| | | son.ibm.com | | | | | | | | | 10/17/2004 05:40 PM | | | Please respond to | | | opendx-dev | | | | |---------+--------------------------------------> >----------------------------------------------------------------------- ------------------------------------| | | | To: opendx2-dev@lists.berlios.de | | cc: | | Subject: Re: [opendx-dev] compile on 2cpu amd_64; rhel3 | | | >----------------------------------------------------------------------- ------------------------------------| With linux and shared libraries, you need to use ldconfig to help the os find them. As for the library incompatibility, this is going to be a 64 bit OS library versus a 32 bit OS library. The libtool guys have been discussing how to handle some of this--but they still have not come to any kind of consensus. As of now, no one has come forward to start working on any of the 64 bit os issues beyond the SGI. So unless someone is willing to undertake this or pay someone (like myself) to undertake it--I wish you luck. David >Coincidently, I just got a similar box as a loaner for a couple of >weeks and I am seeing similar problems. It's a dual 2.2GHz AMD Opteron >system running RHEL-WS3.0 for x86_64 with an SMP kernel >(2.4.21-20.ELsmp). It has >gcc v3.2.3 (from Red Hat Linux 3.2.3-42). The system has an nVidia >Quadro FX3000 with the latest driver from nvidia.com for this >architecture. I started by using the latest tarball (July 30, 2004) >for OpenDX 4.3.3. My initial attempt was to make a build for large >arena and SMP enabled as well >as with netCDF. I had to build a netCDF library for this platform, but >it seems to work ok. Anyway, the dx build creates an executable but >reports similar loader problems: > /usr/bin/ld: skipping incompatible /usr/X11R6/lib/libXpm.so when > searching for -lXpm > /usr/bin/ld: skipping incompatible /usr/X11R6/lib/libXpm.a when > searching for -lXpm > /usr/bin/ld: skipping incompatible /usr/X11R6/lib/libXp.so when > searching for -lXp > /usr/bin/ld: skipping incompatible /usr/X11R6/lib/libXp.a when searching > for -lXp > /usr/bin/ld: skipping incompatible /usr/X11R6/lib/libXmu.so when > searching for -lXmu > /usr/bin/ld: skipping incompatible /usr/X11R6/lib/libXmu.a when > searching for -lXmu > /usr/bin/ld: skipping incompatible /usr/X11R6/lib/libXext.so when > searching for -lXext > /usr/bin/ld: skipping incompatible /usr/X11R6/lib/libXext.a when > searching for -lXext > /usr/bin/ld: skipping incompatible /usr/X11R6/lib/libXt.so when > searching for -lXt > /usr/bin/ld: skipping incompatible /usr/X11R6/lib/libXt.a when searching > for -lXt > /usr/bin/ld: skipping incompatible /usr/X11R6/lib/libX11.so when > searching for -lX11 > /usr/bin/ld: skipping incompatible /usr/X11R6/lib/libX11.a when > searching for -lX11 > /usr/bin/ld: skipping incompatible /usr/X11R6/lib/libSM.so when > searching for -lSM > /usr/bin/ld: skipping incompatible /usr/X11R6/lib/libSM.a when searching > for -lSM > /usr/bin/ld: skipping incompatible /usr/X11R6/lib/libICE.so when > searching for -lICE > /usr/bin/ld: skipping incompatible /usr/X11R6/lib/libICE.a when > searching for -lICE > >I've noticed that this occurs whether it's pointing to /usr/X11R6/lib >or /usr/X11R6/lib64. I was going to experiment with other building >options (with and without large arenas, with and without SMP support), >although I'll doubt it will matter since it seems to be just a ui >problem. > >I have also seen problems with hardware rendering. A few from the >sample set seem to work fine (i.e, changing rendering to hardware mode >first or others like TextureMap). When I run something more complex, >some screen objects are visible (e.g., Captions) while others are not >(e.g., labels on color bars). When I rotate a 3d object, the surfaces >will disappear as if an opaque triangle colored with the background >color is draw in front, which grows with further rotation. The problem >is repeatable. Software rendering seems ok, but I've not done >exhaustive testing. My initial guess >as to the rendering (not the loader) problem is with the driver from nVidia >(see >http://www.nvnews.net/vbulletin/showthread.php?s=1a67d4b4573fcbe9d9001c >1ef411a486&t=34005, > for example). > >As an alternative, I also tried to install the RH9 4.3.3 rpm, which >works just fine on RHEL-WS3 Intel boxes. At installation it complains >about libxml and libpng being missing, although are there, and >LD_LIBRARY_PATH and LIBPATH point to the right place. I can use >--force and --nodeps to get beyond that. None of the executables will >run, complaining about not being able to find libpng even with the >environment variables set correctly. > >I am also open to any suggestions on either of these issues as I >continue to look into them. > >Thanks. > > > >|---------+--------------------------------------> >| | "Peter Connolly" | >| | <[EMAIL PROTECTED]| >| | lsruhe.de> | >| | Sent by: | >| | [EMAIL PROTECTED]| >| | son.ibm.com | >| | | >| | | >| | 10/13/2004 04:27 PM | >| | Please respond to | >| | opendx-dev | >| | | >|---------+--------------------------------------> > >>---------------------------------------------------------------------- -------------------------------------| > | >| > | To: <opendx2-dev@lists.berlios.de> >| > | cc: >| > | Subject: [opendx-dev] compile on 2cpu amd_64; rhel3 >| > | >| > >>---------------------------------------------------------------------- -------------------------------------| > > > > >Dear All, > >I have been struggling for a while getting a working OpenDX >insatallation on a new graphics station (2cpu amd64 with nvidia fx5700; >latest nvidia drivers ). I have given up on sorting out the dependency >issues with the farious rpms and am trying the CVS source. > >It built sucessfully built but with a series of linking errors e.g.: > >/usr/bin/ld: skipping incompatible /usr/X11R6/lib/libX11.a when >searching for -lX11 > >After make install, dx comes up and I all the samples I have tried run >fine. > >However when running a dx.net of my own I only get a blank image window >and my colour maps are also black. Even more strangly, the colour bars >in the image window are fine! It appears that the exec is doing it's >job OK and generating the data for image to plot since there is a cpu >load and I get a file full of data when I send the data plottted to >export. > >The exact same net works fine on all the Win2K/XP boxes we have running >the Viz version of DX. > >Has any one seen this before or have any ideas where I might look to >try and sort this out? > >Thanks in advance > >Peter > >PS >the complete list of "incompatible libraries is: > > >libXpm.so when searching for -lXpm >libXp.a when searching for -lXp >libGLU.so when searching for -lGLU >libGL.a when searching for -lGL >libXmu.so when searching for -lXmu >libXt.a when searching for -lXt >libdpstk.so when searching for -ldpstk >libdps.a when searching for -ldps >libXext.so when searching for -lXext >libSM.a when searching for -lSM >libICE.so when searching for -lICE >libX11.a when searching for -lX11