Nikita,

Did you try to contact SystemImager developers? It seems to me that they have 
much more experience to track and understand this kind of problems.

Regards,

----- "Nikita Andreev" <n...@kemsu.ru> a écrit :

> Dear Geoffroy,
> 
> Thanks for the tip. I've changed rsync verbosity level at the server
> and at client side but it didn't show any helpful information. I see
> how rsync downloads file after file and at some point connection
> breaks. I even tried to strace server side:
> 
> [pid  6316]
> lstat("usr/share/doc/postgresql-8.1.18/html/release-7-4-5.html",
> {st_mode=S_IFREG|0644, st_size=3239, ...}) = 0
> [pid  6316] stat("/etc/localtime", {st_mode=S_IFREG|0644,
> st_size=1944, ...}) = 0
> [pid  6316] write(4, "2011/02/16 01:21:52 [6316] [send"..., 107) =
> 107
> [pid  6316] select(4, NULL, [3], NULL, {60, 0}) = 1 (out [3], left
> {60, 0})
> [pid  6316] write(3,
> "\35\0\0\7:%\22release-7-2-6.htmlV\16\0\0\373\246\240"..., 33) = -1
> ECONNRESET (Connection reset by peer)
> [pid  6316] stat("/etc/localtime", {st_mode=S_IFREG|0644,
> st_size=1944, ...}) = 0
> [pid  6316] write(4, "2011/02/16 01:21:52 [6316] rsync"..., 119) =
> 119
> [pid  6316] rt_sigaction(SIGUSR1, {0x1, [], SA_RESTORER,
> 0x35e98302d0}, NULL, 8) = 0
> [pid  6316] rt_sigaction(SIGUSR2, {0x1, [], SA_RESTORER,
> 0x35e98302d0}, NULL, 8) = 0
> [pid  6316] stat("/etc/localtime", {st_mode=S_IFREG|0644,
> st_size=1944, ...}) = 0
> [pid  6316] write(4, "2011/02/16 01:21:52 [6316] rsync"..., 115) =
> 115
> [pid  6316] stat("/etc/localtime", {st_mode=S_IFREG|0644,
> st_size=1944, ...}) = 0
> [pid  6316] write(4, "2011/02/16 01:21:52 [6316] _exit"..., 96) = 96
> [pid  6316] exit_group(12)              = ?
> Process 6316 detached
> <... select resumed> )                  = ? ERESTARTNOHAND (To be
> restarted)
> --- SIGCHLD (Child exited) @ 0 (0) ---
> wait4(-1, NULL, WNOHANG, NULL)          = 6316
> wait4(-1, NULL, WNOHANG, NULL)          = -1 ECHILD (No child
> processes)
> rt_sigreturn(0xffffffffffffffff)        = -1 EINTR (Interrupted system
> call)
> select(6, [4 5], NULL, NULL, NULL)      = ? ERESTARTNOHAND (To be
> restarted)
> 
> It doesn't seem to be helpful though.
> 
> At client side rsync has version 3.0.0pre6, protocol version 30.PR15
> and at server side it's rsync version 2.6.8, protocol version 29 (I'm
> using UYOK BTW, since systemimager's initrd doesn't support node
> Ethernet cards). So it's either due to protocol versions mismatch,
> undiscovered driver bug or rsync bug. I could try to change rsync in
> initrd to another version but I would face pure hell of a library
> dependency problems.
> 
> Can you suggest something in this situation?
> 
> Regards,
> Nikita
> 
> 
> -----Original Message-----
> From: geoffroy.val...@free.fr [mailto:geoffroy.val...@free.fr] 
> Sent: Tuesday, February 15, 2011 11:56 PM
> To: Nikita Andreev
> Cc: oscar-users@lists.sourceforge.net
> Subject: Re: [Oscar-users] Problem with GUI on CentOS 5.5
> 
> Nikita,
> 
> Try to have a look a the scripts in /var/lib/systemimager/scripts, you
> will find all the scripts using rsync for the installation of the
> clients.
> 
> On my side, I will have another look at the latest modification of the
> setup_pxe script and see if it is not a tricky corner case.
> 
> Thanks,
> 
> ----- "Nikita Andreev" <n...@kemsu.ru> a écrit :
> 
> > Yes, I've realized that changing to earlier OSCAR probably won't
> help.
> > And yes I tried to restart rsync (even the whole system). It's
> strange
> > since everything is ok with [boot] module and [scripts] module.
> > 
> > And 100 percent it's not a cable issue. It's a blade chassis and
> > interconnect module is integrated into it. It doesn't have cables
> at
> > all.
> > 
> > Could you tell me where I can change actual client rsync commands
> > which connect and download data from OSCAR server?
> > 
> > Regards,
> > Nikita
> > 
> > -----Original Message-----
> > From: geoffroy.val...@free.fr [mailto:geoffroy.val...@free.fr] 
> > Sent: Tuesday, February 15, 2011 10:17 PM
> > To: Nikita Andreev
> > Cc: oscar-users@lists.sourceforge.net
> > Subject: Re: [Oscar-users] Problem with GUI on CentOS 5.5
> > 
> > Nikita,
> > 
> > The code behind rsync is exactly the same in unstable and the other
> > repository. The only difference is a little patch to setup PXE. Did
> > you try to restart the rsync daemon to see if it fixes the problem?
> I
> > can see that the connection is reset, kind of like if a headnode
> drops
> > the connection at some point (which of course creates problems).
> > 
> > But again, i do not think switching away from unstable will change
> > anything for that specific problem.
> > 
> > Thanks,
> > 
> > ----- "Nikita Andreev" <n...@kemsu.ru> a écrit :
> > 
> > > Dear Geoffroy,
> > > 
> > > I'm using unstable as you suggested. At the moment I have rsync
> > > problems. PXE installation fails when trying to download actual
> OS
> > > image even though it works fine with 'boot' and 'scripts' targets
> > > before that.
> > > 
> > > rsync error message:
> > > 
> > > 2011/02/15 13:43:12 [5881] rsync on centos-5.5-x86_64/ from n5
> > > (192.168.111.5)
> > > 2011/02/15 13:43:28 [5881] rsync: writefd_unbuffered failed to
> write
> > 4
> > > bytes [sender]: Connection reset by peer (104)
> > > 2011/02/15 13:43:28 [5881] rsync error: error in rsync protocol
> > data
> > > stream (code 12) at io.c(1119) [sender=2.6.8]
> > > 
> > > or 
> > > 
> > > 2011/02/15 12:22:14 [5249] rsync on centos-5.5-x86_64/ from n5
> > > (192.168.111.5)
> > > 2011/02/15 12:22:32 [5249] rsync: read error: Connection reset by
> > peer
> > > (104)
> > > 2011/02/15 12:22:32 [5249] rsync error: error in rsync protocol
> > data
> > > stream (code 12) at io.c(614) [sender=2.6.8]
> > > 
> > > I'm thinking of reverting back to some older OSCAR versions in
> hope
> > > this issue won't arise. Kinda stuck at this point and dunno what
> to
> > > try next.
> > > 
> > > Regards,
> > > Nikita
> > > 
> > > 
> > > -----Original Message-----
> > > From: geoffroy.val...@free.fr [mailto:geoffroy.val...@free.fr] 
> > > Sent: Tuesday, February 15, 2011 8:14 PM
> > > To: Nikita Andreev
> > > Cc: oscar-users@lists.sourceforge.net
> > > Subject: Re: [Oscar-users] Problem with GUI on CentOS 5.5
> > > 
> > > Nikita,
> > > 
> > > There is almost no difference so why do you want to do that? I
> ask
> > the
> > > question because there is no well defined way to do so and i will
> > send
> > > time on this kind of things only for good reasons.
> > > Also, i am about to release 6.1.1 so we will have almost no
> > difference
> > > between unstable and the other repos.
> > > 
> > > Regards,
> > > 
> > > ----- "Nikita Andreev" <n...@kemsu.ru> a écrit :
> > > 
> > > > Dear Geoffroy,
> > > > 
> > > > Could you tell how I can revert back to 6.1.0 from unstable
> > without
> > > > full OS reinstall. 
> > > > 
> > > > Regards,
> > > > Nikita
> > > > 
> > > > -----Original Message-----
> > > > From: geoffroy.val...@free.fr [mailto:geoffroy.val...@free.fr] 
> > > > Sent: Monday, February 14, 2011 5:01 PM
> > > > To: Nikita Andreev
> > > > Cc: oscar-users@lists.sourceforge.net
> > > > Subject: Re: [Oscar-users] Problem with GUI on CentOS 5.5
> > > > 
> > > > Nikita,
> > > > 
> > > > Once again:
> > > > - LAM has an issue, i did not look at it yet.
> > > > - Torque/MAUI has issues, i pushed new RPMs into unstable; i do
> > not
> > > > know if you are using unstable, if so, it will be nice to send
> me
> > > the
> > > > output otherwise i cannot progress (if you are not using
> > unstable,
> > > no
> > > > solution at the moment with Torque/MAUI). At the moment, i do
> not
> > > even
> > > > know which version of the RPMs for Torque/MAUI we are trying to
> > > > install so i cannot do more.
> > > > 
> > > > Regards,
> > > > 
> > > > 
> > > > ----- "Nikita Andreev" <n...@kemsu.ru> a écrit :
> > > > 
> > > > > I've resolved an issue with yum update. But it doesn't solve
> > any
> > > > other
> > > > > issues I'm still having. Switcher configuration doesn't work.
> > It
> > > > > doesn't see any of MPI implementations event when I check all
> > > three
> > > > of
> > > > > them. Step 3 bails out with LAM dependency problem. Then if I
> > > > unselect
> > > > > LAM it bails out with TORQUE dependency problem. Then if I
> > > unselect
> > > > > TORQUE it bails out with MAUI dependency problem since MAUI
> > > depends
> > > > on
> > > > > TORQUE. If I unselect MAUI then I have successful install. I
> > > think
> > > > I
> > > > > will go for it and install all other packages manually.
> > > > > 
> > > > > Thank you for support so far.
> > > > > 
> > > > > Regards,
> > > > > Nikita
> > > > > 
> > > > > -----Original Message-----
> > > > > From: geoffroy.val...@free.fr [mailto:geoffroy.val...@free.fr]
> 
> > > > > Sent: Sunday, February 13, 2011 11:18 AM
> > > > > To: Nikita Andreev
> > > > > Cc: oscar-users@lists.sourceforge.net
> > > > > Subject: Re: [Oscar-users] Problem with GUI on CentOS 5.5
> > > > > 
> > > > > Nikita,
> > > > > 
> > > > > It will be very useful if you give me the command you execute
> > at
> > > > the
> > > > > same time, typically from the bash prompt to the end of the
> > > output.
> > > > > 
> > > > > With the command, it seems to me that you try to
> remove/install
> > > only
> > > > a
> > > > > single RPM. Can you please try to remove all the torque RPMs
> > with
> > > a
> > > > > single command, something like "rpm -e torque-oscar
> > > > > torque-oscar-server torque-oscar-scheduler".
> > > > > 
> > > > > Regards,
> > > > > 
> > > > > ----- "Nikita Andreev" <n...@kemsu.ru> a écrit :
> > > > > 
> > > > > > Output follows:
> > > > > > 
> > > > > > --> Finished Dependency Resolution
> > > > > > torque-server-2.1.8-3oscar.x86_64 from oscar has depsolving
> > > > > problems
> > > > > >   --> Missing Dependency: torque = 2.1.8-3oscar is needed
> by
> > > > > package
> > > > > > torque-server-2.1.8-3oscar.x86_64 (oscar)
> > > > > > torque-server-2.1.8-3oscar.x86_64 from oscar has depsolving
> > > > > problems
> > > > > >   --> Missing Dependency: libtorque.so.0()(64bit) is needed
> > by
> > > > > package
> > > > > > torque-server-2.1.8-3oscar.x86_64 (oscar)
> > > > > > Error: Missing Dependency: torque = 2.1.8-3oscar is needed
> by
> > > > > package
> > > > > > torque-server-2.1.8-3oscar.x86_64 (oscar)
> > > > > > Error: Missing Dependency: libtorque.so.0()(64bit) is
> needed
> > by
> > > > > > package torque-server-2.1.8-3oscar.x86_64 (oscar)
> > > > > >  You could try using --skip-broken to work around the
> problem
> > > > > >  You could try running: package-cleanup --problems
> > > > > >                         package-cleanup --dupes
> > > > > >                         rpm -Va --nofiles --nodigest
> > > > > > The program package-cleanup is found in the yum-utils
> > package.
> > > > > > 
> > > > > > Regards,
> > > > > > Nikita
> > > > > > 
> > > > > > -----Original Message-----
> > > > > > From: geoffroy.val...@free.fr
> [mailto:geoffroy.val...@free.fr]
> > 
> > > > > > Sent: Saturday, February 12, 2011 9:48 PM
> > > > > > To: Nikita Andreev
> > > > > > Cc: oscar-users@lists.sourceforge.net
> > > > > > Subject: Re: [Oscar-users] Problem with GUI on CentOS 5.5
> > > > > > 
> > > > > > Nikita,
> > > > > > 
> > > > > > What is the output when you run that command?
> > > > > > 
> > > > > > Thanks,
> > > > > > 
> > > > > > ----- "Nikita Andreev" <n...@kemsu.ru> a écrit :
> > > > > > 
> > > > > > > Geoffroy,
> > > > > > > 
> > > > > > > I've tried that. I've removed all torque-related
> packages.
> > > Same
> > > > > > error
> > > > > > > pops up.
> > > > > > > 
> > > > > > > Regards,
> > > > > > > 
> > > > > > > -----Original Message-----
> > > > > > > From: geoffroy.val...@free.fr
> > [mailto:geoffroy.val...@free.fr]
> > > 
> > > > > > > Sent: Saturday, February 12, 2011 11:19 AM
> > > > > > > To: Nikita Andreev
> > > > > > > Cc: oscar-users@lists.sourceforge.net
> > > > > > > Subject: Re: [Oscar-users] Problem with GUI on CentOS 5.5
> > > > > > > 
> > > > > > > Nikita,
> > > > > > > 
> > > > > > > Try to manually remove the RPMs:
> > > > > > > yum remove torque-oscar torque-oscar-server
> > > > > torque-oscar-scheduler
> > > > > > > 
> > > > > > > Then, you should be good.
> > > > > > > 
> > > > > > > Regards,
> > > > > > > 
> > > > > > > ----- "Nikita Andreev" <n...@kemsu.ru> a écrit :
> > > > > > > 
> > > > > > > > Dear Geoffroy,
> > > > > > > > 
> > > > > > > > While trying to update to new package versions from
> > > > repository
> > > > > I
> > > > > > > get
> > > > > > > > dependency resolution problem. I tried to manually
> remove
> > > all
> > > > > > > torque
> > > > > > > > related packages and update again but with no luck.
> Find
> > > logs
> > > > > > > > attached.
> > > > > > > > 
> > > > > > > > Sincerely,
> > > > > > > > Nikita
> > > > > > > > 
> > > > > > > > -----Original Message-----
> > > > > > > > From: geoffroy.val...@free.fr
> > > [mailto:geoffroy.val...@free.fr]
> > > > 
> > > > > > > > Sent: Saturday, February 12, 2011 1:22 AM
> > > > > > > > To: Nikita Andreev
> > > > > > > > Cc: oscar-users@lists.sourceforge.net
> > > > > > > > Subject: Re: [Oscar-users] Problem with GUI on CentOS
> 5.5
> > > > > > > > 
> > > > > > > > Nikita,
> > > > > > > > 
> > > > > > > > Then unselect Torque and give it a try.
> > > > > > > > 
> > > > > > > > About your question OS/ version of OSCAR, there is no
> > clear
> > > > > > answer:
> > > > > > > it
> > > > > > > > all depends which OSCAR packages you select (as i said
> > > > before).
> > > > > > If
> > > > > > > you
> > > > > > > > want to make sure you can deploy the compute nodes,
> just
> > > > select
> > > > > > the
> > > > > > > > core OSCAR packages and try to deploy your cluster. So
> > far,
> > > i
> > > > > had
> > > > > > > no
> > > > > > > > bug reports about the core itself, only a few
> > configuration
> > > > > files
> > > > > > > were
> > > > > > > > off.
> > > > > > > > 
> > > > > > > > If you want to install all the OSCAR packages,
> including
> > > > OSCAR
> > > > > > > > packages that were developed by contributors for which
> i
> > > did
> > > > > not
> > > > > > > > control the quality yet, i am pretty sure that you will
> > not
> > > > > find
> > > > > > a
> > > > > > > > single configuration.
> > > > > > > > OSCAR is NOT a product, nobody can guarantee service
> and
> > > > > overlook
> > > > > > > all
> > > > > > > > OSCAR packages at the moment.
> > > > > > > > 
> > > > > > > > Regards,
> > > > > > > > 
> > > > > > > > ----- "Nikita Andreev" <n...@kemsu.ru> a écrit :
> > > > > > > > 
> > > > > > > > > > Can you log in to the nodes?
> > > > > > > > > 
> > > > > > > > > Since step 3 fails I cannot proceed to nodes
> > > installation.
> > > > So
> > > > > I
> > > > > > > > have
> > > > > > > > > no nodes installed and hence I cannot login to any of
> > > them.
> > > > > > > > > 
> > > > > > > > > Could you advise me at this point some prior CentOS
> (or
> > > any
> > > > > > other
> > > > > > > > OS)
> > > > > > > > > + OSCAR versions pair which I can possibly try?
> > > > > > > > > 
> > > > > > > > > Best of luck,
> > > > > > > > > Nikita
> > > > > > > > > 
> > > > > > > > > -----Original Message-----
> > > > > > > > > From: geoffroy.val...@free.fr
> > > > [mailto:geoffroy.val...@free.fr]
> > > > > 
> > > > > > > > > Sent: Friday, February 11, 2011 8:55 PM
> > > > > > > > > To: Nikita Andreev
> > > > > > > > > Cc: oscar-users@lists.sourceforge.net
> > > > > > > > > Subject: Re: [Oscar-users] Problem with GUI on CentOS
> > 5.5
> > > > > > > > > 
> > > > > > > > > Dear Nikita,
> > > > > > > > > 
> > > > > > > > > A configuration script from Torque (package i am
> > > currently
> > > > > > trying
> > > > > > > > to
> > > > > > > > > fix) failed, i will have a look as soon as possible.
> > > > > Otherwise,
> > > > > > > > please
> > > > > > > > > do not pay attention to the messages "Error: No
> > matching
> > > > > > Packages
> > > > > > > > to
> > > > > > > > > list"
> > > > > > > > > 
> > > > > > > > >
> > > > > > > >
> > > > > > >
> > > > > >
> > > > >
> > > >
> > >
> >
> http://svn.oscar.openclustergroup.org/trac/osca/wiki/faq_install#DuringStep4:BuildOSCARClientImagethefollowingmessageappearsmanytimes:Error:NomatchingPackagestolist.
> > > > > > > > > 
> > > > > > > > > Can you log in to the nodes?
> > > > > > > > > 
> > > > > > > > > Regards,
> > > > > > > > > 
> > > > > > > > > ----- "Nikita Andreev" <n...@kemsu.ru> a écrit :
> > > > > > > > > 
> > > > > > > > > > Dear Geoffroy,
> > > > > > > > > > 
> > > > > > > > > > I've tried to unselect LAM and another error has
> > showed
> > > > up.
> > > > > > > Find
> > > > > > > > > logs
> > > > > > > > > > attached. GUI says that installation was
> > unsuccessful.
> > > > > > Console
> > > > > > > > says
> > > > > > > > > > that installation is complete but I clearly see
> > errors
> > > > > above
> > > > > > in
> > > > > > > > > > console log.
> > > > > > > > > > 
> > > > > > > > > > Regards,
> > > > > > > > > > Nikita
> > > > > > > > > > 
> > > > > > > > > > -----Original Message-----
> > > > > > > > > > From: geoffroy.val...@free.fr
> > > > > [mailto:geoffroy.val...@free.fr]
> > > > > > 
> > > > > > > > > > Sent: Thursday, February 10, 2011 10:41 PM
> > > > > > > > > > To: Nikita Andreev
> > > > > > > > > > Cc: oscar-users@lists.sourceforge.net
> > > > > > > > > > Subject: Re: [Oscar-users] Problem with GUI on
> CentOS
> > > 5.5
> > > > > > > > > > 
> > > > > > > > > > Dear Nikita,
> > > > > > > > > > 
> > > > > > > > > > Can you please unselect LAM and try to deploy the
> > > > cluster?
> > > > > I
> > > > > > > > opened
> > > > > > > > > a
> > > > > > > > > > ticket about LAM, i will have a look as soon as
> > > possible
> > > > (i
> > > > > > do
> > > > > > > > not
> > > > > > > > > > know why there is a dependency to libtorque, it
> does
> > > not
> > > > > make
> > > > > > > > much
> > > > > > > > > > sense to me).
> > > > > > > > > > 
> > > > > > > > > >
> > > > http://svn.oscar.openclustergroup.org/trac/oscar/ticket/600
> > > > > > > > > > 
> > > > > > > > > > Regards,
> > > > > > > > > > 
> > > > > > > > > > ----- "Nikita Andreev" <n...@kemsu.ru> a écrit :
> > > > > > > > > > 
> > > > > > > > > > > Dear Geoffroy,
> > > > > > > > > > > 
> > > > > > > > > > > I've reinstalled OS and chosen unstable repo.
> > Another
> > > > > > > > dependency
> > > > > > > > > > > problem has shown up. Find logs attached.
> > > > > > > > > > > 
> > > > > > > > > > > Switcher still doesn't see MPI implementations.
> > Don't
> > > > > know
> > > > > > if
> > > > > > > > it
> > > > > > > > > > has
> > > > > > > > > > > something to do with this dependency problem.
> > > > > > > > > > > 
> > > > > > > > > > > Glad to be of help.
> > > > > > > > > > > 
> > > > > > > > > > > Regards,
> > > > > > > > > > > Nikita
> > > > > > > > > > > 
> > > > > > > > > > > -----Original Message-----
> > > > > > > > > > > From: Nikita Andreev [mailto:n...@kemsu.ru] 
> > > > > > > > > > > Sent: Thursday, February 10, 2011 11:55 AM
> > > > > > > > > > > To: geoffroy.val...@free.fr
> > > > > > > > > > > Cc: oscar-users@lists.sourceforge.net
> > > > > > > > > > > Subject: Re: [Oscar-users] Problem with GUI on
> > CentOS
> > > > 5.5
> > > > > > > > > > > 
> > > > > > > > > > > > To really fix step 3, a modification of a
> script
> > > was
> > > > > > > needed.
> > > > > > > > If
> > > > > > > > > > you
> > > > > > > > > > > are using Torque, i even advice you to switch to
> > > > > > "unstable".
> > > > > > > > > > > 
> > > > > > > > > > > > To really fix step 4, the scripts must be fixed
> > so
> > > > that
> > > > > > > > > > everything
> > > > > > > > > > > can run smoothly. Otherwise, OSCAR detects that a
> > > > script
> > > > > > > failed
> > > > > > > > > and
> > > > > > > > > > > therefore cannot complete the step. Again, if you
> > > > accept
> > > > > of
> > > > > > > > using
> > > > > > > > > > the
> > > > > > > > > > > updated packages, switcher will not catch errors
> > > > anymore
> > > > > > and
> > > > > > > > > > > everything should run smoothly.
> > > > > > > > > > > 
> > > > > > > > > > > Image creation just failed with error Error:
> > Missing
> > > > > > > > Dependency:
> > > > > > > > > > > torque-oscar = 2.1.10-4 is needed by package
> > > > > > > > > > > torque-oscar-modulefile-2.1.10-4.noarch
> > > > > > > > (unstable_rhel-5-x86_64)
> > > > > > > > > > > 
> > > > > > > > > > > I will try to test unstable ASAP.
> > > > > > > > > > > 
> > > > > > > > > > > > Also, when you question the competences of
> > someone,
> > > > > this
> > > > > > > > could
> > > > > > > > > be
> > > > > > > > > > > interpreted as insulting, there is nothing
> > surprising
> > > > > > there.
> > > > > > > So
> > > > > > > > > > before
> > > > > > > > > > > to question someone's expertise, the minimum you
> > > could
> > > > do
> > > > > > is
> > > > > > > to
> > > > > > > > > > check
> > > > > > > > > > > the position of the person in the project.
> > > > > > > > > > > 
> > > > > > > > > > > Again, my apologies. I really didn't mean it.
> > > > > > > > > > > 
> > > > > > > > > > > Regards,
> > > > > > > > > > > 
> > > > > > > > > > > ----- "Nikita Andreev" <n...@kemsu.ru> a écrit :
> > > > > > > > > > > 
> > > > > > > > > > > > Dear Geoffroy,
> > > > > > > > > > > > 
> > > > > > > > > > > > I do not know where I've insulted you, but
> anyway
> > > if
> > > > > I've
> > > > > > > > said
> > > > > > > > > > > > anything inappropriate I would like to
> apologize.
> > I
> > > > > > greatly
> > > > > > > > > > > appreciate
> > > > > > > > > > > > your work as an OSCAR developer and I didn't
> mean
> > > to
> > > > > > offend
> > > > > > > > you
> > > > > > > > > > in
> > > > > > > > > > > any
> > > > > > > > > > > > way. You're doing a great job.
> > > > > > > > > > > > 
> > > > > > > > > > > > To my question. It's seems that there is some
> > > > > > > > misunderstanding.
> > > > > > > > > > > > Regarding problem with Torque in step 3 as I
> said
> > > in
> > > > > > first
> > > > > > > > > letter
> > > > > > > > > > I
> > > > > > > > > > > > solved it just by rebuilding opkg-torque-server
> > > > > package.
> > > > > > I
> > > > > > > > > > > downloaded
> > > > > > > > > > > > it and changed .spec file so that it depended
> on
> > > > > > > > > > > > torque-oscar-modulefile instead of
> > > torque-modulefile.
> > > > > And
> > > > > > > > step
> > > > > > > > > 3
> > > > > > > > > > > went
> > > > > > > > > > > > fine from that point.
> > > > > > > > > > > > 
> > > > > > > > > > > > Regarding step 4. I had GUI issue with error
> > > message
> > > > in
> > > > > > > > console
> > > > > > > > > I
> > > > > > > > > > > > quoted below. 
> > > > > > > > > > > > 
> > > > > > > > > > > > I've just updated OSCAR from 6.0.3 to 6.1.0 a
> > > minute
> > > > > ago
> > > > > > by
> > > > > > > > > `yum
> > > > > > > > > > > > update` command and now GUI works fine. I'm
> able
> > to
> > > > run
> > > > > > > step
> > > > > > > > 4
> > > > > > > > > > and
> > > > > > > > > > > > waiting right now for image to be created.
> > > > > > > > > > > > 
> > > > > > > > > > > > The only issue I still have is switcher. When I
> > go
> > > > into
> > > > > > > > > > configuring
> > > > > > > > > > > > switcher window from GUI I see: “No pkg_config
> > were
> > > > > > > supplied
> > > > > > > > by
> > > > > > > > > > any
> > > > > > > > > > > > OSCAR packages – nothing to configure”. I
> choose
> > > > > openmpi,
> > > > > > > > mpich
> > > > > > > > > > and
> > > > > > > > > > > > lam to install so it has to see those mpi
> > > > > > implementations.
> > > > > > > > > > > > 
> > > > > > > > > > > > Regards,
> > > > > > > > > > > > Nikita
> > > > > > > > > > > > 
> > > > > > > > > > > > -----Original Message-----
> > > > > > > > > > > > From: geoffroy.val...@free.fr
> > > > > > > [mailto:geoffroy.val...@free.fr]
> > > > > > > > 
> > > > > > > > > > > > Sent: Thursday, February 10, 2011 10:15 AM
> > > > > > > > > > > > To: Nikita Andreev
> > > > > > > > > > > > Cc: oscar-users@lists.sourceforge.net
> > > > > > > > > > > > Subject: Re: [Oscar-users] Problem with GUI on
> > > CentOS
> > > > > 5.5
> > > > > > > > > > > > 
> > > > > > > > > > > > Nikita,
> > > > > > > > > > > > 
> > > > > > > > > > > > As i said, you may want to try to _not_ install
> > the
> > > > > > > > third-part
> > > > > > > > > > > > packages that are creating the problems, the
> core
> > > of
> > > > > > OSCAR
> > > > > > > > > should
> > > > > > > > > > > work
> > > > > > > > > > > > pretty well (a user found a problem in C3 but
> we
> > > > > already
> > > > > > > have
> > > > > > > > a
> > > > > > > > > > fix
> > > > > > > > > > > > that should be available very shortly). It is
> not
> > a
> > > > > > > question
> > > > > > > > of
> > > > > > > > > > > > version of distribution but only that i cannot
> > fix
> > > > all
> > > > > > the
> > > > > > > > bugs
> > > > > > > > > > > right
> > > > > > > > > > > > away.
> > > > > > > > > > > > 
> > > > > > > > > > > > So please, try to new packages. Obviously the
> > > > problems
> > > > > > you
> > > > > > > > are
> > > > > > > > > > > facing
> > > > > > > > > > > > are because of these issues and if you are
> still
> > > > > blocked,
> > > > > > > it
> > > > > > > > > > means
> > > > > > > > > > > > that you did not really fixed them. I try to be
> > > > > reactive,
> > > > > > > to
> > > > > > > > > help
> > > > > > > > > > > you,
> > > > > > > > > > > > so could you please try to do what i suggest to
> > > you?
> > > > > That
> > > > > > > > will
> > > > > > > > > > > really
> > > > > > > > > > > > help.
> > > > > > > > > > > > So to summarize, i pushed new RPMs for Torque
> in
> > > > > > unstable,
> > > > > > > i
> > > > > > > > > > really
> > > > > > > > > > > > advice you to test them. I am about to push new
> > > > > packages
> > > > > > > for
> > > > > > > > > > > Ganglia
> > > > > > > > > > > > too, i really advice you to use these packages
> > too.
> > > > If
> > > > > > you
> > > > > > > do
> > > > > > > > > not
> > > > > > > > > > > want
> > > > > > > > > > > > to use these packages, i will not be able to
> help
> > > > you.
> > > > > > > > > > > > 
> > > > > > > > > > > > BTW, if you fixed a problem by yourself, could
> > you
> > > at
> > > > > > least
> > > > > > > > > > > explained
> > > > > > > > > > > > what you did? Otherwise i do not see how OSCAR
> > > could
> > > > > get
> > > > > > > > > better;
> > > > > > > > > > > this
> > > > > > > > > > > > is an open source project, if i do not get
> > > feedback,
> > > > > > there
> > > > > > > is
> > > > > > > > > no
> > > > > > > > > > > point
> > > > > > > > > > > > of continuing on my side, i could maintain
> OSCAR
> > on
> > > > my
> > > > > > side
> > > > > > > > for
> > > > > > > > > > my
> > > > > > > > > > > own
> > > > > > > > > > > > needs and do not distribute it.
> > > > > > > > > > > > 
> > > > > > > > > > > > BTW, i am an OSCAR developer and the project
> > chair,
> > > > and
> > > > > i
> > > > > > > do
> > > > > > > > > not
> > > > > > > > > > > > appreciate the fact that you question my
> position
> > > in
> > > > > the
> > > > > > > > > project,
> > > > > > > > > > > this
> > > > > > > > > > > > is almost insutling. My name is official
> > displayed
> > > on
> > > > > the
> > > > > > > > > > website,
> > > > > > > > > > > and
> > > > > > > > > > > > not difficult to find:
> > > > > > > > > > > >
> > > > > > http://svn.oscar.openclustergroup.org/trac/oscar/wiki/Team
> > > > > > > > > > > > 
> > > > > > > > > > > > Regards,
> > > > > > > > > > > > 
> > > > > > > > > > > > ----- "Nikita Andreev" <n...@kemsu.ru> a écrit :
> > > > > > > > > > > > 
> > > > > > > > > > > > > > Based on your logs, many packages failed to
> > > > > install:
> > > > > > > > > > > > > disable-services, ganglia, lam, maui, mpich,
> > > > > ntpconfig,
> > > > > > > > > > openmpi,
> > > > > > > > > > > > > opium, torque
> > > > > > > > > > > > > 
> > > > > > > > > > > > > Please look below. I fixed torque problem by
> > > myself
> > > > > and
> > > > > > > > > > > everything
> > > > > > > > > > > > > went fine until step 4. disable-services,
> > > ganglia,
> > > > > lam,
> > > > > > > > maui,
> > > > > > > > > > > > mpich,
> > > > > > > > > > > > > ntpconfig, openmpi, opium and torque installs
> > > just
> > > > > > fine.
> > > > > > > > > > > > > 
> > > > > > > > > > > > > > These packages are third-party packages and
> > > based
> > > > > on
> > > > > > > the
> > > > > > > > > > logs,
> > > > > > > > > > > i
> > > > > > > > > > > > > cannot tell you why they were not installed
> > > > > > successfully.
> > > > > > > > So
> > > > > > > > > > the
> > > > > > > > > > > > first
> > > > > > > > > > > > > thing i advice you to do is to unselect these
> > > > > packages
> > > > > > > > using
> > > > > > > > > > > > selector
> > > > > > > > > > > > > and try to deploy again the cluster.
> > > > > > > > > > > > > 
> > > > > > > > > > > > > Proceed through the logs down below and you
> > will
> > > > find
> > > > > > > that
> > > > > > > > > > actual
> > > > > > > > > > > > > error is some GUI bug in step 4. It pops up
> > just
> > > > > after
> > > > > > I
> > > > > > > > push
> > > > > > > > > > the
> > > > > > > > > > > > > button in GUI.
> > > > > > > > > > > > > 
> > > > > > > > > > > > > > Finally, about your question about PBS,
> > > remember
> > > > > that
> > > > > > > > this
> > > > > > > > > is
> > > > > > > > > > a
> > > > > > > > > > > > free
> > > > > > > > > > > > > software and that i am working on OSCAR on my
> > > free
> > > > > time
> > > > > > > > > > > > > 
> > > > > > > > > > > > > I understand.
> > > > > > > > > > > > > 
> > > > > > > > > > > > > So the point is that I can't install OSCAR on
> > top
> > > > of
> > > > > > > CentOS
> > > > > > > > > > 5.5.
> > > > > > > > > > > > Since
> > > > > > > > > > > > > it's a clean vanilla installation it should
> be
> > > > common
> > > > > > for
> > > > > > > > > > CentOS
> > > > > > > > > > > > 5.5
> > > > > > > > > > > > > for everyone. I fully understand that there
> is
> > no
> > > > > > > official
> > > > > > > > > > > support
> > > > > > > > > > > > > from OSCAR and I'm on my own. But OSCAR have
> to
> > > > > install
> > > > > > > at
> > > > > > > > > > least
> > > > > > > > > > > on
> > > > > > > > > > > > > some versions of CentOS. Can you give my some
> > > clue?
> > > > > > Maybe
> > > > > > > I
> > > > > > > > > > > should
> > > > > > > > > > > > try
> > > > > > > > > > > > > CentOS 5.4? It's a tedious and time consuming
> > > > process
> > > > > > of
> > > > > > > > > > > > reinstalling
> > > > > > > > > > > > > everything that's why I would be much more
> > > > optimistic
> > > > > > if
> > > > > > > > this
> > > > > > > > > > > > advice
> > > > > > > > > > > > > will come from OSCAR developers.
> > > > > > > > > > > > > 
> > > > > > > > > > > > > Regards,
> > > > > > > > > > > > > Nikita
> > > > > > > > > > > > > 
> > > > > > > > > > > > > ----- "Nikita Andreev" <n...@kemsu.ru> a écrit
> :
> > > > > > > > > > > > > 
> > > > > > > > > > > > > > Dear Geoffroy,
> > > > > > > > > > > > > > 
> > > > > > > > > > > > > > I want to emphasize that main problem is
> not
> > > > > Torque.
> > > > > > It
> > > > > > > > > seems
> > > > > > > > > > > > like
> > > > > > > > > > > > > > it's a problem with GUI which doesn't allow
> > me
> > > to
> > > > > > > install
> > > > > > > > > > OSCAR
> > > > > > > > > > > > at
> > > > > > > > > > > > > > all. Neither with Torque nor without it.
> > > > > > > > > > > > > > 
> > > > > > > > > > > > > > Which version of OSCAR prior to version
> 6.1.0
> > > is
> > > > > > > stable?
> > > > > > > > > > > > > > 
> > > > > > > > > > > > > > Are all CentOS versions supported or only
> > > > specific
> > > > > > > > > versions?
> > > > > > > > > > > > > > 
> > > > > > > > > > > > > > By the way, why Torque won't be supported?
> > > Which
> > > > > PBS
> > > > > > > will
> > > > > > > > > be
> > > > > > > > > > > > > supported
> > > > > > > > > > > > > > then?
> > > > > > > > > > > > > > 
> > > > > > > > > > > > > > Thank you for your time.
> > > > > > > > > > > > > > 
> > > > > > > > > > > > > > Sincerely,
> > > > > > > > > > > > > > Nikita
> > > > > > > > > > > > > > 
> > > > > > > > > > > > > > -----Original Message-----
> > > > > > > > > > > > > > From: geoffroy.val...@free.fr
> > > > > > > > > [mailto:geoffroy.val...@free.fr]
> > > > > > > > > > 
> > > > > > > > > > > > > > Sent: Wednesday, February 09, 2011 1:56 AM
> > > > > > > > > > > > > > To: oscar-users@lists.sourceforge.net
> > > > > > > > > > > > > > Cc: n...@kemsu.ru
> > > > > > > > > > > > > > Subject: Re: [Oscar-users] Problem with GUI
> > on
> > > > > CentOS
> > > > > > > 5.5
> > > > > > > > > > > > > > 
> > > > > > > > > > > > > > Hi,
> > > > > > > > > > > > > > 
> > > > > > > > > > > > > > I am having problems with Torque since a
> > while
> > > > > > > > > (dependencies
> > > > > > > > > > > > issues
> > > > > > > > > > > > > > and problems building the binary packages).
> > > > > > > > Unfortunately,
> > > > > > > > > i
> > > > > > > > > > do
> > > > > > > > > > > > not
> > > > > > > > > > > > > > have the time to fix that specific issue at
> > the
> > > > > > moment,
> > > > > > > > > since
> > > > > > > > > > i
> > > > > > > > > > > > am
> > > > > > > > > > > > > > focusing on the release of oscar-6.1.0
> (which
> > > > will
> > > > > > > _not_
> > > > > > > > > > > support
> > > > > > > > > > > > > > Torque at first).
> > > > > > > > > > > > > > 
> > > > > > > > > > > > > > A ticket is open related to that problem:
> > > > > > > > > > > > > >
> > > > > > > >
> > http://svn.oscar.openclustergroup.org/trac/oscar/ticket/572
> > > > > > > > > > > > > > 
> > > > > > > > > > > > > > Once OSCAR-6.1.0 will be released, i will
> > focus
> > > > on
> > > > > > > fixing
> > > > > > > > > all
> > > > > > > > > > > > known
> > > > > > > > > > > > > > issues related to OPKGs, the goal of 6.1.0
> > > being
> > > > to
> > > > > > > > provide
> > > > > > > > > a
> > > > > > > > > > > > > robust
> > > > > > > > > > > > > > OSCAR core.
> > > > > > > > > > > > > > 
> > > > > > > > > > > > > > Regards,
> > > > > > > > > > > > > > 
> > > > > > > > > > > > > > ----- "Nikita Andreev" <n...@kemsu.ru> a
> écrit
> > :
> > > > > > > > > > > > > > 
> > > > > > > > > > > > > > > I’m trying to deploy a cluster from
> CentOS
> > > 5.5
> > > > > > x86_64
> > > > > > > > > with
> > > > > > > > > > > > OSCAR
> > > > > > > > > > > > > > > 6.0.5. It’s a fresh install. I had a
> > problem
> > > > with
> > > > > > > > > > > > > torque-modulefile
> > > > > > > > > > > > > > > dependency with the following error:
> > > > > > > > > > > > > > > 
> > > > > > > > > > > > > > > 
> > > > > > > > > > > > > > > 
> > > > > > > > > > > > > > > opkg-torque-server-2.1.13-1.noarch from
> > > > > > > > > > > unstable_rhel-5-x86_64
> > > > > > > > > > > > > has
> > > > > > > > > > > > > > > depsolving problems
> > > > > > > > > > > > > > > 
> > > > > > > > > > > > > > > --> Missing Dependency: torque-modulefile
> > is
> > > > > needed
> > > > > > > by
> > > > > > > > > > > package
> > > > > > > > > > > > > > > opkg-torque-server-2.1.13-1.noarch
> > > > > > > > > (unstable_rhel-5-x86_64)
> > > > > > > > > > > > > > > 
> > > > > > > > > > > > > > > 
> > > > > > > > > > > > > > > 
> > > > > > > > > > > > > > > I’ve resolved it by manually downloading
> > > > > > > > > opkg-torque-server
> > > > > > > > > > > and
> > > > > > > > > > > > > > > rebuilding with torque-oscar-modulefile
> > > > > dependency
> > > > > > > > which
> > > > > > > > > I
> > > > > > > > > > > > > believe
> > > > > > > > > > > > > > is
> > > > > > > > > > > > > > > correct.
> > > > > > > > > > > > > > > 
> > > > > > > > > > > > > > > 
> > > > > > > > > > > > > > > 
> > > > > > > > > > > > > > > At the moment I have two issues I can’t
> > > resolve
> > > > > by
> > > > > > > > > myself:
> > > > > > > > > > > > > > > 
> > > > > > > > > > > > > > > 
> > > > > > > > > > > > > > > 
> > > > > > > > > > > > > > > 1. When I go into configuring switcher
> from
> > > the
> > > > > GUI
> > > > > > I
> > > > > > > > > see:
> > > > > > > > > > > “No
> > > > > > > > > > > > > > > pkg_config were supplied by any OSCAR
> > > packages
> > > > –
> > > > > > > > nothing
> > > > > > > > > to
> > > > > > > > > > > > > > > configure”. There should be something to
> > > > > configure
> > > > > > > > since
> > > > > > > > > I
> > > > > > > > > > > > choose
> > > > > > > > > > > > > > > openmpi, mpich and lam to install.
> > > > > > > > > > > > > > > 
> > > > > > > > > > > > > > > 
> > > > > > > > > > > > > > > 
> > > > > > > > > > > > > > > 2. I can’t invoke 4th step. I get the
> > > following
> > > > > > error
> > > > > > > > > (find
> > > > > > > > > > > log
> > > > > > > > > > > > > > > attached):
> > > > > > > > > > > > > > > 
> > > > > > > > > > > > > > > 
> > > > > > > > > > > > > > > 
> > > > > > > > > > > > > > > Tk::Error: Can't set -options to
> > > > > `ARRAY(0xd34c710)'
> > > > > > > for
> > > > > > > > > > > > > > > Tk::Optionmenu=HASH(0xd360660): No -label
> > at
> > > > > > > > > > > > > > >
> > > > > > > > > > > > > >
> > > > > > > > > > > > >
> > > > > > > > > > > >
> > > > > > > > > > >
> > > > > > > > > >
> > > > > > > > >
> > > > > > > >
> > > > > > >
> > > > > >
> > > > >
> > > >
> > >
> >
> /usr/lib64/perl5/vendor_perl/5.8.8/x86_64-linux-thread-multi/Tk/Widget.pm
> > > > > > > > > > > > > > > line 256
> > > > > > > > > > > > > > > 
> > > > > > > > > > > > > > > 
> > > > > > > > > > > > > > > 
> > > > > > > > > > > > > > > If CentOS 5.5 isn’t supported then please
> > > > provide
> > > > > > me
> > > > > > > > with
> > > > > > > > > > > > > supported
> > > > > > > > > > > > > > > version. Is it 5.4?
> > > > > > > > > > > > > > > 
> > > > > > > > > > > > > > > 
> > > > > > > > > > > > > > > 
> > > > > > > > > > > > > > > Another question would be why all
> necessary
> > > > > > packages
> > > > > > > > like
> > > > > > > > > > > > torque,
> > > > > > > > > > > > > > > maui, etc .. are under “Experimental”
> > package
> > > > > set?
> > > > > > Is
> > > > > > > > > OSCAR
> > > > > > > > > > > > 6.0.5
> > > > > > > > > > > > > > an
> > > > > > > > > > > > > > > unstable version and I should rollback to
> > > > version
> > > > > > 5?
> > > > > > > > > > > > > > > 
> > > > > > > > > > > > > > > 
> > > > > > > > > > > > > > > 
> > > > > > > > > > > > > > > Regards,
> > > > > > > > > > > > > > > 
> > > > > > > > > > > > > > > Nikita
> > > > > > > > > > > > > > > 
> > > > > > > > > > > > > > > 
> > > > > > > > > > > > > > > 
> > > > > > > > > > > > > > > 
> > > > > > > > > > > > > > >
> > > > > > > > > > > > > >
> > > > > > > > > > > > >
> > > > > > > > > > > >
> > > > > > > > > > >
> > > > > > > > > >
> > > > > > > > >
> > > > > > > >
> > > > > > >
> > > > > >
> > > > >
> > > >
> > >
> >
> ------------------------------------------------------------------------------
> > > > > > > > > > > > > > > The ultimate all-in-one performance
> > toolkit:
> > > > > > Intel(R)
> > > > > > > > > > > Parallel
> > > > > > > > > > > > > > Studio
> > > > > > > > > > > > > > > XE:
> > > > > > > > > > > > > > > Pinpoint memory and threading errors
> before
> > > > they
> > > > > > > > happen.
> > > > > > > > > > > > > > > Find and fix more than 250 security
> defects
> > > in
> > > > > the
> > > > > > > > > > > development
> > > > > > > > > > > > > > cycle.
> > > > > > > > > > > > > > > Locate bottlenecks in serial and parallel
> > > code
> > > > > that
> > > > > > > > limit
> > > > > > > > > > > > > > > performance.
> > > > > > > > > > > > > > > http://p.sf.net/sfu/intel-dev2devfeb
> > > > > > > > > > > > > > >
> > > _______________________________________________
> > > > > > > > > > > > > > > Oscar-users mailing list
> > > > > > > > > > > > > > > Oscar-users@lists.sourceforge.net
> > > > > > > > > > > > > > >
> > > > > > > >
> https://lists.sourceforge.net/lists/listinfo/oscar-users
> > > > > > > > > > > 
> > > > > > > > > > > 
> > > > > > > > > > > 
> > > > > > > > > > >
> > > > > > > > > >
> > > > > > > > >
> > > > > > > >
> > > > > > >
> > > > > >
> > > > >
> > > >
> > >
> >
> ------------------------------------------------------------------------------
> > > > > > > > > > > The ultimate all-in-one performance toolkit:
> > Intel(R)
> > > > > > > Parallel
> > > > > > > > > > Studio
> > > > > > > > > > > XE:
> > > > > > > > > > > Pinpoint memory and threading errors before they
> > > > happen.
> > > > > > > > > > > Find and fix more than 250 security defects in
> the
> > > > > > > development
> > > > > > > > > > cycle.
> > > > > > > > > > > Locate bottlenecks in serial and parallel code
> that
> > > > limit
> > > > > > > > > > > performance.
> > > > > > > > > > > http://p.sf.net/sfu/intel-dev2devfeb
> > > > > > > > > > > _______________________________________________
> > > > > > > > > > > Oscar-users mailing list
> > > > > > > > > > > Oscar-users@lists.sourceforge.net
> > > > > > > > > > >
> > > https://lists.sourceforge.net/lists/listinfo/oscar-users

------------------------------------------------------------------------------
The ultimate all-in-one performance toolkit: Intel(R) Parallel Studio XE:
Pinpoint memory and threading errors before they happen.
Find and fix more than 250 security defects in the development cycle.
Locate bottlenecks in serial and parallel code that limit performance.
http://p.sf.net/sfu/intel-dev2devfeb
_______________________________________________
Oscar-users mailing list
Oscar-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/oscar-users

Reply via email to