Greetings, Here's what I did. 1) unpack ge tarballs into /opt/ge on all hosts 2) configure grid master 3) scp /opt/ge/default to all hosts 4) verify ssh works back and forth among all hosts as root 5) run ./start_gui_installer -debug 6) Install all execution hosts
This is shared nothing, so there are no filesystems shared among the systems. Are there any other configurations which I need to do? I did this a few months ago, but I'm wondering if I missed something this time around. qrsh, and qlogin work for some of the hosts. qsh works for most of the hosts. I'm seeing errors like this on the qmaster host: 04/01/2011 16:22:48|schedu|qmasterhost|E|unable to find job 1197 from the scheduler order package 04/01/2011 16:23:03|schedu|qmasterhost |E|could not find job "1197" in master list 04/01/2011 16:23:03|schedu|qmasterhost |E|callback function for event "48. EVENT DEL JOB 1197.1" failed And seeing messages like this on execution hosts: 04/01/2011 16:06:49| main|exehost1|W|reaping job "1190" ptf complains: Job does not exist 04/01/2011 16:06:49| main|exehost1|E|can't open file active_jobs/1190.1/error: No such file or directory Thanks, Bill _______________________________________________ users mailing list [email protected] https://gridengine.org/mailman/listinfo/users
