Am 25.09.2012 um 14:26 schrieb Jérémie Dubois-Lacoste: > Hi All, > > We recently reinstalled our cluster and we have some serious issues. > Contrary to our previous installation, we now installed a fully 64bits > system. We use Rocks cluster 6\CentOS 6.3, > and SGE 6.2u5. > > The memory values reported by SGE are very high compared > to the actual need of every jobs, and many get killed because > they exceed the limit, while they should not. > I found this thread about too low memory reports: > http://comments.gmane.org/gmane.comp.clustering.gridengine.users/19303 > > But I didn't find anything about too high memory reports... > > > Here is a simple test to make it clear: > > I submit a very stupid python script "minimal.py", wich is just: > ----- > import time > > time.sleep(30) > print("done") > ----- > > * I tried to run it directly to check the memory consumption with: > $ /usr/bin/time -v python minimal.py > And I get: Maximum resident set size (kbytes): 15376 > > > * Then, when submitting the jobs with: > qsub -m ase -M <my_mail> -b y -N memTest -o test.out -e test.err -cwd > "python minimal.py" > I go checking on the computation node where it gets scheduled and I "top": > PID USER PR NI VIRT RES SHR S %CPU %MEM TIME+ COMMAND > 20240 myName 23 3 114m 3844 1832 S 0.0 0.0 0:00.14 python minimal.py
The virtual size is listed here as 114m as well. -- Reuti > So I understand it uses 3.8Mb of RAM. > > > * But from the e-mail I get when the jobs terminate: > Job 1879536 (memTest) Complete > User = myName > Queue = [email protected] > Host = compute-3-0.local > Start Time = 09/25/2012 13:46:45 > End Time = 09/25/2012 13:47:15 > User Time = 00:00:00 > System Time = 00:00:00 > Wallclock Time = 00:00:30 > CPU = 00:00:00 > Max vmem = 114.441M > Exit Status = 0 > > > It says 114Mb, I don't understand this huge difference. > > > The consequence is that most of the jobs get killed by "fakely" (I presume) > exceeding the hard memory limit. Any clue is welcome! > > > Sincerely, > > Jérémie > > _______________________________________________ > users mailing list > [email protected] > https://gridengine.org/mailman/listinfo/users > _______________________________________________ users mailing list [email protected] https://gridengine.org/mailman/listinfo/users
