Re: [gt-user] Grid Computing Advice

Ioan Raicu Mon, 27 Aug 2007 11:58:40 -0700

Hi Russell,

If I understand your workload, and your "next generation" model (~5Mtasks) still requires ~40 hours to process on a single machine, thenyour tasks are on average 28.8 ms / task (40 hours / 5M tasks)! Pleasecorrect me if I misunderstood your workload characteristics. With tasklengths in this range, you are looking at dispatch and execution ratesof about 34.72 tasks/sec per node (1000 ms / 28.8 ms/tasks). If you nowhave 100 nodes, you need 3472 tasks/sec overall system throughput tokeep all 100 nodes busy with 28.8 ms tasks. Typical production LRM's(local resource manager) throughput are in the ~1 job/sec range, anddevelopment version of these LRM's are pushing 10~20 jobs/sec.Our own work within the project Falkon(http://people.cs.uchicago.edu/~iraicu/research/Falkon/) works withexisting LRMs and has achieved rates in the ~500 tasks/sec range. Wehave also scaled Falkon to 2M queued tasks with 1.5GB of memory, andshould scale to your workload size of 5M with a linear and proportionalincrease in memory. We are working now to improve the throughputfurther by parallelizing the Falkon architecture! BTW, Falkon isimplemented in Java, and is using the Globus Toolkit 4. We have nottested it on Windows, but there is nothing inherent that would stop itfrom working in a Windows environment (with the exception of somescripts perhaps).

Feel free to write me off-list if you have more Falkon specific questions.


Ioan

Russell Miles wrote:

I am a Database Admin in a Metropolitan Planning Organization,therefore we process many complex, resource-intensive models focusingon things like transportation and air quality. We are planning for thenext generation of modeling technology and wish to incorporatedistributed computing into the mix. We desire this since the currentmodels can take up to 40 hours to process on a singleworkstation/server. The "next generation" model will consist of around5 million independent tasks that will come together once all of thetasks are completed. We wish to spread this processing over the 100 orso PCs we have in the office, utilizing their idle CPU time.
I'm looking for some very specific advice, but all the information youcan give would be much appreciated.
1) We're trying to decide what language to develop our models in tomost easily coexist with grid computing code. My research has shownthat Java and .NET are the two most widely used grid computing bases.Which do you all recommend? Or, is there some other technology thatyou recommend?
2) What third party package, open source package, or other softwarewould you recommend to most efficiently implement this solutionfocusing on performance? My research has shown that Digipede, PlatformComputing, and Alchemi are some of the more popular grid computingplatforms that work in Windows....what do you think about these? Weare open to Linux/UNIX as well, but for ease of implementation,Windows is what we're currently running.
I appreciate any info you can provide and look forward to hearing backfrom you,
Russell


--
============================================
Ioan Raicu
Ph.D. Student
============================================
Distributed Systems Laboratory
Computer Science Department
University of Chicago
1100 E. 58th Street, Ryerson Hall
Chicago, IL 60637
============================================
Email: [EMAIL PROTECTED]
Web:   http://www.cs.uchicago.edu/~iraicu
      http://dsl.cs.uchicago.edu/
============================================
============================================

Re: [gt-user] Grid Computing Advice

Reply via email to