Sorry not path, but patch.

Louis

On 12-11-08 02:46 PM, Louis Letourneau wrote:
> With the path I went from 6*120hrs(walltime was hit each time) to:
> resources_used.walltime=47:35:27
> 
> geez...I should have asked earlier :-)
> 
> 
> thanks
> Louis
> 
> On 12-11-05 04:52 PM, Sébastien Boisvert wrote:
>> On 11/05/2012 02:32 PM, Louis Letourneau wrote:
>>> Darn I forgot about that bug and I saw it pass on the mailing list too.
>>>
>>> Sorry for the post then.
>>>
>>
>> Not sure I sent an email about that patch ;-)
>>   
>>> I saw your mini-rank posting, I think it's a wonderful idea especially
>>> since there are more and more cores per nodes now.
>>>
>>> Is the infinit loop fixed in the mini-rank codebase?
>>
>> It is not an infinite loop, it just that a loop over a k-mer with
>> a coverage of 99999 (a large coverage) takes a while with all the messages.
>>
>>> If not should I just apply the patch?
>>>
>>
>> Yeah, the patch fixes the long running time in the scaffolding.
>>
>>> Again, thanks for the great work.
>>>
>>
>> Thanks for the testing !
>>
>>> Louis
>>>
>>> On 12-11-05 12:17 PM, Sébastien Boisvert wrote:
>>>> On 11/05/2012 11:14 AM, Louis Letourneau wrote:
>>>>> I have assembled 2 >2.5G genomes (not the same, both mammals) in about
>>>>> 48hrs using 2025cores. This works great.
>>>>>
>>>>
>>>> Nice.
>>>>
>>>>> I'm trying to assemble a fish and I am having issues I don't quite know
>>>>> how to debug.
>>>>>
>>>>> The fish is about 1.9Gb in size and not diploid.
>>>>>
>>>>> If I run Ray using the paired + mates using k31 I was able to assemble
>>>>> it in 168hours (I needed to restart after 120hours because of
>>>>> walltime...thanks for the checkpoints :-) ).
>>>>>
>>>>
>>>> That's quite long, what's the latency ?
>>>>
>>>> We are working on a new programming model called "mini-ranks" to better use
>>>> super computer with a lot of nodes, but also with a lot of cores per node.
>>>>
>>>> Ray uses RayPlatform, and RayPlatform uses MPI. In the new model, 
>>>> RayPlatform
>>>> uses "mini-ranks".
>>>>
>>>> The current model in RayPlatform is to use pure MPI programming model, 
>>>> which
>>>> can be really bad on some super computer if there is just one network card
>>>> on each node that must serve lots of MPI processes.
>>>>
>>>> If you are interested, we have a experimental branch called for mini-ranks 
>>>> that
>>>> can use only 1 MPI process per node, and as many IEEE POSIX threads (one 
>>>> for each
>>>> mini-ranks).
>>>>
>>>> With mini-ranks, the routing code in RayPlatform will become obsolete !
>>>>
>>>> Some latency results:
>>>>
>>>>
>>>> Table 1: Comparison of MPI ranks with mini-ranks on the Colosse
>>>> super-computer at Laval University.
>>>> +-------+---------------------------------------------------+
>>>> | Cores | Average round-trip latency (us)                   |
>>>> +-------+-----------------------+---------------------------+
>>>> |       | MPI ranks             | mini-ranks                |
>>>> |       | (pure MPI)            | (MPI + pthread)           |
>>>> +-------+-----------------------+---------------------------+
>>>> | 8     | 11.25 +/- 0           | 24.1429 +/- 0             |
>>>> | 16    | 35.875 +/- 6.92369    | 43.0179 +/- 8.76275       |
>>>> | 32    | 66.3125 +/- 6.76387   | 41.7143 +/- 1.23924       |
>>>> | 64    | 90 +/- 16.5265        | 37.75 +/- 6.41984         |
>>>> | 128   | 126.562 +/- 25.0116   | 43.0179 +/- 8.76275       |
>>>> | 256   | 203.637 +/- 67.4579   | 44.6429 +/- 6.11862       |
>>>> +-------+-----------------------+---------------------------+
>>>>
>>>> If you want to try that:
>>>>
>>>> git clone g...@github.com:sebhtml/RayPlatform.git
>>>> cd RayPlatform;git checkout minirank-model;cd ..
>>>> git clone g...@github.com:sebhtml/ray.git
>>>> cd ray;git checkout minirank-model;
>>>> make
>>>>
>>>> then, to run on 100 nodes, with 24 cores per node:
>>>>
>>>>
>>>> mpiexec -n 100 -bynode Ray -mini-ranks-per-rank 23 \
>>>> ...
>>>>
>>>> Notes:
>>>>
>>>> 1. the -bynode is necessary in Open-MPI because the default is -byslot.
>>>> -byslot will work also if the job scheduler presents the slots in a by-node
>>>> round-robin strategy.
>>>>
>>>> 2. It is important to throw 23 mini-ranks per MPI process and not 24
>>>> because each MPI process has a communication thread too and you
>>>> don't want to oversubscribe the folks on the CPU at all.
>>>>
>>>> 3. The mini-rank code contains 0 (zero) locks, 0 mutexes, 0 spinlocks, 0 
>>>> semaphore.
>>>> The code is non-blocking, and lock-free which is why it works so well.
>>>>
>>>> 4. This work should be merged once I have made additional sanity checks.
>>>>
>>>> 5. If you want to look at the code, the class MessageQueue is particularly 
>>>> interesting.
>>>>
>>>>> It worked (although the assembly wasn't great, possibly due to a lot of
>>>>> repeats), but took way longer than the bigger genomes.
>>>>>
>>>>> I'm trying the same without the mates. I also changed the kmer from k31
>>>>> to k61.
>>>>>
>>>>> I hit walltime 5 times now, 120hours each and it's not finished.
>>>>>
>>>>> The variables that changed are kmer and no mates.
>>>>>
>>>>> The first run ran many steps in the log.
>>>>> Since the first wall time, the only output I seem to be having
>>>>>
>>>>> Rank X: gathering scaffold links [Y/2987] [Z/7166]
>>>>>
>>>>> (X,Y,Z varies of course)
>>>>>
>>>>
>>>> Known bug where the Ray stalls on repeats too long...
>>>>
>>>> https://github.com/sebhtml/ray/issues/91
>>>>
>>>> This is because of a bug (1 month old, actually). I have a patch in the 
>>>> queue,
>>>> but I am not satisfied by its impact overall. The patch fixes the problem 
>>>> of running
>>>> time though. I will solves this bug in the scaffolder when I have time.
>>>> Meanwhile, you can use the patch, which solves the problem, but it's a 
>>>> dirty hack.
>>>>
>>>> You can test this patch:
>>>>
>>>> wget 
>>>> http://downloads.sourceforge.net/project/denovoassembler/Ray-v2.1.0.tar.bz2
>>>> tar -xjf Ray-v2.1.0.tar.bz2
>>>> cd Ray-v2.1.0
>>>> wget 
>>>> https://github.com/sebhtml/patches/raw/master/ray/human-seb-from-13efb22270e4f563c9cafc.patc
>>>> patch -p1 < human-seb-from-13efb22270e4f563c9cafc.patch
>>>>
>>>> make ...
>>>>
>>>>> I was using a version compiled from sources for the polytope routing.
>>>>>
>>>>
>>>> As I said, "mini-ranks" *will* supercede the virtual routing subsystem. 
>>>> The problem with
>>>> virtual routing is that it increases the number of physical hops. With 
>>>> mini-ranks,
>>>> it is not the case at all.
>>>>
>>>>> Any ideas?
>>>>>
>>>>
>>>> To wrap-up:
>>>>
>>>> 1. Try mini-ranks;
>>>> 2. Try the patch;
>>>>
>>>>
>>>> p.s.: I should resume the patchwork, branch merging once I am done 
>>>> implementing
>>>> the reviewers' concerns for my Debian package and Fedora package for Ray.
>>>>
>>>> p.s.2: For your information, our paper about Ray Meta should appear 
>>>> somewhere in the
>>>> near future, it is in re-review (the reviewers are assessing our revised 
>>>> manuscript).
>>>>
>>>>> Louis
>>>>>
>>>>> ------------------------------------------------------------------------------
>>>>> LogMeIn Central: Instant, anywhere, Remote PC access and management.
>>>>> Stay in control, update software, and manage PCs from one command center
>>>>> Diagnose problems and improve visibility into emerging IT issues
>>>>> Automate, monitor and manage. Do more in less time with Central
>>>>> http://p.sf.net/sfu/logmein12331_d2d
>>>>> _______________________________________________
>>>>> Denovoassembler-users mailing list
>>>>> Denovoassembler-users@lists.sourceforge.net
>>>>> https://lists.sourceforge.net/lists/listinfo/denovoassembler-users
>>>>>
>>>>
>>>>
>>>
>>> ------------------------------------------------------------------------------
>>> LogMeIn Central: Instant, anywhere, Remote PC access and management.
>>> Stay in control, update software, and manage PCs from one command center
>>> Diagnose problems and improve visibility into emerging IT issues
>>> Automate, monitor and manage. Do more in less time with Central
>>> http://p.sf.net/sfu/logmein12331_d2d
>>> _______________________________________________
>>> Denovoassembler-users mailing list
>>> Denovoassembler-users@lists.sourceforge.net
>>> https://lists.sourceforge.net/lists/listinfo/denovoassembler-users
>>>
>>
>>
> 
> ------------------------------------------------------------------------------
> Everyone hates slow websites. So do we.
> Make your web apps faster with AppDynamics
> Download AppDynamics Lite for free today:
> http://p.sf.net/sfu/appdyn_d2d_nov
> _______________________________________________
> Denovoassembler-users mailing list
> Denovoassembler-users@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/denovoassembler-users
> 

------------------------------------------------------------------------------
Everyone hates slow websites. So do we.
Make your web apps faster with AppDynamics
Download AppDynamics Lite for free today:
http://p.sf.net/sfu/appdyn_d2d_nov
_______________________________________________
Denovoassembler-users mailing list
Denovoassembler-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/denovoassembler-users

Reply via email to