On 01/12/13 04:12 AM, Sucheta Tripathy wrote: > > I am trying to run Ray for a while now on our Illumina data. It is run in > parallel mode and memory is not a constraint now because we have a cluster > with 16 X 16 GB memory in each node. >
The problem is this: tmp1 and tmp2 don't contain the same number of sequences. Using paired files with different number of sequences will cause crash in Ray 2.2.0 and earlier versions. If you use Ray 2.3.0, Ray will throw an error. Also, lane1_1b and lane_1_2b are empty (0 sequences). You may want to check that out too. > my Command: > mpiexec -n 64 Ray \ > -k \ > 31 \ > -disable-recycling \ > -p \ > /scratch/sucheta/tmp1.fastq \ > /scratch/sucheta/tmp2.fastq \ > -p \ > /scratch/sucheta/lane1_1b.fq \ > /scratch/sucheta/lane1_2b.fq \ > -o \ > rayTest > > Content of NumberOfSequences.txt > Files: 4 > > FileNumber: 0 > FilePath: /scratch/sucheta/tmp1.fastq > NumberOfSequences: 73013670 > FirstSequence: 0 > LastSequence: 73013669 > > FileNumber: 1 > FilePath: /scratch/sucheta/tmp2.fastq > NumberOfSequences: 72339350 > FirstSequence: 73013670 > LastSequence: 145353019 > > FileNumber: 2 > FilePath: /scratch/sucheta/lane1_1b.fq > NumberOfSequences: 0 > > FileNumber: 3 > FilePath: /scratch/sucheta/lane1_2b.fq > NumberOfSequences: 0 > > Summary > NumberOfSequences: 145353020 > FirstSequence: 0 > LastSequence: 145353019 > > > However, after running for a while Ray terminates: This is the last few lines > of STDOUT: > > --- > > Speed RAY_SLAVE_MODE_EXTENSION 1183 units/second > Rank 9: assembler memory usage: 850432 KiB > Rank 9 is changing direction. > Rank 12: assembler memory usage: 826944 KiB > Rank 12 starts on seed 1200, length is 477, flow 0 [1200/6451] > Rank 12 starts on seed 1201, length is 476, flow 0 [1201/6451] > Rank 10 traversed 1273646 nucleotide symbols > Rank 10: assembler memory usage: 869460 KiB > Rank 10 starts on seed 534, length is 906, flow 0 [534/6579] > Rank 31 reached 2270 vertices from seed 491, flow 1 > Speed RAY_SLAVE_MODE_EXTENSION 1174 units/second > Rank 31: assembler memory usage: 866768 KiB > Rank 31 is changing direction. > Rank 2 traversed 1910058 nucleotide symbols > Rank 2 reached 1269 vertices from seed 1007, flow 2 > Speed RAY_SLAVE_MODE_EXTENSION 1435 units/second > Rank 2: assembler memory usage: 843432 KiB > Rank 2 (extension done) NumberOfFlows: 2 > Rank 2 FlowedVertices: 0 576 1 1269 2 1269 > Rank 2: assembler memory usage: 839332 KiB > Rank 2 starts on seed 1008, length is 576, flow 0 [1008/6537] > Terminated > --- > > It creates the following output files before crashing: > > CoverageDistributionAnalysis.txt NumberOfSequences.txt > CoverageDistribution.txt ParallelPaths.txt > degreeDistribution.txt Plugins > ElapsedTime.txt RayCommand.txt > FilePartition.txt RayPlatform_Version.txt > GraphPartition.txt RayVersion.txt > LibraryData.xml Scheduling > LibraryStatistics.txt SeedLengthDistribution.txt > NetworkTest.txt SequencePartition.txt > > > Any help in this regard will be much appreciated! > > > -- > Sucheta Tripathy, Ph.D > Scientist, Ramalingaswamy Fellow, > Indian Institute of Chemical Biology, > Kolkata, India. > https://sites.google.com/site/suchetalab/ > https://twitter.com/tsucheta ------------------------------------------------------------------------------ Sponsored by Intel(R) XDK Develop, test and display web and hybrid apps with a single code base. Download it for free now! http://pubads.g.doubleclick.net/gampad/clk?id=111408631&iu=/4140/ostg.clktrk _______________________________________________ Denovoassembler-users mailing list Denovoassembler-users@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/denovoassembler-users