Thanks Sebastian! I tried fixing that as well! Now both my libraries have
good quality equal number of sequences: Here is the content of
NumberOfSequences file:
Files: 2
FileNumber: 0
FilePath: /scratch/sucheta/out1.fastq
NumberOfSequences: 69393976
FirstSequence: 0
LastSequence: 69393975
FileNumber: 1
FilePath: /scratch/sucheta/out2.fastq
NumberOfSequences: 69393976
FirstSequence: 69393976
LastSequence: 138787951
Summary
NumberOfSequences: 138787952
FirstSequence: 0
LastSequence: 138787951
~
In the output directory, I see the following files: Where the
parallelPaths.txt file has 0 content. I am not sure what is making this
crash.
-rw-r--r-- 1 sucheta iicb 332 Dec 6 07:48
CoverageDistributionAnalysis.txt
-rw-r--r-- 1 sucheta iicb 259819 Dec 6 07:48 CoverageDistribution.txt
-rw-r--r-- 1 sucheta iicb 672 Dec 6 09:28 degreeDistribution.txt
-rw-r--r-- 1 sucheta iicb 1541 Dec 6 15:41 ElapsedTime.txt
-rw-r--r-- 1 sucheta iicb 164 Dec 6 06:23 FilePartition.txt
-rw-r--r-- 1 sucheta iicb 3545 Dec 6 09:27 GraphPartition.txt
-rw-r--r-- 1 sucheta iicb 6496 Dec 6 15:41 LibraryData.xml
-rw-r--r-- 1 sucheta iicb 389 Dec 6 15:41 LibraryStatistics.txt
-rw-r--r-- 1 sucheta iicb 2388 Dec 6 06:18 NetworkTest.txt
-rw-r--r-- 1 sucheta iicb 352 Dec 6 06:23 NumberOfSequences.txt
-rw-r--r-- 1 sucheta iicb 0 Dec 6 15:42 ParallelPaths.txt
drwxr-x--- 2 sucheta iicb 4096 Dec 6 16:20 Plugins
-rw-r--r-- 1 sucheta iicb 116 Dec 6 06:18 RayCommand.txt
-rw-r--r-- 1 sucheta iicb 18 Dec 6 06:18 RayPlatform_Version.txt
-rw-r--r-- 1 sucheta iicb 10 Dec 6 06:18 RayVersion.txt
drwxr-x--- 2 sucheta iicb 4096 Dec 6 06:18 Scheduling
-rw-r--r-- 1 sucheta iicb 25434 Dec 6 15:02 SeedLengthDistribution.txt
-rw-r--r-- 1 sucheta iicb 2859 Dec 6 06:23 SequencePartition.txt
Here is my commandline:
mpiexec -n 96 Ray \
-k \
21 \
-p \
/scratch/sucheta/out1.fastq \
/scratch/sucheta/out2.fastq \
-o \
rayTest1
~
On Fri, Dec 6, 2013 at 9:10 PM, Sébastien Boisvert <
sebastien.boisver...@ulaval.ca> wrote:
> On 01/12/13 04:12 AM, Sucheta Tripathy wrote:
>
>>
>> I am trying to run Ray for a while now on our Illumina data. It is run in
>> parallel mode and memory is not a constraint now because we have a cluster
>> with 16 X 16 GB memory in each node.
>>
>>
> The problem is this: tmp1 and tmp2 don't contain the same number of
> sequences.
>
> Using paired files with different number of sequences will cause crash in
> Ray 2.2.0 and earlier versions.
> If you use Ray 2.3.0, Ray will throw an error.
>
>
> Also, lane1_1b and lane_1_2b are empty (0 sequences). You may want to
> check that out too.
>
>
> my Command:
>> mpiexec -n 64 Ray \
>> -k \
>> 31 \
>> -disable-recycling \
>> -p \
>> /scratch/sucheta/tmp1.fastq \
>> /scratch/sucheta/tmp2.fastq \
>> -p \
>> /scratch/sucheta/lane1_1b.fq \
>> /scratch/sucheta/lane1_2b.fq \
>> -o \
>> rayTest
>>
>> Content of NumberOfSequences.txt
>> Files: 4
>>
>> FileNumber: 0
>> FilePath: /scratch/sucheta/tmp1.fastq
>> NumberOfSequences: 73013670
>> FirstSequence: 0
>> LastSequence: 73013669
>>
>> FileNumber: 1
>> FilePath: /scratch/sucheta/tmp2.fastq
>> NumberOfSequences: 72339350
>> FirstSequence: 73013670
>> LastSequence: 145353019
>>
>> FileNumber: 2
>> FilePath: /scratch/sucheta/lane1_1b.fq
>> NumberOfSequences: 0
>>
>> FileNumber: 3
>> FilePath: /scratch/sucheta/lane1_2b.fq
>> NumberOfSequences: 0
>>
>> Summary
>> NumberOfSequences: 145353020
>> FirstSequence: 0
>> LastSequence: 145353019
>>
>>
>> However, after running for a while Ray terminates: This is the last few
>> lines of STDOUT:
>>
>> ---
>>
>> Speed RAY_SLAVE_MODE_EXTENSION 1183 units/second
>> Rank 9: assembler memory usage: 850432 KiB
>> Rank 9 is changing direction.
>> Rank 12: assembler memory usage: 826944 KiB
>> Rank 12 starts on seed 1200, length is 477, flow 0 [1200/6451]
>> Rank 12 starts on seed 1201, length is 476, flow 0 [1201/6451]
>> Rank 10 traversed 1273646 nucleotide symbols
>> Rank 10: assembler memory usage: 869460 KiB
>> Rank 10 starts on seed 534, length is 906, flow 0 [534/6579]
>> Rank 31 reached 2270 vertices from seed 491, flow 1
>> Speed RAY_SLAVE_MODE_EXTENSION 1174 units/second
>> Rank 31: assembler memory usage: 866768 KiB
>> Rank 31 is changing direction.
>> Rank 2 traversed 1910058 nucleotide symbols
>> Rank 2 reached 1269 vertices from seed 1007, flow 2
>> Speed RAY_SLAVE_MODE_EXTENSION 1435 units/second
>> Rank 2: assembler memory usage: 843432 KiB
>> Rank 2 (extension done) NumberOfFlows: 2
>> Rank 2 FlowedVertices: 0 576 1 1269 2 1269
>> Rank 2: assembler memory usage: 839332 KiB
>> Rank 2 starts on seed 1008, length is 576, flow 0 [1008/6537]
>> Terminated
>> ---
>>
>> It creates the following output files before crashing:
>>
>> CoverageDistributionAnalysis.txt NumberOfSequences.txt
>> CoverageDistribution.txt ParallelPaths.txt
>> degreeDistribution.txt Plugins
>> ElapsedTime.txt RayCommand.txt
>> FilePartition.txt RayPlatform_Version.txt
>> GraphPartition.txt RayVersion.txt
>> LibraryData.xml Scheduling
>> LibraryStatistics.txt SeedLengthDistribution.txt
>> NetworkTest.txt SequencePartition.txt
>>
>>
>> Any help in this regard will be much appreciated!
>>
>>
>> --
>> Sucheta Tripathy, Ph.D
>> Scientist, Ramalingaswamy Fellow,
>> Indian Institute of Chemical Biology,
>> Kolkata, India.
>> https://sites.google.com/site/suchetalab/
>> https://twitter.com/tsucheta
>>
>
>
--
Sucheta Tripathy, Ph.D
Scientist, Ramalingaswamy Fellow,
Indian Institute of Chemical Biology,
Kolkata, India.
https://sites.google.com/site/suchetalab/
https://twitter.com/tsucheta
------------------------------------------------------------------------------
Sponsored by Intel(R) XDK
Develop, test and display web and hybrid apps with a single code base.
Download it for free now!
http://pubads.g.doubleclick.net/gampad/clk?id=111408631&iu=/4140/ostg.clktrk
_______________________________________________
Denovoassembler-users mailing list
Denovoassembler-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/denovoassembler-users