On Monday 01 Nov 2004 14:49, Can Gencer wrote:
> Hello everyone,
>
> We are trying to parse a quite large multiple BLAST results file (around
> 4GB), and the computer available has 1GB of RAM. However, when the code
> in the cookbook is used (
> "http://www.biojava.org/docs/bj_in_anger/BlastParser.htm";), using the
> BlastLikeSAXParser it will give out an OutOfMemory exception after a
> short while, and when I monitor the system during the parsing, I don't
> see the memory usage going up significantly. It is the
> parse(InputSource) method that throws the exception. Is there a way to
> solve this problem ?
>
This is probably not the answer you want but I'm parsing BLAST files at 
least as large as yours without this problem using the BlastXMLParserFacade 
class.  Perhaps it may be a temporary workaround until someone who 
understands the other parser responds, I certainly don't.

There is also a alpha/beta-quality parser filter framework that could 
perhaps be used with the XML parser framework in CVS.

Regards,
David Huen
P.S. A number of fixes have gone into the XML parsing for NCBI Blastn (the 
only part I use, the other parts may work too)software in CVS which may 
make it workable for you now.  In particular, the irritating DTD related 
bug appears to be worked around.
_______________________________________________
Biojava-l mailing list  -  [EMAIL PROTECTED]
http://biojava.org/mailman/listinfo/biojava-l

Reply via email to