On 06/27/2014 12:31 PM, Ketil Malde wrote:

> performance of the blastxml library, 
> which parses Blast XML output files.

(I'm interested in XML processing as well -
also large files, though not for bio stuff)

can you show a test case (actual source code,
XML input data, and your performance measurements)?

what is meant by "the parsing is lazy" exactly?
You want a BlastResult with a lazy list of results
(containing BlastRecords with a lazy list of hits, etc)?

but you still want to accept valid files only?
seems to me that you have to look at each character, then.
(need to see the closing tag before you can build the root node, etc.)

you can defer some part of the work to later, but then you'll pay
the moment you access, e.g., the last element of the results list?

- J.

Reply via email to