Re: very large xml-file parsing

Nick Burch Sat, 13 Sep 2014 06:30:13 -0700

On Sat, 13 Sep 2014, Mugat Gurkowsky wrote:

i am trying to use tika in combination with lucene to parse and index ofvery large xml-files. so far, without success, because of memorylimitations. tika's BodyContentHandler seems to try to copy the wholecontent in memory, which doesn't work as files are several giga-byteslarge.

It depends on what the BodyContentHandler is doing with the resultingcontent. Make sure whatever is downstream of it is doing streaming notbuffering


Nick

Re: very large xml-file parsing

Reply via email to