tika-user  

Re: JavaHeapSpace - Parsing 4GB of data recursively

Daniel Knapp
Thu, 26 Nov 2009 05:02:48 -0800


Hi,

On Thu, Nov 26, 2009 at 1:07 PM, Daniel Knapp
<daniel.kn...@mni.fh-giessen.de> wrote:
Okay, thanks for that hint. But should i deal with the extracted content?
Actually i'm using the following Code:
[...]
StringBuilder sb = new StringBuilder();
while ((tmp = br.readLine()) != null) {
sb.append(tmp);
}
[...]
But think this can't be the best solution, my memory gets fuller and fuller.
Is this so exotic or do i overlook a detail?

Your memory gets filled because you buffer the extracted text into a
StringBuilder instance.

How about something like this instead:

   String tmp;
   while ((tmp = br.readLine()) != null) {
       System.out.print(tmp);
   }

Yes, this works but i need to send the content to Solr so i need something to store the data or not?


BR,

Jukka Zitting