How can parsing a 5Mb take 3minutes?

Clemens Wyss DEV Sun, 22 Dec 2013 01:08:38 -0800

I have a 3Mb pdf files (and others) that takes 3 minutes to extract ist 
content. In my test I am using AutodetectParser (and PDFParser). 
I have built Tika from sources, i.e. am using 1.5 snapshot.


Can anybody explain why/how this is possible?

Where/how can I send the very document? 

Regards
Clemens

How can parsing a 5Mb take 3minutes?

Reply via email to