RE: NonSequentialPDFParser

Allison, Timothy B. Mon, 02 Dec 2013 07:02:51 -0800

Does the speedup only help if you are trying to parse an individual page vs the 
entire document?  If so, is partial parsing a use case for Tika?  If this has 
the same performance on the full document as the regular parser, does it have 
lower memory overhead?

-----Original Message-----
From: Hong-Thai Nguyen [mailto:[email protected]] 
Sent: Monday, December 02, 2013 9:18 AM
To: [email protected]
Subject: NonSequentialPDFParser

Hi all,
NonSequentialPDFParser may increase 45% parsing performance on PDF extraction. 
Should we integrate in Tika ?
https://issues.apache.org/jira/browse/PDFBOX-1104

Thanks,

Hong-Thai

RE: NonSequentialPDFParser

Reply via email to