[ 
https://issues.apache.org/jira/browse/TIKA-4017?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17712536#comment-17712536
 ] 

Tim Allison commented on TIKA-4017:
-----------------------------------

I ran time tests on parsing all PDFs.  I threw out the first run.  This 
suggests that after all the class loading, counting incremental updates is not 
much slower than not counting them.

I'd still not be comfortable turning this on by default.  

 
|Without counting updates|Counting updates|
|1318|1485|
|1134|1296|
|1026|1137|
|1009|1142|
|1103|1066|
|1013|1163|
|988|1024|
|1024|1048|
|1029|1012|

> Add optional detection and parsing of incremental updates in PDF
> ----------------------------------------------------------------
>
>                 Key: TIKA-4017
>                 URL: https://issues.apache.org/jira/browse/TIKA-4017
>             Project: Tika
>          Issue Type: Task
>            Reporter: Tim Allison
>            Priority: Major
>




--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to