[
https://issues.apache.org/jira/browse/TIKA-2864?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16838802#comment-16838802
]
Tim Allison edited comment on TIKA-2864 at 5/13/19 6:48 PM:
------------------------------------------------------------
The problem went away.
I still notice that {{application/vnd.ms-equation}} and
{{applicatoin/vnd.ms-graph}} files are taking longer than they used
to...something to dig into for 1.22 or 2.0...:
||MimeA||MimeB||MillisA||MillisB||%Change||
|application/vnd.ms-equation|application/vnd.ms-equation|93060|192447|207%|
|application/vnd.ms-graph|application/vnd.ms-graph|39678|50188|126%|
|text/csv; charset=ISO-8859-1|text/csv; charset=ISO-8859-1;
delimiter=comma|145843|177244|121%|
|application/x-tika-msoffice-embedded;
format=comp_obj|application/x-tika-msoffice-embedded;
format=comp_obj|25656|31069|121%|
|text/plain; charset=windows-1252|text/csv; charset=windows-1252;
delimiter=comma|166834|199847|120%|
|text/plain; charset=windows-1252|text/tsv; charset=windows-1252;
delimiter=tab|85061|97840|115%|
|text/csv; charset=windows-1252|text/csv; charset=windows-1252;
delimiter=comma|132954|146280|110%|
|text/plain; charset=ISO-8859-1|text/tsv; charset=ISO-8859-1;
delimiter=tab|219477|235333|107%|
was (Author: [email protected]):
The problem went away.
I still notice that {{application/vnd.ms-equation}} and
{{applicatoin/vnd.ms-graph}} files are taking longer than they used
to...something to dig into for 1.22 or 2.0...:
||MimeA||MimeB||MillisA||MillisB||%Change||
|application/vnd.ms-equation|application/vnd.ms-equation|93060|192447|207%|
|application/vnd.ms-graph|application/vnd.ms-graph|39678|50188|126%|
|text/csv; charset=ISO-8859-1|text/csv; charset=ISO-8859-1;
delimiter=comma|145843|177244|121%|
|application/x-tika-msoffice-embedded;
format=comp_obj|application/x-tika-msoffice-embedded;
format=comp_obj|25656|31069|121%|
|text/plain; charset=windows-1252|text/csv; charset=windows-1252;
delimiter=comma|166834|199847|120%|
|text/plain; charset=windows-1252|text/tsv; charset=windows-1252;
delimiter=tab|85061|97840|115%|
|text/csv; charset=windows-1252|text/csv; charset=windows-1252;
delimiter=comma|132954|146280|1.10%|
|text/plain; charset=ISO-8859-1|text/tsv; charset=ISO-8859-1;
delimiter=tab|219477|235333|107%|
> Fix regression in RFC822 parsing time
> -------------------------------------
>
> Key: TIKA-2864
> URL: https://issues.apache.org/jira/browse/TIKA-2864
> Project: Tika
> Issue Type: Task
> Reporter: Tim Allison
> Assignee: Tim Allison
> Priority: Blocker
> Fix For: 1.21
>
>
> In running the regression tests, we found a 1000x slowdown in rfc files on
> the full batch run. When we try to reproduce this locally, we can only
> replicate 10x, even multithreaded, but we can at least replicate a 10x
> slowdown.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)