Tim Allison commented on TIKA-2434:

[~StefanKah], yep, sorry, that question was for you.

PDFs, like MSOffice files and MSG files, can contain attachments.  These 
"regular" attachments often are represented by an icon somewhere for 
MSOffice/MSG; for PDFs, in Acrobat Reader, there's a pane for regular 
attachments.  Users can attach a Word file or a zip file or a jpeg or anything 
else to a PDF.

Inline attachments are image files that are used to render the page or parts of 
the page; rather than an icon, the application renders the image.  Inline 
attachments are far, far more common in PDFs than regular attachments.

> Language detection slow, cpu intensive, CLI interrupts work
> -----------------------------------------------------------
>                 Key: TIKA-2434
>                 URL: https://issues.apache.org/jira/browse/TIKA-2434
>             Project: Tika
>          Issue Type: Bug
>          Components: cli
>    Affects Versions: 1.16
>         Environment: OS X 10.11.6, JRE 1.8.0_25
>            Reporter: Stefan Karner
> Since version 1.16, when using tika -l FILE, it takes a lot longer than e.g. 
> 1.15.
> Also, when batch processing a bunch of files in the background, the Java 
> runtime icon pops up when processing the next file, stealing the input focus 
> from whatever other application I'm currently working on, thus constantly 
> interrupting my work.
> Also, the Java runtime uses from 100% to 400% CPU when executing Tika.

This message was sent by Atlassian JIRA

Reply via email to