Tilman Hausherr created TIKA-2492:
-------------------------------------
Summary: Remove pdfdebugger from tika
Key: TIKA-2492
URL: https://issues.apache.org/jira/browse/TIKA-2492
Project: Tika
Issue Type: Improvement
Components: packaging
Reporter: Tilman Hausherr
Priority: Minor
PDFDebugger isn't needed in tika but it is a dependency in pdfbox-tools
(because that one contains the command line interface, which calls the PDFBox
command line tools).
Thus I suggest that the tika parser pom be changed like this:
{code}
<dependency>
<groupId>org.apache.pdfbox</groupId>
<artifactId>pdfbox-tools</artifactId>
<version>${pdfbox.version}</version>
<exclusions>
<exclusion>
<groupId>commons-logging</groupId>
<artifactId>commons-logging</artifactId>
</exclusion>
+ <exclusion>
+ <groupId>org.apache.pdfbox</groupId>
+ <artifactId>pdfbox-debugger</artifactId>
+ </exclusion>
</exclusions>
{code}
This saves you 200KB in tika-app. That's not much, but every weight loss counts
:-)
It should also be possible to get it removed from tika-bundle, but I don't know
how to remove it properly. Just removing it from "Embed-Dependency" isn't
enough.
--
This message was sent by Atlassian JIRA
(v6.4.14#64029)