Tilman Hausherr created TIKA-2492:
-------------------------------------

             Summary: Remove pdfdebugger from tika
                 Key: TIKA-2492
                 URL: https://issues.apache.org/jira/browse/TIKA-2492
             Project: Tika
          Issue Type: Improvement
          Components: packaging
            Reporter: Tilman Hausherr
            Priority: Minor


PDFDebugger isn't needed in tika but it is a dependency in pdfbox-tools 
(because that one contains the command line interface, which calls the PDFBox 
command line tools).

Thus I suggest that the tika parser pom be changed like this:
{code}
    <dependency>
      <groupId>org.apache.pdfbox</groupId>
      <artifactId>pdfbox-tools</artifactId>
      <version>${pdfbox.version}</version>
      <exclusions>
        <exclusion>
          <groupId>commons-logging</groupId>
          <artifactId>commons-logging</artifactId>
        </exclusion>
+        <exclusion>
+          <groupId>org.apache.pdfbox</groupId>
+          <artifactId>pdfbox-debugger</artifactId>
+        </exclusion>
      </exclusions>
{code}
This saves you 200KB in tika-app. That's not much, but every weight loss counts 
:-)

It should also be possible to get it removed from tika-bundle, but I don't know 
how to remove it properly. Just removing it from "Embed-Dependency" isn't 
enough.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Reply via email to