Hi, On Wed, Sep 23, 2009 at 3:36 AM, Marshall Schor <[email protected]> wrote: > The Tika project gets around this by not having any binary distribution; > so they never distribute (themselves) these parts.
See [1] for the LICENSE and NOTICE files that cover the full set of depencencies. Tika currently does not ship a binary that embeds those dependencies because of some unresolved PDFBox licensing issues. These are fixed in the PDFBox 0.8.0-incubating release that's just become available on [2] and [3]. For now (until Tika 0.5 is released) you can work around the PDFBox issue by explicitly excluding the PDFBox dependency. > I believe we also need to get an Export 5D002 Registration for the Tika > Annotator, and also for the Sandbox Distribution which includes it. The export notification for Tika is there for the PDF decryption support. If you exclude the PDFBox dependency or avoid configuring Tika to activate the decryption feature, then you won't be using that functionality and by my current understanding you don't need to do the export control registration. [1] https://svn.apache.org/repos/asf/lucene/tika/tags/0.4/tika-app/src/main/resources/META-INF/ [2] http://incubator.apache.org/pdfbox/download.html [3] http://repo2.maven.org/maven2/org/apache/pdfbox/pdfbox/0.8.0-incubator/ BR, Jukka Zitting
