Hi,

On Wed, Sep 23, 2009 at 3:36 AM, Marshall Schor <[email protected]> wrote:
> The Tika project gets around this by not having any binary distribution;
> so they never distribute (themselves) these parts.

See [1] for the LICENSE and NOTICE files that cover the full set of
depencencies.

Tika currently does not ship a binary that embeds those dependencies
because of some unresolved PDFBox licensing issues. These are fixed in
the PDFBox 0.8.0-incubating release that's just become available on
[2] and [3].

For now (until Tika 0.5 is released) you can work around the PDFBox
issue by explicitly excluding the PDFBox dependency.

> I believe we also need to get an Export 5D002 Registration for the Tika
> Annotator, and also for the Sandbox Distribution which includes it.

The export notification for Tika is there for the PDF decryption
support. If you exclude the PDFBox dependency or avoid configuring
Tika to activate the decryption feature, then you won't be using that
functionality and by my current understanding you don't need to do the
export control registration.

[1] 
https://svn.apache.org/repos/asf/lucene/tika/tags/0.4/tika-app/src/main/resources/META-INF/
[2] http://incubator.apache.org/pdfbox/download.html
[3] http://repo2.maven.org/maven2/org/apache/pdfbox/pdfbox/0.8.0-incubator/

BR,

Jukka Zitting

Reply via email to