Author: siren Date: Mon Mar 23 06:41:13 2009 New Revision: 757327 URL: http://svn.apache.org/viewvc?rev=757327&view=rev Log: NUTCH-722 remove JAI libs
Removed: lucene/nutch/trunk/src/plugin/parse-pdf/lib/jai_codec.jar lucene/nutch/trunk/src/plugin/parse-pdf/lib/jai_core.jar Modified: lucene/nutch/trunk/CHANGES.txt lucene/nutch/trunk/README.txt lucene/nutch/trunk/src/plugin/parse-pdf/plugin.xml Modified: lucene/nutch/trunk/CHANGES.txt URL: http://svn.apache.org/viewvc/lucene/nutch/trunk/CHANGES.txt?rev=757327&r1=757326&r2=757327&view=diff ============================================================================== --- lucene/nutch/trunk/CHANGES.txt (original) +++ lucene/nutch/trunk/CHANGES.txt Mon Mar 23 06:41:13 2009 @@ -380,6 +380,8 @@ 143. NUTCH-715 - Subcollection plugin doesn't work with default subcollections.xml file (Dmitry Lihachev via siren) + +144. NUTCH-722 - Nutch contains JAI jars that we cannot redistribute Release 0.9 - 2007-04-02 Modified: lucene/nutch/trunk/README.txt URL: http://svn.apache.org/viewvc/lucene/nutch/trunk/README.txt?rev=757327&r1=757326&r2=757327&view=diff ============================================================================== --- lucene/nutch/trunk/README.txt (original) +++ lucene/nutch/trunk/README.txt Mon Mar 23 06:41:13 2009 @@ -1,5 +1,19 @@ Apache Nutch README +Important note: Due to licensing issues we cannot provide two libraries that +are normally provided with PDFBox (jai_core.jar, jai_codec.jar), the parser +library we use for parsing PDF files. If you encounter unexpected problems when +working with PDF files please + +1. download the two missing libraries from: + http://pdfbox.cvs.sourceforge.net/viewvc/pdfbox/pdfbox/external/ + +2. Put them to directory src/plugin/parse-pdf/lib +3. follow the instructions in file src/plugin/parse-pdf/plugin.xml +4. Rebuild nutch. + + + Interesting files include: Modified: lucene/nutch/trunk/src/plugin/parse-pdf/plugin.xml URL: http://svn.apache.org/viewvc/lucene/nutch/trunk/src/plugin/parse-pdf/plugin.xml?rev=757327&r1=757326&r2=757327&view=diff ============================================================================== --- lucene/nutch/trunk/src/plugin/parse-pdf/plugin.xml (original) +++ lucene/nutch/trunk/src/plugin/parse-pdf/plugin.xml Mon Mar 23 06:41:13 2009 @@ -30,8 +30,12 @@ <library name="FontBox-0.2.0-dev.jar"/> <library name="JempBox-0.2.0-dev.jar"/> <library name="bcprov-jdk14-132.jar"/> + <!-- Uncomment the following two lines after you have downloaded the + libraries, see README.txt for more details.--> + <!-- <library name="jai_codec.jar"/> <library name="jai_core.jar"/> + --> </runtime> <requires>