svn commit: r478421 - /lucene/nutch/

2006-11-23 Thread mattmann
Author: mattmann Date: Wed Nov 22 17:27:24 2006 New Revision: 478421 URL: http://svn.apache.org/viewvc?view=revrev=478421 Log: - ignore pesky eclipse .project and .classpath files, and .settings directory Modified: lucene/nutch/ (props changed) Propchange: lucene/nutch

svn commit: r478925 - /lucene/nutch/trunk/src/java/org/apache/nutch/metadata/package.html

2006-11-24 Thread mattmann
Author: mattmann Date: Fri Nov 24 09:27:34 2006 New Revision: 478925 URL: http://svn.apache.org/viewvc?view=revrev=478925 Log: - javadoc package description for metadata subsystem Added: lucene/nutch/trunk/src/java/org/apache/nutch/metadata/package.html Added: lucene/nutch/trunk/src/java

svn commit: r478933 - /lucene/nutch/trunk/src/java/org/apache/nutch/metadata/Metadata.java

2006-11-24 Thread mattmann
Author: mattmann Date: Fri Nov 24 09:54:28 2006 New Revision: 478933 URL: http://svn.apache.org/viewvc?view=revrev=478933 Log: - remove unnecessary comments - optimize counting of non-null values by performing it inline rather than calling a function and creating a new object Modified

svn commit: r500090 - /lucene/nutch/trunk/conf/nutch-default.xml

2007-01-25 Thread mattmann
Author: mattmann Date: Thu Jan 25 18:02:13 2007 New Revision: 500090 URL: http://svn.apache.org/viewvc?view=revrev=500090 Log: - add comment about enabling protocol-httpclient in order to support HTTPS Modified: lucene/nutch/trunk/conf/nutch-default.xml Modified: lucene/nutch/trunk/conf

svn commit: r500093 - /lucene/nutch/trunk/conf/nutch-default.xml

2007-01-25 Thread mattmann
Author: mattmann Date: Thu Jan 25 18:03:41 2007 New Revision: 500093 URL: http://svn.apache.org/viewvc?view=revrev=500093 Log: - forgot period Modified: lucene/nutch/trunk/conf/nutch-default.xml Modified: lucene/nutch/trunk/conf/nutch-default.xml URL: http://svn.apache.org/viewvc/lucene

svn commit: r501312 - /lucene/nutch/trunk/bin/

2007-01-29 Thread mattmann
Author: mattmann Date: Mon Jan 29 21:19:53 2007 New Revision: 501312 URL: http://svn.apache.org/viewvc?view=revrev=501312 Log: - ignore hadoop-config.sh (generated file) Modified: lucene/nutch/trunk/bin/ (props changed) Propchange: lucene/nutch/trunk/bin

svn commit: r501315 - in /lucene/nutch/trunk: ./ lib/ src/java/org/apache/nutch/net/ src/java/org/apache/nutch/plugin/ src/java/org/apache/nutch/protocol/ src/java/org/apache/nutch/scoring/ src/java/o

2007-01-29 Thread mattmann
Author: mattmann Date: Mon Jan 29 21:55:03 2007 New Revision: 501315 URL: http://svn.apache.org/viewvc?view=revrev=501315 Log: Fix for NUTCH-390 Javadoc warnings Modified: lucene/nutch/trunk/CHANGES.txt lucene/nutch/trunk/lib/commons-logging-1.0.4.jar lucene/nutch/trunk/src/java/org

svn commit: r516660 - in /lucene/nutch/trunk: ./ src/plugin/protocol-file/src/java/org/apache/nutch/protocol/file/ src/plugin/protocol-file/src/test/ src/plugin/protocol-file/src/test/org/ src/plugin/

2007-03-09 Thread mattmann
Author: mattmann Date: Fri Mar 9 22:52:31 2007 New Revision: 516660 URL: http://svn.apache.org/viewvc?view=revrev=516660 Log: fix for NUTCH-384 (contributed by Heiko Dietze) Added: lucene/nutch/trunk/src/plugin/protocol-file/src/test/ lucene/nutch/trunk/src/plugin/protocol-file/src/test

svn commit: r525011 - /lucene/nutch/tags/release-0.9/

2007-04-02 Thread mattmann
Author: mattmann Date: Mon Apr 2 20:23:57 2007 New Revision: 525011 URL: http://svn.apache.org/viewvc?view=revrev=525011 Log: - remove tagged release - start release of Nutch 0.9 from scratch - see mailing list conversations regarding change in process Removed: lucene/nutch/tags/release

svn commit: r525020 - /lucene/nutch/trunk/src/site/src/documentation/content/xdocs/index.xml

2007-04-02 Thread mattmann
Author: mattmann Date: Mon Apr 2 21:04:29 2007 New Revision: 525020 URL: http://svn.apache.org/viewvc?view=revrev=525020 Log: - prep for 0.9 rc Modified: lucene/nutch/trunk/src/site/src/documentation/content/xdocs/index.xml Modified: lucene/nutch/trunk/src/site/src/documentation/content

svn commit: r525021 - /lucene/nutch/branches/branch-0.9/

2007-04-02 Thread mattmann
Author: mattmann Date: Mon Apr 2 21:13:50 2007 New Revision: 525021 URL: http://svn.apache.org/viewvc?view=revrev=525021 Log: Nutch 0.9 release maintenance branch and rc working copy Added: lucene/nutch/branches/branch-0.9/ - copied from r525020, lucene/nutch/trunk/

svn commit: r526024 - /lucene/nutch/tags/release-0.9/

2007-04-05 Thread mattmann
Author: mattmann Date: Thu Apr 5 19:10:16 2007 New Revision: 526024 URL: http://svn.apache.org/viewvc?view=revrev=526024 Log: Nutch 0.9 release Added: lucene/nutch/tags/release-0.9/ - copied from r526023, lucene/nutch/trunk/

svn commit: r526035 - in /lucene/nutch/trunk: conf/nutch-default.xml default.properties

2007-04-05 Thread mattmann
Author: mattmann Date: Thu Apr 5 19:36:56 2007 New Revision: 526035 URL: http://svn.apache.org/viewvc?view=revrev=526035 Log: - update for new development, Nutch 1.0-dev Modified: lucene/nutch/trunk/conf/nutch-default.xml lucene/nutch/trunk/default.properties Modified: lucene/nutch

svn commit: r526036 - /lucene/nutch/trunk/CHANGES.txt

2007-04-05 Thread mattmann
Author: mattmann Date: Thu Apr 5 19:38:15 2007 New Revision: 526036 URL: http://svn.apache.org/viewvc?view=revrev=526036 Log: - update for new development, Nutch 1.0-dev Modified: lucene/nutch/trunk/CHANGES.txt Modified: lucene/nutch/trunk/CHANGES.txt URL: http://svn.apache.org/viewvc

svn commit: r548076 - in /lucene/nutch/trunk: CHANGES.txt src/java/org/apache/nutch/fetcher/Fetcher.java src/java/org/apache/nutch/fetcher/Fetcher2.java src/java/org/apache/nutch/indexer/Indexer.java

2007-06-17 Thread mattmann
Author: mattmann Date: Sun Jun 17 10:19:14 2007 New Revision: 548076 URL: http://svn.apache.org/viewvc?view=revrev=548076 Log: - fix for NUTCH-443 (contributed by Dogacan) Modified: lucene/nutch/trunk/CHANGES.txt lucene/nutch/trunk/src/java/org/apache/nutch/fetcher/Fetcher.java

svn commit: r548730 - in /lucene/nutch/trunk: ./ conf/ src/java/org/apache/nutch/metadata/ src/plugin/ src/plugin/feed/ src/plugin/feed/lib/ src/plugin/feed/sample/ src/plugin/feed/src/ src/plugin/fee

2007-06-19 Thread mattmann
Author: mattmann Date: Tue Jun 19 07:01:02 2007 New Revision: 548730 URL: http://svn.apache.org/viewvc?view=revrev=548730 Log: fix for NUTCH-444 Added: lucene/nutch/trunk/src/java/org/apache/nutch/metadata/Feed.java lucene/nutch/trunk/src/plugin/feed/ lucene/nutch/trunk/src/plugin

svn commit: r583016 - in /lucene/nutch/trunk: ./ conf/ lib/ src/java/org/apache/nutch/parse/ src/java/org/apache/nutch/protocol/ src/java/org/apache/nutch/util/mime/ src/plugin/index-more/src/java/org

2007-10-08 Thread mattmann
Author: mattmann Date: Mon Oct 8 17:23:38 2007 New Revision: 583016 URL: http://svn.apache.org/viewvc?rev=583016view=rev Log: - fix for NUTCH-562 Added: lucene/nutch/trunk/conf/tika-mimetypes.xml lucene/nutch/trunk/lib/tika-0.1-dev.jar (with props) Removed: lucene/nutch/trunk/conf

svn commit: r620811 - in /lucene/nutch/trunk: ./ lib/ src/java/org/apache/nutch/parse/ src/java/org/apache/nutch/protocol/ src/java/org/apache/nutch/util/ src/plugin/index-more/src/java/org/apache/nut

2008-02-12 Thread mattmann
Author: mattmann Date: Tue Feb 12 06:08:50 2008 New Revision: 620811 URL: http://svn.apache.org/viewvc?rev=620811view=rev Log: - fix for NUTCH-608 Added: lucene/nutch/trunk/lib/tika-0.1-incubating.jar (with props) lucene/nutch/trunk/src/java/org/apache/nutch/util/MimeUtil.java

svn commit: r663092 - in /lucene/nutch/trunk: CHANGES.txt conf/tika-mimetypes.xml src/java/org/apache/nutch/util/MimeUtil.java

2008-06-04 Thread mattmann
Author: mattmann Date: Wed Jun 4 06:40:19 2008 New Revision: 663092 URL: http://svn.apache.org/viewvc?rev=663092view=rev Log: - fix for NUTCH-618 Modified: lucene/nutch/trunk/CHANGES.txt lucene/nutch/trunk/conf/tika-mimetypes.xml lucene/nutch/trunk/src/java/org/apache/nutch/util

svn commit: r892350 - in /lucene/nutch/trunk: ./ src/plugin/protocol-httpclient/src/test/org/apache/nutch/protocol/httpclient/ src/test/org/apache/nutch/crawl/ src/test/org/apache/nutch/fetcher/

2009-12-18 Thread mattmann
Author: mattmann Date: Fri Dec 18 19:01:23 2009 New Revision: 892350 URL: http://svn.apache.org/viewvc?rev=892350view=rev Log: - fix for NUTCH-777 Upgrading to jetty6 broke unit tests Modified: lucene/nutch/trunk/CHANGES.txt lucene/nutch/trunk/src/plugin/protocol-httpclient/src/test/org

svn commit: r909268 [2/2] - in /lucene/nutch/trunk: ./ conf/ src/java/org/apache/nutch/parse/ src/plugin/ src/plugin/parse-tika/ src/plugin/parse-tika/src/ src/plugin/parse-tika/src/java/ src/plugin/p

2010-02-11 Thread mattmann
Added: lucene/nutch/trunk/src/plugin/parse-tika/src/test/org/apache/nutch/tika/TestMSWordParser.java URL: http://svn.apache.org/viewvc/lucene/nutch/trunk/src/plugin/parse-tika/src/test/org/apache/nutch/tika/TestMSWordParser.java?rev=909268view=auto

svn commit: r909269 - in /lucene/nutch/trunk/src/plugin/parse-tika: lib/ sample/

2010-02-11 Thread mattmann
Author: mattmann Date: Fri Feb 12 06:59:40 2010 New Revision: 909269 URL: http://svn.apache.org/viewvc?rev=909269view=rev Log: - 2nd part of NUTCH-766 Tika parser Added: lucene/nutch/trunk/src/plugin/parse-tika/lib/ lucene/nutch/trunk/src/plugin/parse-tika/lib/asm-3.1.jar (with props

svn commit: r931419 - in /lucene/nutch/trunk: conf/nutch-default.xml default.properties

2010-04-06 Thread mattmann
Author: mattmann Date: Wed Apr 7 03:40:46 2010 New Revision: 931419 URL: http://svn.apache.org/viewvc?rev=931419view=rev Log: - prep for 1.1 release Modified: lucene/nutch/trunk/conf/nutch-default.xml lucene/nutch/trunk/default.properties Modified: lucene/nutch/trunk/conf/nutch

svn commit: r931420 - /lucene/nutch/trunk/CHANGES.txt

2010-04-06 Thread mattmann
Author: mattmann Date: Wed Apr 7 03:41:29 2010 New Revision: 931420 URL: http://svn.apache.org/viewvc?rev=931420view=rev Log: - prep for 1.1 release Modified: lucene/nutch/trunk/CHANGES.txt Modified: lucene/nutch/trunk/CHANGES.txt URL: http://svn.apache.org/viewvc/lucene/nutch/trunk

svn commit: r931421 - /lucene/nutch/trunk/src/site/src/documentation/content/xdocs/index.xml

2010-04-06 Thread mattmann
Author: mattmann Date: Wed Apr 7 03:45:43 2010 New Revision: 931421 URL: http://svn.apache.org/viewvc?rev=931421view=rev Log: Release 1.1: step 3/4 from http://bit.ly/d5ugid Modified: lucene/nutch/trunk/src/site/src/documentation/content/xdocs/index.xml Modified: lucene/nutch/trunk/src

svn commit: r935453 - in /lucene/nutch/trunk: CHANGES.txt src/java/org/apache/nutch/crawl/Crawl.java

2010-04-18 Thread mattmann
Author: mattmann Date: Mon Apr 19 05:36:40 2010 New Revision: 935453 URL: http://svn.apache.org/viewvc?rev=935453view=rev Log: - fix for NUTCH-812 Crawl.java incorrectly uses the Generator API resulting in NPE Modified: lucene/nutch/trunk/CHANGES.txt lucene/nutch/trunk/src/java/org

svn commit: r935454 - in /lucene/nutch/tags: 1.1-rc1/ 1.1/

2010-04-18 Thread mattmann
Author: mattmann Date: Mon Apr 19 05:37:57 2010 New Revision: 935454 URL: http://svn.apache.org/viewvc?rev=935454view=rev Log: - rename initial tag to include rc1 label, since I will cut a new RC for Nutch 1.1 that includes NUTCH-812 Added: lucene/nutch/tags/1.1-rc1/ - copied from

svn commit: r937933 - /lucene/nutch/tags/1.1/

2010-04-25 Thread mattmann
Author: mattmann Date: Mon Apr 26 05:02:16 2010 New Revision: 937933 URL: http://svn.apache.org/viewvc?rev=937933view=rev Log: Nutch 1.1 release candidate #2 - applied NUTCH-812 Crawl.java incorrectly uses the Generator API resulting in NPE Added: lucene/nutch/tags/1.1/ - copied from

svn commit: r942427 - in /lucene/nutch/trunk: CHANGES.txt build.xml

2010-05-08 Thread mattmann
Author: mattmann Date: Sat May 8 18:02:37 2010 New Revision: 942427 URL: http://svn.apache.org/viewvc?rev=942427view=rev Log: - fix for NUTCH-816 Add zip target to build.xml Modified: lucene/nutch/trunk/CHANGES.txt lucene/nutch/trunk/build.xml Modified: lucene/nutch/trunk/CHANGES.txt

svn commit: r942428 - /lucene/nutch/trunk/CHANGES.txt

2010-05-08 Thread mattmann
Author: mattmann Date: Sat May 8 18:03:12 2010 New Revision: 942428 URL: http://svn.apache.org/viewvc?rev=942428view=rev Log: - release prep, 3rd time's a charm. Modified: lucene/nutch/trunk/CHANGES.txt Modified: lucene/nutch/trunk/CHANGES.txt URL: http://svn.apache.org/viewvc/lucene

svn commit: r942429 - /lucene/nutch/tags/1.1/

2010-05-08 Thread mattmann
Author: mattmann Date: Sat May 8 18:03:49 2010 New Revision: 942429 URL: http://svn.apache.org/viewvc?rev=942429view=rev Log: About to retag for rc3. Removed: lucene/nutch/tags/1.1/