svn commit: r563894 [2/2] - in /lucene/nutch/trunk: ./ src/java/org/apache/nutch/analysis/ src/java/org/apache/nutch/crawl/ src/java/org/apache/nutch/fetcher/ src/java/org/apache/nutch/html/ src/java/

2007-08-08 Thread dogacan
Modified: lucene/nutch/trunk/src/java/org/apache/nutch/tools/DmozParser.java URL: http://svn.apache.org/viewvc/lucene/nutch/trunk/src/java/org/apache/nutch/tools/DmozParser.java?view=diff&rev=563894&r1=563893&r2=563894 ==

svn commit: r563807 - in /lucene/nutch/trunk: CHANGES.txt src/java/org/apache/nutch/crawl/Injector.java src/java/org/apache/nutch/net/UrlValidator.java src/test/org/apache/nutch/crawl/TestInjector.jav

2007-08-08 Thread dogacan
Author: dogacan Date: Wed Aug 8 03:57:11 2007 New Revision: 563807 URL: http://svn.apache.org/viewvc?view=rev&rev=563807 Log: NUTCH-522 - Use URLValidator in the Injector. Modified: lucene/nutch/trunk/CHANGES.txt lucene/nutch/trunk/src/java/org/apache/nutch/crawl/Injector.java lucene

svn commit: r563777 - in /lucene/nutch/trunk: CHANGES.txt src/java/org/apache/nutch/crawl/CrawlDatum.java src/java/org/apache/nutch/metadata/Metadata.java src/java/org/apache/nutch/parse/ParseData.jav

2007-08-08 Thread dogacan
Author: dogacan Date: Wed Aug 8 00:33:23 2007 New Revision: 563777 URL: http://svn.apache.org/viewvc?view=rev&rev=563777 Log: NUTCH-535 - ParseData's contentMeta accumulates unnecessary values during parse. Modified: lucene/nutch/trunk/CHANGES.txt lucene/nutch/trunk/src/java/org/apache/n