svn commit: r209663 [2/12] - in /lucene/nutch/branches/mapred: conf/ site/ src/java/org/apache/nutch/crawl/ src/java/org/apache/nutch/fetcher/ src/java/org/apache/nutch/parse/ src/java/org/apache/nutc

2005-07-07 Thread cutting
Modified: lucene/nutch/branches/mapred/src/plugin/languageidentifier/src/java/org/apache/nutch/analysis/lang/da.ngp URL: http://svn.apache.org/viewcvs/lucene/nutch/branches/mapred/src/plugin/languageidentifier/src/java/org/apache/nutch/analysis/lang/da.ngp?rev=209663r1=209662r2=209663view=diff

svn commit: r209663 [5/12] - in /lucene/nutch/branches/mapred: conf/ site/ src/java/org/apache/nutch/crawl/ src/java/org/apache/nutch/fetcher/ src/java/org/apache/nutch/parse/ src/java/org/apache/nutch/protocol/ src/java/org/apache/nutch/segment/ src/j...

2005-07-07 Thread cutting
Modified: lucene/nutch/branches/mapred/src/plugin/languageidentifier/src/java/org/apache/nutch/analysis/lang/en.ngp URL: http://svn.apache.org/viewcvs/lucene/nutch/branches/mapred/src/plugin/languageidentifier/src/java/org/apache/nutch/analysis/lang/en.ngp?rev=209663r1=209662r2=209663view=diff

svn commit: r209663 [7/12] - in /lucene/nutch/branches/mapred: conf/ site/ src/java/org/apache/nutch/crawl/ src/java/org/apache/nutch/fetcher/ src/java/org/apache/nutch/parse/ src/java/org/apache/nutch/protocol/ src/java/org/apache/nutch/segment/ src/j...

2005-07-07 Thread cutting
Modified: lucene/nutch/branches/mapred/src/plugin/languageidentifier/src/java/org/apache/nutch/analysis/lang/fi.ngp URL: http://svn.apache.org/viewcvs/lucene/nutch/branches/mapred/src/plugin/languageidentifier/src/java/org/apache/nutch/analysis/lang/fi.ngp?rev=209663r1=209662r2=209663view=diff

svn commit: r209663 [3/12] - in /lucene/nutch/branches/mapred: conf/ site/ src/java/org/apache/nutch/crawl/ src/java/org/apache/nutch/fetcher/ src/java/org/apache/nutch/parse/ src/java/org/apache/nutch/protocol/ src/java/org/apache/nutch/segment/ src/j...

2005-07-07 Thread cutting
Modified: lucene/nutch/branches/mapred/src/plugin/languageidentifier/src/java/org/apache/nutch/analysis/lang/de.ngp URL: http://svn.apache.org/viewcvs/lucene/nutch/branches/mapred/src/plugin/languageidentifier/src/java/org/apache/nutch/analysis/lang/de.ngp?rev=209663r1=209662r2=209663view=diff

svn commit: r209495 - in /lucene/nutch/branches/mapred/src: java/org/apache/nutch/io/ java/org/apache/nutch/mapred/ test/org/apache/nutch/mapred/

2005-07-06 Thread cutting
Author: cutting Date: Wed Jul 6 11:37:44 2005 New Revision: 209495 URL: http://svn.apache.org/viewcvs?rev=209495view=rev Log: Add unit test for SequenceFile InputFormat. Fix code to pass unit test. SequenceFile now inserts sync marks after a fixed number of bytes rather than after a fixed

svn commit: r209522 - /lucene/nutch/branches/mapred/src/java/org/apache/nutch/io/SequenceFile.java

2005-07-06 Thread cutting
Author: cutting Date: Wed Jul 6 14:48:39 2005 New Revision: 209522 URL: http://svn.apache.org/viewcvs?rev=209522view=rev Log: Fix sorting to work with new sync method. Sorting prefixes temporary data with its length, which is hard to compute with syncs. So syncs are no longer stored

<    1   2