Author: jerome
Date: Mon Apr 3 13:57:46 2006
New Revision: 391150
URL: http://svn.apache.org/viewcvs?rev=391150&view=rev
Log:
no more dump parse-mspowerpoint unit test result to a file for visual checks
Modified:
lucene/nutch/trunk/src/plugin/parse-mspowerpoint/src/test/org/apache/nutch/par
Author: ab
Date: Mon Apr 3 07:36:19 2006
New Revision: 391055
URL: http://svn.apache.org/viewcvs?rev=391055&view=rev
Log:
Forgot to properly initialize the score.
Modified:
lucene/nutch/trunk/src/java/org/apache/nutch/crawl/CrawlDbReducer.java
Modified: lucene/nutch/trunk/src/java/org/apach
Author: ab
Date: Mon Apr 3 06:35:34 2006
New Revision: 391044
URL: http://svn.apache.org/viewcvs?rev=391044&view=rev
Log:
Make sure we use new values for score, metadata, fetch interval
and fetch time.
Modified:
lucene/nutch/trunk/src/java/org/apache/nutch/crawl/CrawlDbReducer.java
Modified
Author: ab
Date: Mon Apr 3 04:19:43 2006
New Revision: 391003
URL: http://svn.apache.org/viewcvs?rev=391003&view=rev
Log:
Add a -topN option to the reader. This collects the indicated number of
top scoring URLs in a CrawlDB in a sorted list. Such a list is useful for
identifying scoring problems