Dear Wiki user, You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change notification.
The "bin/nutch_readdb" page has been changed by LewisJohnMcgibbney: http://wiki.apache.org/nutch/bin/nutch_readdb?action=diff&rev1=7&rev2=8 Comment: Reflects changes to Nutch 1.3, this is not complete, content is still required to substantiate on the parameters we can pass - readdb is an alias for org.apache.nutch.db.WebDBReader + Readdb is an alias for org.apache.nutch.crawl.CrawlDbReader - The WebDBReader implements all the read-only parts of accessing our web database. All the writing ones can be found in WebDBWriter. + The CrawlDbReader implements all the read-only parts of accessing our web database. It provides us with a read utility for the CrawlDB. + Usage: + {{{ - Usage: bin/nutch org.apache.nutch.db.WebDBReader (-local | -ndfs <namenode:port>) <db> [-pageurl url] | [-pagemd5 md5] | [-dumppageurl] | [-dumppagemd5] | [-toppages <k>] | [-linkurl url] | [-linkmd5 md5] | [-dumplinks] | [-stats] + bin/nutch org.apache.nutch.crawl.CrawlDbReader (-local | -ndfs <namenode:port>) <db> [-pageurl url] | [-pagemd5 md5] | [-dumppageurl] | [-dumppagemd5] | [-toppages <k>] | [-linkurl url] | [-linkmd5 md5] | [-dumplinks] | [-stats] + }}} + + '''(-local | -ndfs <namenode:port>)''': + + '''<db>''': + + '''[-pageurl url]''': + + '''[-pagemd5 md5]''': + + '''[-dumppageurl]''': + + '''[-dumppagemd5]''': + + '''[-toppages <k>]''': + + '''[-linkurl url]''': + + '''[-linkmd5 md5]''': + + '''[-dumplinks]''': + + '''[-stats]''': CommandLineOptions - - (Actually this looks out of date. You might be looking for org.apache.nutch.crawl.CrawlDbReader instead) -

