Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change 
notification.

The "bin/nutch_readdb" page has been changed by LewisJohnMcgibbney:
http://wiki.apache.org/nutch/bin/nutch_readdb?action=diff&rev1=7&rev2=8

Comment:
Reflects changes to Nutch 1.3, this is not complete, content is still required 
to substantiate on the parameters we can pass

- readdb is an alias for org.apache.nutch.db.WebDBReader
+ Readdb is an alias for org.apache.nutch.crawl.CrawlDbReader
  
- The WebDBReader implements all the read-only parts of accessing our web 
database. All the writing ones can be found in WebDBWriter.
+ The CrawlDbReader implements all the read-only parts of accessing our web 
database. It provides us with a read utility for the CrawlDB.
  
+ Usage: 
+ {{{
- Usage: bin/nutch org.apache.nutch.db.WebDBReader (-local | -ndfs 
<namenode:port>) <db> [-pageurl url] | [-pagemd5 md5] | [-dumppageurl] | 
[-dumppagemd5] | [-toppages <k>] | [-linkurl url] | [-linkmd5 md5] | 
[-dumplinks] | [-stats]
+ bin/nutch org.apache.nutch.crawl.CrawlDbReader (-local | -ndfs 
<namenode:port>) <db> [-pageurl url] | [-pagemd5 md5] | [-dumppageurl] | 
[-dumppagemd5] | [-toppages <k>] | [-linkurl url] | [-linkmd5 md5] | 
[-dumplinks] | [-stats]
+ }}}
+ 
+ '''(-local | -ndfs <namenode:port>)''':
+ 
+ '''<db>''':
+ 
+ '''[-pageurl url]''':
+ 
+ '''[-pagemd5 md5]''':
+ 
+ '''[-dumppageurl]''':
+ 
+ '''[-dumppagemd5]''':
+ 
+ '''[-toppages <k>]''':
+ 
+ '''[-linkurl url]''':
+ 
+ '''[-linkmd5 md5]''':
+ 
+ '''[-dumplinks]''':
+ 
+ '''[-stats]''':
  
  CommandLineOptions
  
- 
- (Actually this looks out of date. You might be looking for 
org.apache.nutch.crawl.CrawlDbReader instead)
- 

Reply via email to