Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change 
notification.

The "NutchHadoopTutorial" page has been changed by TejasPatil:
https://wiki.apache.org/nutch/NutchHadoopTutorial?action=diff&rev1=41&rev2=42

  You might want to try some of these commands before doing a search
  
  {{{
- hadoop jar nutch-${version}.jar org.apache.nutch.crawl.LinkDbReader 
crawldb/linkdb -dump /tmp/linksdir
+ hadoop jar nutch-${version}.job org.apache.nutch.crawl.LinkDbReader 
crawldb/linkdb -dump /tmp/linksdir
  mkdir /nutch/search/output/
  bin/hadoop dfs -copyToLocal /tmp/linksdir  /nutch/search/output/linksdir
  less /nutch/search/output/linksdir/*
@@ -407, +407 @@

  Or if we want to look at the whole thing as a text file we might try 
  
  {{{
- hadoop jar nutch-${version}.jar org.apache.nutch.crawl.LinkDbReader 
crawldb/linkdb -dump /tmp/entiredump
+ hadoop jar nutch-${version}.job org.apache.nutch.crawl.LinkDbReader 
crawldb/linkdb -dump /tmp/entiredump
  bin/hadoop dfs -copyToLocal /tmp/entiredump  /nutch/search/output/entiredump
  less /nutch/search/output/entiredump/*
  }}}

Reply via email to