is there any command for that when I use describe 'webpage'
there is not column something like fetchtime? How can I see it from Hbase? 2013/3/21 Tejas Patil [via Lucene] <[email protected] > > yes. If you have configured it to use HBase, then the info will be stored > in HBase. > > On Wed, Mar 20, 2013 at 4:27 PM, kamaci <[hidden > email]<http://user/SendEmail.jtp?type=node&node=4049596&i=0>> > wrote: > > > I use Hbase than where is that crawldb? Is it stored at my Hbase or any > > other special folder at Nutch? > > > > 2013/3/21 Tejas Patil [via Lucene] < > > [hidden email] <http://user/SendEmail.jtp?type=node&node=4049596&i=1> > > > > > > > > "readdb" works for both versions of nutch. In 2.x, its implemented by > > > WebTableReader [0] class. See the usage to get more details of the > > > command. > > > > > > [0] > > > > > > > > > http://svn.apache.org/viewvc/nutch/branches/2.x/src/java/org/apache/nutch/crawl/WebTableReader.java?view=markup > > > > > > > > > > > > On Wed, Mar 20, 2013 at 4:18 PM, kamaci <[hidden email]< > > http://user/SendEmail.jtp?type=node&node=4049588&i=0>> > > > wrote: > > > > > > > Ok that works for me: > > > > > > > > ./bin/nutch readdb -url http://www.generalist.org.uk/blog/ > > > > > > > > > > > > 2013/3/21 kamaci [via Lucene] <[hidden email]< > > http://user/SendEmail.jtp?type=node&node=4049588&i=1>> > > > > > > > > > > > > I use Nutch 2.1 and don't use that crawldb command. I have an > Hbase > > > > > database. Can I see such kind of data still? I think readdb > doesn't > > > work > > > > at > > > > > my situaton? > > > > > > > > > > ------------------------------ > > > > > If you reply to this email, your message will be added to the > > > discussion > > > > > below: > > > > > > > > > > > > > > > > > > > > http://lucene.472066.n3.nabble.com/Does-Nutch-Checks-Whether-A-Page-crawled-before-or-not-tp4049564p4049582.html > > > > > To unsubscribe from Does Nutch Checks Whether A Page crawled > before > > > or > > > > > not, click here< > > > > > > > > > > . > > > > > NAML< > > > > > > > > > > http://lucene.472066.n3.nabble.com/template/NamlServlet.jtp?macro=macro_viewer&id=instant_html%21nabble%3Aemail.naml&base=nabble.naml.namespaces.BasicNamespace-nabble.view.web.template.NabbleNamespace-nabble.view.web.template.NodeNamespace&breadcrumbs=notify_subscribers%21nabble%3Aemail.naml-instant_emails%21nabble%3Aemail.naml-send_instant_email%21nabble%3Aemail.naml > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > -- > > > > View this message in context: > > > > > > > > > > http://lucene.472066.n3.nabble.com/Does-Nutch-Checks-Whether-A-Page-crawled-before-or-not-tp4049564p4049584.html > > > > > > > Sent from the Nutch - User mailing list archive at Nabble.com. > > > > > > > > > > > > > ------------------------------ > > > If you reply to this email, your message will be added to the > discussion > > > below: > > > > > > > > > http://lucene.472066.n3.nabble.com/Does-Nutch-Checks-Whether-A-Page-crawled-before-or-not-tp4049564p4049588.html > > > To unsubscribe from Does Nutch Checks Whether A Page crawled before > or > > > not, click here< > > > > > > . > > > NAML< > > > http://lucene.472066.n3.nabble.com/template/NamlServlet.jtp?macro=macro_viewer&id=instant_html%21nabble%3Aemail.naml&base=nabble.naml.namespaces.BasicNamespace-nabble.view.web.template.NabbleNamespace-nabble.view.web.template.NodeNamespace&breadcrumbs=notify_subscribers%21nabble%3Aemail.naml-instant_emails%21nabble%3Aemail.naml-send_instant_email%21nabble%3Aemail.naml > > > > > > > > > > > > > > > > -- > > View this message in context: > > > http://lucene.472066.n3.nabble.com/Does-Nutch-Checks-Whether-A-Page-crawled-before-or-not-tp4049564p4049590.html > > > Sent from the Nutch - User mailing list archive at Nabble.com. > > > > > ------------------------------ > If you reply to this email, your message will be added to the discussion > below: > > http://lucene.472066.n3.nabble.com/Does-Nutch-Checks-Whether-A-Page-crawled-before-or-not-tp4049564p4049596.html > To unsubscribe from Does Nutch Checks Whether A Page crawled before or > not, click > here<http://lucene.472066.n3.nabble.com/template/NamlServlet.jtp?macro=unsubscribe_by_code&node=4049564&code=ZnVya2Fua2FtYWNpQGdtYWlsLmNvbXw0MDQ5NTY0fDEyODM4MDc0Mg==> > . > NAML<http://lucene.472066.n3.nabble.com/template/NamlServlet.jtp?macro=macro_viewer&id=instant_html%21nabble%3Aemail.naml&base=nabble.naml.namespaces.BasicNamespace-nabble.view.web.template.NabbleNamespace-nabble.view.web.template.NodeNamespace&breadcrumbs=notify_subscribers%21nabble%3Aemail.naml-instant_emails%21nabble%3Aemail.naml-send_instant_email%21nabble%3Aemail.naml> > -- View this message in context: http://lucene.472066.n3.nabble.com/Does-Nutch-Checks-Whether-A-Page-crawled-before-or-not-tp4049564p4049597.html Sent from the Nutch - User mailing list archive at Nabble.com.

