I use Hbase than where is that crawldb? Is it stored at my Hbase or any
other special folder at Nutch?

2013/3/21 Tejas Patil [via Lucene] <[email protected]
>

> "readdb" works for both versions of nutch. In 2.x, its implemented by
> WebTableReader [0] class. See the usage to get more details of the
> command.
>
> [0]
>
> http://svn.apache.org/viewvc/nutch/branches/2.x/src/java/org/apache/nutch/crawl/WebTableReader.java?view=markup
>
>
>
> On Wed, Mar 20, 2013 at 4:18 PM, kamaci <[hidden 
> email]<http://user/SendEmail.jtp?type=node&node=4049588&i=0>>
> wrote:
>
> > Ok that works for me:
> >
> > ./bin/nutch readdb -url http://www.generalist.org.uk/blog/
> >
> >
> > 2013/3/21 kamaci [via Lucene] <[hidden 
> > email]<http://user/SendEmail.jtp?type=node&node=4049588&i=1>>
>
> >
> > > I use Nutch 2.1 and don't use that crawldb command. I have an Hbase
> > > database. Can I see such kind of data still? I think readdb doesn't
> work
> > at
> > > my situaton?
> > >
> > > ------------------------------
> > >  If you reply to this email, your message will be added to the
> discussion
> > > below:
> > >
> > >
> >
> http://lucene.472066.n3.nabble.com/Does-Nutch-Checks-Whether-A-Page-crawled-before-or-not-tp4049564p4049582.html
> > >  To unsubscribe from Does Nutch Checks Whether A Page crawled before
> or
> > > not, click here<
> > >
> > > .
> > > NAML<
> >
> http://lucene.472066.n3.nabble.com/template/NamlServlet.jtp?macro=macro_viewer&id=instant_html%21nabble%3Aemail.naml&base=nabble.naml.namespaces.BasicNamespace-nabble.view.web.template.NabbleNamespace-nabble.view.web.template.NodeNamespace&breadcrumbs=notify_subscribers%21nabble%3Aemail.naml-instant_emails%21nabble%3Aemail.naml-send_instant_email%21nabble%3Aemail.naml
> > >
> > >
> >
> >
> >
> >
> > --
> > View this message in context:
> >
> http://lucene.472066.n3.nabble.com/Does-Nutch-Checks-Whether-A-Page-crawled-before-or-not-tp4049564p4049584.html
>
> > Sent from the Nutch - User mailing list archive at Nabble.com.
> >
>
>
> ------------------------------
>  If you reply to this email, your message will be added to the discussion
> below:
>
> http://lucene.472066.n3.nabble.com/Does-Nutch-Checks-Whether-A-Page-crawled-before-or-not-tp4049564p4049588.html
>  To unsubscribe from Does Nutch Checks Whether A Page crawled before or
> not, click 
> here<http://lucene.472066.n3.nabble.com/template/NamlServlet.jtp?macro=unsubscribe_by_code&node=4049564&code=ZnVya2Fua2FtYWNpQGdtYWlsLmNvbXw0MDQ5NTY0fDEyODM4MDc0Mg==>
> .
> NAML<http://lucene.472066.n3.nabble.com/template/NamlServlet.jtp?macro=macro_viewer&id=instant_html%21nabble%3Aemail.naml&base=nabble.naml.namespaces.BasicNamespace-nabble.view.web.template.NabbleNamespace-nabble.view.web.template.NodeNamespace&breadcrumbs=notify_subscribers%21nabble%3Aemail.naml-instant_emails%21nabble%3Aemail.naml-send_instant_email%21nabble%3Aemail.naml>
>




--
View this message in context: 
http://lucene.472066.n3.nabble.com/Does-Nutch-Checks-Whether-A-Page-crawled-before-or-not-tp4049564p4049590.html
Sent from the Nutch - User mailing list archive at Nabble.com.

Reply via email to