Kamaci,
Thanks for using Nutch 2.x.
I would ask you kindly to review both our guidelines to the community
correspondence [0] as well as the plethora of information linked to from
the main Nutch wiki page.
These questions can be much better answered by taking a bit of time to read
through the documentation (which is now better than it has ever been) and
then approaching the community.
Thank you
Lewis

[0]
http://wiki.apache.org/nutch/Becoming_A_Nutch_Developer#Step_One:_Using_the_Mailing_Lists

On Wed, Mar 20, 2013 at 4:43 PM, kamaci <[email protected]> wrote:

> is there any command for that when I use
>
> describe 'webpage'
>
> there is not column something like fetchtime? How can I see it from Hbase?
>
> 2013/3/21 Tejas Patil [via Lucene] <
> [email protected]
> >
>
> > yes. If you have configured it to use HBase, then the info will be stored
> > in HBase.
> >
> > On Wed, Mar 20, 2013 at 4:27 PM, kamaci <[hidden email]<
> http://user/SendEmail.jtp?type=node&node=4049596&i=0>>
> > wrote:
> >
> > > I use Hbase than where is that crawldb? Is it stored at my Hbase or any
> > > other special folder at Nutch?
> > >
> > > 2013/3/21 Tejas Patil [via Lucene] <
> > > [hidden email] <http://user/SendEmail.jtp?type=node&node=4049596&i=1>
> > > >
> > >
> > > > "readdb" works for both versions of nutch. In 2.x, its implemented by
> > > > WebTableReader [0] class. See the usage to get more details of the
> > > > command.
> > > >
> > > > [0]
> > > >
> > > >
> > >
> >
> http://svn.apache.org/viewvc/nutch/branches/2.x/src/java/org/apache/nutch/crawl/WebTableReader.java?view=markup
> > > >
> > > >
> > > >
> > > > On Wed, Mar 20, 2013 at 4:18 PM, kamaci <[hidden email]<
> > > http://user/SendEmail.jtp?type=node&node=4049588&i=0>>
> > > > wrote:
> > > >
> > > > > Ok that works for me:
> > > > >
> > > > > ./bin/nutch readdb -url http://www.generalist.org.uk/blog/
> > > > >
> > > > >
> > > > > 2013/3/21 kamaci [via Lucene] <[hidden email]<
> > > http://user/SendEmail.jtp?type=node&node=4049588&i=1>>
> > > >
> > > > >
> > > > > > I use Nutch 2.1 and don't use that crawldb command. I have an
> > Hbase
> > > > > > database. Can I see such kind of data still? I think readdb
> > doesn't
> > > > work
> > > > > at
> > > > > > my situaton?
> > > > > >
> > > > > > ------------------------------
> > > > > >  If you reply to this email, your message will be added to the
> > > > discussion
> > > > > > below:
> > > > > >
> > > > > >
> > > > >
> > > >
> > >
> >
> http://lucene.472066.n3.nabble.com/Does-Nutch-Checks-Whether-A-Page-crawled-before-or-not-tp4049564p4049582.html
> > > > > >  To unsubscribe from Does Nutch Checks Whether A Page crawled
> > before
> > > > or
> > > > > > not, click here<
> > > > > >
> > > > > > .
> > > > > > NAML<
> > > > >
> > > >
> > >
> >
> http://lucene.472066.n3.nabble.com/template/NamlServlet.jtp?macro=macro_viewer&id=instant_html%21nabble%3Aemail.naml&base=nabble.naml.namespaces.BasicNamespace-nabble.view.web.template.NabbleNamespace-nabble.view.web.template.NodeNamespace&breadcrumbs=notify_subscribers%21nabble%3Aemail.naml-instant_emails%21nabble%3Aemail.naml-send_instant_email%21nabble%3Aemail.naml
> > > > > >
> > > > > >
> > > > >
> > > > >
> > > > >
> > > > >
> > > > > --
> > > > > View this message in context:
> > > > >
> > > >
> > >
> >
> http://lucene.472066.n3.nabble.com/Does-Nutch-Checks-Whether-A-Page-crawled-before-or-not-tp4049564p4049584.html
> > > >
> > > > > Sent from the Nutch - User mailing list archive at Nabble.com.
> > > > >
> > > >
> > > >
> > > > ------------------------------
> > > >  If you reply to this email, your message will be added to the
> > discussion
> > > > below:
> > > >
> > > >
> > >
> >
> http://lucene.472066.n3.nabble.com/Does-Nutch-Checks-Whether-A-Page-crawled-before-or-not-tp4049564p4049588.html
> > > >  To unsubscribe from Does Nutch Checks Whether A Page crawled before
> > or
> > > > not, click here<
> > > >
> > > > .
> > > > NAML<
> > >
> >
> http://lucene.472066.n3.nabble.com/template/NamlServlet.jtp?macro=macro_viewer&id=instant_html%21nabble%3Aemail.naml&base=nabble.naml.namespaces.BasicNamespace-nabble.view.web.template.NabbleNamespace-nabble.view.web.template.NodeNamespace&breadcrumbs=notify_subscribers%21nabble%3Aemail.naml-instant_emails%21nabble%3Aemail.naml-send_instant_email%21nabble%3Aemail.naml
> > > >
> > > >
> > >
> > >
> > >
> > >
> > > --
> > > View this message in context:
> > >
> >
> http://lucene.472066.n3.nabble.com/Does-Nutch-Checks-Whether-A-Page-crawled-before-or-not-tp4049564p4049590.html
> >
> > > Sent from the Nutch - User mailing list archive at Nabble.com.
> > >
> >
> >
> > ------------------------------
> >  If you reply to this email, your message will be added to the discussion
> > below:
> >
> >
> http://lucene.472066.n3.nabble.com/Does-Nutch-Checks-Whether-A-Page-crawled-before-or-not-tp4049564p4049596.html
> >  To unsubscribe from Does Nutch Checks Whether A Page crawled before or
> > not, click here<
> http://lucene.472066.n3.nabble.com/template/NamlServlet.jtp?macro=unsubscribe_by_code&node=4049564&code=ZnVya2Fua2FtYWNpQGdtYWlsLmNvbXw0MDQ5NTY0fDEyODM4MDc0Mg==
> >
> > .
> > NAML<
> http://lucene.472066.n3.nabble.com/template/NamlServlet.jtp?macro=macro_viewer&id=instant_html%21nabble%3Aemail.naml&base=nabble.naml.namespaces.BasicNamespace-nabble.view.web.template.NabbleNamespace-nabble.view.web.template.NodeNamespace&breadcrumbs=notify_subscribers%21nabble%3Aemail.naml-instant_emails%21nabble%3Aemail.naml-send_instant_email%21nabble%3Aemail.naml
> >
> >
>
>
>
>
> --
> View this message in context:
> http://lucene.472066.n3.nabble.com/Does-Nutch-Checks-Whether-A-Page-crawled-before-or-not-tp4049564p4049597.html
> Sent from the Nutch - User mailing list archive at Nabble.com.




-- 
*Lewis*

Reply via email to