Michael,
I'm afraid to say but the segread doesn't exists in the 0.8 branch
anymore.
I was knowing both methods but with map reduce the file structures
are different, that is why I was asking.
Thanks, anyway.
Stefan
Am 15.10.2005 um 04:22 schrieb Michael Ji:
or, you can use segread in bin/nutch to dump a new
fetch segment to see what page it fetched,
Michael Ji,
--- Stefan Groschupf <[EMAIL PROTECTED]> wrote:
Which class do you mean?
There is the old webdbadmin tool, but I guess this
will not work for
the new crawl db.
The bin/nutch admin command isn't supported until
more.
Thanks
Stefan
Am 15.10.2005 um 00:21 schrieb Michael Ji:
using DBAdminTool to dump the webdb and you can
get
whole list of Pages in text format,
Michael Ji,
--- Stefan Groschupf <[EMAIL PROTECTED]> wrote:
Hi,
is there any chance to read the statistics of the
nutch 0.8 crawl db
or a trick to get an idea of how many pages are
already crawled?
Thanks for the hints.
Stefan
__________________________________
Start your day with Yahoo! - Make it your home
page!
http://www.yahoo.com/r/hs
__________________________________
Yahoo! Music Unlimited
Access over 1 million songs. Try it free.
http://music.yahoo.com/unlimited/