Re: How come I have so many retries listed in stats?

2010-01-10 Thread Julien Nioche
./description /property Yet after letting things run for some time, if I look at the stats I have the following... Is there some other setting I should have which would prevent such excessive retries? Also, what is up with the negative retry numbers? [Sat Jan 9 08:38:36 2010] Executing

How come I have so many retries listed in stats?

2010-01-09 Thread Jesse Hires
at the stats I have the following... Is there some other setting I should have which would prevent such excessive retries? Also, what is up with the negative retry numbers? [Sat Jan 9 08:38:36 2010] Executing: bin/nutch readdb crawl/crawldb -stats [Sat Jan 9 08:38:37 2010] CrawlDb statistics start: crawl

Stats?

2008-01-31 Thread Paul Stewart
Hi folks... Is there a way to retrieve stats from Nutch - meaning how many webpages are indexed, to be indexed etc?? When I was working with AspSeek and Mnogosearch in the past I could run a command to see stats Thanks again, Paul

Re: Stats?

2008-01-31 Thread Susam Pal
Try this command:- bin/nutch readdb crawl/crawldb -stats To get help, try:- bin/nutch readdb Regards, Susam Pal On Feb 1, 2008 8:21 AM, Paul Stewart [EMAIL PROTECTED] wrote: Hi folks... Is there a way to retrieve stats from Nutch - meaning how many webpages are indexed, to be indexed

Nutch Search stats

2006-04-21 Thread Aled Jones
Hiya all Does nutch save any of the search terms entered for stats purposes? E.g. most commonly used terms and so on. Pity but I can't come to the nutch-user meeting, an 11 hour flight too far! ;-) Cheers Aled ### This message has been scanned by F

Re: Nutch Search stats

2006-04-21 Thread Ravish Bhagdev
No. Not at present (unless somone enlightens me) R On 4/21/06, Aled Jones [EMAIL PROTECTED] wrote: Hiya all Does nutch save any of the search terms entered for stats purposes? E.g. most commonly used terms and so on. Pity but I can't come to the nutch-user meeting, an 11 hour flight too

Re: Nutch Search stats

2006-04-21 Thread Bill Goffe
200 7176 - Bill Ravish Bhagdev said: No. Not at present (unless somone enlightens me) R On 4/21/06, Aled Jones [EMAIL PROTECTED] wrote: Hiya all Does nutch save any of the search terms entered for stats purposes? E.g. most commonly used terms and so on. Pity but I

Re: Nutch Search stats

2006-04-21 Thread Berlin Brown
: No. Not at present (unless somone enlightens me) R On 4/21/06, Aled Jones [EMAIL PROTECTED] wrote: Hiya all Does nutch save any of the search terms entered for stats purposes? E.g. most commonly used terms and so on. Pity but I can't come to the nutch-user meeting, an 11 hour

Stats

2005-11-28 Thread Aled Jones
Hi Are there any tools/plugins to get stats on crawled data e.g pages found, domains searched, common keywords etc? Thanks Aled This e-mail and any attachments are strictly confidential and intended solely

Re: Stats

2005-11-28 Thread Stefan Groschupf
28.11.2005 um 14:45 schrieb Aled Jones: Hi Are there any tools/plugins to get stats on crawled data e.g pages found, domains searched, common keywords etc? Thanks Aled ** ** This e-mail and any attachments are strictly

Re: Stats

2005-11-28 Thread Andrzej Bialecki
Stefan Groschupf wrote: Yes, for 0.7 there is a segement and db tool that gives you this information for 0.8 one is in the pipeline and I think already in the jira. Yes, I'm cleaning it up a little and will commit it shortly. There are already tools in mapred to read the DBs (CrawlDB

ATB: Stats

2005-11-28 Thread Aled Jones
/Subject: Re: Stats Stefan Groschupf wrote: Yes, for 0.7 there is a segement and db tool that gives you this information for 0.8 one is in the pipeline and I think already in the jira. Yes, I'm cleaning it up a little and will commit it shortly. There are already tools in mapred

YML: Stats

2005-11-28 Thread Aled Jones
Nevermind, got it working. Just being a div ;-) -Neges Wreiddiol-/-Original Message- Oddi wrth/From: Aled Jones Anfonwyd/Sent: 28 November 2005 15:07 At/To: nutch-user@lucene.apache.org Pwnc/Subject: ATB: Stats Cool, thanks for that. Do I only need the luke.jar as nutch

nutch readdb db -stats Pages Fetched Not Consistent

2005-11-10 Thread Bill Goffe
Now that I've got my reverse proxy up, one less pressing question. With ~/nutch/bin/nutch readdb db -stats I get Number of pages: 1,399,730 Number of links: 5,369,361 Yet when I search my log (have a perl script that outputs everything to it) with grep fetching log | wc -l I get 313,998