./description
/property
Yet after letting things run for some time, if I look at the stats I have
the following... Is there some other setting I should have which would
prevent such excessive retries? Also, what is up with the negative retry
numbers?
[Sat Jan 9 08:38:36 2010] Executing
at the stats I have
the following... Is there some other setting I should have which would
prevent such excessive retries? Also, what is up with the negative retry
numbers?
[Sat Jan 9 08:38:36 2010] Executing: bin/nutch readdb crawl/crawldb -stats
[Sat Jan 9 08:38:37 2010] CrawlDb statistics start: crawl
Hi folks...
Is there a way to retrieve stats from Nutch - meaning how many webpages
are indexed, to be indexed etc??
When I was working with AspSeek and Mnogosearch in the past I could run
a command to see stats
Thanks again,
Paul
Try this command:-
bin/nutch readdb crawl/crawldb -stats
To get help, try:-
bin/nutch readdb
Regards,
Susam Pal
On Feb 1, 2008 8:21 AM, Paul Stewart [EMAIL PROTECTED] wrote:
Hi folks...
Is there a way to retrieve stats from Nutch - meaning how many webpages
are indexed, to be indexed
Hiya all
Does nutch save any of the search terms entered for stats purposes? E.g.
most commonly used terms and so on.
Pity but I can't come to the nutch-user meeting, an 11 hour flight too
far! ;-)
Cheers
Aled
###
This message has been scanned by F
No. Not at present (unless somone enlightens me)
R
On 4/21/06, Aled Jones [EMAIL PROTECTED] wrote:
Hiya all
Does nutch save any of the search terms entered for stats purposes? E.g.
most commonly used terms and so on.
Pity but I can't come to the nutch-user meeting, an 11 hour flight too
200 7176
- Bill
Ravish Bhagdev said:
No. Not at present (unless somone enlightens me)
R
On 4/21/06, Aled Jones [EMAIL PROTECTED] wrote:
Hiya all
Does nutch save any of the search terms entered for stats purposes? E.g.
most commonly used terms and so on.
Pity but I
:
No. Not at present (unless somone enlightens me)
R
On 4/21/06, Aled Jones [EMAIL PROTECTED] wrote:
Hiya all
Does nutch save any of the search terms entered for stats purposes? E.g.
most commonly used terms and so on.
Pity but I can't come to the nutch-user meeting, an 11 hour
Hi
Are there any tools/plugins to get stats on crawled data e.g pages
found, domains searched, common keywords etc?
Thanks
Aled
This e-mail and any attachments are strictly confidential and intended solely
28.11.2005 um 14:45 schrieb Aled Jones:
Hi
Are there any tools/plugins to get stats on crawled data e.g pages
found, domains searched, common keywords etc?
Thanks
Aled
**
**
This e-mail and any attachments are strictly
Stefan Groschupf wrote:
Yes,
for 0.7 there is a segement and db tool that gives you this
information for 0.8 one is in the pipeline and I think already in the
jira.
Yes, I'm cleaning it up a little and will commit it shortly. There are
already tools in mapred to read the DBs (CrawlDB
/Subject: Re: Stats
Stefan Groschupf wrote:
Yes,
for 0.7 there is a segement and db tool that gives you this
information for 0.8 one is in the pipeline and I think
already in the
jira.
Yes, I'm cleaning it up a little and will commit it shortly.
There are already tools in mapred
Nevermind, got it working. Just being a div ;-)
-Neges Wreiddiol-/-Original Message-
Oddi wrth/From: Aled Jones
Anfonwyd/Sent: 28 November 2005 15:07
At/To: nutch-user@lucene.apache.org
Pwnc/Subject: ATB: Stats
Cool, thanks for that.
Do I only need the luke.jar as nutch
Now that I've got my reverse proxy up, one less pressing question. With
~/nutch/bin/nutch readdb db -stats
I get
Number of pages: 1,399,730
Number of links: 5,369,361
Yet when I search my log (have a perl script that outputs everything to
it) with grep fetching log | wc -l I get
313,998
14 matches
Mail list logo