Re: Problems with DomainStatistics

2015-01-07 Thread Lewis John Mcgibbney
Hi Markus,

On Wed, Jan 7, 2015 at 7:42 PM, user-digest-h...@nutch.apache.org wrote:

 Hi - it is a strange piece indeed. You cannot just tell it where the
 crawldb is, you need to tell it where the directory is, so specifying
 current is ok, but not part-*


Thanks very much. I'll cook a patch up and get it to Jira.
Lewis


RE: Problems with DomainStatistics

2015-01-07 Thread Markus Jelsma
Hi - it is a strange piece indeed. You cannot just tell it where the crawldb 
is, you need to tell it where the directory is, so specifying current is ok, 
but not part-*

M
 
 
-Original message-
 From:Lewis John Mcgibbney lewis.mcgibb...@gmail.com
 Sent: Wednesday 7th January 2015 19:48
 To: user@nutch.apache.org
 Subject: Problems with DomainStatistics
 
 Hi Folks,
 Does anyone else have problems with the DomainStaticstics [0] tool?
 
 I use it as follows
 
 ./bin/nutch domainstats /usr/local/.../crawldb/old/part-0/ output tld
 
 Although it is generated, nothing is written to the output directory
 
 ./bin/nutch domainstats /usr/local/.../crawldb/current/part-0/ output
 tld
 
 Although it is generated, nothing is written to the output directory
 
 The command line is less than helpful however I;ve even looked into the
 code and can quite see where I'm missing things.
 Any help is appreciated.
 Thanks
 Lewis
 
 [0]
 https://github.com/apache/nutch/blob/trunk/src/java/org/apache/nutch/util/domain/DomainStatistics.java
 
 -- 
 *Lewis*
 


Problems with DomainStatistics

2015-01-07 Thread Lewis John Mcgibbney
Hi Folks,
Does anyone else have problems with the DomainStaticstics [0] tool?

I use it as follows

./bin/nutch domainstats /usr/local/.../crawldb/old/part-0/ output tld

Although it is generated, nothing is written to the output directory

./bin/nutch domainstats /usr/local/.../crawldb/current/part-0/ output
tld

Although it is generated, nothing is written to the output directory

The command line is less than helpful however I;ve even looked into the
code and can quite see where I'm missing things.
Any help is appreciated.
Thanks
Lewis

[0]
https://github.com/apache/nutch/blob/trunk/src/java/org/apache/nutch/util/domain/DomainStatistics.java

-- 
*Lewis*