Re: Hadopp Config Exception in Nutch

2009-03-10 Thread Bartosz Gadzimski

Hi,

Which version of nutch are you using?

You have wiki tutorial on running nutch in eclipse (it's important to 
add conf dir to classpath and move it to top of loading libs)

http://wiki.apache.org/nutch/RunNutchInEclipse0.9

I've installed nutch rc in eclipse on windows just 2 hours ago and it's 
working great.


Hope you set searcher.dir on you nutch-site.xml properly (full path to 
your crawl dir)


Thanks,
Bartosz

Lukas, Ray pisze:

Has anyone seen this.. Do you know the solution.. I will start looking
through the hadopp code but if someone has fixed this already I would
appreciate knowing.. Thanks guys..

Fri Mar 6 14:48:40 2009 DEBUG main java.io.IOException: config()
at
org.apache.hadoop.conf.Configuration.init(Configuration.java:93)
at
org.apache.nutch.util.NutchConfiguration.create(NutchConfiguration.java:
51)
at IndexCreator.init(IndexCreator.java:66)
at Driver.createIndex(Driver.java:17)
at Driver.main(Driver.java:10)

and
 
Fri Mar 6 14:48:40 2009 DEBUG main java.io.IOException: config(config)

at
org.apache.hadoop.conf.Configuration.init(Configuration.java:102)
at org.apache.hadoop.mapred.JobConf.init(JobConf.java:77)
at org.apache.hadoop.mapred.JobConf.init(JobConf.java:88)
at org.apache.nutch.util.NutchJob.init(NutchJob.java:27)
at IndexCreator.init(IndexCreator.java:68)
at Driver.createIndex(Driver.java:17)
at Driver.main(Driver.java:10)

Ray

Some background.. I am working in Eclipse, on windows. Put the jar files
in lib directory from the nutch distribution, added the plug-in
directory and nutch seems to generate indexes which Luke can open even
though this happens.. But NutchBean search always fails to find
anything, (returns zero hit objects) even though Luke finds and displays
the indexes without a problem.. I am thinking index dir is okay but the
other things are not, so Nutch Bean is something that might be relying
on those other things and so I am seeking to resolve this.. 
Set up log4J correctly and now I am finally seeing these. Thought I

would ask if anyone else has run across this... Sorry I am new to Nutch,
but I don't want to stay that way.. 

  




RE: Hadopp Config Exception in Nutch

2009-03-10 Thread Lukas, Ray
Oh rats.. Sorry.. Early morning here.. Forgot.. Yes, version.. You need
to know that..  I am running Nutch 0.9.. I have gone through the
tutorial.. I have not done as well as you have.. 

Yep set up that search.dir setting.. I am going to go through everything
from scratch again I think..  I am missing something.. 
So you have search from your java app working then I take it.. Hummm...
Very interesting.. Hum.. Would sure like to spend 15 minutes with you..
Ha.. Ah.. Okay.. So there is hope for me.. Good good.. I am really
close.. Really close.. I can taste it.. 
Thanks man.. I am going back throguh that tutorial..  Must have missed
something.. 

-Original Message-
From: Bartosz Gadzimski [mailto:bartek...@o2.pl] 
Sent: Tuesday, March 10, 2009 8:02 AM
To: nutch-user@lucene.apache.org
Subject: Re: Hadopp Config Exception in Nutch

Hi,

Which version of nutch are you using?

You have wiki tutorial on running nutch in eclipse (it's important to 
add conf dir to classpath and move it to top of loading libs)
http://wiki.apache.org/nutch/RunNutchInEclipse0.9

I've installed nutch rc in eclipse on windows just 2 hours ago and it's 
working great.

Hope you set searcher.dir on you nutch-site.xml properly (full path to 
your crawl dir)

Thanks,
Bartosz

Lukas, Ray pisze:
 Has anyone seen this.. Do you know the solution.. I will start looking
 through the hadopp code but if someone has fixed this already I would
 appreciate knowing.. Thanks guys..

 Fri Mar 6 14:48:40 2009 DEBUG main java.io.IOException: config()
   at
 org.apache.hadoop.conf.Configuration.init(Configuration.java:93)
   at

org.apache.nutch.util.NutchConfiguration.create(NutchConfiguration.java:
 51)
   at IndexCreator.init(IndexCreator.java:66)
   at Driver.createIndex(Driver.java:17)
   at Driver.main(Driver.java:10)

 and
  
 Fri Mar 6 14:48:40 2009 DEBUG main java.io.IOException: config(config)
   at
 org.apache.hadoop.conf.Configuration.init(Configuration.java:102)
   at org.apache.hadoop.mapred.JobConf.init(JobConf.java:77)
   at org.apache.hadoop.mapred.JobConf.init(JobConf.java:88)
   at org.apache.nutch.util.NutchJob.init(NutchJob.java:27)
   at IndexCreator.init(IndexCreator.java:68)
   at Driver.createIndex(Driver.java:17)
   at Driver.main(Driver.java:10)

 Ray

 Some background.. I am working in Eclipse, on windows. Put the jar
files
 in lib directory from the nutch distribution, added the plug-in
 directory and nutch seems to generate indexes which Luke can open even
 though this happens.. But NutchBean search always fails to find
 anything, (returns zero hit objects) even though Luke finds and
displays
 the indexes without a problem.. I am thinking index dir is okay but
the
 other things are not, so Nutch Bean is something that might be relying
 on those other things and so I am seeking to resolve this.. 
 Set up log4J correctly and now I am finally seeing these. Thought I
 would ask if anyone else has run across this... Sorry I am new to
Nutch,
 but I don't want to stay that way.. 

   



Re: Hadopp Config Exception in Nutch

2009-03-10 Thread Bartosz Gadzimski

Hi,

I can use it from cygwin, from eclipse or anyway you wan't it. Crawl, 
nutchBean also.


You should try nutch trunk or even rc 
http://people.apache.org/~siren/nutch-1.0/rc1/nutch-1.0.tar.gz


It's to much difference to write here, it's just 10 times better than 0.9


Lukas, Ray pisze:

Oh rats.. Sorry.. Early morning here.. Forgot.. Yes, version.. You need
to know that..  I am running Nutch 0.9.. I have gone through the
tutorial.. I have not done as well as you have.. 


Yep set up that search.dir setting.. I am going to go through everything
from scratch again I think..  I am missing something.. 
So you have search from your java app working then I take it.. Hummm...

Very interesting.. Hum.. Would sure like to spend 15 minutes with you..
Ha.. Ah.. Okay.. So there is hope for me.. Good good.. I am really
close.. Really close.. I can taste it.. 
Thanks man.. I am going back throguh that tutorial..  Must have missed
something.. 


-Original Message-
From: Bartosz Gadzimski [mailto:bartek...@o2.pl] 
Sent: Tuesday, March 10, 2009 8:02 AM

To: nutch-user@lucene.apache.org
Subject: Re: Hadopp Config Exception in Nutch

Hi,

Which version of nutch are you using?

You have wiki tutorial on running nutch in eclipse (it's important to 
add conf dir to classpath and move it to top of loading libs)

http://wiki.apache.org/nutch/RunNutchInEclipse0.9

I've installed nutch rc in eclipse on windows just 2 hours ago and it's 
working great.


Hope you set searcher.dir on you nutch-site.xml properly (full path to 
your crawl dir)


Thanks,
Bartosz

Lukas, Ray pisze:
  

Has anyone seen this.. Do you know the solution.. I will start looking
through the hadopp code but if someone has fixed this already I would
appreciate knowing.. Thanks guys..

Fri Mar 6 14:48:40 2009 DEBUG main java.io.IOException: config()
at
org.apache.hadoop.conf.Configuration.init(Configuration.java:93)
at



org.apache.nutch.util.NutchConfiguration.create(NutchConfiguration.java:
  

51)
at IndexCreator.init(IndexCreator.java:66)
at Driver.createIndex(Driver.java:17)
at Driver.main(Driver.java:10)

and
 
Fri Mar 6 14:48:40 2009 DEBUG main java.io.IOException: config(config)

at
org.apache.hadoop.conf.Configuration.init(Configuration.java:102)
at org.apache.hadoop.mapred.JobConf.init(JobConf.java:77)
at org.apache.hadoop.mapred.JobConf.init(JobConf.java:88)
at org.apache.nutch.util.NutchJob.init(NutchJob.java:27)
at IndexCreator.init(IndexCreator.java:68)
at Driver.createIndex(Driver.java:17)
at Driver.main(Driver.java:10)

Ray

Some background.. I am working in Eclipse, on windows. Put the jar


files
  

in lib directory from the nutch distribution, added the plug-in
directory and nutch seems to generate indexes which Luke can open even
though this happens.. But NutchBean search always fails to find
anything, (returns zero hit objects) even though Luke finds and


displays
  

the indexes without a problem.. I am thinking index dir is okay but


the
  

other things are not, so Nutch Bean is something that might be relying
on those other things and so I am seeking to resolve this.. 
Set up log4J correctly and now I am finally seeing these. Thought I

would ask if anyone else has run across this... Sorry I am new to


Nutch,
  
but I don't want to stay that way.. 

  




  




RE: Hadopp Config Exception in Nutch

2009-03-10 Thread Lukas, Ray
Okay.. I am switching over right now.. RIGHT NOW! 

-Original Message-
From: Bartosz Gadzimski [mailto:bartek...@o2.pl] 
Sent: Tuesday, March 10, 2009 8:29 AM
To: nutch-user@lucene.apache.org
Subject: Re: Hadopp Config Exception in Nutch

Hi,

I can use it from cygwin, from eclipse or anyway you wan't it. Crawl, 
nutchBean also.

You should try nutch trunk or even rc 
http://people.apache.org/~siren/nutch-1.0/rc1/nutch-1.0.tar.gz

It's to much difference to write here, it's just 10 times better than
0.9


Lukas, Ray pisze:
 Oh rats.. Sorry.. Early morning here.. Forgot.. Yes, version.. You
need
 to know that..  I am running Nutch 0.9.. I have gone through the
 tutorial.. I have not done as well as you have.. 

 Yep set up that search.dir setting.. I am going to go through
everything
 from scratch again I think..  I am missing something.. 
 So you have search from your java app working then I take it..
Hummm...
 Very interesting.. Hum.. Would sure like to spend 15 minutes with
you..
 Ha.. Ah.. Okay.. So there is hope for me.. Good good.. I am really
 close.. Really close.. I can taste it.. 
 Thanks man.. I am going back throguh that tutorial..  Must have missed
 something.. 

 -Original Message-
 From: Bartosz Gadzimski [mailto:bartek...@o2.pl] 
 Sent: Tuesday, March 10, 2009 8:02 AM
 To: nutch-user@lucene.apache.org
 Subject: Re: Hadopp Config Exception in Nutch

 Hi,

 Which version of nutch are you using?

 You have wiki tutorial on running nutch in eclipse (it's important to 
 add conf dir to classpath and move it to top of loading libs)
 http://wiki.apache.org/nutch/RunNutchInEclipse0.9

 I've installed nutch rc in eclipse on windows just 2 hours ago and
it's 
 working great.

 Hope you set searcher.dir on you nutch-site.xml properly (full path to

 your crawl dir)

 Thanks,
 Bartosz

 Lukas, Ray pisze:
   
 Has anyone seen this.. Do you know the solution.. I will start
looking
 through the hadopp code but if someone has fixed this already I would
 appreciate knowing.. Thanks guys..

 Fri Mar 6 14:48:40 2009 DEBUG main java.io.IOException: config()
  at
 org.apache.hadoop.conf.Configuration.init(Configuration.java:93)
  at

 

org.apache.nutch.util.NutchConfiguration.create(NutchConfiguration.java:
   
 51)
  at IndexCreator.init(IndexCreator.java:66)
  at Driver.createIndex(Driver.java:17)
  at Driver.main(Driver.java:10)

 and
  
 Fri Mar 6 14:48:40 2009 DEBUG main java.io.IOException:
config(config)
  at
 org.apache.hadoop.conf.Configuration.init(Configuration.java:102)
  at org.apache.hadoop.mapred.JobConf.init(JobConf.java:77)
  at org.apache.hadoop.mapred.JobConf.init(JobConf.java:88)
  at org.apache.nutch.util.NutchJob.init(NutchJob.java:27)
  at IndexCreator.init(IndexCreator.java:68)
  at Driver.createIndex(Driver.java:17)
  at Driver.main(Driver.java:10)

 Ray

 Some background.. I am working in Eclipse, on windows. Put the jar
 
 files
   
 in lib directory from the nutch distribution, added the plug-in
 directory and nutch seems to generate indexes which Luke can open
even
 though this happens.. But NutchBean search always fails to find
 anything, (returns zero hit objects) even though Luke finds and
 
 displays
   
 the indexes without a problem.. I am thinking index dir is okay but
 
 the
   
 other things are not, so Nutch Bean is something that might be
relying
 on those other things and so I am seeking to resolve this.. 
 Set up log4J correctly and now I am finally seeing these. Thought I
 would ask if anyone else has run across this... Sorry I am new to
 
 Nutch,
   
 but I don't want to stay that way..