RE: nutch-1.0.war deploying error

2009-10-13 Thread nikinch

yeap you were right I edited the JAVA_HOME but not everywhere.
Anyways thanks a bunch it was a jvm version problem.



Arkadi wrote:
 
 Hi,
 
 It looks like you have to upgrade your jvm.
 
 Arkadi 
 
 -Original Message-
 From: nikinch [mailto:maill...@qwamci.com]
 Sent: Tuesday, October 13, 2009 1:20 AM
 To: nutch-user@lucene.apache.org
 Subject: nutch-1.0.war deploying error
 
 
 Hello
 
 I have been playing around with nutch-1.0 recently and upon looking into
 the
 carrot2 feature i have tried to deploy the .war to test.
 I copied the war fil in my apache webapps folder but i get this error
 upon
 trying to start the service.
 I can't seem to get it fixed if anyone has some input as to what is the
 problem i'd appreciate
 
 12 oct. 2009 15:03:35 org.apache.catalina.core.StandardContext
 listenerStart
 GRAVE: Exception lors de l'envoi de l'évènement contexte initialisé
 (context
 initialized) à l'instance de classe d'écoute (listener)
 org.apache.nutch.searcher.NutchBean$NutchBeanConstructor
 java.lang.UnsupportedClassVersionError: Bad version number in .class file
 (unable to load class org.apache.hadoop.io.VersionedWritable) (unable to
 load class org.apache.nutch.parse.ParseData)
 at
 org.apache.catalina.loader.WebappClassLoader.findClassInternal(WebappClass
 Loader.java:1854)
 at
 org.apache.catalina.loader.WebappClassLoader.findClass(WebappClassLoader.j
 ava:890)
 at
 org.apache.catalina.loader.WebappClassLoader.loadClass(WebappClassLoader.j
 ava:1354)
 at
 org.apache.catalina.loader.WebappClassLoader.loadClass(WebappClassLoader.j
 ava:1233)
 at java.lang.ClassLoader.loadClassInternal(ClassLoader.java:319)
 at
 org.apache.nutch.util.NutchConfiguration.clinit(NutchConfiguration.java:
 42)
 at
 org.apache.nutch.searcher.NutchBean$NutchBeanConstructor.contextInitialize
 d(NutchBean.java:420)
 at
 org.apache.catalina.core.StandardContext.listenerStart(StandardContext.jav
 a:3934)
 at
 org.apache.catalina.core.StandardContext.start(StandardContext.java:4429)
 at
 org.apache.catalina.manager.ManagerServlet.start(ManagerServlet.java:1249)
 at
 org.apache.catalina.manager.HTMLManagerServlet.start(HTMLManagerServlet.ja
 va:612)
 at
 org.apache.catalina.manager.HTMLManagerServlet.doGet(HTMLManagerServlet.ja
 va:136)
 at javax.servlet.http.HttpServlet.service(HttpServlet.java:617)
 at javax.servlet.http.HttpServlet.service(HttpServlet.java:717)
 at
 org.apache.catalina.core.ApplicationFilterChain.internalDoFilter(Applicati
 onFilterChain.java:290)
 at
 org.apache.catalina.core.ApplicationFilterChain.doFilter(ApplicationFilter
 Chain.java:206)
 at
 org.apache.catalina.core.StandardWrapperValve.invoke(StandardWrapperValve.
 java:233)
 at
 org.apache.catalina.core.StandardContextValve.invoke(StandardContextValve.
 java:191)
 at
 org.apache.catalina.authenticator.AuthenticatorBase.invoke(AuthenticatorBa
 se.java:525)
 at
 org.apache.catalina.core.StandardHostValve.invoke(StandardHostValve.java:1
 28)
 at
 org.apache.catalina.valves.ErrorReportValve.invoke(ErrorReportValve.java:1
 02)
 at
 org.apache.catalina.core.StandardEngineValve.invoke(StandardEngineValve.ja
 va:109)
 at
 org.apache.catalina.connector.CoyoteAdapter.service(CoyoteAdapter.java:293
 )
 at
 org.apache.coyote.http11.Http11Processor.process(Http11Processor.java:849)
 at
 org.apache.coyote.http11.Http11Protocol$Http11ConnectionHandler.process(Ht
 tp11Protocol.java:583)
 at
 org.apache.tomcat.util.net.JIoEndpoint$Worker.run(JIoEndpoint.java:454)
 at java.lang.Thread.run(Thread.java:595)
 
 --
 View this message in context: http://www.nabble.com/nutch-1.0.war-
 deploying-error-tp25856641p25856641.html
 Sent from the Nutch - User mailing list archive at Nabble.com.
 
 
 

-- 
View this message in context: 
http://www.nabble.com/nutch-1.0.war-deploying-error-tp25856641p25869254.html
Sent from the Nutch - User mailing list archive at Nabble.com.



Re: Incremental Whole Web Crawling

2009-10-13 Thread Eric Osgood

Andrzej,

Where do I get the nightly builds from? I tried to use the eclipse  
plugin that supports svn to no avail. Is there a ftp, http server  
where I can download the nutch source fresh?


Thanks,

Eric

On Oct 11, 2009, at 12:40 PM, Andrzej Bialecki wrote:


Eric Osgood wrote:
When I set generate.update.db to true and then run generate, it  
only runs twice and generates 100K for the 1st gen, 62.5K for the  
second gen and 0 for the 3rd gen on a seed list of 1.6M. I don't  
understand this, for a topN of 100K it should run 16 times and  
create 16 distinct lists if I am not mistaken.


There was a bug in this code that I fixed recently - please get a  
new nightly build and try it again.



--
Best regards,
Andrzej Bialecki 
___. ___ ___ ___ _ _   __
[__ || __|__/|__||\/|  Information Retrieval, Semantic Web
___|||__||  \|  ||  |  Embedded Unix, System Integration
http://www.sigram.com  Contact: info at sigram dot com



Eric Osgood
-
Cal Poly - Computer Engineering, Moon Valley Software
-
eosg...@calpoly.edu, e...@lakemeadonline.com
-
www.calpoly.edu/~eosgood, www.lakemeadonline.com



Re: Incremental Whole Web Crawling

2009-10-13 Thread Eric Osgood
Ok, I think I am on the right track now, but just to be sure: the code  
I want is the branch section of svn under nutchbase at http://svn.apache.org/repos/asf/lucene/nutch/branches/nutchbase/ 
 correct?


Thanks,

Eric


On Oct 13, 2009, at 1:38 PM, Andrzej Bialecki wrote:


Eric Osgood wrote:

Andrzej,
Where do I get the nightly builds from? I tried to use the eclipse  
plugin that supports svn to no avail. Is there a ftp, http server  
where I can download the nutch source fresh?


Personally I prefer to use a command-line svn, even though I do  
development in Eclipse - I'm probably old-fashioned but I always  
want to be very clear on what's going on when I do an update.


See the instructions here:

http://lucene.apache.org/nutch/version_control.html


--
Best regards,
Andrzej Bialecki 
___. ___ ___ ___ _ _   __
[__ || __|__/|__||\/|  Information Retrieval, Semantic Web
___|||__||  \|  ||  |  Embedded Unix, System Integration
http://www.sigram.com  Contact: info at sigram dot com



Eric Osgood
-
Cal Poly - Computer Engineering, Moon Valley Software
-
eosg...@calpoly.edu, e...@lakemeadonline.com
-
www.calpoly.edu/~eosgood, www.lakemeadonline.com



Re: Incremental Whole Web Crawling

2009-10-13 Thread Andrzej Bialecki

Eric Osgood wrote:
Ok, I think I am on the right track now, but just to be sure: the code I 
want is the branch section of svn under nutchbase at 
http://svn.apache.org/repos/asf/lucene/nutch/branches/nutchbase/ correct?


No, you need the trunk from here:

http://svn.apache.org/repos/asf/lucene/nutch/trunk


--
Best regards,
Andrzej Bialecki 
 ___. ___ ___ ___ _ _   __
[__ || __|__/|__||\/|  Information Retrieval, Semantic Web
___|||__||  \|  ||  |  Embedded Unix, System Integration
http://www.sigram.com  Contact: info at sigram dot com



Re: Incremental Whole Web Crawling

2009-10-13 Thread Andrzej Bialecki

Eric Osgood wrote:

So the trunk contains the most recent nightly update?


It's the other way around - nightly build is created from a snapshot of 
the trunk. The trunk is always the most recent.



--
Best regards,
Andrzej Bialecki 
 ___. ___ ___ ___ _ _   __
[__ || __|__/|__||\/|  Information Retrieval, Semantic Web
___|||__||  \|  ||  |  Embedded Unix, System Integration
http://www.sigram.com  Contact: info at sigram dot com



Re: Incremental Whole Web Crawling

2009-10-13 Thread Eric Osgood

O ok,

You learn something new everyday! I didn't know that the trunk was the  
most recent build. Good to know! So this current trunk does have a fix  
for the generator bug?



On Oct 13, 2009, at 2:05 PM, Andrzej Bialecki wrote:


Eric Osgood wrote:

So the trunk contains the most recent nightly update?


It's the other way around - nightly build is created from a snapshot  
of the trunk. The trunk is always the most recent.



--
Best regards,
Andrzej Bialecki 
___. ___ ___ ___ _ _   __
[__ || __|__/|__||\/|  Information Retrieval, Semantic Web
___|||__||  \|  ||  |  Embedded Unix, System Integration
http://www.sigram.com  Contact: info at sigram dot com



Eric Osgood
-
Cal Poly - Computer Engineering, Moon Valley Software
-
eosg...@calpoly.edu, e...@lakemeadonline.com
-
www.calpoly.edu/~eosgood, www.lakemeadonline.com