RE: nutch-1.0.war deploying error
yeap you were right I edited the JAVA_HOME but not everywhere. Anyways thanks a bunch it was a jvm version problem. Arkadi wrote: Hi, It looks like you have to upgrade your jvm. Arkadi -Original Message- From: nikinch [mailto:maill...@qwamci.com] Sent: Tuesday, October 13, 2009 1:20 AM To: nutch-user@lucene.apache.org Subject: nutch-1.0.war deploying error Hello I have been playing around with nutch-1.0 recently and upon looking into the carrot2 feature i have tried to deploy the .war to test. I copied the war fil in my apache webapps folder but i get this error upon trying to start the service. I can't seem to get it fixed if anyone has some input as to what is the problem i'd appreciate 12 oct. 2009 15:03:35 org.apache.catalina.core.StandardContext listenerStart GRAVE: Exception lors de l'envoi de l'évènement contexte initialisé (context initialized) à l'instance de classe d'écoute (listener) org.apache.nutch.searcher.NutchBean$NutchBeanConstructor java.lang.UnsupportedClassVersionError: Bad version number in .class file (unable to load class org.apache.hadoop.io.VersionedWritable) (unable to load class org.apache.nutch.parse.ParseData) at org.apache.catalina.loader.WebappClassLoader.findClassInternal(WebappClass Loader.java:1854) at org.apache.catalina.loader.WebappClassLoader.findClass(WebappClassLoader.j ava:890) at org.apache.catalina.loader.WebappClassLoader.loadClass(WebappClassLoader.j ava:1354) at org.apache.catalina.loader.WebappClassLoader.loadClass(WebappClassLoader.j ava:1233) at java.lang.ClassLoader.loadClassInternal(ClassLoader.java:319) at org.apache.nutch.util.NutchConfiguration.clinit(NutchConfiguration.java: 42) at org.apache.nutch.searcher.NutchBean$NutchBeanConstructor.contextInitialize d(NutchBean.java:420) at org.apache.catalina.core.StandardContext.listenerStart(StandardContext.jav a:3934) at org.apache.catalina.core.StandardContext.start(StandardContext.java:4429) at org.apache.catalina.manager.ManagerServlet.start(ManagerServlet.java:1249) at org.apache.catalina.manager.HTMLManagerServlet.start(HTMLManagerServlet.ja va:612) at org.apache.catalina.manager.HTMLManagerServlet.doGet(HTMLManagerServlet.ja va:136) at javax.servlet.http.HttpServlet.service(HttpServlet.java:617) at javax.servlet.http.HttpServlet.service(HttpServlet.java:717) at org.apache.catalina.core.ApplicationFilterChain.internalDoFilter(Applicati onFilterChain.java:290) at org.apache.catalina.core.ApplicationFilterChain.doFilter(ApplicationFilter Chain.java:206) at org.apache.catalina.core.StandardWrapperValve.invoke(StandardWrapperValve. java:233) at org.apache.catalina.core.StandardContextValve.invoke(StandardContextValve. java:191) at org.apache.catalina.authenticator.AuthenticatorBase.invoke(AuthenticatorBa se.java:525) at org.apache.catalina.core.StandardHostValve.invoke(StandardHostValve.java:1 28) at org.apache.catalina.valves.ErrorReportValve.invoke(ErrorReportValve.java:1 02) at org.apache.catalina.core.StandardEngineValve.invoke(StandardEngineValve.ja va:109) at org.apache.catalina.connector.CoyoteAdapter.service(CoyoteAdapter.java:293 ) at org.apache.coyote.http11.Http11Processor.process(Http11Processor.java:849) at org.apache.coyote.http11.Http11Protocol$Http11ConnectionHandler.process(Ht tp11Protocol.java:583) at org.apache.tomcat.util.net.JIoEndpoint$Worker.run(JIoEndpoint.java:454) at java.lang.Thread.run(Thread.java:595) -- View this message in context: http://www.nabble.com/nutch-1.0.war- deploying-error-tp25856641p25856641.html Sent from the Nutch - User mailing list archive at Nabble.com. -- View this message in context: http://www.nabble.com/nutch-1.0.war-deploying-error-tp25856641p25869254.html Sent from the Nutch - User mailing list archive at Nabble.com.
Re: Incremental Whole Web Crawling
Andrzej, Where do I get the nightly builds from? I tried to use the eclipse plugin that supports svn to no avail. Is there a ftp, http server where I can download the nutch source fresh? Thanks, Eric On Oct 11, 2009, at 12:40 PM, Andrzej Bialecki wrote: Eric Osgood wrote: When I set generate.update.db to true and then run generate, it only runs twice and generates 100K for the 1st gen, 62.5K for the second gen and 0 for the 3rd gen on a seed list of 1.6M. I don't understand this, for a topN of 100K it should run 16 times and create 16 distinct lists if I am not mistaken. There was a bug in this code that I fixed recently - please get a new nightly build and try it again. -- Best regards, Andrzej Bialecki ___. ___ ___ ___ _ _ __ [__ || __|__/|__||\/| Information Retrieval, Semantic Web ___|||__|| \| || | Embedded Unix, System Integration http://www.sigram.com Contact: info at sigram dot com Eric Osgood - Cal Poly - Computer Engineering, Moon Valley Software - eosg...@calpoly.edu, e...@lakemeadonline.com - www.calpoly.edu/~eosgood, www.lakemeadonline.com
Re: Incremental Whole Web Crawling
Ok, I think I am on the right track now, but just to be sure: the code I want is the branch section of svn under nutchbase at http://svn.apache.org/repos/asf/lucene/nutch/branches/nutchbase/ correct? Thanks, Eric On Oct 13, 2009, at 1:38 PM, Andrzej Bialecki wrote: Eric Osgood wrote: Andrzej, Where do I get the nightly builds from? I tried to use the eclipse plugin that supports svn to no avail. Is there a ftp, http server where I can download the nutch source fresh? Personally I prefer to use a command-line svn, even though I do development in Eclipse - I'm probably old-fashioned but I always want to be very clear on what's going on when I do an update. See the instructions here: http://lucene.apache.org/nutch/version_control.html -- Best regards, Andrzej Bialecki ___. ___ ___ ___ _ _ __ [__ || __|__/|__||\/| Information Retrieval, Semantic Web ___|||__|| \| || | Embedded Unix, System Integration http://www.sigram.com Contact: info at sigram dot com Eric Osgood - Cal Poly - Computer Engineering, Moon Valley Software - eosg...@calpoly.edu, e...@lakemeadonline.com - www.calpoly.edu/~eosgood, www.lakemeadonline.com
Re: Incremental Whole Web Crawling
Eric Osgood wrote: Ok, I think I am on the right track now, but just to be sure: the code I want is the branch section of svn under nutchbase at http://svn.apache.org/repos/asf/lucene/nutch/branches/nutchbase/ correct? No, you need the trunk from here: http://svn.apache.org/repos/asf/lucene/nutch/trunk -- Best regards, Andrzej Bialecki ___. ___ ___ ___ _ _ __ [__ || __|__/|__||\/| Information Retrieval, Semantic Web ___|||__|| \| || | Embedded Unix, System Integration http://www.sigram.com Contact: info at sigram dot com
Re: Incremental Whole Web Crawling
Eric Osgood wrote: So the trunk contains the most recent nightly update? It's the other way around - nightly build is created from a snapshot of the trunk. The trunk is always the most recent. -- Best regards, Andrzej Bialecki ___. ___ ___ ___ _ _ __ [__ || __|__/|__||\/| Information Retrieval, Semantic Web ___|||__|| \| || | Embedded Unix, System Integration http://www.sigram.com Contact: info at sigram dot com
Re: Incremental Whole Web Crawling
O ok, You learn something new everyday! I didn't know that the trunk was the most recent build. Good to know! So this current trunk does have a fix for the generator bug? On Oct 13, 2009, at 2:05 PM, Andrzej Bialecki wrote: Eric Osgood wrote: So the trunk contains the most recent nightly update? It's the other way around - nightly build is created from a snapshot of the trunk. The trunk is always the most recent. -- Best regards, Andrzej Bialecki ___. ___ ___ ___ _ _ __ [__ || __|__/|__||\/| Information Retrieval, Semantic Web ___|||__|| \| || | Embedded Unix, System Integration http://www.sigram.com Contact: info at sigram dot com Eric Osgood - Cal Poly - Computer Engineering, Moon Valley Software - eosg...@calpoly.edu, e...@lakemeadonline.com - www.calpoly.edu/~eosgood, www.lakemeadonline.com