Got it...the Fedora Core tomcat wasn't liking life. I grabbed Tomcat
6 and away we go.
Next issue :) Need some tools. I had a segment fail a crawl...
[EMAIL PROTECTED] nutch]# bin/nutch invertlinks crawl/linkdb -dir
crawl/segments/20080418185259/
LinkDb: starting
LinkDb: linkdb: crawl/linkdb
LinkDb: URL normalize: true
LinkDb: URL filter: true
LinkDb: adding segment: crawl/segments/20080418185259/parse_data
LinkDb: adding segment: crawl/segments/20080418185259/content
LinkDb: adding segment: crawl/segments/20080418185259/crawl_parse
LinkDb: adding segment: crawl/segments/20080418185259/crawl_generate
LinkDb: adding segment: crawl/segments/20080418185259/crawl_fetch
LinkDb: adding segment: crawl/segments/20080418185259/parse_text
LinkDb: org.apache.hadoop.mapred.InvalidInputException: Input path
doesnt exist : /nutch/crawl/segments/20080418185259/parse_data/parse_data
Input path doesnt exist :
/nutch/crawl/segments/20080418185259/content/parse_data
Input path doesnt exist :
/nutch/crawl/segments/20080418185259/crawl_parse/parse_data
Input path doesnt exist :
/nutch/crawl/segments/20080418185259/crawl_generate/parse_data
Input path doesnt exist :
/nutch/crawl/segments/20080418185259/crawl_fetch/parse_data
Input path doesnt exist :
/nutch/crawl/segments/20080418185259/parse_text/parse_data
at
org.apache.hadoop.mapred.InputFormatBase.validateInput(InputFormatBase.java:138)
at org.apache.hadoop.mapred.JobClient.submitJob(JobClient.java:326)
at org.apache.hadoop.mapred.JobClient.runJob(JobClient.java:543)
at org.apache.nutch.crawl.LinkDb.invert(LinkDb.java:232)
at org.apache.nutch.crawl.LinkDb.run(LinkDb.java:377)
at org.apache.hadoop.util.ToolBase.doMain(ToolBase.java:189)
at org.apache.nutch.crawl.LinkDb.main(LinkDb.java:333)
How can I fix what was crawled and get it into the index?
Thanks a ton!!
Jason
On Fri, Apr 18, 2008 at 6:33 PM, <[EMAIL PROTECTED]> wrote:
> Jason,
> It looks like you really have other Tomcat issues, not related to Nutch.
> Can you deploy a simple webapp on your Tomcat instead?
>
> I would also suggest giving Jetty 6.1.9 a try, it's super simple to get it
> going.
>
>
> Otis --
> Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch
>
>
>
>
> ----- Original Message ----
> From: Jason Boss <[EMAIL PROTECTED]>
> To: [email protected]
> Sent: Friday, April 18, 2008 8:36:44 PM
> Subject: Errors with Tomcat
>
> Hey guys,
>
> Trying to get tomcat working and I am getting owned.
>
> Here is what I am seeing.
>
> Restarting tomcat:
>
> [EMAIL PROTECTED] nutch]# /etc/init.d/tomcat5 restart
> Stopping tomcat5: [ OK ]
> Starting tomcat5: /usr/bin/rebuild-jar-repository: error: Could not
> find jta Java extension for this JVM
> /usr/bin/rebuild-jar-repository: error: Some detected jars were not
> found for this jvm
> [ OK ]
> [EMAIL PROTECTED] nutch]#
>
>
> When I try to get to the search page I get either of these 2 errors:
>
> HTTP Status 500 -
>
> type Exception report
>
> message
>
> description The server encountered an internal error () that prevented
> it from fulfilling this request.
>
> exception
>
> org.apache.jasper.JasperException: Unable to load class for JSP
>
> org.apache.jasper.JspCompilationContext.load(jasper5-compiler-5.5.23.jar.so)
>
> org.apache.jasper.servlet.JspServletWrapper.getServlet(jasper5-compiler-5.5.23.jar.so)
>
> org.apache.jasper.servlet.JspServletWrapper.service(jasper5-compiler-5.5.23.jar.so)
>
> org.apache.jasper.servlet.JspServlet.serviceJspFile(jasper5-compiler-5.5.23.jar.so)
>
> org.apache.jasper.servlet.JspServlet.service(jasper5-compiler-5.5.23.jar.so)
>
> javax.servlet.http.HttpServlet.service(tomcat5-servlet-2.4-api-5.5.23.jar.so)
>
> root cause
>
> java.lang.ClassNotFoundException: org.apache.jsp.index_jsp not found
> in
> org.apache.jasper.servlet.JasperLoader{urls=[file:/usr/share/tomcat5/work/Catalina/localhost/_/],
> parent=WebappClassLoader
> delegate: false
> repositories:
> /WEB-INF/classes/
> ----------> Parent Classloader:
>
> org.apache.catalina.loader.StandardClassLoader{urls=[file:/var/lib/tomcat5/shared/classes/],
>
> parent=org.apache.catalina.loader.StandardClassLoader{urls=[file:/var/lib/tomcat5/common/classes/,file:/var/lib/tomcat5/common/i18n/tomcat-i18n-en.jar,file:/var/lib/tomcat5/common/i18n/tomcat-i18n-fr.jar,file:/var/lib/tomcat5/common/i18n/tomcat-i18n-ja.jar,file:/var/lib/tomcat5/common/i18n/tomcat-i18n-es.jar,file:/usr/share/java/libgcj-4.1.1.jar,file:/usr/share/java/jakarta-commons-el-1.0.jar,file:/usr/share/java/jakarta-commons-collections-3.1.jar,file:/usr/share/java/jakarta-commons-logging-api-1.0.4.jar,file:/usr/share/java/tomcat5/naming-resources-5.5.23.jar,file:/usr/share/java/tomcat5/naming-factory-5.5.23.jar,file:/usr/share/java/jasper5-runtime-5.5.23.jar,file:/usr/share/java/classpathx-mail-1.3.1-monolithic-1.1.1.jar,file:/usr/share/java/jakarta-commons-dbcp-1.2.1.jar,file:/usr/share/java/ant-1.6.5.jar,file:/usr/share/java/jakarta-commons-pool-1.3.jar,file:/usr/share/java/mx4j/mx4j-3.0.1.jar,file:/usr/share/java/tomcat5-jsp-2.0-api-5.5.23.jar,f
>
> ile:/usr/share/java/tomcat5-servlet-2.4-api-5.5.23.jar,file:/usr/share/java/jasper5-compiler-5.5.23.jar,file:/usr/share/java/classpathx-jaf-1.0.jar,file:/usr/share/java/libgcj-4.1.1.jar],
>
> parent=gnu.gcj.runtime.SystemClassLoader{urls=[file:/usr/lib/jvm/java/lib/tools.jar,file:/usr/share/tomcat5/bin/bootstrap.jar,file:/usr/share/tomcat5/bin/commons-logging-api.jar,file:/usr/share/java/mx4j/mx4j-impl.jar,file:/usr/share/java/mx4j/mx4j-jmx.jar],
> parent=gnu.gcj.runtime.ExtensionClassLoader{urls=[], parent=null}}}}
> }
> java.net.URLClassLoader.findClass(libgcj.so.7rh)
>
> org.apache.jasper.servlet.JasperLoader.loadClass(jasper5-compiler-5.5.23.jar.so)
>
> org.apache.jasper.servlet.JasperLoader.loadClass(jasper5-compiler-5.5.23.jar.so)
>
> org.apache.jasper.JspCompilationContext.load(jasper5-compiler-5.5.23.jar.so)
>
> org.apache.jasper.servlet.JspServletWrapper.getServlet(jasper5-compiler-5.5.23.jar.so)
>
> org.apache.jasper.servlet.JspServletWrapper.service(jasper5-compiler-5.5.23.jar.so)
>
> org.apache.jasper.servlet.JspServlet.serviceJspFile(jasper5-compiler-5.5.23.jar.so)
>
> org.apache.jasper.servlet.JspServlet.service(jasper5-compiler-5.5.23.jar.so)
>
> javax.servlet.http.HttpServlet.service(tomcat5-servlet-2.4-api-5.5.23.jar.so)
>
> note The full stack trace of the root cause is available in the Apache
> Tomcat/5.5.20 logs.
> Apache Tomcat/5.5.20
>
>
> Or this one:
>
> HTTP Status 500 -
>
> type Exception report
>
> message
>
> description The server encountered an internal error () that prevented
> it from fulfilling this request.
>
> exception
>
> javax.servlet.ServletException
>
> org.apache.jasper.servlet.JspServlet.service(jasper5-compiler-5.5.23.jar.so)
>
> javax.servlet.http.HttpServlet.service(tomcat5-servlet-2.4-api-5.5.23.jar.so)
>
> root cause
>
> java.lang.ExceptionInInitializerError
> java.lang.Class.initializeClass(libgcj.so.7rh)
> java.lang.Class.forName(libgcj.so.7rh)
> java.lang.Class.forName(libgcj.so.7rh)
>
> org.apache.tools.ant.taskdefs.compilers.CompilerAdapterFactory.doesModernCompilerExist(ant-1.6.5.jar.so)
>
> org.apache.tools.ant.taskdefs.compilers.CompilerAdapterFactory.getCompiler(ant-1.6.5.jar.so)
> org.apache.tools.ant.taskdefs.Javac.compile(ant-1.6.5.jar.so)
> org.apache.tools.ant.taskdefs.Javac.execute(ant-1.6.5.jar.so)
>
> org.apache.jasper.compiler.AntCompiler.generateClass(jasper5-compiler-5.5.23.jar.so)
>
> org.apache.jasper.compiler.Compiler.compile(jasper5-compiler-5.5.23.jar.so)
>
> org.apache.jasper.compiler.Compiler.compile(jasper5-compiler-5.5.23.jar.so)
>
> org.apache.jasper.compiler.Compiler.compile(jasper5-compiler-5.5.23.jar.so)
>
> org.apache.jasper.JspCompilationContext.compile(jasper5-compiler-5.5.23.jar.so)
>
> org.apache.jasper.servlet.JspServletWrapper.service(jasper5-compiler-5.5.23.jar.so)
>
> org.apache.jasper.servlet.JspServlet.serviceJspFile(jasper5-compiler-5.5.23.jar.so)
>
> org.apache.jasper.servlet.JspServlet.service(jasper5-compiler-5.5.23.jar.so)
>
> javax.servlet.http.HttpServlet.service(tomcat5-servlet-2.4-api-5.5.23.jar.so)
>
> root cause
>
> java.lang.NullPointerException
> com.sun.tools.javac.Main.<clinit>(Main.java:95)
> java.lang.Class.initializeClass(libgcj.so.7rh)
> java.lang.Class.forName(libgcj.so.7rh)
> java.lang.Class.forName(libgcj.so.7rh)
>
> org.apache.tools.ant.taskdefs.compilers.CompilerAdapterFactory.doesModernCompilerExist(ant-1.6.5.jar.so)
>
> org.apache.tools.ant.taskdefs.compilers.CompilerAdapterFactory.getCompiler(ant-1.6.5.jar.so)
> org.apache.tools.ant.taskdefs.Javac.compile(ant-1.6.5.jar.so)
> org.apache.tools.ant.taskdefs.Javac.execute(ant-1.6.5.jar.so)
>
> org.apache.jasper.compiler.AntCompiler.generateClass(jasper5-compiler-5.5.23.jar.so)
>
> org.apache.jasper.compiler.Compiler.compile(jasper5-compiler-5.5.23.jar.so)
>
> org.apache.jasper.compiler.Compiler.compile(jasper5-compiler-5.5.23.jar.so)
>
> org.apache.jasper.compiler.Compiler.compile(jasper5-compiler-5.5.23.jar.so)
>
> org.apache.jasper.JspCompilationContext.compile(jasper5-compiler-5.5.23.jar.so)
>
> org.apache.jasper.servlet.JspServletWrapper.service(jasper5-compiler-5.5.23.jar.so)
>
> org.apache.jasper.servlet.JspServlet.serviceJspFile(jasper5-compiler-5.5.23.jar.so)
>
> org.apache.jasper.servlet.JspServlet.service(jasper5-compiler-5.5.23.jar.so)
>
> javax.servlet.http.HttpServlet.service(tomcat5-servlet-2.4-api-5.5.23.jar.so)
>
> note The full stack trace of the root cause is available in the Apache
> Tomcat/5.5.20 logs.
> Apache Tomcat/5.5.20
>
> This is using Nutch .9. What am I doing wrong?
>
> Thanks!!
>
> Jason
>
>