Hi,
I'm running Nutch-0.7.2. I created an Index for my local Lan which
consists of 45.000 Pages.
I can inspect this Index with Luke an everything looks fine. When I try
to start a search Query with Nutch
I can see the following Exception in my JBOSS Logfile (at the End of the
Log).
//Here I'm redploying the Nutch.war Archive....
2007-01-26 15:55:06,611 INFO [org.jboss.web.tomcat.tc5.TomcatDeployer]
deploy, ctxPath=/nutch,
warUrl=file:/srv/opt/jboss-3.2.6/server/ecs_cs/tmp/deploy/tmp31541nutch.war/
2007-01-26 15:55:06,831 DEBUG [tomcat.localhost./nutch.Context] Starting
tomcat.localhost./nutch.Context
2007-01-26 15:55:06,832 DEBUG [tomcat.localhost./nutch.Context]
Configuring default Resources
2007-01-26 15:55:06,836 DEBUG [tomcat.localhost./nutch.Context]
Processing standard container startup
2007-01-26 15:55:06,844 DEBUG [tomcat.localhost./nutch.Context] Setting
deployment descriptor public ID to '-//Sun Microsystems, Inc.//DTD Web
Application 2.3//EN'
2007-01-26 15:55:06,862 DEBUG [tomcat.localhost./nutch.Context] Setting
deployment descriptor public ID to '-//Sun Microsystems, Inc.//DTD Web
Application 2.3//EN'
2007-01-26 15:55:06,866 DEBUG [tomcat.localhost./nutch.Context] Posting
standard context attributes
2007-01-26 15:55:06,866 DEBUG [tomcat.localhost./nutch.Context]
Configuring application event listeners
2007-01-26 15:55:06,866 DEBUG [tomcat.localhost./nutch.Context] Sending
application start events
2007-01-26 15:55:06,866 DEBUG [tomcat.localhost./nutch.Context] Starting
filters
2007-01-26 15:55:06,866 DEBUG [tomcat.localhost./nutch.Context]
Starting filter 'CommonHeadersFilter'
2007-01-26 15:55:06,867 DEBUG [tomcat.localhost./nutch.Context] Starting
completed //Archive successfully loaded...?!?!
2007-01-26 15:55:06,867 DEBUG [tomcat.localhost./nutch.Context] Checking
for
jboss.web:j2eeType=WebModule,name=//localhost/nutch,J2EEApplication=none,J2EEServer=none
//Here I startet a query in my Webbrowser...
2007-01-26 15:55:53,585 INFO [STDOUT] 070126 155553 parsing
file:/srv/opt/jboss-3.2.6/server/ecs_cs/tmp/deploy/tmp31541nutch.war/WEB-INF/classes/nutch-default.xml
2007-01-26 15:55:53,591 INFO [STDOUT] 070126 155553 parsing
file:/srv/opt/jboss-3.2.6/server/ecs_cs/tmp/deploy/tmp31541nutch.war/WEB-INF/classes/nutch-site.xml
2007-01-26 15:55:53,599 INFO [STDOUT] 070126 155553 Plugins: looking
in:
/srv/opt/jboss-3.2.6/server/ecs_cs/tmp/deploy/tmp31541nutch.war/WEB-INF/classes/plugins
2007-01-26 15:55:53,600 INFO [STDOUT] 070126 155553 not including:
/srv/opt/jboss-3.2.6/server/ecs_cs/tmp/deploy/tmp31541nutch.war/WEB-INF/classes/plugins/clustering-carrot2
2007-01-26 15:55:53,600 INFO [STDOUT] 070126 155553 not including:
/srv/opt/jboss-3.2.6/server/ecs_cs/tmp/deploy/tmp31541nutch.war/WEB-INF/classes/plugins/creativecommons
2007-01-26 15:55:53,600 INFO [STDOUT] 070126 155553 parsing:
/srv/opt/jboss-3.2.6/server/ecs_cs/tmp/deploy/tmp31541nutch.war/WEB-INF/classes/plugins/index-basic/plugin.xml
2007-01-26 15:55:53,607 INFO [STDOUT] 070126 155553 impl:
point=org.apache.nutch.indexer.IndexingFilter
class=org.apache.nutch.indexer.basic.BasicIndexingFilter
2007-01-26 15:55:53,609 INFO [STDOUT] 070126 155553 not including:
/srv/opt/jboss-3.2.6/server/ecs_cs/tmp/deploy/tmp31541nutch.war/WEB-INF/classes/plugins/index-more
2007-01-26 15:55:53,609 INFO [STDOUT] 070126 155553 not including:
/srv/opt/jboss-3.2.6/server/ecs_cs/tmp/deploy/tmp31541nutch.war/WEB-INF/classes/plugins/language-identifier
2007-01-26 15:55:53,609 INFO [STDOUT] 070126 155553 parsing:
/srv/opt/jboss-3.2.6/server/ecs_cs/tmp/deploy/tmp31541nutch.war/WEB-INF/classes/plugins/nutch-extensionpoints/plugin.xml
2007-01-26 15:55:53,612 INFO [STDOUT] 070126 155553 not including:
/srv/opt/jboss-3.2.6/server/ecs_cs/tmp/deploy/tmp31541nutch.war/WEB-INF/classes/plugins/ontology
2007-01-26 15:55:53,612 INFO [STDOUT] 070126 155553 not including:
/srv/opt/jboss-3.2.6/server/ecs_cs/tmp/deploy/tmp31541nutch.war/WEB-INF/classes/plugins/parse-ext
2007-01-26 15:55:53,613 INFO [STDOUT] 070126 155553 parsing:
/srv/opt/jboss-3.2.6/server/ecs_cs/tmp/deploy/tmp31541nutch.war/WEB-INF/classes/plugins/parse-html/plugin.xml
2007-01-26 15:55:53,614 INFO [STDOUT] 070126 155553 impl:
point=org.apache.nutch.parse.Parser
class=org.apache.nutch.parse.html.HtmlParser
2007-01-26 15:55:53,615 INFO [STDOUT] 070126 155553 not including:
/srv/opt/jboss-3.2.6/server/ecs_cs/tmp/deploy/tmp31541nutch.war/WEB-INF/classes/plugins/parse-js
2007-01-26 15:55:53,615 INFO [STDOUT] 070126 155553 not including:
/srv/opt/jboss-3.2.6/server/ecs_cs/tmp/deploy/tmp31541nutch.war/WEB-INF/classes/plugins/parse-msword
2007-01-26 15:55:53,615 INFO [STDOUT] 070126 155553 not including:
/srv/opt/jboss-3.2.6/server/ecs_cs/tmp/deploy/tmp31541nutch.war/WEB-INF/classes/plugins/parse-pdf
2007-01-26 15:55:53,615 INFO [STDOUT] 070126 155553 not including:
/srv/opt/jboss-3.2.6/server/ecs_cs/tmp/deploy/tmp31541nutch.war/WEB-INF/classes/plugins/parse-rss
2007-01-26 15:55:53,615 INFO [STDOUT] 070126 155553 parsing:
/srv/opt/jboss-3.2.6/server/ecs_cs/tmp/deploy/tmp31541nutch.war/WEB-INF/classes/plugins/parse-text/plugin.xml
2007-01-26 15:55:53,617 INFO [STDOUT] 070126 155553 impl:
point=org.apache.nutch.parse.Parser
class=org.apache.nutch.parse.text.TextParser
2007-01-26 15:55:53,617 INFO [STDOUT] 070126 155553 not including:
/srv/opt/jboss-3.2.6/server/ecs_cs/tmp/deploy/tmp31541nutch.war/WEB-INF/classes/plugins/protocol-file
2007-01-26 15:55:53,618 INFO [STDOUT] 070126 155553 not including:
/srv/opt/jboss-3.2.6/server/ecs_cs/tmp/deploy/tmp31541nutch.war/WEB-INF/classes/plugins/protocol-ftp
2007-01-26 15:55:53,618 INFO [STDOUT] 070126 155553 parsing:
/srv/opt/jboss-3.2.6/server/ecs_cs/tmp/deploy/tmp31541nutch.war/WEB-INF/classes/plugins/protocol-http/plugin.xml
2007-01-26 15:55:53,619 INFO [STDOUT] 070126 155553 impl:
point=org.apache.nutch.protocol.Protocol
class=org.apache.nutch.protocol.http.Http
2007-01-26 15:55:53,620 INFO [STDOUT] 070126 155553 not including:
/srv/opt/jboss-3.2.6/server/ecs_cs/tmp/deploy/tmp31541nutch.war/WEB-INF/classes/plugins/protocol-httpclient
2007-01-26 15:55:53,620 INFO [STDOUT] 070126 155553 parsing:
/srv/opt/jboss-3.2.6/server/ecs_cs/tmp/deploy/tmp31541nutch.war/WEB-INF/classes/plugins/query-basic/plugin.xml
2007-01-26 15:55:53,622 INFO [STDOUT] 070126 155553 impl:
point=org.apache.nutch.searcher.QueryFilter
class=org.apache.nutch.searcher.basic.BasicQueryFilter
2007-01-26 15:55:53,622 INFO [STDOUT] 070126 155553 not including:
/srv/opt/jboss-3.2.6/server/ecs_cs/tmp/deploy/tmp31541nutch.war/WEB-INF/classes/plugins/query-more
2007-01-26 15:55:53,622 INFO [STDOUT] 070126 155553 parsing:
/srv/opt/jboss-3.2.6/server/ecs_cs/tmp/deploy/tmp31541nutch.war/WEB-INF/classes/plugins/query-site/plugin.xml
2007-01-26 15:55:53,624 INFO [STDOUT] 070126 155553 impl:
point=org.apache.nutch.searcher.QueryFilter
class=org.apache.nutch.searcher.site.SiteQueryFilter
2007-01-26 15:55:53,624 INFO [STDOUT] 070126 155553 parsing:
/srv/opt/jboss-3.2.6/server/ecs_cs/tmp/deploy/tmp31541nutch.war/WEB-INF/classes/plugins/query-url/plugin.xml
2007-01-26 15:55:53,626 INFO [STDOUT] 070126 155553 impl:
point=org.apache.nutch.searcher.QueryFilter
class=org.apache.nutch.searcher.url.URLQueryFilter
2007-01-26 15:55:53,626 INFO [STDOUT] 070126 155553 not including:
/srv/opt/jboss-3.2.6/server/ecs_cs/tmp/deploy/tmp31541nutch.war/WEB-INF/classes/plugins/urlfilter-prefix
2007-01-26 15:55:53,626 INFO [STDOUT] 070126 155553 parsing:
/srv/opt/jboss-3.2.6/server/ecs_cs/tmp/deploy/tmp31541nutch.war/WEB-INF/classes/plugins/urlfilter-regex/plugin.xml
2007-01-26 15:55:53,628 INFO [STDOUT] 070126 155553 impl:
point=org.apache.nutch.net.URLFilter
class=org.apache.nutch.net.RegexURLFilter
2007-01-26 15:55:53,639 INFO [STDOUT] 070126 155553 10 creating new bean
2007-01-26 15:55:53,640 INFO [STDOUT] 070126 155553 10 opening segment
indexes in /srv/opt/nutch-0.7.2/crawl.db/segments
2007-01-26 15:55:53,652 ERROR [org.jboss.web.localhost.Engine]
StandardWrapperValve[jsp]: Servlet.service() for servlet jsp threw exception
java.lang.ArrayIndexOutOfBoundsException
In my Browser i got the following Error ...
HTTP Status 500 -
------------------------------------------------------------------------
*type* Exception report
*message*
*description* _The server encountered an internal error () that
prevented it from fulfilling this request._
*exception*
org.apache.jasper.JasperException
org.apache.jasper.servlet.JspServletWrapper.service(JspServletWrapper.java:372)
org.apache.jasper.servlet.JspServlet.serviceJspFile(JspServlet.java:292)
org.apache.jasper.servlet.JspServlet.service(JspServlet.java:236)
javax.servlet.http.HttpServlet.service(HttpServlet.java:810)
org.jboss.web.tomcat.filters.ReplyHeaderFilter.doFilter(ReplyHeaderFilter.java:75)
*root cause*
java.lang.ArrayIndexOutOfBoundsException
*note* _The full stack trace of the root cause is available in the
Apache Tomcat/5.0.28 logs._
------------------------------------------------------------------------
Apache Tomcat/5.0.28
I also tested this Search on a newly created Index ( a small one ) but
got the same error. I Also tried to run Nutch-0.8.1 but still the same.
Also I couldn't find any information about this error and now I don't
know what to do. Maybe you have got a idea...
Thanks in advance...
Yours sincerely,
Erik H.
-------------------------------------------------------------------------
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the chance to share your
opinions on IT & business topics through brief surveys - and earn cash
http://www.techsay.com/default.php?page=join.php&p=sourceforge&CID=DEVDEV
_______________________________________________
Nutch-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-general