Robin Haswell wrote: > Hey there > > I'm having issues searching with my newly (vastly) expanded database. > Could anyone shed any light on this? Basically, on a newly started > server, I search for "test", and this appears in catalina.out: > > 2006-12-20 10:51:40,710 INFO NutchBean - creating new bean > 2006-12-20 10:51:40,725 INFO NutchBean - opening merged index in > crawl/index > 2006-12-20 10:51:40,871 INFO Configuration - found resource > common-terms.utf8 at > file:/nutch/apache-tomcat-5.5/webapps/ROOT/WEB-INF/classes/common-terms.utf8 > 2006-12-20 10:51:40,880 INFO NutchBean - opening segments in > crawl/segments > 2006-12-20 10:51:40,898 INFO SummarizerFactory - Using the first > summarizer extension found: Basic Summarizer > 2006-12-20 10:51:40,901 INFO NutchBean - opening linkdb in crawl/linkdb > 2006-12-20 10:51:40,907 INFO NutchBean - query request from > 195.166.60.2 > 2006-12-20 10:51:40,925 INFO NutchBean - query: test > 2006-12-20 10:51:40,925 INFO NutchBean - lang: en > 2006-12-20 10:51:40,974 INFO NutchBean - searching for 20 raw hits > 2006-12-20 10:52:13,306 ERROR [jsp] - Servlet.service() for servlet jsp > threw exception > java.lang.OutOfMemoryError: Java heap space >
This is the problem - you need to increase the heap space in your Tomcat. Since you expanded you index, the bigger index won't fit in the same heap space as before ... especially when you run searches that touch more of the index, parts of it need to be loaded into memory - so this problem may not occur for searches that return only few results. -- Best regards, Andrzej Bialecki <>< ___. ___ ___ ___ _ _ __________________________________ [__ || __|__/|__||\/| Information Retrieval, Semantic Web ___|||__|| \| || | Embedded Unix, System Integration http://www.sigram.com Contact: info at sigram dot com ------------------------------------------------------------------------- Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT & business topics through brief surveys - and earn cash http://www.techsay.com/default.php?page=join.php&p=sourceforge&CID=DEVDEV _______________________________________________ Nutch-general mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/nutch-general
