Re: [Nutch-general] Merging

2004-08-27 Thread Doug Cutting
Jason Boss wrote: Playing with the merge command. After you create the index I am attemting to move just the new index over to another server. Upon stopping and starting Tomcat it can't find any kind of data. The only way it seems to see it is if the full segments are there. Is there a way to m

Re: [Nutch-general] Fast Search II

2004-08-27 Thread Doug Cutting
Jason Boss wrote: Thanks for the reply. Say for instance we want to index about 1/2 billion pages, how many computers using the distributed search method would you need? And to get fast decent results of those 1/2 billion pages what is the recommended hardware needed to make that happen? A single

Re: [Nutch-general] Nutch 0.5 - Failure in indexing some sites

2004-08-27 Thread Doug Cutting
wmelo wrote: I have Nutch 0.5 installed in two computers. The firs with a Pentium III and the other one an Athlon 2.6. Both have Fedora 2. In the pentium machine I can index without any problems sites like http://www.nrc.gov, but in the Athlon I receive an error message indicating something

Re: [Nutch-general] Problems - What could it be?

2004-08-27 Thread Doug Cutting
Jason Boss wrote: root cause javax.servlet.ServletException at org.apache.jasper.runtime.PageContextImpl.handlePageException(PageContextImp l.java:536) at org.apache.jsp.search_jsp._jspService(search_jsp.java:460) What version of Nutch are you running? Have you modified search.jsp?

[Nutch-general] Problems - What could it be?

2004-08-27 Thread Jason Boss
Here is our layout. We have a dual processor Xeon board with 2 gigs of ram. You can crawl and index 1.5 million pages and it works great with or without a merged index. If you take the page count up to 2 million, you can't get the index to run on any computer. This is what happens at any page c