Hello,
I'm moving from 0.5 release to the latest nightly release and I'm
getting the following error:
050415 031240 Starting URL processing
050415 031240 Plugins: directory not found: plugins=
Exception in thread main java.lang.ExceptionInInitializerError
at
Dear Chirag,
Thanks for your answer and your time.
Some other questions to your mail.
In that case a central DB server would be okay, and you can move the
segments around using rsync to the search servers. I've found this to be a
more hassle-free solution that using NDFS, but I'm certain given the
1. Souped-up DB server - Dual CPU, 4 GB Ram (min) RAID 5 or 10, 1-2
NICS
This is the 'fetcher' server?
This is you fetch/crawler/indexer -- create the final segments here, then
move them to the search server. That way if a search server goes down,
simply move the segment to another server.
To add from my experiences:
I've preferred Resin (stability performance)
I always go for more ram than more servers. It's cheaper in the long run
when it comes to man hours and service as well as MTBF for your hardware.
Use Squid to proxy/load balance your java servers. This helped alleviate
Andrzej,
Thanks for the feedback, i've pulled in your
recommendation but still not getting it right. I'm a
newb when it comes to trying this out myself :)
Here is my code:
%@ page
contentType=text/xml; charset=UTF-8
pageEncoding=UTF-8
import=javax.servlet.*
import=javax.servlet.http.*
Is there an archive of the mailing list anymore that is searchable? The
old lists on sourceforge are gone and the one on apache's site is just a
flatfile of recent subjects.
I'm interested in looking up the info on Mapreduce and what it does as
well as stuff i missed while i was out :)
Try:
http://www.mail-archive.com/index.php?hunt=nutch
Matthias
--
http://www.eventax.com - eventax GmbH
http://www.umkreisfinder.de - Die Suchmaschine für Lokales und Events
Bryon,
While I agree with your analysis/feedback I think it can be a little
daunting for a first timer (or someone who has not got their hands dirty
with nutch).
Yes, Resin is more stable -- but tomcat works out of the box and Resin
issues can be hard to fix given that not of whole lot of people
Dough All,
What is the status of map reduce? i just got finished reading your paper
and all of the threads and i'm drooling over the notion of such a system :)