IMHO there are two things:
1. these little marketing and management issues that often have no valid reason but make a big difference:
Programmer / Freelancer : let's use ruby we'll even be able to build a superfast search interface to all your great marketing docs with ferret, rails and ruby
Manager: i think we've got this, it's implemented by something called bluezeneeee
P/F: yes we even might use the indexes of this and perform searches with the old system while we are changing...
M: changing what
P/F: the system to ruby, ferret...
M: WTF?
for these conversations it would be of help to stay in the background as much as possible with changes as possible...
2. Tools around Lucene
I think people will now give marvins patch and luke a try, but luke is not the only thing. Thanks to eric for putting up solr. I think it's a little bit of the old java 90%/10% - thingy. For 90% of webapps all the java, spring, hibernate stuff is damn complex and you'll be faster with ruby. but the 10 or less percent, often the big money stuff of fortune companys, of banks etc. made their management decision to either j2ee or .net. And for these projects the programming teams often need distributed and high volume things, see cnet and solr.
I've heard about solr on this thread for the first time and wonder a little how it does together with nutch / hadoop for the distributed things but will do some googleing on this myself. I think there is definitly need - also in the ruby world - for search engines and crawlers. And nutch has some nifty features about RDig. Discussions about the interchangeability between nutch and ferret are showing that people are interested in using Lucene tools but front end with ruby, rails and ferret. I've for example tried to work with ferret on a nutch index and luckily ferret didn't choke on the index because there were no utf-8 chars in there. So I could extract url, segment, docno but then there came this nfs / hadoop thing to extract content and summaries as well and I gave up.
There also seems to be interest and need in distributed search architectures as the p2p efforts of hyperestraier as well as nfs / hadoop and solr (rsync?) are showing...
Regards
Jan
On 5/17/06, David Balmain <[EMAIL PROTECTED]> wrote:
On 5/17/06, Marvin Humphrey <[EMAIL PROTECTED]> wrote:
> How many users here care about Lucene compatibility, and why?
Great question. Who does care, and why? Performance used to be a very
good reason but that doesn't apply anymore. Is it Java's libraries?
Java does have PDFBox for example. Unfortunately Ruby doesn't yet have
an equivalent but there are ways around this. The only good reason I
can think of is the lack of a Luke port. Anyone care to enlighten us?
Cheers,
Dave
_______________________________________________
Ferret-talk mailing list
[email protected]
http://rubyforge.org/mailman/listinfo/ferret-talk
_______________________________________________ Ferret-talk mailing list [email protected] http://rubyforge.org/mailman/listinfo/ferret-talk

