-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1

Hi,

Am 07.10.2006 um 17:40 schrieb Cristina Belderrain:

> Let me remind you that all this must be done just to provide something
> that's already there: Nutch is built on top of Lucene, after all. If
> it's hard to understand why Lucene's capabilities were simply
> neutralized in Nutch, it's even harder to figure out why no choice was
> left to users by means of some configuration file.

I think this issue is rooted in the underlying philosophy of Nutch:  
Nutch was designed with the idea of a possible Google(and the likes)- 
sized crawler and indexer in mind. Regular expressions and wildcard  
queries do not seem to fit into this philosophy, as such queries  
would be way less efficient on a huge data set than simple boolean  
queries.

Nevertheless, I agree that there should be an option to choose the  
Lucene query engine instead of the Nutch flavour one because Nutch  
has been proven to be equally suitable for areas which do not require  
as efficient queries (like intranet crawling for instance) as an all- 
out web indexing application.

- --
Best regards,
Björn Wilmsmann


-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.1 (Darwin)

iD8DBQFFJ+75gz0R1bg11MERAgT7AJ4mPRF8Z0BR2yLCm5Pxsz4VvtTI6QCfcS8b
q8gM8LQapjAloNIRwNV+osE=
=v7Lf
-----END PGP SIGNATURE-----

-------------------------------------------------------------------------
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the chance to share your
opinions on IT & business topics through brief surveys -- and earn cash
http://www.techsay.com/default.php?page=join.php&p=sourceforge&CID=DEVDEV
_______________________________________________
Nutch-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-general

Reply via email to