Hi all,

Well I still get a very slow mergesegs:


>050917 043332 - data in segment index/segments/20050916014401 is corrupt, using only 128115 entries.

This is a common and recurring problem. What's worse is that an unfixed segment like this will destroy the performance of the search, too, not just the backend pre-processing.

I propose to modify MapFile.Reader so that it refuses to open such file, and throws an Exception, unless a force=true flag is given. Tools that want to ignore this can do so, but all other tools will be able to make a conscious decision whether to fix it first, or to use it as such.

If there are no objections, I will change it in the trunk/ in a couple of days.

--
Best regards,
Andrzej Bialecki     <><
 ___. ___ ___ ___ _ _   __________________________________
[__ || __|__/|__||\/|  Information Retrieval, Semantic Web
___|||__||  \|  ||  |  Embedded Unix, System Integration
http://www.sigram.com  Contact: info at sigram dot com



-------------------------------------------------------
SF.Net email is sponsored by:
Tame your development challenges with Apache's Geronimo App Server. Download it for free - -and be entered to win a 42" plasma tv or your very
own Sony(tm)PSP.  Click here to play: http://sourceforge.net/geronimo.php
_______________________________________________
Nutch-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-general

Reply via email to