Reformatted excerpts from Ben Gamari's message of 2009-10-07: > I understand that designing software around a contingency like this > might not be the best practice, but the frequency with which I've > needed to rebuild really does make me think that ruby isn't the best > language for the indexer.
The indexer isn't in Ruby, it's written in C++ in the case of Xapian and C in the case of Ferret. > This is easily the fifth time I've needed to rebuild and each time it > has taken over 30 minutes for 1.5 GB of mail. That's substantially > less than 1MB/second for what should be an I/O bound operation. Ouch. I think this isn't the indexer's fault so much as the mbox parsing, which is Ruby. I'm sorry you've had to rebuild the index so many times. The Xapian side of things is very new, and I think you've had a run of bad luck. But I am personally not motivated to improve index time performance, because that's not a common event. At least, it shouldn't be. -- William <wmorgan-...@masanjin.net> _______________________________________________ sup-talk mailing list sup-talk@rubyforge.org http://rubyforge.org/mailman/listinfo/sup-talk