Re: [Ferret-talk] issues with index for table with over 18 million records

Erik Morton Wed, 08 Aug 2007 14:16:43 -0700

We had to patch it because we were getting seemingly random errors  
while searching a 2GB+ index. This the trac ticket: http:// 
ferret.davebalmain.com/trac/ticket/215. The patch I included changes  
some ints to off_t's, which solved the problem. As far as I know this  
patch was never applied to the trunk.

We build our index using a modified version of RDig. We basically run  
up to 80 EC2 servers in parallel to create 80 separate indexes, which  
we later combine into a single index. You could follow a similar  
route and still have AAF mange the index after it is built. You'd  
need to make sure that the documents created by RDig/whatever have  
the same fields that AAF expects.

Erik
On Aug 8, 2007, at 4:53 PM, Craig Jolicoeur wrote:

> Erik Morton wrote:
>> We have a 1 million record index that is about 6GB in size. We build
>> it in parallel w/out AAF so it's hard to comment on the speed of your
>> index build. However I will say that I did need to manually patch
>> Ferret to better handle large indexes.
>>
>
>
> Erik,
>
> What issues did you find that caused you to patch the ferret code?
>
> ALso, you say you build the index in parallel w/out AAF; how do you do
> that?  Not sure I'm following how to do that so if you can explain,  
> I'd
> appreciate it.
> -- 
> Posted via http://www.ruby-forum.com/.
> _______________________________________________
> Ferret-talk mailing list
> [email protected]
> http://rubyforge.org/mailman/listinfo/ferret-talk

_______________________________________________
Ferret-talk mailing list
[email protected]
http://rubyforge.org/mailman/listinfo/ferret-talk

Re: [Ferret-talk] issues with index for table with over 18 million records

Reply via email to