On Mon, Aug 1, 2011 at 11:16 AM, Mark <static.void....@gmail.com> wrote:
> We have around 10million documents that are in our index and about 10% of
> them have some extra "statistics" that are calculated on a daily basis which
> are then index and used in our function queries. This reindexing comes at
> the expense of doing multiple joins in DIH so I am thinking it may be faster
> to precompute these values and use external files rather than have to
> re-index 10% of our corpus daily. How many external file fields could one
> use before it becomes "too many"? Is this a valid use case or am I trying to
> fit a square into a circular hole?

Each external file field will take up maxDoc*4 bytes of RAM.
The other consideration is the time to load them (how often the index
needs to change).

-Yonik
http://www.lucidimagination.com

Reply via email to