I second this use-case.  This is my only concern with Solr faceting — Solr's 
UnInvertedField on the search index to discover frequently used words.  It 
doesn't scale well.  Shai; do you think  this would scale?  FWIW one of my 
indexes with only 300k docs has ~3.1M terms — not a lot but it's a number to 
consider.

~ David

From: "Adrien Grand [via Lucene]" 
<ml-node+s472066n4026847...@n3.nabble.com<mailto:ml-node+s472066n4026847...@n3.nabble.com>>
Hi Shai,

Thanks for your answers!
…

> So I think that if anyone would want to really manage taxonomies of that
> size, we'd need to discuss and maybe get back to the drawing board :).

One use-case I'm thinking of is finding the top terms of documents
that match an arbitrary query. This can be very useful to help you
better understand your data, but in this case the number of distinct
values is the size of your term dictionary.

--
Adrien

Reply via email to