I second this use-case. This is my only concern with Solr faceting — Solr's UnInvertedField on the search index to discover frequently used words. It doesn't scale well. Shai; do you think this would scale? FWIW one of my indexes with only 300k docs has ~3.1M terms — not a lot but it's a number to consider.
~ David From: "Adrien Grand [via Lucene]" <ml-node+s472066n4026847...@n3.nabble.com<mailto:ml-node+s472066n4026847...@n3.nabble.com>> Hi Shai, Thanks for your answers! … > So I think that if anyone would want to really manage taxonomies of that > size, we'd need to discuss and maybe get back to the drawing board :). One use-case I'm thinking of is finding the top terms of documents that match an arbitrary query. This can be very useful to help you better understand your data, but in this case the number of distinct values is the size of your term dictionary. -- Adrien