Hey guys, I have a question about the mechanics of aggregation and sorting w.r.t. the fielddata cache. I know this has been covered in some detail previously, and I'm caught up on the advice to use doc_values where possible, but we have a use case where we do light analysis on a particular set of fields in our document, but also allow sorting on those fields.
While we'll probably modify our schema to solve the issue, I was first wondering whether it is possible to filter the set of documents that ES aggregates / sorts over *before* pulling them into the fielddata cache? We have extremely high cardinality fields, but very selective queries, and it seems very inefficient to pull multiple gigabytes into the fielddata cache to select relatively few matching documents. Thanks, Lindsey -- You received this message because you are subscribed to the Google Groups "elasticsearch" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/e32cf7c3-e2b3-48e9-bc7c-d7f2e0016835%40googlegroups.com. For more options, visit https://groups.google.com/d/optout.
