I'm doing some overall testing on my cluster, debating if I should switch to Doc Values. I have about 15 fields for each document, with 83 million documents spread across 60 indices. All the fields are dynamically mapped, and all of them can migrate to Doc Values. So, I have one copy of the data using FDC and a second copy using DV. Overall it's a 3x increase in consumed disk space, and a 98% decrease in FDC size when using DV.
My question is, what is that last left over 2%? If everything is on disk, why is it reporting memory usage in the FDC? Some indices report 0 bytes, but others report anywhere between 34 KB - 700 KB? What am I missing here? Are things still loaded into the FDC anyway? Maybe I missed a field type in the dynamic templates? Also, what field types are recommended to move to Doc Values? High-cardinality non-analyzed string fields? High-cardinality in general? Everything? Overall performance seems similar, but this is just one of quite a few data-sets that would be interacted with at any given time and hopefully I have less memory issues (GC/eviction). -- Please update your bookmarks! We have moved to https://discuss.elastic.co/ --- You received this message because you are subscribed to the Google Groups "elasticsearch" group. To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscr...@googlegroups.com. To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/2da6e886-5619-4313-a2a3-dcde99c986a2%40googlegroups.com. For more options, visit https://groups.google.com/d/optout.