I'm doing some overall testing on my cluster, debating if I should switch 
to Doc Values.  I have about 15 fields for each document, with 83 million 
documents spread across 60 indices.  All the fields are dynamically mapped, 
and all of them can migrate to Doc Values.  So, I have one copy of the data 
using FDC and a second copy using DV.  Overall it's a 3x increase in 
consumed disk space, and a 98% decrease in FDC size when using DV. 

My question is, what is that last left over 2%?  If everything is on disk, 
why is it reporting memory usage in the FDC?  Some indices report 0 bytes, 
but others report anywhere between 34 KB - 700 KB?  What am I missing here? 
 Are things still loaded into the FDC anyway?  Maybe I missed a field type 
in the dynamic templates?

Also, what field types are recommended to move to Doc Values? 
 High-cardinality non-analyzed string fields?  High-cardinality in general? 
 Everything?

Overall performance seems similar, but this is just one of quite a few 
data-sets that would be interacted with at any given time and hopefully I 
have less memory issues (GC/eviction).

-- 
Please update your bookmarks! We have moved to https://discuss.elastic.co/
--- 
You received this message because you are subscribed to the Google Groups 
"elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to elasticsearch+unsubscr...@googlegroups.com.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/elasticsearch/2da6e886-5619-4313-a2a3-dcde99c986a2%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Reply via email to