It does not seem to be memory footprint also ? looks too high for my index.

./zahoor


On 20-May-2013, at 10:55 PM, Jason Hellman <jhell...@innoventsolutions.com> 
wrote:

> Most definitely not the number of unique elements in each segment.  My 32 
> document sample index (built from the default example docs data) has the 
> following:
> 
> entry#0:
> 'StandardDirectoryReader(​segments_b:29 _8(​4.2.1):C32)'=>'manu_exact',class 
> org.apache.lucene.index.SortedDocValues,0.5=>org.apache.lucene.search.FieldCacheImpl$SortedDocValuesImpl#1778857102
> 
> There is no chance for there to be 1.8 billion unique elements in that index.
> 
> On May 20, 2013, at 1:20 PM, Erick Erickson <erickerick...@gmail.com> wrote:
> 
>> Not sure, never had to worry about what they are......
>> 
>> On Mon, May 20, 2013 at 12:28 PM, J Mohamed Zahoor <zah...@indix.com> wrote:
>>> 
>>> What is the number at the end?
>>> is it the no of unique elements in each segment?
>>> 
>>> ./zahoor
>>> 
>>> 
>>> On 20-May-2013, at 7:37 PM, Erick Erickson <erickerick...@gmail.com> wrote:
>>> 
>>>> Because the same field is split amongst a number of segments. If you
>>>> look in the index directory, you should see files like _3fgm.* and
>>>> _3ffm.*. Each such group represents one segment. The number of
>>>> segments changes with merging etc.
>>>> 
>>>> Best
>>>> Erick
>>>> 
>>>> On Mon, May 20, 2013 at 6:43 AM, J Mohamed Zahoor <zah...@indix.com> wrote:
>>>>> Hi
>>>>> 
>>>>> Why is that lucene field cache has multiple entries for the same field 
>>>>> S_24.
>>>>> It is a dynamic field.
>>>>> 
>>>>> 
>>>>> 'SegmentCoreReader(owner=_3fgm(4.2.1):C7681)'=>'S_24',double,org.apache.lucene.search.FieldCache.NUMERIC_UTILS_DOUBLE_PARSER=>org.apache.lucene.search.FieldCacheImpl$DoublesFromArray#1174240382
>>>>> 
>>>>> 'SegmentCoreReader(owner=_3ffm(4.2.1):C1596758)'=>'S_24',double,org.apache.lucene.search.FieldCache.NUMERIC_UTILS_DOUBLE_PARSER=>org.apache.lucene.search.FieldCacheImpl$DoublesFromArray#83384344
>>>>> 
>>>>> 'SegmentCoreReader(owner=_3fgh(4.2.1):C2301)'=>'S_24',double,org.apache.lucene.search.FieldCache.NUMERIC_UTILS_DOUBLE_PARSER=>org.apache.lucene.search.FieldCacheImpl$DoublesFromArray#1281331764
>>>>> 
>>>>> 
>>>>> Also, the number at the end.. does it specified the no of entries in that 
>>>>> cache bucket?
>>>>> 
>>>>> ./zahoor
>>> 
> 

Reply via email to