Duplicates in Facets

2012-04-04 Thread Jamie Johnson
I am currently indexing some information and am wondering why I am
getting duplicates in facets.  From what I can tell they are the same,
but is there any case that could cause this that I may not be thinking
of?  Could this be some non printable character making it's way into
the index?


Sample output from luke

lst name=fields
  lst name=organization_umvs
str name=typestring/str
str name=schemaI--M---OFl/str
str name=dynamicBase*_umvs/str
str name=index(unstored field)/str
int name=docs332/int
int name=distinct-1/int
lst name=topTerms
  int name=ORGANIZATION 1328/int
  int name=ORGANIZATION 2124/int
  int name=ORGANIZATION 236/int
  int name=ORGANIZATION 220/int
  int name=ORGANIZATION 34/int
/lst


Re: Duplicates in Facets

2012-04-04 Thread Darren Govoni
Try using Luke to look at your index and see if there are multiple
similar TFV's. You can browse them easily in Luke.

On Wed, 2012-04-04 at 23:35 -0400, Jamie Johnson wrote:
 I am currently indexing some information and am wondering why I am
 getting duplicates in facets.  From what I can tell they are the same,
 but is there any case that could cause this that I may not be thinking
 of?  Could this be some non printable character making it's way into
 the index?
 
 
 Sample output from luke
 
 lst name=fields
   lst name=organization_umvs
 str name=typestring/str
 str name=schemaI--M---OFl/str
 str name=dynamicBase*_umvs/str
 str name=index(unstored field)/str
 int name=docs332/int
 int name=distinct-1/int
 lst name=topTerms
   int name=ORGANIZATION 1328/int
   int name=ORGANIZATION 2124/int
   int name=ORGANIZATION 236/int
   int name=ORGANIZATION 220/int
   int name=ORGANIZATION 34/int
 /lst
 




Re: Duplicates in Facets

2012-04-04 Thread Jamie Johnson
Yes, thanks for the reply.  Turns out there is whitespace differences
in these fields, thank you for the quick reply!

On Wed, Apr 4, 2012 at 11:45 PM, Darren Govoni dar...@ontrenet.com wrote:
 Try using Luke to look at your index and see if there are multiple
 similar TFV's. You can browse them easily in Luke.

 On Wed, 2012-04-04 at 23:35 -0400, Jamie Johnson wrote:
 I am currently indexing some information and am wondering why I am
 getting duplicates in facets.  From what I can tell they are the same,
 but is there any case that could cause this that I may not be thinking
 of?  Could this be some non printable character making it's way into
 the index?


 Sample output from luke

 lst name=fields
   lst name=organization_umvs
     str name=typestring/str
     str name=schemaI--M---OFl/str
     str name=dynamicBase*_umvs/str
     str name=index(unstored field)/str
     int name=docs332/int
     int name=distinct-1/int
     lst name=topTerms
       int name=ORGANIZATION 1328/int
       int name=ORGANIZATION 2124/int
       int name=ORGANIZATION 236/int
       int name=ORGANIZATION 220/int
       int name=ORGANIZATION 34/int
     /lst