Hi,
I have a field in my index called related_ids, indexed and stored, with the
following field type:
!--
A text field that tokenizes on whitespace, removing non-word
characters at the
start and end of each token, but preserving meaningful punctuation
*within*
I'm really reaching here, but lucene only indexes the first 10,000 terms by
default (you can up the limit). Is there a chancethat you're hitting that
limit? That 1cuk is past the 10,000th term
in record 2.40?
For this to be possible, I have to assume that the FieldAnalysis
tool ignores this
That's probably it! It is quite near the end of the field. I'll try upping
it and re-indexing.
Thanks :-)
Erick Erickson wrote:
I'm really reaching here, but lucene only indexes the first 10,000 terms
by
default (you can up the limit). Is there a chancethat you're hitting that
limit?