Alan Woodward created SOLR-13233:
------------------------------------
Summary: SpellCheckCollator ignores stacked tokens
Key: SOLR-13233
URL: https://issues.apache.org/jira/browse/SOLR-13233
Project: Solr
Issue Type: Bug
Security Level: Public (Default Security Level. Issues are Public)
Reporter: Alan Woodward
When building collations, SpellCheckCollator ignores any tokens with a position
increment of 0, assuming that they've been injected and may therefore have
incorrect offsets (injected terms generally keep the offsets of the terms
they're replacing, as they don't themselves appear anywhere in the original
source). However, this assumption is not necessarily correct - for example,
WordDelimiterGraphFilter emits stacked tokens *before* the original token,
because it needs to iterate through all stacked tokens to correctly set the
original token's position length.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]