The Javadoc says "positionLength determines how many positions this token
spans". It's not obvious from the documentation how the full graph structure
for nested multi-word synonyms can be expressed merely using that attribute.
Is this detailed anywhere? (Maybe in Jira... but it is still down.) I mean,
a multi-word synonym is multiple tokens. How does any of the "tokens" span
more than one position?
-- Jack Krupansky
-----Original Message-----
From: Robert Muir
Sent: Friday, August 10, 2012 1:44 PM
To: dev@lucene.apache.org
Subject: Re: Proposal: Full support for multi-word synonyms at query time
On Fri, Aug 10, 2012 at 1:36 PM, Jack Krupansky <j...@basetechnology.com>
wrote:
One of the ongoing potholes of Solr and Lucene is lack of full support for
multi-word synonyms at query time. The root of the problem is twofold:
individual terms are presented for analysis which precludes recognition of
multi-term synonyms, and the output stream from the analyis process is a
single, linear stream without regard to any graph/lattice structure for
multiple synonyms.
But this is not true. PositionLengthAttribute was already added, which
makes it a graph.
--
lucidimagination.com
---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org
---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org