[
https://issues.apache.org/jira/browse/LUCENE-1077?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Grant Ingersoll updated LUCENE-1077:
------------------------------------
Attachment: LUCENE-1077.patch
Adds to the sinks package:
DateRecognizerSinkTokenizer which only adds dates that can be parsed by a
DateFormat object
TokenRangeSinkTokenizer as described earlier
TokenTypeSinkTokenizer only adds to the sink if the token type is a specific
value.
Adds the payloads package, which contains
NumericPayloadTokenFilter -- Assigns a predefined float-based payload to a
Token if the type matches the specified input type of the Token. As a use
case, this could be used to assign a payload for all tokens that are marked as
"bold" or some other value.
> New Analysis Contributions
> ---------------------------
>
> Key: LUCENE-1077
> URL: https://issues.apache.org/jira/browse/LUCENE-1077
> Project: Lucene - Java
> Issue Type: New Feature
> Components: Analysis, contrib/*
> Reporter: Grant Ingersoll
> Assignee: Grant Ingersoll
> Priority: Minor
> Fix For: 2.3
>
> Attachments: LUCENE-1077.patch, LUCENE-1077.patch
>
>
> With the advent of the new TeeTokenFilter and SinkTokenizer, there now exists
> some interesting new things that can be done in the analysis phase of
> indexing. See LUCENE-1058.
> This patch provides some new implementations of SinkTokenizer that may be
> useful.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.
---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]