[
https://issues.apache.org/jira/browse/SOLR-1057?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12988706#comment-12988706
]
Koji Sekiguchi edited comment on SOLR-1057 at 2/1/11 12:54 AM:
---------------------------------------------------------------
I think this can be used for SOLR-64. I'll take it.
TODO:
- move PathTokenizer to
modules/analysis/common/src/java/org/apache/lucene/analysis/path/ (4.0) or
lucene/src/java/org/apache/lucene/analysis/ (3.1)
- make test cases
- respect the original path delimiter (seems current patch outputs backslash
even if the input uses slash)
- accept an arbitrary delimiter and replacement
- add offset correction
was (Author: koji):
I think this can be used for SOLR-64. I'll take it.
TODO:
- move PathTokenizer to
modules/analysis/common/src/java/org/apache/lucene/analysis/path/ (4.0) or
lucene/src/java/org/apache/lucene/analysis/ (3.1)
- make test cases
- respect the original path delimiter (seems current patch outputs backslash
even if the input uses slash)
- accept an arbitrary delimiter
> PathTokenizerFactory
> --------------------
>
> Key: SOLR-1057
> URL: https://issues.apache.org/jira/browse/SOLR-1057
> Project: Solr
> Issue Type: New Feature
> Components: Schema and Analysis
> Reporter: Ryan McKinley
> Assignee: Koji Sekiguchi
> Priority: Minor
> Fix For: 3.1, 4.0
>
> Attachments: SOLR-1057-PathTokenizerFactory.patch,
> SOLR-1057-PathTokenizerFactory.patch
>
>
> This is a Tokenizer that splits the input string into a series of paths. For
> example:
> {panel}
> /aaa/bbb/ccc
> {panel}
> becomes:
> {panel}
> /aaa/
> /aaa/bbb/
> /aaa/bbb/ccc
> {panel}
--
This message is automatically generated by JIRA.
-
For more information on JIRA, see: http://www.atlassian.com/software/jira
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]