[
https://issues.apache.org/jira/browse/SOLR-1057?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12989693#comment-12989693
]
Robert Muir commented on SOLR-1057:
-----------------------------------
I'm a little confused about the use of the tokenizer (i have no problems
technically, its maybe a naming issue?)
Is this intended for tokenizing file pathnames as its name would suggest? In
this case I think the path should have positions, e.g. /foo/bar/whatever.txt is
foo(1), bar(1), whatever.txt(1)?
It seems instead, this one is intended for representing hierarchies, as it
creates synonyms of /foo, /foo/bar, /foo/bar/whatever.txt... with position
increments of zero.
I guess I'm just being picky about naming, but i think this hierarchical case
is more specific than 'tokenizing file pathnames' and maybe a name like
HierarchyTokenizer (this one too probably isn't the best!) would better
represent what it does?
> PathTokenizerFactory
> --------------------
>
> Key: SOLR-1057
> URL: https://issues.apache.org/jira/browse/SOLR-1057
> Project: Solr
> Issue Type: New Feature
> Components: Schema and Analysis
> Reporter: Ryan McKinley
> Assignee: Koji Sekiguchi
> Priority: Minor
> Fix For: 3.1, 4.0
>
> Attachments: SOLR-1057-PathTokenizerFactory.patch,
> SOLR-1057-PathTokenizerFactory.patch, SOLR-1057.patch
>
>
> This is a Tokenizer that splits the input string into a series of paths. For
> example:
> {panel}
> /aaa/bbb/ccc
> {panel}
> becomes:
> {panel}
> /aaa/
> /aaa/bbb/
> /aaa/bbb/ccc
> {panel}
--
This message is automatically generated by JIRA.
-
For more information on JIRA, see: http://www.atlassian.com/software/jira
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]