[ 
https://issues.apache.org/jira/browse/SOLR-1057?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12989693#comment-12989693
 ] 

Robert Muir commented on SOLR-1057:
-----------------------------------

I'm a little confused about the use of the tokenizer (i have no problems 
technically, its maybe a naming issue?)

Is this intended for tokenizing file pathnames as its name would suggest? In 
this case I think the path should have positions, e.g. /foo/bar/whatever.txt is 
foo(1), bar(1), whatever.txt(1)?

It seems instead, this one is intended for representing hierarchies, as it 
creates synonyms of /foo, /foo/bar, /foo/bar/whatever.txt... with position 
increments of zero.

I guess I'm just being picky about naming, but i think this hierarchical case 
is more specific than 'tokenizing file pathnames' and maybe a name like 
HierarchyTokenizer (this one too probably isn't the best!) would better 
represent what it does?


> PathTokenizerFactory
> --------------------
>
>                 Key: SOLR-1057
>                 URL: https://issues.apache.org/jira/browse/SOLR-1057
>             Project: Solr
>          Issue Type: New Feature
>          Components: Schema and Analysis
>            Reporter: Ryan McKinley
>            Assignee: Koji Sekiguchi
>            Priority: Minor
>             Fix For: 3.1, 4.0
>
>         Attachments: SOLR-1057-PathTokenizerFactory.patch, 
> SOLR-1057-PathTokenizerFactory.patch, SOLR-1057.patch
>
>
> This is a Tokenizer that splits the input string into a series of paths.  For 
> example:
> {panel}
>  /aaa/bbb/ccc
> {panel}
> becomes:
> {panel}
>  /aaa/
>  /aaa/bbb/
>  /aaa/bbb/ccc
> {panel}

-- 
This message is automatically generated by JIRA.
-
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to