[ 
https://issues.apache.org/jira/browse/SOLR-1674?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12793646#action_12793646
 ] 

Yonik Seeley commented on SOLR-1674:
------------------------------------

Hmmm, isn't it a bug that this passes?
    assertTokenizesTo(map, "a b", new String[] { "ab", "ab", "ab"  });

w/o the 1670 fix, we get "ab/ab/ab" (repeated tokens all at the same position). 
 That's not the same as three "ab" tokens in a row.

Also, we seem to have lost matching flexibility with overlapping tokens.  
"a/aa" should be the same as "aa/a", but if you change the order of overlapping 
tokens now, the tests fail.

Didn't ya guys like my a/aa syntax to indicate overlapping tokens?  It 
certainly made it faster for me to write the original testcases :-)

> improve analysis tests, cut over to new API
> -------------------------------------------
>
>                 Key: SOLR-1674
>                 URL: https://issues.apache.org/jira/browse/SOLR-1674
>             Project: Solr
>          Issue Type: Test
>          Components: Schema and Analysis
>            Reporter: Robert Muir
>            Assignee: Mark Miller
>         Attachments: SOLR-1674.patch, SOLR-1674.patch, SOLR-1674_speedup.patch
>
>
> This patch
> * converts all analysis tests to use the new tokenstream api
> * converts most tests to use the more stringent assertion mechanisms from 
> lucene
> * adds new tests to improve coverage
> Most bugs found by more stringent testing have been fixed, with the exception 
> of SynonymFilter.
> The problems with this filter are more serious, the previous tests were 
> essentially a no-op.
> The new tests for SynonymFilter test the current behavior, but have FIXMEs 
> with what I think the old test wanted to expect in the comments.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to