[ https://issues.apache.org/jira/browse/SOLR-1674?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12793646#action_12793646 ]
Yonik Seeley commented on SOLR-1674: ------------------------------------ Hmmm, isn't it a bug that this passes? assertTokenizesTo(map, "a b", new String[] { "ab", "ab", "ab" }); w/o the 1670 fix, we get "ab/ab/ab" (repeated tokens all at the same position). That's not the same as three "ab" tokens in a row. Also, we seem to have lost matching flexibility with overlapping tokens. "a/aa" should be the same as "aa/a", but if you change the order of overlapping tokens now, the tests fail. Didn't ya guys like my a/aa syntax to indicate overlapping tokens? It certainly made it faster for me to write the original testcases :-) > improve analysis tests, cut over to new API > ------------------------------------------- > > Key: SOLR-1674 > URL: https://issues.apache.org/jira/browse/SOLR-1674 > Project: Solr > Issue Type: Test > Components: Schema and Analysis > Reporter: Robert Muir > Assignee: Mark Miller > Attachments: SOLR-1674.patch, SOLR-1674.patch, SOLR-1674_speedup.patch > > > This patch > * converts all analysis tests to use the new tokenstream api > * converts most tests to use the more stringent assertion mechanisms from > lucene > * adds new tests to improve coverage > Most bugs found by more stringent testing have been fixed, with the exception > of SynonymFilter. > The problems with this filter are more serious, the previous tests were > essentially a no-op. > The new tests for SynonymFilter test the current behavior, but have FIXMEs > with what I think the old test wanted to expect in the comments. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.