[ https://issues.apache.org/jira/browse/LUCENE-3873?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13238422#comment-13238422 ]
Michael McCandless commented on LUCENE-3873: -------------------------------------------- I agree we can use it in specific places for starters... The patch on LUCENE-3848 mixes in "TokenStream to Automaton" and MockGraphTokenFilter; I'll split that apart and only commit MockGraphTokenFilter here. One problem is... MockGraphTokenFilter isn't setting offsets currently.... I think to do this "correctly" it needs to buffer up pending input tokens, until it's reached the posLength it wants to output for a random token, and then set the offset accordingly. > tie MockGraphTokenFilter into all analyzers tests > ------------------------------------------------- > > Key: LUCENE-3873 > URL: https://issues.apache.org/jira/browse/LUCENE-3873 > Project: Lucene - Java > Issue Type: Task > Components: modules/analysis > Reporter: Robert Muir > Assignee: Michael McCandless > > Mike made a MockGraphTokenFilter on LUCENE-3848. > Many filters currently arent tested with anything but a simple tokenstream. > we should test them with this, too, it might find bugs (zero-length terms, > stacked terms/synonyms, etc) -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org