[ 
https://issues.apache.org/jira/browse/LUCENE-3873?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13238422#comment-13238422
 ] 

Michael McCandless commented on LUCENE-3873:
--------------------------------------------

I agree we can use it in specific places for starters...

The patch on LUCENE-3848 mixes in "TokenStream to Automaton" and 
MockGraphTokenFilter; I'll split that apart and only commit 
MockGraphTokenFilter here.

One problem is... MockGraphTokenFilter isn't setting offsets currently.... I 
think to do this "correctly" it needs to buffer up pending input tokens, until 
it's reached the posLength it wants to output for a random token, and then set 
the offset accordingly.
                
> tie MockGraphTokenFilter into all analyzers tests
> -------------------------------------------------
>
>                 Key: LUCENE-3873
>                 URL: https://issues.apache.org/jira/browse/LUCENE-3873
>             Project: Lucene - Java
>          Issue Type: Task
>          Components: modules/analysis
>            Reporter: Robert Muir
>            Assignee: Michael McCandless
>
> Mike made a MockGraphTokenFilter on LUCENE-3848.
> Many filters currently arent tested with anything but a simple tokenstream.
> we should test them with this, too, it might find bugs (zero-length terms,
> stacked terms/synonyms, etc)

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

Reply via email to