Easiest way would be to pre-process your input and join those 2 tokens before splitting them by white space.
But from given context I might miss some details...still worth a shot. On Fri, Dec 21, 2012 at 9:50 AM, Xi Shen <davidshe...@gmail.com> wrote: > Hi, > > I am looking for a token filter that can combine 2 terms into 1? E.g. > > the input has been tokenized by white space: > > t1 t2 t2a t3 > > I want a filter that output: > > t1 t2t2a t3 > > I know it is a very special case, and I am thinking about develop a filter > of my own. But I cannot figure out which API I should use to look for terms > in a Token Stream. > > -- > Regards, > David Shen > > http://about.me/davidshen > https://twitter.com/#!/davidshen84 >