Hi, I have a requirement to write a custom tokenizer using Lucene framework.
My requirement is it should have capabilities to match multiple words as one token. for example. When user passes String as International Business machine logo or IBM logo it should return International Business Machine as one token and logo as one token. Please help me as how can I approach this ... Regards Geet