Andrzej, Thank you for explanation. > No, in this case, if "web" and "services" were added to > common-grams.utf8, the result would look like: > > web|web-services, services|services-is, cool > > where | marks tokens indexed at the same position in the index.
I guess you meant common-terms.utf8 rather? If so, Lucene indexes pairs of words that include "a", "and", "for", etc. that are usually regarded as stop words and simply thrown away by many search engines? That is amazing. -kuro
