Hi all, I am using Lucene and OpenNLP for POS tagging. I would like to support biGrams with POS tags as well. For example, I would like something like that:
Input: (I[PRP], am[VBP], using[VBG], Lucene[NNP]) Output: (I[PRP] am[VBP], am[VBP] using[VBG], using[VBG] Lucene[NNP]) The problem above is that I do not have "pure" tokens, like "I", "am" etc., so the analysis could be wrong if I add the POS tags as an input in Lucene. Is there a way to solve this, apart from creating my custome Lucene analyser? Thank you in advance. Regards, Niki.