I did as you mentioned and the problem still the same, I think the problem in the highFrequentTerm part. There I see duplicate words in the produced high frequent list. The comparison itself ok because I can see only terms belong to document type "A" is added to the TermInfoQueue. However, the frequency is not correctly counted for each term and also with some duplicate words in the list. Does something wrong with TermDocs dok and dok.freq()?
-- View this message in context: http://lucene.472066.n3.nabble.com/conditional-High-Freq-Terms-in-Lucene-index-tp3868066p3873567.html Sent from the Lucene - Java Developer mailing list archive at Nabble.com. --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
