Hello,
I’m wondering if anyone can help with a question about the dictionary format in lucene.vector-cvb integration. I’ve previously used the pathway from text files: seqdirectory > seq2sparse > rowid > cvb and it works fine. The dictionary created by seq2sparse is in sequence file format, and this is accepted by cvb. But when using a pathway from a lucene index: lucene.vector > cvb there is a problem with cvb throwing the error “dict.out not a SequenceFile”. Lucene.vector appears to generate a dictionary in plain text format, but cvb requires it in sequence file format. Does anyone know how to use lucence.vector with cvb, which I assume means obtaining a dictionary as a sequence file from lucene.vector? Thanks for your help. James
