Hi Heshan
one approach could be something like this:
1- vectorize each ngram of each sentence. One vectorization strategy is to
use word2vec (the deep learning package). i believe someone has ported
word2vec (originally in C) to Lucene. do google search
2- aggregate each word vector (i.e some clu
Hi Heshan,
I think you can achieve what you are looking for. You may read "lucene in
Action 2nd edition" about lucene scoring system and FuzzyQuery. Hope this
may help. May be someone can suggest much better approach.
On Wed, Apr 1, 2015 at 8:14 AM, hesh jay wrote:
> hi,
> I am second year under
hi,
I am second year undergraduate of University of Moratuwa,SriLanka.My second
year project I am doing Question answering system(Knowledge base).In this
project i have to suggest similar question perviously asked by other users.
I should find similarity of two Sentences in my application to sugges