Hi,

We are trying to incorporate synonym filter during indexing using Nutch. As
per my understanding Nutch doesn’t have synonym indexing plug-in by default.
Can we extend IndexFilter in Nutch to incorporate the synonym filter plug-in
available in Lucene using WordNet or custom synonym plug-in without any
negative impacts to existing Nutch indexing (i.e., considering bigram etc).


Another option we are thinking is to look for synonyms during query time.
But this might cause performance issue as we scale the system to say more
than 100M pages.



Can someone please suggest the best way to incorporate the synonym filter in
Nutch.



Thank you.



Regards,

Dharan

Reply via email to