Hi Koji, I got your Idea.Its awesome..
But my problem is Dictionary corpus itself..If I use Wordnet,it can create dictionary index for only English.. I need to create Dictionary Index for all languages.I want to know whether anything like wordnet which i can readily plugin in my application .. Please Kindly Guide me.. Thanks and Regards Vignesh Srinivasan. On Wed, Oct 9, 2013 at 5:56 PM, Koji Sekiguchi <[email protected]> wrote: > Hi VIGNESH, > > The heart of my idea in the article is that if you have a dictionary > (corpus) > in Lucene index, my program can extract synonym data from the index. > Wikipedia was a concrete example for the description I used. > Please see the figure in the article for the system architecture. > > koji > > > > (13/10/09 20:46), VIGNESH S wrote: > >> Hi Koji, >> >> Thanks for your reply and guidance. >> >> I have read the Below Article and it is really helpful in getting the >> relevant synonyms. >> >> But How are you getting the synonym from Wikipedia..do wikipedia expose >> any >> API or is there any readymade dictionary file wikipedia is giving for all >> languages. >> >> Please kindly help. >> >> >> >> >> On Mon, Oct 7, 2013 at 8:06 PM, Koji Sekiguchi <[email protected]> >> wrote: >> >> (13/10/07 18:33), VIGNESH S wrote: >>> >>> Hi, >>>> >>>> How to implement synonym Search for All languages.. >>>> >>>> As far as i know,Wordnet has only English Support..Is there any other we >>>> can use to get support for all languages. >>>> >>>> >>> I think most people make synonym data manually... >>> I've never explored Wordnet, but I think it is too general to adopt for >>> your >>> business field? >>> >>> I've developed a program that extracts synonym knowledge from Wikipedia >>> (see my signature below). The outcome is useful for general purpose. >>> But I think, instead of using universal set of Wikipedia but using >>> subset of it, the program could extract more useful synonym knowledge for >>> a specific business field. >>> >>> To do so, to extract a subset of Wikipedia, the existing Lucene index >>> (that includes interesing words of the specific field) can be used. >>> >>> koji >>> -- >>> http://soleami.com/blog/****automatically-acquiring-**<http://soleami.com/blog/**automatically-acquiring-**> >>> synonym-knowledge-from-****wikipedia.html<http://soleami.** >>> com/blog/automatically-**acquiring-synonym-knowledge-** >>> from-wikipedia.html<http://soleami.com/blog/automatically-acquiring-synonym-knowledge-from-wikipedia.html> >>> > >>> >>> ------------------------------****----------------------------** >>> --**--------- >>> To unsubscribe, e-mail: java-user-unsubscribe@lucene.****apache.org< >>> java-user-**[email protected]<[email protected]> >>> > >>> For additional commands, e-mail: [email protected].****org< >>> java-user-help@lucene.**apache.org <[email protected]>> >>> >>> >>> >> >> > > -- > http://soleami.com/blog/**automatically-acquiring-** > synonym-knowledge-from-**wikipedia.html<http://soleami.com/blog/automatically-acquiring-synonym-knowledge-from-wikipedia.html> > > ------------------------------**------------------------------**--------- > To unsubscribe, e-mail: > java-user-unsubscribe@lucene.**apache.org<[email protected]> > For additional commands, e-mail: > [email protected].**org<[email protected]> > > -- Thanks and Regards Vignesh Srinivasan 9739135640
