Dump WordNet to SOLR Synonym format
-----------------------------------

                 Key: LUCENE-2347
                 URL: https://issues.apache.org/jira/browse/LUCENE-2347
             Project: Lucene - Java
          Issue Type: New Feature
          Components: contrib/*
    Affects Versions: 3.0.1
            Reporter: Bill Bell


This enhancement allows you to dump v2 of WordNet to SOLR synonym format! Get 
all your syns loaded easily.

1. You can load all synonyms from http://wordnetcode.princeton.edu/2.0/ WordNet 
V2 to SOLR by first using the Sys2Index program
http://lucene.apache.org/java/2_2_0/api/org/apache/lucene/wordnet/Syns2Index.html

Get WNprolog from http://wordnetcode.princeton.edu/2.0/

2. We modified this program to work with SOLR (See attached) on 
amidev.kaango.com in /vol/src/lucene/contrib/wordnet
vi 
/vol/src/lucene/contrib/wordnet/src/java/org/apache/lucene/wordnet/Syns2Solr.java

3. Run ant

4. java -classpath 
/vol/src/lucene/build/contrib/wordnet/lucene-wordnet-3.1-dev.jar 
org.apache.lucene.wordnet.Syns2Solr prolog/wn_s.pl solr > index_synonyms.txt

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-dev-h...@lucene.apache.org

Reply via email to