Dear Wiki user, You have subscribed to a wiki page or wiki category on "Tika Wiki" for change notification.
The "TikaAndNLTK" page has been changed by manalishah: https://wiki.apache.org/tika/TikaAndNLTK?action=diff&rev1=12&rev2=13 $ pip install --process-dependency-links nltkrest }}} - 2. '''Setuptools and/or Distribute''': The module can be downloaded from [[https://github.com/manalishah/NLTKRest|this site]] and then installed with the following commands: + 2. '''Setuptools and/or Distribute''': The module can be downloaded from [[https://github.com/manalishah/NLTKRest|github]] and then installed with the following commands: {{{ $ cd NLTKRest/nltkrest $ python setup.py install nltkrest @@ -69, +69 @@ #config file must be stored in this directory pwd - export PATH_PREFIX="$NER_RES/org/apache/tika/parser/ner/nltk" + export PATH_PREFIX="$NLTK_RES/org/apache/tika/parser/ner/nltk" mkdir -p $PATH_PREFIX #create and edit the properties file vim $PATH_PREFIX/NLTKServer.properties @@ -78, +78 @@ 3. '''Download NLTKRest-resources''' - Better yet, you could skip the previous two steps completely and save the hassle. Simply download the project [[https://github.com/manalishah/NLTKRest|NLTKRest-resources]] and edit the properties file + Better yet, you could skip the previous two steps completely and save the hassle. Simply download the project [[https://github.com/manalishah/NLTKRest-resources|NLTKRest-resources]] and edit the properties file {{{ - cd $HOME && git clone https://github.com/manalishah/NLTKRest|NLTKRest-resources + cd $HOME && git clone https://github.com/manalishah/NLTKRest-resources export NLTK_RES=$HOME/NLTKRest-resources - edit $HOME/NLTKRest/org/apache/tika/parser/ner/nltk/NLTKServer.properties + vim $NLTK_RES/org/apache/tika/parser/ner/nltk/NLTKServer.properties }}} == Running NLTK with Tika == @@ -93, +93 @@ export TIKA_APP={your/path/to/tika-app}/target/tika-app-1.13-SNAPSHOT.jar #set the system property to use NLTKNERecogniser class - java -Dner.impl.class=org.apache.tika.parser.ner.nltk.NLTKNERecogniser -classpath $NLTK_RES:$TIKA_APP org.apache.tika.cli.TikaCLI --config=tika-config.xml -m http://www.hawking.org.uk/ + java -Dner.impl.class=org.apache.tika.parser.ner.nltk.NLTKNERecogniser -classpath $NLTK_RES:$TIKA_APP org.apache.tika.cli.TikaCLI --config=NLTK_RES/tika-config.xml -m http://www.hawking.org.uk/ }}}
