[ 
https://issues.apache.org/jira/browse/TIKA-2016?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15970216#comment-15970216
 ] 

ASF GitHub Bot commented on TIKA-2016:
--------------------------------------

thammegowda commented on issue #169: TIKA-2016  Sentiment Analysis Parser 
Contributed by amensiko and thammegowda
URL: https://github.com/apache/tika/pull/169#issuecomment-294326796
 
 
   1. The categorical datasets links in Datasets page is broken, we have to fix 
it too. http://irds.usc.edu/SentimentAnalysisParser/datasets.html 
   2. The Models page has no models linked, the paths mentioned in the web page 
doesnt exists in the repo. 
http://irds.usc.edu/SentimentAnalysisParser/models.html. I am hoping the 
Categorical analysis (or multi class classifier in simple words) should work 
simply by changing the `modelPath` parameter in tika config XML (Please give me 
a link to model, I will test it :+1: ). By default, it uses netflix review 
dataset binary label model, but changing it is as easy as editing the tika 
config XML.
   3. The OpenNLP API has changed. I upgraded from 1.5.x to 1.6.0, it went 
fine. However, the latest is 1.7.2, in this version, they renamed `SentimentME` 
as `DocumentCagetegorizerME`, thus the existing  _sentiment analysis_ models 
won't work and we are stuck up here until we regenerate models. 
 
----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
[email protected]


> A parser that combines Apache OpenNLP and Apache Tika and provides facilities 
> for automatically deriving sentiment from text.
> -----------------------------------------------------------------------------------------------------------------------------
>
>                 Key: TIKA-2016
>                 URL: https://issues.apache.org/jira/browse/TIKA-2016
>             Project: Tika
>          Issue Type: New Feature
>          Components: parser
>            Reporter: Anastasija Mensikova
>            Assignee: Chris A. Mattmann
>              Labels: analysis, gsoc2016, memex, parser, sentiment
>             Fix For: 1.15
>
>
> A new project that implements a parser that uses Apache OpenNLP and Apache 
> Tika to perform Sentiment Analysis.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

Reply via email to