Thamme Gowda created TIKA-2263:
----------------------------------

             Summary: Sentiment Analysis - Standardize the interface and 
integrate OpenNLP, CoreNLP with pluggable models
                 Key: TIKA-2263
                 URL: https://issues.apache.org/jira/browse/TIKA-2263
             Project: Tika
          Issue Type: Improvement
            Reporter: Thamme Gowda


h2. Background
[As per Wikipedia|https://en.wikipedia.org/wiki/Sentiment_analysis]: Sentiment 
analysis (also known as opinion mining) refers to the use of natural language 
processing, text analysis and computational linguistics to identify and extract 
subjective information in source materials. Sentiment analysis is widely 
applied to reviews and social media for a variety of applications, ranging from 
marketing to customer service.


h2. Goals
* Create an interface for SentimentAnalyser
* Create a parser for sentiment analysis with pluggable implementations of 
SentimentAnalyser and its model files
* Provide an implementation from Apache OpenNLP and its default models for 
english. Make this as default
** Converge the efforts of [~chrismattmann] and 
[[email protected]] in [TIKA-2016| 
https://issues.apache.org/jira/browse/TIKA-2016]
* Provide an implementation from Stanford CoreNLP and make this as optional 
(due to its license incompatibility wit AL2.0). Make it attachable at runtime 
similar to already existing [CoreNLP Named Entity 
Recogniser|https://wiki.apache.org/tika/TikaAndNER#Using_Stanford_CoreNLP_NER].
** Leave room for plugging other/custom implementations at the runtime

h2. References
* https://issues.apache.org/jira/browse/TIKA-2016
* https://issues.apache.org/jira/browse/TIKA-1787
* http://irds.usc.edu/SentimentAnalysisParser/index.html
* http://nlp.stanford.edu/sentiment/







--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

Reply via email to