Thamme Gowda created TIKA-2263:
----------------------------------
Summary: Sentiment Analysis - Standardize the interface and
integrate OpenNLP, CoreNLP with pluggable models
Key: TIKA-2263
URL: https://issues.apache.org/jira/browse/TIKA-2263
Project: Tika
Issue Type: Improvement
Reporter: Thamme Gowda
h2. Background
[As per Wikipedia|https://en.wikipedia.org/wiki/Sentiment_analysis]: Sentiment
analysis (also known as opinion mining) refers to the use of natural language
processing, text analysis and computational linguistics to identify and extract
subjective information in source materials. Sentiment analysis is widely
applied to reviews and social media for a variety of applications, ranging from
marketing to customer service.
h2. Goals
* Create an interface for SentimentAnalyser
* Create a parser for sentiment analysis with pluggable implementations of
SentimentAnalyser and its model files
* Provide an implementation from Apache OpenNLP and its default models for
english. Make this as default
** Converge the efforts of [~chrismattmann] and
[[email protected]] in [TIKA-2016|
https://issues.apache.org/jira/browse/TIKA-2016]
* Provide an implementation from Stanford CoreNLP and make this as optional
(due to its license incompatibility wit AL2.0). Make it attachable at runtime
similar to already existing [CoreNLP Named Entity
Recogniser|https://wiki.apache.org/tika/TikaAndNER#Using_Stanford_CoreNLP_NER].
** Leave room for plugging other/custom implementations at the runtime
h2. References
* https://issues.apache.org/jira/browse/TIKA-2016
* https://issues.apache.org/jira/browse/TIKA-1787
* http://irds.usc.edu/SentimentAnalysisParser/index.html
* http://nlp.stanford.edu/sentiment/
--
This message was sent by Atlassian JIRA
(v6.3.15#6346)