[
https://issues.apache.org/jira/browse/MAPREDUCE-7400?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=18035698#comment-18035698
]
ASF GitHub Bot commented on MAPREDUCE-7400:
-------------------------------------------
github-actions[bot] closed pull request #4808: MAPREDUCE-7400. New MapReduce
example - Sentiment Analysis
URL: https://github.com/apache/hadoop/pull/4808
> New Map Reduce Example - Simple Sentiment Analysis
> --------------------------------------------------
>
> Key: MAPREDUCE-7400
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-7400
> Project: Hadoop Map/Reduce
> Issue Type: Improvement
> Components: examples
> Affects Versions: 3.4.0
> Reporter: Meetu Patel
> Priority: Minor
> Labels: pull-request-available
> Attachments: MAPREDUCE-7400.patch, sample_data.txt, sample_words.txt
>
>
> I am looking to add a new map reduce example, i.e, sentiment analysis.
> Sentiment analysis map reduce job helps in determining the sentiment score
> for a user. It takes each tweet made by an user and assigns a sentiment score
> for that tweet/sentence for a particular user and then aggregates the
> sentiment scores for all tweets made by all users.
> This example takes the twitter dataset which contains users and the tweets
> made by users and gives the output as <username, sentiment score>. For each
> user, the sentiment score is calculated for all the tweets made by that
> particular user.
> This mapreduce examples takes in two input files - input twitter dataset and
> a file containing list of words.
> The word list file contains positive, negative and negation words which are
> used to give a sentiment score to the words in tweets.
> You can use command:
> bin/hadoop jar /HADOOP_PATH/share/hadoop/mapreduce/mapreduce-examples.jar
> sentimentanalysis <input file/dir path> <output dir path> <word list file
> path/dir path>
> For example, you can use the sample files and run the above command as:
> bin/hadoop jar /HADOOP_PATH/share/hadoop/mapreduce/mapreduce-examples.jar
> sentimentanalysis sample_data.txt <output dir path> sample_words.txt
--
This message was sent by Atlassian Jira
(v8.20.10#820010)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]