[ https://issues.apache.org/jira/browse/MAPREDUCE-7400?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17802800#comment-17802800 ]
Shilun Fan commented on MAPREDUCE-7400: --------------------------------------- Bulk update: moved all 3.4.0 non-blocker issues, please move back if it is a blocker. Retarget 3.5.0. > New Map Reduce Example - Simple Sentiment Analysis > -------------------------------------------------- > > Key: MAPREDUCE-7400 > URL: https://issues.apache.org/jira/browse/MAPREDUCE-7400 > Project: Hadoop Map/Reduce > Issue Type: Improvement > Components: examples > Affects Versions: 3.4.0 > Reporter: Meetu Patel > Priority: Minor > Labels: pull-request-available > Fix For: 3.4.0 > > Attachments: MAPREDUCE-7400.patch, sample_data.txt, sample_words.txt > > > I am looking to add a new map reduce example, i.e, sentiment analysis. > Sentiment analysis map reduce job helps in determining the sentiment score > for a user. It takes each tweet made by an user and assigns a sentiment score > for that tweet/sentence for a particular user and then aggregates the > sentiment scores for all tweets made by all users. > This example takes the twitter dataset which contains users and the tweets > made by users and gives the output as <username, sentiment score>. For each > user, the sentiment score is calculated for all the tweets made by that > particular user. > This mapreduce examples takes in two input files - input twitter dataset and > a file containing list of words. > The word list file contains positive, negative and negation words which are > used to give a sentiment score to the words in tweets. > You can use command: > bin/hadoop jar /HADOOP_PATH/share/hadoop/mapreduce/mapreduce-examples.jar > sentimentanalysis <input file/dir path> <output dir path> <word list file > path/dir path> > For example, you can use the sample files and run the above command as: > bin/hadoop jar /HADOOP_PATH/share/hadoop/mapreduce/mapreduce-examples.jar > sentimentanalysis sample_data.txt <output dir path> sample_words.txt -- This message was sent by Atlassian Jira (v8.20.10#820010) --------------------------------------------------------------------- To unsubscribe, e-mail: mapreduce-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: mapreduce-issues-h...@hadoop.apache.org