[jira] [Commented] (MAPREDUCE-7400) New Map Reduce Example - Simple Sentiment Analysis

ASF GitHub Bot (Jira) Thu, 25 Aug 2022 12:52:06 -0700


    [ 
https://issues.apache.org/jira/browse/MAPREDUCE-7400?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17585008#comment-17585008
 ]


ASF GitHub Bot commented on MAPREDUCE-7400:
-------------------------------------------

MeetuPatel opened a new pull request, #4808:
URL: https://github.com/apache/hadoop/pull/4808

   <!--
     Thanks for sending a pull request!
       1. If this is your first time, please read our contributor guidelines: 
https://cwiki.apache.org/confluence/display/HADOOP/How+To+Contribute
       2. Make sure your PR title starts with JIRA issue id, e.g., 
'HADOOP-17799. Your PR title ...'.
   -->
   
   ### Description of PR
   I am looking to add a new map reduce example, i.e, sentiment analysis. 
Sentiment analysis map reduce job helps in determining the sentiment score for 
a user. It takes each tweet made by an user and assigns a sentiment score for 
that tweet/sentence for a particular user and then aggregates the sentiment 
scores for all tweets made by all users.
   
   This example takes the twitter dataset which contains users and the tweets 
made by users and gives the output as <username, sentiment score>. For each 
user, the sentiment score is calculated for all the tweets made by that 
particular user.
   
   
   ### How was this patch tested?
   This patch was tested using the twitter dataset on a single node Hadoop 
cluster in pseudo-distributed mode.
   
   ### For code changes:
   
   - [x] Does the title or this PR starts with the corresponding JIRA issue id 
(e.g. 'HADOOP-17799. Your PR title ...')?
   - [ ] Object storage: have the integration tests been executed and the 
endpoint declared according to the connector-specific documentation?
   - [ ] If adding new dependencies to the code, are these dependencies 
licensed in a way that is compatible for inclusion under [ASF 
2.0](http://www.apache.org/legal/resolved.html#category-a)?
   - [ ] If applicable, have you updated the `LICENSE`, `LICENSE-binary`, 
`NOTICE-binary` files?
   
   




> New Map Reduce Example - Simple Sentiment Analysis
> --------------------------------------------------
>
>                 Key: MAPREDUCE-7400
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-7400
>             Project: Hadoop Map/Reduce
>          Issue Type: Improvement
>          Components: examples
>    Affects Versions: 3.4.0
>            Reporter: Meetu Patel
>            Priority: Minor
>             Fix For: 3.4.0
>
>         Attachments: MAPREDUCE-7400.patch, sample_data.txt, sample_words.txt
>
>
> I am looking to add a new map reduce example, i.e, sentiment analysis. 
> Sentiment analysis map reduce job helps in determining the sentiment score 
> for a user. It takes each tweet made by an user and assigns a sentiment score 
> for that tweet/sentence for a particular user and then aggregates the 
> sentiment scores for all tweets made by all users.
> This example takes the twitter dataset which contains users and the tweets 
> made by users and gives the output as <username, sentiment score>. For each 
> user, the sentiment score is calculated for all the tweets made by that 
> particular user. 
> This mapreduce examples takes in two input files - input twitter dataset and 
> a file containing list of words.
> The word list file contains positive, negative and negation words which are 
> used to give a sentiment score to the words in tweets.
> You can use command:
> bin/hadoop jar /HADOOP_PATH/share/hadoop/mapreduce/mapreduce-examples.jar 
> sentimentanalysis <input file/dir path> <output dir path> <word list file 
> path/dir path>
> For example, you can use the sample files and run the above command as:
> bin/hadoop jar /HADOOP_PATH/share/hadoop/mapreduce/mapreduce-examples.jar 
> sentimentanalysis sample_data.txt <output dir path> sample_words.txt



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[jira] [Commented] (MAPREDUCE-7400) New Map Reduce Example - Simple Sentiment Analysis

Reply via email to