[jira] [Work logged] (MAPREDUCE-7376) AggregateWordCount fetches wrong results

ASF GitHub Bot (Jira) Mon, 02 May 2022 12:04:05 -0700


     [ 
https://issues.apache.org/jira/browse/MAPREDUCE-7376?focusedWorklogId=765105&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-765105
 ]


ASF GitHub Bot logged work on MAPREDUCE-7376:
---------------------------------------------

                Author: ASF GitHub Bot
            Created on: 02/May/22 19:03
            Start Date: 02/May/22 19:03
    Worklog Time Spent: 10m 
      Work Description: ayushtkn opened a new pull request, #4257:
URL: https://github.com/apache/hadoop/pull/4257

   ### Description of PR
   
   Fixes AggregateWordCount
   
   ### How was this patch tested?
   ```
   hadoop-3.4.0-SNAPSHOT % bin/hadoop jar 
share/hadoop/mapreduce/hadoop-mapreduce-examples-3.4.0-SNAPSHOT.jar  
aggregatewordcount /testData /testOut 1 textinputformat
   
   hadoop-3.4.0-SNAPSHOT % bin/hdfs dfs -cat /testOut/part-r-00000              
                                                                                
     
   Bye  1
   Goodbye      1
   Hadoop       2
   Hello        2
   World        2
   ```
   ``/testData`` had two files:
   `wc01.txt:`
   Hello World Bye World
   
   `wc02.txt:`
   Hello Hadoop Goodbye Hadoop
   
   ### For code changes:
   
   - [ ] Does the title or this PR starts with the corresponding JIRA issue id 
(e.g. 'HADOOP-17799. Your PR title ...')?




Issue Time Tracking
-------------------

            Worklog Id:     (was: 765105)
    Remaining Estimate: 0h
            Time Spent: 10m

> AggregateWordCount fetches wrong results
> ----------------------------------------
>
>                 Key: MAPREDUCE-7376
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-7376
>             Project: Hadoop Map/Reduce
>          Issue Type: Bug
>            Reporter: Ayush Saxena
>            Assignee: Ayush Saxena
>            Priority: Major
>          Time Spent: 10m
>  Remaining Estimate: 0h
>
> AggregateWordCount rather than counting  the words, gives a single line 
> output counting the number of rows
> Wrong Result Looks Like:
> {noformat}
> hadoop-3.4.0-SNAPSHOT % bin/hdfs dfs -cat /testOut1/part-r-00000    
> record_count 2
> {noformat}
> Correct Should Look Like:
> {noformat}
> hadoop-3.4.0-SNAPSHOT % bin/hdfs dfs -cat /testOut1/part-r-00000              
>                                                                              
> Bye   1
> Goodbye       1
> Hadoop        2
> Hello 2
> World 2
> {noformat}



--
This message was sent by Atlassian Jira
(v8.20.7#820007)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[jira] [Work logged] (MAPREDUCE-7376) AggregateWordCount fetches wrong results

Reply via email to