[jira] [Created] (MAPREDUCE-7158) Inefficient Flush Logic in JobHistory EventWriter

Zichen Sun (JIRA) Mon, 12 Nov 2018 11:26:30 -0800

Zichen Sun created MAPREDUCE-7158:
-------------------------------------

             Summary: Inefficient Flush Logic in JobHistory EventWriter
                 Key: MAPREDUCE-7158
                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-7158
             Project: Hadoop Map/Reduce
          Issue Type: Bug
            Reporter: Zichen Sun



In HDFS, if the flush is implemented to send server request to actually commit 
the pending writes on the storage service side, we could observe in the 
benchmark runs that the MR jobs are taking much longer. From investigation we 
see the current implementation for writing events doesn't look right:
EventWriter# write()
This flush is redundant and this statement should be removed. It defeats the 
purpose of having a separate flush function itself.
Encoder.flush calls flush of the underlying output stream
After patching with the fix the MR jobs could complete normally, please kindly 
find the patch in attached.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[jira] [Created] (MAPREDUCE-7158) Inefficient Flush Logic in JobHistory EventWriter

Reply via email to