[ 
https://issues.apache.org/jira/browse/HADOOP-2863?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Raghu Angadi updated HADOOP-2863:
---------------------------------

      Description: 
Why does FSDataOutputStream.close() call flush()? This stream itself does not 
store any data that it needs to flush. It is a wrapper and it should just 
invoke its outputstream's close().

For. e.g one bad side effect is that, in the case of DFSOutputStream which 
extends FSOutputSummer, flush() inside close sends the current data even though 
FSOutputSummer might have some data.. this left over data will be sent in side 
close() (so it sends data in two different packets instead of one). Other 
filesystems might have similar side effects.

I will submit a patch.



  was:

Why does FSDataOutputStream.close() call flush()? This stream itself does not 
store any data that it needs to flush. It is a wrapper and it should just 
invoke its outputstream's close().

For. e.g one bad side effect is that, in the case of DFSOutputStream which 
extends FSOutputSummer, flush() inside close sends the current data even though 
FSOutputSummer might have some data.. this left over data will be sent in side 
close() (so it sends data in two different packets instead of one). Other 
filesystems might have similar side effects.

I will submit a patch.



    Fix Version/s:     (was: 0.17.0)

bq. For. e.g one bad side effect is that, in the case of DFSOutputStream which 
extends FSOutputSummer, flush() inside close sends the current data even though 
FSOutputSummer might have some data.. this left over data will be sent in side 
close() (so it sends data in two different packets instead of one). Other 
filesystems might have similar side effects.

I am fixing the above as part of HADOOP-1702 by renaming 
{{DFSOutputStream.flush()}} to {{flushInternal()}} (because it is not a normal 
flush() and needs to be called only inside close()).

> FSDataOutputStream should not flush() inside close().
> -----------------------------------------------------
>
>                 Key: HADOOP-2863
>                 URL: https://issues.apache.org/jira/browse/HADOOP-2863
>             Project: Hadoop Core
>          Issue Type: Bug
>          Components: fs
>            Reporter: Raghu Angadi
>            Assignee: Raghu Angadi
>         Attachments: HADOOP-2863.patch, HADOOP-2863.patch
>
>
> Why does FSDataOutputStream.close() call flush()? This stream itself does not 
> store any data that it needs to flush. It is a wrapper and it should just 
> invoke its outputstream's close().
> For. e.g one bad side effect is that, in the case of DFSOutputStream which 
> extends FSOutputSummer, flush() inside close sends the current data even 
> though FSOutputSummer might have some data.. this left over data will be sent 
> in side close() (so it sends data in two different packets instead of one). 
> Other filesystems might have similar side effects.
> I will submit a patch.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to