[ 
https://issues.apache.org/jira/browse/HDFS-15972?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17319870#comment-17319870
 ] 

Felix N commented on HDFS-15972:
--------------------------------

Hi [~LiJinglun], is this the expected behavior? During heavy write period, this 
might lead to data loss.

> Fedbalance only copies data partially when there's existing opened file
> -----------------------------------------------------------------------
>
>                 Key: HDFS-15972
>                 URL: https://issues.apache.org/jira/browse/HDFS-15972
>             Project: Hadoop HDFS
>          Issue Type: Bug
>            Reporter: Felix N
>            Priority: Major
>
> If there are opened files when fedbalance is run and data is being written to 
> these files, fedbalance might skip the newly written data.
> Steps to recreate the issue:
>  # Create a dummy file /test/file with some data: {{echo "start" | hdfs dfs 
> -appendToFile /test/file}}
>  # Start writing to the file: {{hdfs dfs -appendToFile /test/file}} but do 
> not stop writing
>  # Run fedbalance: {{hadoop fedbalance submit hdfs://ns1/test 
> hdfs://ns2/test}}
>  # Write something to the file while fedbalance is running, "end" for 
> example, then stop writing
>  # After fedbalance is done, {{hdfs://ns2/test/file}} should only contain 
> "start" while {{hdfs://ns1/user/hadoop/.Trash/Current/test/file}} contains 
> "start\nend"
> Fedbalance is run with default configs and arguments so no diff should happen.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: hdfs-issues-unsubscr...@hadoop.apache.org
For additional commands, e-mail: hdfs-issues-h...@hadoop.apache.org

Reply via email to