[ 
https://issues.apache.org/jira/browse/HDFS-17134?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Shuyan Zhang updated HDFS-17134:
--------------------------------
    Description: 
The result of `getListing` in NameNode are sorted based on `byte[]`, while the 
Router side is based on `String`. If there are special characters in path, the 
sorting result of the router will be inconsistent with the namenode. This may 
result in duplicate `getListing` results obtained by the client due to wrong 
`startAfter` parameter.
For exemple, namenode returns [path1, path2, path3] for a `getListing` request, 
while router returns [path1, path3, path2] to client. Then client will pass 
`path2` as `startAfter`  at the next iteration, so it will receive `path3` 
again.
We need to fix the Router code so that the order of its result is the same as 
NameNode.

  was:
The result of `getListing` in NameNode are sorted based on `byte[]`, while the 
Router side is based on `String`. If there are special characters in path, the 
sorting result of the router is inconsistent with the namenode. This may result 
in duplicate `getListing` results obtained by the client due to wrong 
`startAfter` parameter.

For exemple, namenode returns [path1, path2, path3], while router returns 
[path1, path3, path2] to client. Then client will pass `startAfter` as `path2` 
at the next iteration, so it will receive `path3` again.

We need to fix the Router code so that the order of its results is the same as 
NameNode.


> RBF: Fix duplicate results of getListing through Router.
> --------------------------------------------------------
>
>                 Key: HDFS-17134
>                 URL: https://issues.apache.org/jira/browse/HDFS-17134
>             Project: Hadoop HDFS
>          Issue Type: Bug
>            Reporter: Shuyan Zhang
>            Assignee: Shuyan Zhang
>            Priority: Major
>              Labels: pull-request-available
>
> The result of `getListing` in NameNode are sorted based on `byte[]`, while 
> the Router side is based on `String`. If there are special characters in 
> path, the sorting result of the router will be inconsistent with the 
> namenode. This may result in duplicate `getListing` results obtained by the 
> client due to wrong `startAfter` parameter.
> For exemple, namenode returns [path1, path2, path3] for a `getListing` 
> request, while router returns [path1, path3, path2] to client. Then client 
> will pass `path2` as `startAfter`  at the next iteration, so it will receive 
> `path3` again.
> We need to fix the Router code so that the order of its result is the same as 
> NameNode.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to