[ 
https://issues.apache.org/jira/browse/KNOX-971?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Larry McCay updated KNOX-971:
-----------------------------
    Description: 
When issuing a CREATE of a filename with special characters with something like 
the following:

{code}
curl -i -L -u guest:guest-password 
"https://localhost:8443/webhdfs/v1/user/shfs3453/WORK/datasets/test_électronique_embarqué.pdf?op=CREATE";
{code}

The file is successfully created and written and can be successfully retrieved.
However the resulting filename within HDFS is actually 
"test_�lectronique_embarqu�.pdf" and if the same filename is used from dfs CLI 
the filename is correct in HDFS.

Moreover, trying to retrieve the properly named file through the gateway 
results in a 404.

{code}
17/06/17 10:28:41 
||349b43e9-f449-4c5f-9dfb-f84e7ef943a2|audit|127.0.0.1|WEBHDFS||||access|uri|/gateway/sandbox/webhdfs/v1//user/admin/test_�lectronique_embarqu�.pdf?op=CREATE|unavailable|Request
 method: PUT
17/06/17 10:28:41 
||349b43e9-f449-4c5f-9dfb-f84e7ef943a2|audit|127.0.0.1|WEBHDFS|guest|||authentication|uri|/gateway/sandbox/webhdfs/v1//user/admin/test_�lectronique_embarqu�.pdf?op=CREATE|success|
17/06/17 10:28:41 
||349b43e9-f449-4c5f-9dfb-f84e7ef943a2|audit|127.0.0.1|WEBHDFS|guest|||authentication|uri|/gateway/sandbox/webhdfs/v1//user/admin/test_�lectronique_embarqu�.pdf?op=CREATE|success|Groups:
 []
17/06/17 10:28:41 
||349b43e9-f449-4c5f-9dfb-f84e7ef943a2|audit|127.0.0.1|WEBHDFS|guest|hdfs||identity-mapping|principal|guest|success|Effective
 User: hdfs
17/06/17 10:28:41 
||349b43e9-f449-4c5f-9dfb-f84e7ef943a2|audit|127.0.0.1|WEBHDFS|guest|hdfs||dispatch|uri|http://c6401.ambari.apache.org:50070/webhdfs/v1/user/admin/test_�lectronique_embarqu�.pdf?op=CREATE&user.name=hdfs|unavailable|Request
 method: PUT
17/06/17 10:28:41 
||349b43e9-f449-4c5f-9dfb-f84e7ef943a2|audit|127.0.0.1|WEBHDFS|guest|hdfs||dispatch|uri|http://c6401.ambari.apache.org:50070/webhdfs/v1/user/admin/test_�lectronique_embarqu�.pdf?op=CREATE&user.name=hdfs|success|Response
 status: 307
17/06/17 10:28:41 
||349b43e9-f449-4c5f-9dfb-f84e7ef943a2|audit|127.0.0.1|WEBHDFS|guest|hdfs||access|uri|/gateway/sandbox/webhdfs/v1//user/admin/test_�lectronique_embarqu�.pdf?op=CREATE|success|Response
 status: 307
17/06/17 10:28:41 
||82c97677-3b7b-4c19-bfea-e82fc149fc30|audit|127.0.0.1|WEBHDFS||||access|uri|/gateway/sandbox/webhdfs/data/v1/webhdfs/v1/user/admin/test_�lectronique_embarqu�.pdf?_=AAAACAAAABAAAACwKFV5ruVkBa7y6-HR3hqRqWFrapQYx523sBG1Vkvfg88gfaoAs4u2AZcbpm-KRYVqgQanuBkZFPyA4lxqwzptXGis5FNuQjk3fTHxEfGvsqGP2TbVQL24MT59dxszeVqwGxLrPS8SCLruYA5XmCEYt4Zhbty5IPdZFXikUc0aqolHSeafnc9j0gkrBBzfbUexrTSMMY26-Su8622oG5bFPlTWH1klTZ7vx_gRIwb6IdHhUnRwGgamJ3CpwArtXAJ9a9J8JHClvvk|unavailable|Request
 method: PUT
17/06/17 10:28:41 
||82c97677-3b7b-4c19-bfea-e82fc149fc30|audit|127.0.0.1|WEBHDFS|guest|||authentication|uri|/gateway/sandbox/webhdfs/data/v1/webhdfs/v1/user/admin/test_�lectronique_embarqu�.pdf?_=AAAACAAAABAAAACwKFV5ruVkBa7y6-HR3hqRqWFrapQYx523sBG1Vkvfg88gfaoAs4u2AZcbpm-KRYVqgQanuBkZFPyA4lxqwzptXGis5FNuQjk3fTHxEfGvsqGP2TbVQL24MT59dxszeVqwGxLrPS8SCLruYA5XmCEYt4Zhbty5IPdZFXikUc0aqolHSeafnc9j0gkrBBzfbUexrTSMMY26-Su8622oG5bFPlTWH1klTZ7vx_gRIwb6IdHhUnRwGgamJ3CpwArtXAJ9a9J8JHClvvk|success|
17/06/17 10:28:41 
||82c97677-3b7b-4c19-bfea-e82fc149fc30|audit|127.0.0.1|WEBHDFS|guest|||authentication|uri|/gateway/sandbox/webhdfs/data/v1/webhdfs/v1/user/admin/test_�lectronique_embarqu�.pdf?_=AAAACAAAABAAAACwKFV5ruVkBa7y6-HR3hqRqWFrapQYx523sBG1Vkvfg88gfaoAs4u2AZcbpm-KRYVqgQanuBkZFPyA4lxqwzptXGis5FNuQjk3fTHxEfGvsqGP2TbVQL24MT59dxszeVqwGxLrPS8SCLruYA5XmCEYt4Zhbty5IPdZFXikUc0aqolHSeafnc9j0gkrBBzfbUexrTSMMY26-Su8622oG5bFPlTWH1klTZ7vx_gRIwb6IdHhUnRwGgamJ3CpwArtXAJ9a9J8JHClvvk|success|Groups:
 []
17/06/17 10:28:41 
||82c97677-3b7b-4c19-bfea-e82fc149fc30|audit|127.0.0.1|WEBHDFS|guest|hdfs||identity-mapping|principal|guest|success|Effective
 User: hdfs
17/06/17 10:28:41 
||82c97677-3b7b-4c19-bfea-e82fc149fc30|audit|127.0.0.1|WEBHDFS|guest|hdfs||dispatch|uri|http://c6401.ambari.apache.org:50075/webhdfs/v1/user/admin/test_%EF%BF%BDlectronique_embarqu%EF%BF%BD.pdf?op=CREATE&namenoderpcaddress=c6401.ambari.apache.org%3A8020&user.name=hdfs&createflag&createparent=true&overwrite=false|unavailable|Request
 method: PUT
17/06/17 10:28:41 
||82c97677-3b7b-4c19-bfea-e82fc149fc30|audit|127.0.0.1|WEBHDFS|guest|hdfs||dispatch|uri|http://c6401.ambari.apache.org:50075/webhdfs/v1/user/admin/test_%EF%BF%BDlectronique_embarqu%EF%BF%BD.pdf?op=CREATE&namenoderpcaddress=c6401.ambari.apache.org%3A8020&user.name=hdfs&createflag&createparent=true&overwrite=false|success|Response
 status: 201
17/06/17 10:28:41 
||82c97677-3b7b-4c19-bfea-e82fc149fc30|audit|127.0.0.1|WEBHDFS|guest|hdfs||access|uri|/gateway/sandbox/webhdfs/data/v1/webhdfs/v1/user/admin/test_�lectronique_embarqu�.pdf?_=AAAACAAAABAAAACwKFV5ruVkBa7y6-HR3hqRqWFrapQYx523sBG1Vkvfg88gfaoAs4u2AZcbpm-KRYVqgQanuBkZFPyA4lxqwzptXGis5FNuQjk3fTHxEfGvsqGP2TbVQL24MT59dxszeVqwGxLrPS8SCLruYA5XmCEYt4Zhbty5IPdZFXikUc0aqolHSeafnc9j0gkrBBzfbUexrTSMMY26-Su8622oG5bFPlTWH1klTZ7vx_gRIwb6IdHhUnRwGgamJ3CpwArtXAJ9a9J8JHClvvk|success|Response
 status: 201
17/06/17 10:28:47 
||78217ce2-1302-4dcd-a1a9-9e5a7548e552|audit|127.0.0.1|WEBHDFS||||access|uri|/gateway/sandbox/webhdfs/v1/user/admin/?op=LISTSTATUS|unavailable|Request
 method: GET
17/06/17 10:28:47 
||78217ce2-1302-4dcd-a1a9-9e5a7548e552|audit|127.0.0.1|WEBHDFS|guest|||authentication|uri|/gateway/sandbox/webhdfs/v1/user/admin/?op=LISTSTATUS|success|
17/06/17 10:28:47 
||78217ce2-1302-4dcd-a1a9-9e5a7548e552|audit|127.0.0.1|WEBHDFS|guest|||authentication|uri|/gateway/sandbox/webhdfs/v1/user/admin/?op=LISTSTATUS|success|Groups:
 []
17/06/17 10:28:47 
||78217ce2-1302-4dcd-a1a9-9e5a7548e552|audit|127.0.0.1|WEBHDFS|guest|hdfs||identity-mapping|principal|guest|success|Effective
 User: hdfs
17/06/17 10:28:47 
||78217ce2-1302-4dcd-a1a9-9e5a7548e552|audit|127.0.0.1|WEBHDFS|guest|hdfs||dispatch|uri|http://c6401.ambari.apache.org:50070/webhdfs/v1/user/admin?op=LISTSTATUS&user.name=hdfs|unavailable|Request
 method: GET
17/06/17 10:28:47 
||78217ce2-1302-4dcd-a1a9-9e5a7548e552|audit|127.0.0.1|WEBHDFS|guest|hdfs||dispatch|uri|http://c6401.ambari.apache.org:50070/webhdfs/v1/user/admin?op=LISTSTATUS&user.name=hdfs|success|Response
 status: 200
17/06/17 10:28:47 
||78217ce2-1302-4dcd-a1a9-9e5a7548e552|audit|127.0.0.1|WEBHDFS|guest|hdfs||access|uri|/gateway/sandbox/webhdfs/v1/user/admin/?op=LISTSTATUS|success|Response
 status: 200
{code}

  was:
When issuing a CREATE of a filename with special characters with something like 
the following:

{code}
curl -i -L -u guest:guest-password 
"https://localhost:8443/webhdfs/v1/user/shfs3453/WORK/datasets/test_électronique_embarqué.pdf?op=CREATE";
{code}

The file is successfully created and written and can be successfully retrieved.
However the resulting filename within HDFS is actually 
"test_�lectronique_embarqu�.pdf" and if the same filename is used from dfs CLI 
the filename is correct in HDFS.

Moreover, trying to retrieve the properly named file through the gateway 
results in a 404.



> Putting files with special characters in the name mangles the name in HDFS
> --------------------------------------------------------------------------
>
>                 Key: KNOX-971
>                 URL: https://issues.apache.org/jira/browse/KNOX-971
>             Project: Apache Knox
>          Issue Type: Bug
>          Components: Server
>            Reporter: Larry McCay
>            Assignee: Larry McCay
>             Fix For: 0.13.0
>
>
> When issuing a CREATE of a filename with special characters with something 
> like the following:
> {code}
> curl -i -L -u guest:guest-password 
> "https://localhost:8443/webhdfs/v1/user/shfs3453/WORK/datasets/test_électronique_embarqué.pdf?op=CREATE";
> {code}
> The file is successfully created and written and can be successfully 
> retrieved.
> However the resulting filename within HDFS is actually 
> "test_�lectronique_embarqu�.pdf" and if the same filename is used from dfs 
> CLI the filename is correct in HDFS.
> Moreover, trying to retrieve the properly named file through the gateway 
> results in a 404.
> {code}
> 17/06/17 10:28:41 
> ||349b43e9-f449-4c5f-9dfb-f84e7ef943a2|audit|127.0.0.1|WEBHDFS||||access|uri|/gateway/sandbox/webhdfs/v1//user/admin/test_�lectronique_embarqu�.pdf?op=CREATE|unavailable|Request
>  method: PUT
> 17/06/17 10:28:41 
> ||349b43e9-f449-4c5f-9dfb-f84e7ef943a2|audit|127.0.0.1|WEBHDFS|guest|||authentication|uri|/gateway/sandbox/webhdfs/v1//user/admin/test_�lectronique_embarqu�.pdf?op=CREATE|success|
> 17/06/17 10:28:41 
> ||349b43e9-f449-4c5f-9dfb-f84e7ef943a2|audit|127.0.0.1|WEBHDFS|guest|||authentication|uri|/gateway/sandbox/webhdfs/v1//user/admin/test_�lectronique_embarqu�.pdf?op=CREATE|success|Groups:
>  []
> 17/06/17 10:28:41 
> ||349b43e9-f449-4c5f-9dfb-f84e7ef943a2|audit|127.0.0.1|WEBHDFS|guest|hdfs||identity-mapping|principal|guest|success|Effective
>  User: hdfs
> 17/06/17 10:28:41 
> ||349b43e9-f449-4c5f-9dfb-f84e7ef943a2|audit|127.0.0.1|WEBHDFS|guest|hdfs||dispatch|uri|http://c6401.ambari.apache.org:50070/webhdfs/v1/user/admin/test_�lectronique_embarqu�.pdf?op=CREATE&user.name=hdfs|unavailable|Request
>  method: PUT
> 17/06/17 10:28:41 
> ||349b43e9-f449-4c5f-9dfb-f84e7ef943a2|audit|127.0.0.1|WEBHDFS|guest|hdfs||dispatch|uri|http://c6401.ambari.apache.org:50070/webhdfs/v1/user/admin/test_�lectronique_embarqu�.pdf?op=CREATE&user.name=hdfs|success|Response
>  status: 307
> 17/06/17 10:28:41 
> ||349b43e9-f449-4c5f-9dfb-f84e7ef943a2|audit|127.0.0.1|WEBHDFS|guest|hdfs||access|uri|/gateway/sandbox/webhdfs/v1//user/admin/test_�lectronique_embarqu�.pdf?op=CREATE|success|Response
>  status: 307
> 17/06/17 10:28:41 
> ||82c97677-3b7b-4c19-bfea-e82fc149fc30|audit|127.0.0.1|WEBHDFS||||access|uri|/gateway/sandbox/webhdfs/data/v1/webhdfs/v1/user/admin/test_�lectronique_embarqu�.pdf?_=AAAACAAAABAAAACwKFV5ruVkBa7y6-HR3hqRqWFrapQYx523sBG1Vkvfg88gfaoAs4u2AZcbpm-KRYVqgQanuBkZFPyA4lxqwzptXGis5FNuQjk3fTHxEfGvsqGP2TbVQL24MT59dxszeVqwGxLrPS8SCLruYA5XmCEYt4Zhbty5IPdZFXikUc0aqolHSeafnc9j0gkrBBzfbUexrTSMMY26-Su8622oG5bFPlTWH1klTZ7vx_gRIwb6IdHhUnRwGgamJ3CpwArtXAJ9a9J8JHClvvk|unavailable|Request
>  method: PUT
> 17/06/17 10:28:41 
> ||82c97677-3b7b-4c19-bfea-e82fc149fc30|audit|127.0.0.1|WEBHDFS|guest|||authentication|uri|/gateway/sandbox/webhdfs/data/v1/webhdfs/v1/user/admin/test_�lectronique_embarqu�.pdf?_=AAAACAAAABAAAACwKFV5ruVkBa7y6-HR3hqRqWFrapQYx523sBG1Vkvfg88gfaoAs4u2AZcbpm-KRYVqgQanuBkZFPyA4lxqwzptXGis5FNuQjk3fTHxEfGvsqGP2TbVQL24MT59dxszeVqwGxLrPS8SCLruYA5XmCEYt4Zhbty5IPdZFXikUc0aqolHSeafnc9j0gkrBBzfbUexrTSMMY26-Su8622oG5bFPlTWH1klTZ7vx_gRIwb6IdHhUnRwGgamJ3CpwArtXAJ9a9J8JHClvvk|success|
> 17/06/17 10:28:41 
> ||82c97677-3b7b-4c19-bfea-e82fc149fc30|audit|127.0.0.1|WEBHDFS|guest|||authentication|uri|/gateway/sandbox/webhdfs/data/v1/webhdfs/v1/user/admin/test_�lectronique_embarqu�.pdf?_=AAAACAAAABAAAACwKFV5ruVkBa7y6-HR3hqRqWFrapQYx523sBG1Vkvfg88gfaoAs4u2AZcbpm-KRYVqgQanuBkZFPyA4lxqwzptXGis5FNuQjk3fTHxEfGvsqGP2TbVQL24MT59dxszeVqwGxLrPS8SCLruYA5XmCEYt4Zhbty5IPdZFXikUc0aqolHSeafnc9j0gkrBBzfbUexrTSMMY26-Su8622oG5bFPlTWH1klTZ7vx_gRIwb6IdHhUnRwGgamJ3CpwArtXAJ9a9J8JHClvvk|success|Groups:
>  []
> 17/06/17 10:28:41 
> ||82c97677-3b7b-4c19-bfea-e82fc149fc30|audit|127.0.0.1|WEBHDFS|guest|hdfs||identity-mapping|principal|guest|success|Effective
>  User: hdfs
> 17/06/17 10:28:41 
> ||82c97677-3b7b-4c19-bfea-e82fc149fc30|audit|127.0.0.1|WEBHDFS|guest|hdfs||dispatch|uri|http://c6401.ambari.apache.org:50075/webhdfs/v1/user/admin/test_%EF%BF%BDlectronique_embarqu%EF%BF%BD.pdf?op=CREATE&namenoderpcaddress=c6401.ambari.apache.org%3A8020&user.name=hdfs&createflag&createparent=true&overwrite=false|unavailable|Request
>  method: PUT
> 17/06/17 10:28:41 
> ||82c97677-3b7b-4c19-bfea-e82fc149fc30|audit|127.0.0.1|WEBHDFS|guest|hdfs||dispatch|uri|http://c6401.ambari.apache.org:50075/webhdfs/v1/user/admin/test_%EF%BF%BDlectronique_embarqu%EF%BF%BD.pdf?op=CREATE&namenoderpcaddress=c6401.ambari.apache.org%3A8020&user.name=hdfs&createflag&createparent=true&overwrite=false|success|Response
>  status: 201
> 17/06/17 10:28:41 
> ||82c97677-3b7b-4c19-bfea-e82fc149fc30|audit|127.0.0.1|WEBHDFS|guest|hdfs||access|uri|/gateway/sandbox/webhdfs/data/v1/webhdfs/v1/user/admin/test_�lectronique_embarqu�.pdf?_=AAAACAAAABAAAACwKFV5ruVkBa7y6-HR3hqRqWFrapQYx523sBG1Vkvfg88gfaoAs4u2AZcbpm-KRYVqgQanuBkZFPyA4lxqwzptXGis5FNuQjk3fTHxEfGvsqGP2TbVQL24MT59dxszeVqwGxLrPS8SCLruYA5XmCEYt4Zhbty5IPdZFXikUc0aqolHSeafnc9j0gkrBBzfbUexrTSMMY26-Su8622oG5bFPlTWH1klTZ7vx_gRIwb6IdHhUnRwGgamJ3CpwArtXAJ9a9J8JHClvvk|success|Response
>  status: 201
> 17/06/17 10:28:47 
> ||78217ce2-1302-4dcd-a1a9-9e5a7548e552|audit|127.0.0.1|WEBHDFS||||access|uri|/gateway/sandbox/webhdfs/v1/user/admin/?op=LISTSTATUS|unavailable|Request
>  method: GET
> 17/06/17 10:28:47 
> ||78217ce2-1302-4dcd-a1a9-9e5a7548e552|audit|127.0.0.1|WEBHDFS|guest|||authentication|uri|/gateway/sandbox/webhdfs/v1/user/admin/?op=LISTSTATUS|success|
> 17/06/17 10:28:47 
> ||78217ce2-1302-4dcd-a1a9-9e5a7548e552|audit|127.0.0.1|WEBHDFS|guest|||authentication|uri|/gateway/sandbox/webhdfs/v1/user/admin/?op=LISTSTATUS|success|Groups:
>  []
> 17/06/17 10:28:47 
> ||78217ce2-1302-4dcd-a1a9-9e5a7548e552|audit|127.0.0.1|WEBHDFS|guest|hdfs||identity-mapping|principal|guest|success|Effective
>  User: hdfs
> 17/06/17 10:28:47 
> ||78217ce2-1302-4dcd-a1a9-9e5a7548e552|audit|127.0.0.1|WEBHDFS|guest|hdfs||dispatch|uri|http://c6401.ambari.apache.org:50070/webhdfs/v1/user/admin?op=LISTSTATUS&user.name=hdfs|unavailable|Request
>  method: GET
> 17/06/17 10:28:47 
> ||78217ce2-1302-4dcd-a1a9-9e5a7548e552|audit|127.0.0.1|WEBHDFS|guest|hdfs||dispatch|uri|http://c6401.ambari.apache.org:50070/webhdfs/v1/user/admin?op=LISTSTATUS&user.name=hdfs|success|Response
>  status: 200
> 17/06/17 10:28:47 
> ||78217ce2-1302-4dcd-a1a9-9e5a7548e552|audit|127.0.0.1|WEBHDFS|guest|hdfs||access|uri|/gateway/sandbox/webhdfs/v1/user/admin/?op=LISTSTATUS|success|Response
>  status: 200
> {code}



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Reply via email to