[jira] [Created] (HADOOP-14450) ADLS Python client inconsistent when used in tandem with AdlFileSystem

2017-05-23 Thread Sailesh Mukil (JIRA)
Sailesh Mukil created HADOOP-14450:
--

 Summary: ADLS Python client inconsistent when used in tandem with 
AdlFileSystem
 Key: HADOOP-14450
 URL: https://issues.apache.org/jira/browse/HADOOP-14450
 Project: Hadoop Common
  Issue Type: Bug
  Components: fs/adl
Reporter: Sailesh Mukil


Impala uses the AdlFileSystem connector to talk to ADLS. As a part of the 
Impala tests, we drop tables and verify that the files belonging to that table 
have been dropped for all filesystems that Impala supports. These tests 
however, fail with ADLS.

If I use the Hadoop ADLS connector to delete a file, and then list the parent 
directory of that file using the above Python client within the second, the 
client still says that the file is available in ADLS.

This is the Python client from Microsoft that we're using in our testing:
https://github.com/Azure/azure-data-lake-store-python

Their release notes say that it's still a "pre-release preview":
https://github.com/Azure/azure-data-lake-store-python/releases

Questions for the ADLS folks:

Is this a known issue? If so, will it be fixed soon?
Or is this expected behavior?

I'm able to deterministically reproduce it in my tests, with Impala on ADLS.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

-
To unsubscribe, e-mail: common-issues-unsubscr...@hadoop.apache.org
For additional commands, e-mail: common-issues-h...@hadoop.apache.org



[jira] [Created] (HADOOP-14437) AdlFileSystem.getAclStatus() returns unexpected ACL information

2017-05-18 Thread Sailesh Mukil (JIRA)
Sailesh Mukil created HADOOP-14437:
--

 Summary: AdlFileSystem.getAclStatus() returns unexpected ACL 
information
 Key: HADOOP-14437
 URL: https://issues.apache.org/jira/browse/HADOOP-14437
 Project: Hadoop Common
  Issue Type: Bug
  Components: fs/adl
Reporter: Sailesh Mukil


The HDFS getAclStatus() returns ACL information of a file and how that pertains 
to a Hadoop user/group.

However, the ADLS getAclStatus() returns ACL information of a file but does not 
map them to Hadoop users/groups and instead maps them to the client ID of the 
SPI.

The components built around Hadoop use this API with the expectation that the 
ACL information returned will be mapped to Hadoop users/groups, i.e. the 
components expect it to have the same behavior as other filesystems that 
support this API.
If not, the components need to have logic to handle this case where if they're 
talking to ADLS, they won't look for matching the Hadoop user/group, but 
instead the client ID. And once AdlFileSystem changes its API to be able to map 
to Hadoop users/groups, it will be a breaking change for all the components.

Until such functionality is able to be provided, I would suggest that the 
getAclStatus() be unsupported, just as it is in S3AFileSystem.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

-
To unsubscribe, e-mail: common-issues-unsubscr...@hadoop.apache.org
For additional commands, e-mail: common-issues-h...@hadoop.apache.org