NikitaMatskevich opened a new pull request, #2683:
URL: https://github.com/apache/iceberg-python/pull/2683
<!--
Thanks for opening a pull request!
-->
<!-- In the case this PR will resolve an issue, please replace
${GITHUB_ISSUE_ID} below with the actual Github issue id. -->
<!-- Closes #${GITHUB_ISSUE_ID} -->
# Rationale for this change
Starting from version 20, Pyarrow has support for Azure filesystems.
Azure table locations are typically of this format:
"abfss://<bucket_name>@<account_name>.<dfs|blob>.core.windows.net/<namespace>/<table>/<file_path>".
When creating a PyArrowFile, we simply retrieve table location and append
table-relative path to it. This generates a path with
"@<account_name>.<dfs|blob>.core.windows.net" part in it, which cannot be
read/written by Pyarrow library. One has to truncate this part from Azure uris.
The proposed fix is just to start a conversation around the issue. I am not
100% sure how and where this should be fixed.
## Are these changes tested?
Hard to test, because with Azurite it works fine (unlike "real" Azure,
Azurite does not have this part in uris). Do you have any ideas of an
integration test in mind?
We know this issue does not occur with Fsspec.
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]