[ 
https://issues.apache.org/jira/browse/ARROW-8766?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Joris Van den Bossche updated ARROW-8766:
-----------------------------------------
    Labels: dataset-dask-integration filesystem  (was: filesystem)

> [Python] A FileSystem implementation based on Python callbacks
> --------------------------------------------------------------
>
>                 Key: ARROW-8766
>                 URL: https://issues.apache.org/jira/browse/ARROW-8766
>             Project: Apache Arrow
>          Issue Type: Sub-task
>          Components: Python
>            Reporter: Joris Van den Bossche
>            Priority: Major
>              Labels: dataset-dask-integration, filesystem
>
> The new {{pyarrow.fs}} filesystems are now actual C++ objects, and no longer 
> "just" a python interface. So they can't easily be expanded from the Python 
> side, and the existing integration with {{fsspec}} filesystems is therefore 
> also not working anymore. 
> One possible solution is  to have a C++ filesystem that calls back into a 
> python object for each of its methods (possibly similar to how you can 
> implement a flight server in Python, I suppose). 
> Such a FileSystem implementation would allow to make a {{pyarrow.fs}} wrapper 
> for {{fsspec}} filesystems, and thus allow such filesystems to be used in 
> pyarrow where new filesystems are expected.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to