[ 
https://issues.apache.org/jira/browse/ARROW-8998?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Wes McKinney updated ARROW-8998:
--------------------------------
    Description: 
Since in the relatively near future, one will be able to do non-trivial 
analytical operations and query processing natively on Arrow data structures 
through pyarrow, it does not make sense to require users to always install 
NumPy when they install pyarrow. I propose to split the NumPy-depending parts 
of libarrow_python into a libarrow_numpy (which also must be bundled) and 
moving this part of the codebase into a separate Cython module.

This refactoring should be relatively painless though there may be a number of 
packaging details to chase up since this would introduce a new shared library 
to be installed in various packaging targets. 

  was:
Since in the relatively near future, one will be able to do non-trivial 
analytical operations and query processing natively on Arrow data structures 
through pyarrow, it does not make sense to require users to always install 
NumPy when that install pyarrow. I propose to split the NumPy-depending parts 
of libarrow_python into a libarrow_numpy (which also must be bundled) and 
moving this part of the codebase into a separate Cython module.

This refactoring should be relatively painless though there may be a number of 
packaging details to chase up since this would introduce a new shared library 
to be installed in various packaging targets. 


> [Python] Make NumPy an optional runtime dependency
> --------------------------------------------------
>
>                 Key: ARROW-8998
>                 URL: https://issues.apache.org/jira/browse/ARROW-8998
>             Project: Apache Arrow
>          Issue Type: New Feature
>          Components: Python
>            Reporter: Wes McKinney
>            Priority: Major
>
> Since in the relatively near future, one will be able to do non-trivial 
> analytical operations and query processing natively on Arrow data structures 
> through pyarrow, it does not make sense to require users to always install 
> NumPy when they install pyarrow. I propose to split the NumPy-depending parts 
> of libarrow_python into a libarrow_numpy (which also must be bundled) and 
> moving this part of the codebase into a separate Cython module.
> This refactoring should be relatively painless though there may be a number 
> of packaging details to chase up since this would introduce a new shared 
> library to be installed in various packaging targets. 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to