Alessandro Molina created ARROW-12506:
-----------------------------------------
Summary: [Python] Improve modularity of pyarrow codebase to
speedup compile time
Key: ARROW-12506
URL: https://issues.apache.org/jira/browse/ARROW-12506
Project: Apache Arrow
Issue Type: Improvement
Components: Python
Reporter: Alessandro Molina
There are some modules in pyarrow that end up being fairly big to compile
because they are mostly based on including other `pxi` / `pxd` files.
That means that when a change to those files is done a big module has to be
recompiled slowing down the development process when experimenting (seems it's
not uncommon that when a change is done it takes less time to recompile
`libarrow` than `pyarrow` )
It would be convenient to divide those into separate modules that can lead to
separate object files which would allow the compiler to recompile smaller
chunks at the time, so that when a change is done we don't have to recompile
the whole `lib.pyx` but can just recompile the module where the change is
isolated to.
The goal is to allow faster iteration over pyarrow by reducing time spent on
waiting for cython compilation on each change.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)