Alessandro Molina created ARROW-12506:
-----------------------------------------

             Summary: [Python] Improve modularity of pyarrow codebase to 
speedup compile time
                 Key: ARROW-12506
                 URL: https://issues.apache.org/jira/browse/ARROW-12506
             Project: Apache Arrow
          Issue Type: Improvement
          Components: Python
            Reporter: Alessandro Molina


There are some modules in pyarrow that end up being fairly big to compile 
because they are mostly based on including other `pxi` / `pxd` files.

That means that when a change to those files is done a big module has to be 
recompiled slowing down the development process when experimenting (seems it's 
not uncommon that when a change is done it takes less time to recompile 
`libarrow` than `pyarrow` )

It would be convenient to divide those into separate modules that can lead to 
separate object files which would allow the compiler to recompile smaller 
chunks at the time, so that when a change is done we don't have to recompile 
the whole `lib.pyx` but can just recompile the module where the change is 
isolated to.

The goal is to allow faster iteration over pyarrow by reducing time spent on 
waiting for cython compilation on each change.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to