[
https://issues.apache.org/jira/browse/ARROW-12506?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
ASF GitHub Bot updated ARROW-12506:
-----------------------------------
Labels: pull-request-available (was: )
> [Python] Improve modularity of pyarrow codebase to speedup compile time
> -----------------------------------------------------------------------
>
> Key: ARROW-12506
> URL: https://issues.apache.org/jira/browse/ARROW-12506
> Project: Apache Arrow
> Issue Type: Improvement
> Components: Python
> Reporter: Alessandro Molina
> Assignee: Alessandro Molina
> Priority: Minor
> Labels: pull-request-available
> Time Spent: 10m
> Remaining Estimate: 0h
>
> There are some modules in pyarrow that end up being fairly big to compile
> because they are mostly based on including other {{pxi}} / {{pxd}} files.
> That means that when a change to those files is done a big module has to be
> recompiled slowing down the development process when experimenting (seems
> it's not uncommon that when a change is done it takes less time to recompile
> {{libarrow}} than {{pyarrow}})
> It would be convenient to divide those into separate modules that can lead to
> separate object files which would allow the compiler to recompile smaller
> chunks at the time, so that when a change is done we don't have to recompile
> the whole `lib.pyx` but can just recompile the module where the change is
> isolated to.
> The goal is to allow faster iteration over pyarrow by reducing time spent on
> waiting for cython compilation on each change.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)