Antoine Pitrou created ARROW-4637: ------------------------------------- Summary: [Python] Avoid importing Pandas unless necessary Key: ARROW-4637 URL: https://issues.apache.org/jira/browse/ARROW-4637 Project: Apache Arrow Issue Type: Improvement Components: Python Affects Versions: 0.12.0 Reporter: Antoine Pitrou
Importing PyArrow is more than twice slower when Pandas is installed: {code} $ time python -c "import pyarrow" real 0m0,360s user 0m0,305s sys 0m0,037s $ time python -c "import sys; sys.modules['pandas'] = None; import pyarrow" real 0m0,144s user 0m0,124s sys 0m0,020s {code} We should only import Pandas when necessary, e.g. when asked to ingest or create Pandas data. -- This message was sent by Atlassian JIRA (v7.6.3#76005)