[jira] [Created] (ARROW-8984) [R] Revise install guides now that Windows conda package exists

2020-05-29 Thread Neal Richardson (Jira)
Neal Richardson created ARROW-8984: -- Summary: [R] Revise install guides now that Windows conda package exists Key: ARROW-8984 URL: https://issues.apache.org/jira/browse/ARROW-8984 Project: Apache Arr

Re: Why downloading sources of pyarrow and its requirements takes several minutes?

2020-05-29 Thread Valentyn Tymofieiev
Thanks for the input. Opened https://issues.apache.org/jira/browse/ARROW-8983, we can continue the conversation there. On Thu, May 28, 2020 at 2:46 PM Valentyn Tymofieiev wrote: > Hi Arrow dev community, > > Do you have any insight why > > python -m pip download --dest /tmp pyarrow==0.

[jira] [Created] (ARROW-8983) Downloading sources of pyarrow and its requirements from pypi takes several minutes starting from 0.16.0

2020-05-29 Thread Valentyn Tymofieiev (Jira)
Valentyn Tymofieiev created ARROW-8983: -- Summary: Downloading sources of pyarrow and its requirements from pypi takes several minutes starting from 0.16.0 Key: ARROW-8983 URL: https://issues.apache.org/jira/b

Re: Why downloading sources of pyarrow and its requirements takes several minutes?

2020-05-29 Thread Antoine Pitrou
PyArrow has always required Numpy, so this sounds like a red herring. If Numpy wasn't downloaded as part of source dependencies before, it was certainly a bug. Regards Antoine. Le 29/05/2020 à 18:29, Wes McKinney a écrit : > It's possible it's related to > > https://github.com/apache/arrow/c

Re: Why downloading sources of pyarrow and its requirements takes several minutes?

2020-05-29 Thread Wes McKinney
It's possible it's related to https://github.com/apache/arrow/commit/6a583e553de28e3341987911bb63fc19f99a6fb0#diff-23eeeb4347bdd26bfc6b7ee9a3b755dd Is the issue still present with 0.17.0 or 0.17.1? In any case please do open an issue if it is not resolved in master and/or the latest releases. On

Re: Why downloading sources of pyarrow and its requirements takes several minutes?

2020-05-29 Thread Brian Hulette
+1 fo a jira to track this. I looked into it a little bit just out of curiosity. I passed --verbose to pip to get insight into what's going on in in the "Installing build dependencies..." step. I did this for both 0.15.1 and 0.16. They took 4:10 and 5:57 respectively. It looks like 0.16.0 spent 2

[jira] [Created] (ARROW-8982) [CI] Remove allow_failures for s390x in TravisCI

2020-05-29 Thread Kazuaki Ishizaki (Jira)
Kazuaki Ishizaki created ARROW-8982: --- Summary: [CI] Remove allow_failures for s390x in TravisCI Key: ARROW-8982 URL: https://issues.apache.org/jira/browse/ARROW-8982 Project: Apache Arrow I

[jira] [Created] (ARROW-8981) [C++][Dataset] Add support for compressed FileSources

2020-05-29 Thread Ben Kietzman (Jira)
Ben Kietzman created ARROW-8981: --- Summary: [C++][Dataset] Add support for compressed FileSources Key: ARROW-8981 URL: https://issues.apache.org/jira/browse/ARROW-8981 Project: Apache Arrow Issu

Re: Why downloading sources of pyarrow and its requirements takes several minutes?

2020-05-29 Thread Wes McKinney
hi Valentyn, This is the first I've ever heard of anyone doing what you are doing, so safe to say that we've given little to no consideration to this use case. We have been focused on providing binary packages for pip and conda. Could you please open a JIRA and provide more detailed information ab

[jira] [Created] (ARROW-8980) [Python] Metadata grows exponentially when using schema from disk

2020-05-29 Thread Kevin Glasson (Jira)
Kevin Glasson created ARROW-8980: Summary: [Python] Metadata grows exponentially when using schema from disk Key: ARROW-8980 URL: https://issues.apache.org/jira/browse/ARROW-8980 Project: Apache Arrow

[NIGHTLY] Arrow Build Report for Job nightly-2020-05-29-0

2020-05-29 Thread Crossbow
Arrow Build Report for Job nightly-2020-05-29-0 All tasks: https://github.com/ursa-labs/crossbow/branches/all?query=nightly-2020-05-29-0 Failed Tasks: - conda-linux-gcc-py36: URL: https://github.com/ursa-labs/crossbow/branches/all?query=nightly-2020-05-29-0-azure-conda-linux-gcc-py36 - cond