Re: [jira] [Created] (ARROW-6793) [R] Arrow C++ binary packaging for Linux

2019-10-05 Thread Wes McKinney
hi Thomas -- can you reply on the JIRA (ARROW-6793) or start a new thread? Thanks On Fri, Oct 4, 2019 at 4:53 PM Thomas S wrote: > > Very recently i had the pleasure to install arrow on Linux. At this stage > let me first remark that without the help of @xhochy and @kou I certainly > would have f

[DISCUSS] Dictionary Encoding Clarifications/Future Proofing

2019-10-05 Thread Micah Kornfield
I've opened a pull request [1] to clarify some recent conversations about semantics/edge cases for dictionary encoding [2][3] around interleaved batches and when isDelta=False. Specifically, it proposes isDelta=False indicates dictionary replacement. For the file format, only one isDelta=False bat

Re: [VOTE] Release Apache Arrow 0.15.0 - RC2

2019-10-05 Thread Neal Richardson
R release steps per https://cwiki.apache.org/confluence/display/ARROW/Release+Management+Guide#ReleaseManagementGuide-UpdatingRpackages are underway. Neal On Sat, Oct 5, 2019 at 8:40 AM Krisztián Szűcs wrote: > > - website updated with the release notes > - conda-forge updates are merged > > Rem

Re: [Proposal]: Expose Flight gRPC for Dremio use case (Java)

2019-10-05 Thread Jacques Nadeau
> > Is it possible for a single gRPC server to expose multiple services > through the same port (it sounds like it is)? It would be a good idea > to do similar refactoring in C++ so that Flight RPC endpoints can be > provided alongside some other non-Flight endpoints in the same gRPC > server > It

Re: [VOTE] Release Apache Arrow 0.15.0 - RC2

2019-10-05 Thread Krisztián Szűcs
- website updated with the release notes - conda-forge updates are merged Remaining: - Javascript - Rust - R - Homebrew - Apidocs On Sat, Oct 5, 2019 at 2:58 PM Sutou Kouhei wrote: > - uploaded C# packages > > In > "Re: [VOTE] Release Apache Arrow 0.15.0 - RC2" on Sat, 5 Oct 2019 > 14:50:51

[ANNOUNCE] Apache Arrow 0.15.0 released

2019-10-05 Thread Krisztián Szűcs
The Apache Arrow community is pleased to announce the 0.15.0 release. The release includes 711 resolved issues ([1]) since the 0.14.0 release. The release is available now from our website, [2] and [3]: http://arrow.apache.org/install/ Release notes are available at: https://arrow.apache.

[jira] [Created] (ARROW-6797) [Release] Use a separately cloned arrow-site repository in the website post release script

2019-10-05 Thread Krisztian Szucs (Jira)
Krisztian Szucs created ARROW-6797: -- Summary: [Release] Use a separately cloned arrow-site repository in the website post release script Key: ARROW-6797 URL: https://issues.apache.org/jira/browse/ARROW-6797

Re: [VOTE] Release Apache Arrow 0.15.0 - RC2

2019-10-05 Thread Sutou Kouhei
- uploaded C# packages In "Re: [VOTE] Release Apache Arrow 0.15.0 - RC2" on Sat, 5 Oct 2019 14:50:51 +0200, Krisztián Szűcs wrote: > - uploaded python wheels to pypi > - uploaded java artifacts to maven central > > I'm going to update the conda recipes. > > Remaining: > - Javascript > -

Re: [VOTE] Release Apache Arrow 0.15.0 - RC2

2019-10-05 Thread Krisztián Szűcs
- uploaded python wheels to pypi - uploaded java artifacts to maven central I'm going to update the conda recipes. Remaining: - Javascript - Rust - C# - R - Homebrew - Site On Sat, Oct 5, 2019 at 2:29 PM Krisztián Szűcs wrote: > - rebased master > - rebased the pull requests > - released the

Re: [VOTE] Release Apache Arrow 0.15.0 - RC2

2019-10-05 Thread Sutou Kouhei
- released RubyGems - currently creating a pull request to MSYS2 In "Re: [VOTE] Release Apache Arrow 0.15.0 - RC2" on Sat, 5 Oct 2019 14:29:17 +0200, Krisztián Szűcs wrote: > - rebased master > - rebased the pull requests > - released the jira version > - started the new jira version > - u

Re: [DISCUSS] raw pointers and FFI (C-level in-process array protocol)

2019-10-05 Thread Sutou Kouhei
Hi, I think that FFI use is misleading. Normally, language bindings for this API are useless for processing Apache Arrow data. Because these bindings of this API can only import/export Apache Arrow data. Target language may not have useful/fast API for processing imported Apache Arrow data. For ex

Re: [VOTE] Release Apache Arrow 0.15.0 - RC2

2019-10-05 Thread Krisztián Szűcs
- rebased master - rebased the pull requests - released the jira version - started the new jira version - uploaded source artifacts to svn - uploaded binary artifacts to bintray - currently uploading python wheels to pypi On Sat, Oct 5, 2019 at 2:04 PM Sutou Kouhei wrote: > I'll release RubyGem

[NIGHTLY] Arrow Build Report for Job nightly-2019-10-05-0

2019-10-05 Thread Crossbow
Arrow Build Report for Job nightly-2019-10-05-0 All tasks: https://github.com/ursa-labs/crossbow/branches/all?query=nightly-2019-10-05-0 Failed Tasks: - wheel-manylinux2010-cp35m: URL: https://github.com/ursa-labs/crossbow/branches/all?query=nightly-2019-10-05-0-travis-wheel-manylinux2010-c

Re: [VOTE] Release Apache Arrow 0.15.0 - RC2

2019-10-05 Thread Sutou Kouhei
I'll release RubyGems. In "Re: [VOTE] Release Apache Arrow 0.15.0 - RC2" on Sat, 5 Oct 2019 11:46:16 +0200, Krisztián Szűcs wrote: > On Sat, Oct 5, 2019 at 11:40 AM Krisztián Szűcs > wrote: > >> The VOTE carries with 5 binding +1 votes and 2 non-binding +1 votes. >> >> On Fri, Oct 4, 201

Re: [DISCUSS] Result vs Status

2019-10-05 Thread Sutou Kouhei
Hi, In <21183bb1-813f-b35b-b391-3f52ef2e6...@python.org> "Re: [DISCUSS] Result vs Status" on Sat, 5 Oct 2019 12:23:05 +0200, Antoine Pitrou wrote: >> OK, so what could more context be provided on: >> >>> From the discussion in the sync call, it seems reasonable to require that: >>> Public

Re: [DISCUSS] Result vs Status

2019-10-05 Thread Antoine Pitrou
Le 05/10/2019 à 01:40, Micah Kornfield a écrit : >> >>> It was my impression that we had workable solutions for using Result in >> at >>> least Python and Glib/Ruby (I'm don't know about R). >> >> In Python we do (though it needed a C++-side helper). >> > OK, so what could more context be provi

Re: [VOTE] Release Apache Arrow 0.15.0 - RC2

2019-10-05 Thread Krisztián Szűcs
On Sat, Oct 5, 2019 at 11:40 AM Krisztián Szűcs wrote: > The VOTE carries with 5 binding +1 votes and 2 non-binding +1 votes. > > On Fri, Oct 4, 2019 at 10:04 PM Wes McKinney wrote: > >> The commits from your local RC branch aren't available so I cannot >> rebase master yet, I'll just wait for y

Re: [VOTE] Release Apache Arrow 0.15.0 - RC2

2019-10-05 Thread Krisztián Szűcs
The VOTE carries with 5 binding +1 votes and 2 non-binding +1 votes. On Fri, Oct 4, 2019 at 10:04 PM Wes McKinney wrote: > The commits from your local RC branch aren't available so I cannot > rebase master yet, I'll just wait for you to be available again. Actually those commits are available u

[jira] [Created] (ARROW-6796) Certain moderately-sized (~100MB) default-Snappy-compressed Parquet files take enormous memory and long time to load by pyarrow.parquet.read_table

2019-10-05 Thread V Luong (Jira)
V Luong created ARROW-6796: -- Summary: Certain moderately-sized (~100MB) default-Snappy-compressed Parquet files take enormous memory and long time to load by pyarrow.parquet.read_table Key: ARROW-6796 URL: https://issue

回复:[DISCUSS][Java] Design of the algorithm module

2019-10-05 Thread fan_li_ya
Hi Micah and Praveen, Thanks a lot for your valuable feedback. My thoughts on the problems: 1. About audiance of the algorithms: I think the algorithms should be better termed "micro-algorithms". They are termed "micro" in the sense that they do not directly compose a query engine, because th