Re: Human-readable version of Arrow Schema?

2020-01-08 Thread Kohei KaiGai
Hello, pg2arrow [*1] has '--dump' mode to print out schema definition of the given Apache Arrow file. Does it make sense for you? $ ./pg2arrow --dump ~/hoge.arrow [Footer] {Footer: version=V4, schema={Schema: endianness=little, fields=[{Field: name="id", nullable=true, type={Int32}, children=[],

[jira] [Created] (ARROW-7525) [Python][CI] Build PyArrow on VS2019

2020-01-08 Thread Krisztian Szucs (Jira)
Krisztian Szucs created ARROW-7525: -- Summary: [Python][CI] Build PyArrow on VS2019 Key: ARROW-7525 URL: https://issues.apache.org/jira/browse/ARROW-7525 Project: Apache Arrow Issue Type:

[jira] [Created] (ARROW-7524) [C++][CI] Build parquet support in the VS2019 GitHub Actions job

2020-01-08 Thread Krisztian Szucs (Jira)
Krisztian Szucs created ARROW-7524: -- Summary: [C++][CI] Build parquet support in the VS2019 GitHub Actions job Key: ARROW-7524 URL: https://issues.apache.org/jira/browse/ARROW-7524 Project: Apache

Re: Human-readable version of Arrow Schema?

2020-01-08 Thread Micah Kornfield
The C-interface representation is probably slightly less readable then the JSON implementation if I understand the flatbuffer to JSON conversion properly. But as Antoine pointed out it depends on the use-case. FWIW, flatbuffers maintainers indicated forward/backward compatibility is intended to

Re: [C++] "nonexistent" or "non-existent"

2020-01-08 Thread Micah Kornfield
I also think we have "KeyError" which I think might model the same concept? On Mon, Jan 6, 2020 at 7:35 AM Wes McKinney wrote: > I agree using a different terminology than "nonexistent" like > "NotFound" would be good. If we use "nonexistent" then the hyphen-free > spelling seems preferred > >

[jira] [Created] (ARROW-7522) Broken Record Batch returned from a function call

2020-01-08 Thread Chengxin Ma (Jira)
Chengxin Ma created ARROW-7522: -- Summary: Broken Record Batch returned from a function call Key: ARROW-7522 URL: https://issues.apache.org/jira/browse/ARROW-7522 Project: Apache Arrow Issue

[jira] [Created] (ARROW-7521) [Rust] Remove tuple on FixedSizeList datatype

2020-01-08 Thread Neville Dipale (Jira)
Neville Dipale created ARROW-7521: - Summary: [Rust] Remove tuple on FixedSizeList datatype Key: ARROW-7521 URL: https://issues.apache.org/jira/browse/ARROW-7521 Project: Apache Arrow Issue

[jira] [Created] (ARROW-7520) Arrow / R - too many batches causes a crash

2020-01-08 Thread Christian (Jira)
Christian created ARROW-7520: Summary: Arrow / R - too many batches causes a crash Key: ARROW-7520 URL: https://issues.apache.org/jira/browse/ARROW-7520 Project: Apache Arrow Issue Type: Bug

Re: Arrow / R - too many batches causes a crash

2020-01-08 Thread Wes McKinney
Can you please open a JIRA issue? On Wed, Jan 8, 2020 at 12:37 PM Christian Klar wrote: > Hi, > > > > At the bottom please find the session_info. > > > > When creating north of 200-300 batches, the writing to the arrow file > crashes R – it doesn’t even show an error message. Rstudio just

[jira] [Created] (ARROW-7519) [Python] Build wheels, conda packages with PYARROW_WITH_DATASET=1

2020-01-08 Thread Wes McKinney (Jira)
Wes McKinney created ARROW-7519: --- Summary: [Python] Build wheels, conda packages with PYARROW_WITH_DATASET=1 Key: ARROW-7519 URL: https://issues.apache.org/jira/browse/ARROW-7519 Project: Apache Arrow

[jira] [Created] (ARROW-7518) [Python] Use PYARROW_WITH_HDFS when building wheels, conda packages

2020-01-08 Thread Wes McKinney (Jira)
Wes McKinney created ARROW-7518: --- Summary: [Python] Use PYARROW_WITH_HDFS when building wheels, conda packages Key: ARROW-7518 URL: https://issues.apache.org/jira/browse/ARROW-7518 Project: Apache

Arrow / R - too many batches causes a crash

2020-01-08 Thread Christian Klar
Hi, At the bottom please find the session_info. When creating north of 200-300 batches, the writing to the arrow file crashes R – it doesn’t even show an error message. Rstudio just aborts. I have the feeling that maybe each batch becomes a stream and R has issues with the connections, but

Re: [DRAFT] Apache Arrow Board Report January 2020

2020-01-08 Thread Wes McKinney
Not sure what happened there. The two words after "grow" can be removed ## Description: The mission of Apache Arrow is the creation and maintenance of software related to columnar in-memory processing and data interchange ## Issues: There are no issues requiring board attention at this time.

Re: [DRAFT] Apache Arrow Board Report January 2020

2020-01-08 Thread Todd Hendricks
Hi Wes, Looks like there is a cutoff sentence at the end of the Community Health section. On Wed, Jan 8, 2020 at 10:01 AM Wes McKinney wrote: > Here is an updated draft. If there is no more feedback, this can be > submitted to the board > > ## Description: > > The mission of Apache Arrow is

Re: [DRAFT] Apache Arrow Board Report January 2020

2020-01-08 Thread Wes McKinney
Here is an updated draft. If there is no more feedback, this can be submitted to the board ## Description: The mission of Apache Arrow is the creation and maintenance of software related to columnar in-memory processing and data interchange ## Issues: There are no issues requiring board

[jira] [Created] (ARROW-7517) [C++] Builder does not honour dictionary type provided during initialization

2020-01-08 Thread Wamsi Viswanath (Jira)
Wamsi Viswanath created ARROW-7517: -- Summary: [C++] Builder does not honour dictionary type provided during initialization Key: ARROW-7517 URL: https://issues.apache.org/jira/browse/ARROW-7517

[jira] [Created] (ARROW-7516) [C#] .NET Benchmarks are broken

2020-01-08 Thread Eric Erhardt (Jira)
Eric Erhardt created ARROW-7516: --- Summary: [C#] .NET Benchmarks are broken Key: ARROW-7516 URL: https://issues.apache.org/jira/browse/ARROW-7516 Project: Apache Arrow Issue Type: Bug

Re: Arrow sync call January 8 at 12:00 US/Eastern, 17:00 UTC

2020-01-08 Thread Neal Richardson
Attendees: * Ben Kietzman * Wes McKinney * Prudhvi Porandla * Neal Richardson * François Saint-Jacques Discussion: * Blockers for 1.0 release, how to get them done, what is required * 0.16 backlog triage On Tue, Jan 7, 2020 at 9:01 AM Neal Richardson wrote: > Hi all, > Happy 2020! Reminder

[jira] [Created] (ARROW-7515) [C++] Rename nonexistent and non_existent to not_found

2020-01-08 Thread Kenta Murata (Jira)
Kenta Murata created ARROW-7515: --- Summary: [C++] Rename nonexistent and non_existent to not_found Key: ARROW-7515 URL: https://issues.apache.org/jira/browse/ARROW-7515 Project: Apache Arrow