new user question about cross-language use

2018-04-12 Thread Chris Withers
Hi All, Apologies if I'm on the wrong list or struggle to get my question across, I'm very new to Arrow, so please point me to the best place if there's somewhere better to ask these kinds of questions... So, in my mind, Arrow provides a single in-memory model that supports access from a bun

[jira] [Created] (ARROW-2450) [Python] Saving to parquet fails for empty lists

2018-04-12 Thread Uwe L. Korn (JIRA)
Uwe L. Korn created ARROW-2450: -- Summary: [Python] Saving to parquet fails for empty lists Key: ARROW-2450 URL: https://issues.apache.org/jira/browse/ARROW-2450 Project: Apache Arrow Issue Type:

RE: Correct way to set NULL values in VarCharVector (Java API)?

2018-04-12 Thread Atul Dambalkar
Hi Sid, Emilio, It was a mistake on my part. I was not setting the holder.start and holder.end values inside the NullableVarCharHolder, which was causing the issue. It works now. Regards, -Atul -Original Message- From: Atul Dambalkar Sent: Wednesday, April 11, 2018 5:18 PM To: dev@arr

Pickle data from python

2018-04-12 Thread ALBERTO Bocchinfuso
Hello, I cannot pickle RecordBatches, Buffers etc. I found Issue 1654 in the issue tracker, that has been solved with pull request 1238. But this looks to apply only to the types listed (schemas, DataTypes, etc.). When I try to Pickle Buffers etc. I get exactly the same error reported in the i

[jira] [Created] (ARROW-2451) Handle more dtypes efficiently in custom numpy array serializer.

2018-04-12 Thread Robert Nishihara (JIRA)
Robert Nishihara created ARROW-2451: --- Summary: Handle more dtypes efficiently in custom numpy array serializer. Key: ARROW-2451 URL: https://issues.apache.org/jira/browse/ARROW-2451 Project: Apache

Re: Continuous benchmarking setup

2018-04-12 Thread Wes McKinney
hi Antoine, I have a bare metal machine at home (affectionately known as the "pandabox") that's available via SSH that we've been using for continuous benchmarking for other projects. Arrow is welcome to use it. I can give you access to the machine if you would like. Hopefully, we can suitably the

Re: Buffer slices are unsafe

2018-04-12 Thread Wes McKinney
My feeling is that we should advise users of the library that any slices of a ResizableBuffer become invalid after a call to Resize. > I was thinking about something like this [0]. The point is, that the slice > user has no way of knowing if the slice can still be safely used and who > owns the me

Re: Continuous benchmarking setup

2018-04-12 Thread Tom Augspurger
https://github.com/TomAugspurger/asv-runner/ is the setup for the projects currently running. Adding arrow to https://github.com/TomAugspurger/asv-runner/blob/master/tests/full.yml might work. I'll have to redeploy with the update. From: Wes McKinney Sent: Thu

Re: Pickle data from python

2018-04-12 Thread Wes McKinney
hi Alberto, If you cannot find a JIRA about pickling RecordBatch objects, could you please create one? A patch would be welcome for this; it is certainly in scope for the project. If you encounter any new problems, please open a bug report. Thanks! Wes On Thu, Apr 12, 2018 at 3:13 PM, ALBERTO B