Cast from Array to Array and Array

2018-11-14 Thread Romain Francois
I'm implementing Array$cast() in the R package, i.e. https://issues.apache.org/jira/browse/ARROW-3741?filter=12344983 I'm seeing some weird results when casting from int32 to uint32 and uint64 (I'm expecting errors here as it's supposed to be safe

Re: [DISCUSS] Joda Time -> Java8 Time

2018-11-14 Thread Bryan Cutler
Hi all, I'm picking this back up again and have WIP pr at https://github.com/apache/arrow/pull/2966. Please take a look at the new APIs and see if they impact you downstream. In addition to the API changes mentioned before by Li, there is also (4) IntervalDayVector now uses java.time.Duration,

[jira] [Created] (ARROW-3795) [R] Support for retrieving NAs from INT64 arrays

2018-11-14 Thread Javier Luraschi (JIRA)
Javier Luraschi created ARROW-3795: -- Summary: [R] Support for retrieving NAs from INT64 arrays Key: ARROW-3795 URL: https://issues.apache.org/jira/browse/ARROW-3795 Project: Apache Arrow

[jira] [Created] (ARROW-3793) TestScalarAppendUnsafe is not testing unsafe appends

2018-11-14 Thread Benjamin Kietzman (JIRA)
Benjamin Kietzman created ARROW-3793: Summary: TestScalarAppendUnsafe is not testing unsafe appends Key: ARROW-3793 URL: https://issues.apache.org/jira/browse/ARROW-3793 Project: Apache Arrow

[jira] [Created] (ARROW-3794) [R] Consider mapping INT8 to integer() not raw()

2018-11-14 Thread Javier Luraschi (JIRA)
Javier Luraschi created ARROW-3794: -- Summary: [R] Consider mapping INT8 to integer() not raw() Key: ARROW-3794 URL: https://issues.apache.org/jira/browse/ARROW-3794 Project: Apache Arrow

Re: Cast from Array to Array and Array

2018-11-14 Thread Wes McKinney
There's a logical error here https://github.com/apache/arrow/blob/master/cpp/src/arrow/compute/kernels/cast.cc#L239 The cast constexpr in_type kMax = static_cast(std::numeric_limits::max()); is overflowing int32_t This code path was only written for the unsigned-to-signed case. The functors

Re: Cast from Array to Array and Array

2018-11-14 Thread Francois Saint-Jacques
Seems like the type combination you're using (int32 -> uint32) and (int32 -> uint64) don't match the following pattern-matching https://github.com/apache/arrow/blob/master/cpp/src/arrow/compute/kernels/cast.cc#L191-L192 which avoid using "safe" cast and revert to the following cast

[jira] [Created] (ARROW-3799) Improve `make_in_expression`

2018-11-14 Thread Siyuan Zhuang (JIRA)
Siyuan Zhuang created ARROW-3799: Summary: Improve `make_in_expression` Key: ARROW-3799 URL: https://issues.apache.org/jira/browse/ARROW-3799 Project: Apache Arrow Issue Type: Improvement

[jira] [Created] (ARROW-3797) [Rust] BinaryArray::value_offset incorrect in offset case

2018-11-14 Thread Brent Kerby (JIRA)
Brent Kerby created ARROW-3797: -- Summary: [Rust] BinaryArray::value_offset incorrect in offset case Key: ARROW-3797 URL: https://issues.apache.org/jira/browse/ARROW-3797 Project: Apache Arrow

[jira] [Created] (ARROW-3796) [Rust] Add Example for PrimitiveArrayBuilder

2018-11-14 Thread Paddy Horan (JIRA)
Paddy Horan created ARROW-3796: -- Summary: [Rust] Add Example for PrimitiveArrayBuilder Key: ARROW-3796 URL: https://issues.apache.org/jira/browse/ARROW-3796 Project: Apache Arrow Issue Type:

[jira] [Created] (ARROW-3798) [GLib] Add support for column type CSV read options

2018-11-14 Thread Kouhei Sutou (JIRA)
Kouhei Sutou created ARROW-3798: --- Summary: [GLib] Add support for column type CSV read options Key: ARROW-3798 URL: https://issues.apache.org/jira/browse/ARROW-3798 Project: Apache Arrow Issue

[jira] [Created] (ARROW-3790) [C++] Signed to unsigned integer cast yields incorrect results when type sizes are the same

2018-11-14 Thread Wes McKinney (JIRA)
Wes McKinney created ARROW-3790: --- Summary: [C++] Signed to unsigned integer cast yields incorrect results when type sizes are the same Key: ARROW-3790 URL: https://issues.apache.org/jira/browse/ARROW-3790

Re: Cast from Array to Array and Array

2018-11-14 Thread Wes McKinney
https://issues.apache.org/jira/browse/ARROW-3790 On Wed, Nov 14, 2018 at 9:03 AM Wes McKinney wrote: > > There's a logical error here > > https://github.com/apache/arrow/blob/master/cpp/src/arrow/compute/kernels/cast.cc#L239 > > The cast > > constexpr in_type kMax = >

[jira] [Created] (ARROW-3789) [Python] Enable calling object in Table.to_pandas to "self-destruct" for improved memory use

2018-11-14 Thread Wes McKinney (JIRA)
Wes McKinney created ARROW-3789: --- Summary: [Python] Enable calling object in Table.to_pandas to "self-destruct" for improved memory use Key: ARROW-3789 URL: https://issues.apache.org/jira/browse/ARROW-3789

[jira] [Created] (ARROW-3791) [C++] Add CSV conversion option to parse booleans

2018-11-14 Thread Kouhei Sutou (JIRA)
Kouhei Sutou created ARROW-3791: --- Summary: [C++] Add CSV conversion option to parse booleans Key: ARROW-3791 URL: https://issues.apache.org/jira/browse/ARROW-3791 Project: Apache Arrow Issue

[jira] [Created] (ARROW-3788) [Ruby] Add support for CSV parser writtin in C++

2018-11-14 Thread Kouhei Sutou (JIRA)
Kouhei Sutou created ARROW-3788: --- Summary: [Ruby] Add support for CSV parser writtin in C++ Key: ARROW-3788 URL: https://issues.apache.org/jira/browse/ARROW-3788 Project: Apache Arrow Issue

Arrow development sync call today 12pm Eastern / 17:00 UTC

2018-11-14 Thread Wes McKinney
All are welcome https://meet.google.com/vtm-teks-phx

Re: Arrow development sync call today 12pm Eastern / 17:00 UTC

2018-11-14 Thread Siddharth Teotia
Notes: Attendees: Sidd Wes Ravindra Arvind Shyam Bryan Francois Bryan: 1. Switching over to Java time from Joda time. At Dremio we need to assess the impact of these changes. Bryan will put a WIP PR soon. There has been a discussion about this on mailing list 2. The Gandiva microbenchmark test

Re: Cast from Array to Array and Array

2018-11-14 Thread Wes McKinney
Yes, it looks buggy to me, see In [12]: arr = pa.array([-1, -2, -3], type='int32') In [13]: arr.cast('uint8') --- ArrowInvalid Traceback (most recent call last) in () > 1 arr.cast('uint8')

[jira] [Created] (ARROW-3792) [PARQUET] Segmentation fault when writing empty RecordBatches

2018-11-14 Thread Suvayu Ali (JIRA)
Suvayu Ali created ARROW-3792: - Summary: [PARQUET] Segmentation fault when writing empty RecordBatches Key: ARROW-3792 URL: https://issues.apache.org/jira/browse/ARROW-3792 Project: Apache Arrow