[GitHub] [arrow] romainfrancois edited a comment on pull request #8365: ARROW-6582: [R] Arrow to R fails with embedded nuls in strings

2020-11-12 Thread GitBox
romainfrancois edited a comment on pull request #8365: URL: https://github.com/apache/arrow/pull/8365#issuecomment-726577844 It does look like `Rf_mkCharLenCE()` is generating the error: ``` r cpp11::cpp_eval('Rf_mkCharLenCE("camer\\0a", 7, CE_UTF8)') #> Error in f(): embedded

[GitHub] [arrow] romainfrancois commented on pull request #8365: ARROW-6582: [R] Arrow to R fails with embedded nuls in strings

2020-11-12 Thread GitBox
romainfrancois commented on pull request #8365: URL: https://github.com/apache/arrow/pull/8365#issuecomment-726577844 It does look like `Rf_mkCharLenCE()` is generating the error: ``` r cpp11::cpp_eval('Rf_mkCharLenCE("camer\\0a", 6, CE_UTF8)') #> Error in f(): embedded nul in

[GitHub] [arrow] emkornfield commented on pull request #7030: ARROW-7808: [Java][Dataset] Implement Dataset Java API by JNI to C++

2020-11-12 Thread GitBox
emkornfield commented on pull request #7030: URL: https://github.com/apache/arrow/pull/7030#issuecomment-726557997 @zhztheplayer I think I saw you requested a re-review is that accurate? Is the CI failure related to your changes?

[GitHub] [arrow] github-actions[bot] commented on pull request #8657: ARROW-7363: [Python] add combine_chunks method to ChunkedArray

2020-11-12 Thread GitBox
github-actions[bot] commented on pull request #8657: URL: https://github.com/apache/arrow/pull/8657#issuecomment-726557932 https://issues.apache.org/jira/browse/ARROW-7363 This is an automated message from the Apache Git

[GitHub] [arrow] arw2019 opened a new pull request #8657: ARROW-7363: [Python] add combine_chunks method to ChunkedArray

2020-11-12 Thread GitBox
arw2019 opened a new pull request #8657: URL: https://github.com/apache/arrow/pull/8657 As discussed on JIRA we currently don't have a convenience method that converts `ChunkedArray` into a non-chunked `Array`. `flatten` doesn't do this when called on a `ChunkedArray` instance so this PR

[GitHub] [arrow] kiszk commented on a change in pull request #8210: ARROW-10031: [CI][Java] Support Java benchmark in Ursabot

2020-11-12 Thread GitBox
kiszk commented on a change in pull request #8210: URL: https://github.com/apache/arrow/pull/8210#discussion_r522675532 ## File path: java/performance/pom.xml ## @@ -169,10 +173,17 @@ ${benchmark.filter} -f

[GitHub] [arrow] emkornfield commented on a change in pull request #8632: ARROW-10426: [C++] Allow writing large strings to Parquet

2020-11-12 Thread GitBox
emkornfield commented on a change in pull request #8632: URL: https://github.com/apache/arrow/pull/8632#discussion_r522673324 ## File path: cpp/src/parquet/encoding.cc ## @@ -127,6 +129,21 @@ class PlainEncoder : public EncoderImpl, virtual public TypedEncoder { }

[GitHub] [arrow] kiszk commented on pull request #8210: ARROW-10031: [CI][Java] Support Java benchmark in Ursabot

2020-11-12 Thread GitBox
kiszk commented on pull request #8210: URL: https://github.com/apache/arrow/pull/8210#issuecomment-726544148 The following commands should work: ``` archery benchmark list --langauge=java archery benchmark run --langauge=java archery benchmark diff --langauge=java ```

[GitHub] [arrow] kiszk commented on pull request #8210: ARROW-10031: [CI][Java] Support Java benchmark in Ursabot

2020-11-12 Thread GitBox
kiszk commented on pull request #8210: URL: https://github.com/apache/arrow/pull/8210#issuecomment-726543626 @liyafan82 @fsaintjacques @kszucs Would it be possible to review this? This is an automated message from the Apache

[GitHub] [arrow] github-actions[bot] commented on pull request #8386: ARROW-10224: [Python] Build, test, and support Python 3.9

2020-11-12 Thread GitBox
github-actions[bot] commented on pull request #8386: URL: https://github.com/apache/arrow/pull/8386#issuecomment-726538652 Revision: fcb561bf960f23d3c1b85c791bb2acff8e2575a6 Submitted crossbow builds: [ursa-labs/crossbow @

[GitHub] [arrow] kou commented on pull request #8386: ARROW-10224: [Python] Build, test, and support Python 3.9

2020-11-12 Thread GitBox
kou commented on pull request #8386: URL: https://github.com/apache/arrow/pull/8386#issuecomment-726531918 @github-actions crossbow submit wheel-win-cp39 This is an automated message from the Apache Git Service. To respond

[GitHub] [arrow] Bei-z commented on pull request #8542: ARROW-10407: [C++] Add BasicDecimal256 Division Support

2020-11-12 Thread GitBox
Bei-z commented on pull request #8542: URL: https://github.com/apache/arrow/pull/8542#issuecomment-726531615 > Thanks for the updates. Can you look at the errors on CI and fix them? Thanks! Thank you for reviewing! The errors on CI are fixed now.

[GitHub] [arrow] jorgecarleitao edited a comment on pull request #8645: ARROW-10561: [Rust] Simplified Buffer's `write` and `write_bytes` and fixed undefined behavior

2020-11-12 Thread GitBox
jorgecarleitao edited a comment on pull request #8645: URL: https://github.com/apache/arrow/pull/8645#issuecomment-726510728 Ok, I confirm that this error is due to a wrong pointer offset on the `set_bits_raw` and that this undefined behavior is already present in `master`: ```

[GitHub] [arrow] jorgecarleitao edited a comment on pull request #8645: ARROW-10561: [Rust] Simplified Buffer's `write` and `write_bytes` and fixed undefined behavior

2020-11-12 Thread GitBox
jorgecarleitao edited a comment on pull request #8645: URL: https://github.com/apache/arrow/pull/8645#issuecomment-726510728 Ok, I confirm that this error is due to a wrong pointer offset on the `set_bits_raw` and that this undefined behavior is already present in `master`: ```

[GitHub] [arrow] arw2019 edited a comment on pull request #8474: ARROW-10301: [C++][Compute] Implement "all" reduction kernel for boolean data

2020-11-12 Thread GitBox
arw2019 edited a comment on pull request #8474: URL: https://github.com/apache/arrow/pull/8474#issuecomment-720653983 Ready for re-review. CI all green This is an automated message from the Apache Git Service. To respond to

[GitHub] [arrow] jorgecarleitao commented on pull request #8645: ARROW-10561: [Rust] Simplified Buffer's `write` and `write_bytes` and fixed undefined behavior

2020-11-12 Thread GitBox
jorgecarleitao commented on pull request #8645: URL: https://github.com/apache/arrow/pull/8645#issuecomment-726510728 Ok, I confirm that this error is due to a wrong byte offset on the `set_bits_raw` and that this undefined behavior is already present in `master`: ``` Switched to

[GitHub] [arrow] github-actions[bot] commented on pull request #8656: ARROW-10575: [Rust] Rename union.rs to be cosistent with other arrays

2020-11-12 Thread GitBox
github-actions[bot] commented on pull request #8656: URL: https://github.com/apache/arrow/pull/8656#issuecomment-726490144 https://issues.apache.org/jira/browse/ARROW-10575 This is an automated message from the Apache Git

[GitHub] [arrow] github-actions[bot] commented on pull request #8386: ARROW-10224: [Python] Build, test, and support Python 3.9

2020-11-12 Thread GitBox
github-actions[bot] commented on pull request #8386: URL: https://github.com/apache/arrow/pull/8386#issuecomment-726488735 Revision: 5226bf79241b66866ed71e1f89699211843dc0dc Submitted crossbow builds: [ursa-labs/crossbow @

[GitHub] [arrow] kou commented on pull request #8386: ARROW-10224: [Python] Build, test, and support Python 3.9

2020-11-12 Thread GitBox
kou commented on pull request #8386: URL: https://github.com/apache/arrow/pull/8386#issuecomment-726488347 @github-actions crossbow submit wheel-win-cp39 This is an automated message from the Apache Git Service. To respond

[GitHub] [arrow] github-actions[bot] commented on pull request #8386: ARROW-10224: [Python] Build, test, and support Python 3.9

2020-11-12 Thread GitBox
github-actions[bot] commented on pull request #8386: URL: https://github.com/apache/arrow/pull/8386#issuecomment-726474135 Revision: 59da36b94b931ae1c72b7896f1aa847e904b011c Submitted crossbow builds: [ursa-labs/crossbow @

[GitHub] [arrow] kou commented on pull request #8386: ARROW-10224: [Python] Build, test, and support Python 3.9

2020-11-12 Thread GitBox
kou commented on pull request #8386: URL: https://github.com/apache/arrow/pull/8386#issuecomment-726473740 @github-actions crossbow submit wheel-win-cp39 This is an automated message from the Apache Git Service. To respond

[GitHub] [arrow] liyafan82 commented on pull request #8605: ARROW-10508 [Java] Allow FixedSizeListVector to have empty children

2020-11-12 Thread GitBox
liyafan82 commented on pull request #8605: URL: https://github.com/apache/arrow/pull/8605#issuecomment-726470885 > @liyafan82 good idea, added test case to PR. > Also tested together with C++ in my demo project:

[GitHub] [arrow] github-actions[bot] commented on pull request #8386: ARROW-10224: [Python] Build, test, and support Python 3.9

2020-11-12 Thread GitBox
github-actions[bot] commented on pull request #8386: URL: https://github.com/apache/arrow/pull/8386#issuecomment-726433072 Revision: 797da2423e1b82473af3140c2c6d8ac048ed7d7b Submitted crossbow builds: [ursa-labs/crossbow @

[GitHub] [arrow] github-actions[bot] commented on pull request #8655: ARROW-10526: [FlightRPC][C++][Python] Client cookie middleware

2020-11-12 Thread GitBox
github-actions[bot] commented on pull request #8655: URL: https://github.com/apache/arrow/pull/8655#issuecomment-726432824 https://issues.apache.org/jira/browse/ARROW-10526 This is an automated message from the Apache Git

[GitHub] [arrow] lidavidm commented on pull request #8585: ARROW-10475: [C++][FlightRPC] handle IPv6 hosts

2020-11-12 Thread GitBox
lidavidm commented on pull request #8585: URL: https://github.com/apache/arrow/pull/8585#issuecomment-726423866 Ah sorry, I think I got what you mean now - updated. This is an automated message from the Apache Git Service.

[GitHub] [arrow] kou commented on pull request #8386: ARROW-10224: [Python] Build, test, and support Python 3.9

2020-11-12 Thread GitBox
kou commented on pull request #8386: URL: https://github.com/apache/arrow/pull/8386#issuecomment-726419694 @github-actions crossbow submit wheel-win-cp39 This is an automated message from the Apache Git Service. To respond

[GitHub] [arrow] jduo opened a new pull request #8655: ARROW-10526: [FlightRPC][C++][Python] Client cookie middleware

2020-11-12 Thread GitBox
jduo opened a new pull request #8655: URL: https://github.com/apache/arrow/pull/8655 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go

[GitHub] [arrow] wesm commented on pull request #8644: ARROW-10573: [C++] Align written buffers to specified value

2020-11-12 Thread GitBox
wesm commented on pull request #8644: URL: https://github.com/apache/arrow/pull/8644#issuecomment-726416818 Hm. Unaligned buffers are not compliant with the Arrow specification. This optimization is only useful in the extraordinarily narrow use case where all of the columns in the

[GitHub] [arrow] BryanCutler commented on a change in pull request #8057: ARROW-9862: [Java] Enable UnsafeDirectLittleEndian on a big-endian platform

2020-11-12 Thread GitBox
BryanCutler commented on a change in pull request #8057: URL: https://github.com/apache/arrow/pull/8057#discussion_r522508039 ## File path: java/memory/memory-netty/src/test/java/io/netty/buffer/TestUnsafeDirectLittleEndian.java ## @@ -0,0 +1,98 @@ +/* + * Licensed to the

[GitHub] [arrow] BryanCutler commented on a change in pull request #8057: ARROW-9862: [Java] Enable UnsafeDirectLittleEndian on a big-endian platform

2020-11-12 Thread GitBox
BryanCutler commented on a change in pull request #8057: URL: https://github.com/apache/arrow/pull/8057#discussion_r522507327 ## File path: java/memory/memory-netty/src/main/java/io/netty/buffer/UnsafeDirectLittleEndian.java ## @@ -170,6 +166,66 @@ public ByteBuf

[GitHub] [arrow] github-actions[bot] commented on pull request #8644: ARROW-10573: [C++] Align written buffers to specified value

2020-11-12 Thread GitBox
github-actions[bot] commented on pull request #8644: URL: https://github.com/apache/arrow/pull/8644#issuecomment-726395783 https://issues.apache.org/jira/browse/ARROW-10573 This is an automated message from the Apache Git

[GitHub] [arrow] github-actions[bot] commented on pull request #8386: ARROW-10224: [Python] Build, test, and support Python 3.9

2020-11-12 Thread GitBox
github-actions[bot] commented on pull request #8386: URL: https://github.com/apache/arrow/pull/8386#issuecomment-726352589 Revision: 63de89b280184ae5ca7f785b181b7a01634c1d06 Submitted crossbow builds: [ursa-labs/crossbow @

[GitHub] [arrow] kou commented on pull request #8386: ARROW-10224: [Python] Build, test, and support Python 3.9

2020-11-12 Thread GitBox
kou commented on pull request #8386: URL: https://github.com/apache/arrow/pull/8386#issuecomment-726349174 @github-actions crossbow submit wheel-win-cp39 This is an automated message from the Apache Git Service. To respond

[GitHub] [arrow] github-actions[bot] commented on pull request #8386: ARROW-10224: [Python] Build, test, and support Python 3.9

2020-11-12 Thread GitBox
github-actions[bot] commented on pull request #8386: URL: https://github.com/apache/arrow/pull/8386#issuecomment-726329638 Revision: 3806d8c58ce2158a266cfe54c3029d46d7b6b0c8 Submitted crossbow builds: [ursa-labs/crossbow @

[GitHub] [arrow] bkietz commented on a change in pull request #8652: ARROW-10566: [C++] Allow validating ArrayData directly

2020-11-12 Thread GitBox
bkietz commented on a change in pull request #8652: URL: https://github.com/apache/arrow/pull/8652#discussion_r522384657 ## File path: cpp/src/arrow/array/validate.cc ## @@ -392,96 +376,159 @@ Status ValidateArray(const Array& array) {

[GitHub] [arrow] kou commented on pull request #8386: ARROW-10224: [Python] Build, test, and support Python 3.9

2020-11-12 Thread GitBox
kou commented on pull request #8386: URL: https://github.com/apache/arrow/pull/8386#issuecomment-726306020 @github-actions crossbow submit wheel-win-cp39 This is an automated message from the Apache Git Service. To respond

[GitHub] [arrow] bkietz commented on pull request #8582: ARROW-10483: [C++] Move Executor to future.h

2020-11-12 Thread GitBox
bkietz commented on pull request #8582: URL: https://github.com/apache/arrow/pull/8582#issuecomment-726304419 @pitrou rebased, PTAL This is an automated message from the Apache Git Service. To respond to the message, please

[GitHub] [arrow] github-actions[bot] commented on pull request #8386: ARROW-10224: [Python] Build, test, and support Python 3.9

2020-11-12 Thread GitBox
github-actions[bot] commented on pull request #8386: URL: https://github.com/apache/arrow/pull/8386#issuecomment-726295496 Revision: d9abeb78e20ab9a28a5180c2c5001851a53dfb4a Submitted crossbow builds: [ursa-labs/crossbow @

[GitHub] [arrow] github-actions[bot] commented on pull request #8654: ARROW-10572: [Rust][Datafusion] Use aHash instead of FnvHashMap

2020-11-12 Thread GitBox
github-actions[bot] commented on pull request #8654: URL: https://github.com/apache/arrow/pull/8654#issuecomment-726283784 https://issues.apache.org/jira/browse/ARROW-10572 This is an automated message from the Apache Git

[GitHub] [arrow] kou commented on pull request #8632: ARROW-10426: [C++] Allow writing large strings to Parquet

2020-11-12 Thread GitBox
kou commented on pull request #8632: URL: https://github.com/apache/arrow/pull/8632#issuecomment-726283347 Umm, they may have a problem in finalization... This is an automated message from the Apache Git Service. To respond

[GitHub] [arrow] bkietz closed pull request #8591: ARROW-10484: [C++] Make Future<> more generic

2020-11-12 Thread GitBox
bkietz closed pull request #8591: URL: https://github.com/apache/arrow/pull/8591 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to

[GitHub] [arrow] kou commented on pull request #8386: ARROW-10224: [Python] Build, test, and support Python 3.9

2020-11-12 Thread GitBox
kou commented on pull request #8386: URL: https://github.com/apache/arrow/pull/8386#issuecomment-726278539 @github-actions crossbow submit wheel-win-cp39 This is an automated message from the Apache Git Service. To respond

[GitHub] [arrow] pitrou commented on a change in pull request #8652: ARROW-10566: [C++] Allow validating ArrayData directly

2020-11-12 Thread GitBox
pitrou commented on a change in pull request #8652: URL: https://github.com/apache/arrow/pull/8652#discussion_r522346840 ## File path: cpp/src/arrow/array/validate.cc ## @@ -392,96 +376,159 @@ Status ValidateArray(const Array& array) {

[GitHub] [arrow] pitrou commented on a change in pull request #8652: ARROW-10566: [C++] Allow validating ArrayData directly

2020-11-12 Thread GitBox
pitrou commented on a change in pull request #8652: URL: https://github.com/apache/arrow/pull/8652#discussion_r522345946 ## File path: cpp/src/arrow/array/validate.cc ## @@ -392,96 +376,159 @@ Status ValidateArray(const Array& array) {

[GitHub] [arrow] pitrou commented on a change in pull request #8652: ARROW-10566: [C++] Allow validating ArrayData directly

2020-11-12 Thread GitBox
pitrou commented on a change in pull request #8652: URL: https://github.com/apache/arrow/pull/8652#discussion_r522344042 ## File path: cpp/src/arrow/array/validate.cc ## @@ -38,195 +39,172 @@ namespace internal { namespace { -struct ValidateArrayVisitor { - Status

[GitHub] [arrow] Dandandan opened a new pull request #8654: ARROW-10572: [Rust][Datafusion] Use std::collections::HashMap + aHash instead of FnvHashMap

2020-11-12 Thread GitBox
Dandandan opened a new pull request #8654: URL: https://github.com/apache/arrow/pull/8654 Use ahash https://github.com/tkaitchuck/aHash for hashing algorithm. Difference is mainly visible in group_by benches: ``` aggregate_query_group_by 15 12

[GitHub] [arrow] pitrou commented on a change in pull request #8628: ARROW-9489: [C++] Add fill_null kernel implementation for (array[string], scalar[string])

2020-11-12 Thread GitBox
pitrou commented on a change in pull request #8628: URL: https://github.com/apache/arrow/pull/8628#discussion_r522341768 ## File path: python/pyarrow/tests/test_compute.py ## @@ -860,6 +860,16 @@ def test_fill_null(): expected = pa.array([None, None, None, None])

[GitHub] [arrow] github-actions[bot] commented on pull request #8653: ARROW-10560: [Python] Fix crash when creating array from huge string

2020-11-12 Thread GitBox
github-actions[bot] commented on pull request #8653: URL: https://github.com/apache/arrow/pull/8653#issuecomment-726273781 https://issues.apache.org/jira/browse/ARROW-10560 This is an automated message from the Apache Git

[GitHub] [arrow] pitrou commented on a change in pull request #8628: ARROW-9489: [C++] Add fill_null kernel implementation for (array[string], scalar[string])

2020-11-12 Thread GitBox
pitrou commented on a change in pull request #8628: URL: https://github.com/apache/arrow/pull/8628#discussion_r522341491 ## File path: cpp/src/arrow/compute/kernels/codegen_internal.h ## @@ -1058,6 +1058,22 @@ ArrayKernelExec GenerateTypeAgnosticPrimitive(detail::GetTypeId

[GitHub] [arrow] pitrou opened a new pull request #8653: ARROW-10560: [Python] Fix crash when creating array from huge string

2020-11-12 Thread GitBox
pitrou opened a new pull request #8653: URL: https://github.com/apache/arrow/pull/8653 Error out cleanly if an individual string is larger than 2**31 bytes when creating a binary or string array. This is an automated

[GitHub] [arrow] bkietz commented on a change in pull request #8652: ARROW-10566: [C++] Allow validating ArrayData directly

2020-11-12 Thread GitBox
bkietz commented on a change in pull request #8652: URL: https://github.com/apache/arrow/pull/8652#discussion_r522302828 ## File path: cpp/src/arrow/array/validate.cc ## @@ -38,195 +39,172 @@ namespace internal { namespace { -struct ValidateArrayVisitor { - Status

[GitHub] [arrow] lidavidm commented on pull request #8649: ARROW-10567: [C++] Add multiple perf runs options for higher precision reporting

2020-11-12 Thread GitBox
lidavidm commented on pull request #8649: URL: https://github.com/apache/arrow/pull/8649#issuecomment-726262837 Thanks! This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [arrow] lidavidm closed pull request #8649: ARROW-10567: [C++] Add multiple perf runs options for higher precision reporting

2020-11-12 Thread GitBox
lidavidm closed pull request #8649: URL: https://github.com/apache/arrow/pull/8649 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to

[GitHub] [arrow] github-actions[bot] commented on pull request #8650: ARROW-10530: [R] Use Converter API to convert SEXP to Array/ChunkedArray

2020-11-12 Thread GitBox
github-actions[bot] commented on pull request #8650: URL: https://github.com/apache/arrow/pull/8650#issuecomment-726261100 https://issues.apache.org/jira/browse/ARROW-10530 This is an automated message from the Apache Git

[GitHub] [arrow] maartenbreddels commented on a change in pull request #8628: ARROW-9489: [C++] Add fill_null kernel implementation for (array[string], scalar[string])

2020-11-12 Thread GitBox
maartenbreddels commented on a change in pull request #8628: URL: https://github.com/apache/arrow/pull/8628#discussion_r522327950 ## File path: cpp/src/arrow/compute/kernels/codegen_internal.h ## @@ -1058,6 +1058,22 @@ ArrayKernelExec

[GitHub] [arrow] maartenbreddels commented on a change in pull request #8628: ARROW-9489: [C++] Add fill_null kernel implementation for (array[string], scalar[string])

2020-11-12 Thread GitBox
maartenbreddels commented on a change in pull request #8628: URL: https://github.com/apache/arrow/pull/8628#discussion_r522327624 ## File path: cpp/src/arrow/compute/kernels/scalar_fill_null.cc ## @@ -153,6 +153,60 @@ void AddBasicFillNullKernels(ScalarKernel kernel,

[GitHub] [arrow] maartenbreddels commented on a change in pull request #8628: ARROW-9489: [C++] Add fill_null kernel implementation for (array[string], scalar[string])

2020-11-12 Thread GitBox
maartenbreddels commented on a change in pull request #8628: URL: https://github.com/apache/arrow/pull/8628#discussion_r522326005 ## File path: cpp/src/arrow/compute/kernels/codegen_internal.h ## @@ -1058,6 +1058,22 @@ ArrayKernelExec

[GitHub] [arrow] maartenbreddels commented on a change in pull request #8628: ARROW-9489: [C++] Add fill_null kernel implementation for (array[string], scalar[string])

2020-11-12 Thread GitBox
maartenbreddels commented on a change in pull request #8628: URL: https://github.com/apache/arrow/pull/8628#discussion_r522325504 ## File path: python/pyarrow/tests/test_compute.py ## @@ -860,6 +860,16 @@ def test_fill_null(): expected = pa.array([None, None, None, None])

[GitHub] [arrow] pitrou closed pull request #8627: ARROW-10546: [Python] Deprecate DaskFileSystem/S3FSWrapper + stop using it internally

2020-11-12 Thread GitBox
pitrou closed pull request #8627: URL: https://github.com/apache/arrow/pull/8627 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to

[GitHub] [arrow] pitrou commented on a change in pull request #8628: ARROW-9489: [C++] Add fill_null kernel implementation for (array[string], scalar[string])

2020-11-12 Thread GitBox
pitrou commented on a change in pull request #8628: URL: https://github.com/apache/arrow/pull/8628#discussion_r522312822 ## File path: cpp/src/arrow/compute/kernels/scalar_fill_null.cc ## @@ -153,6 +153,60 @@ void AddBasicFillNullKernels(ScalarKernel kernel, ScalarFunction*

[GitHub] [arrow] pitrou closed pull request #8638: ARROW-10558: [Python] Fix python S3 filesystem tests interdependence

2020-11-12 Thread GitBox
pitrou closed pull request #8638: URL: https://github.com/apache/arrow/pull/8638 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to

[GitHub] [arrow] pitrou closed pull request #8651: ARROW-10568: [C++][Parquet] Avoid crashing when OutputStream::Tell fails

2020-11-12 Thread GitBox
pitrou closed pull request #8651: URL: https://github.com/apache/arrow/pull/8651 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to

[GitHub] [arrow] github-actions[bot] commented on pull request #8652: ARROW-10566: [C++] Allow validating ArrayData directly

2020-11-12 Thread GitBox
github-actions[bot] commented on pull request #8652: URL: https://github.com/apache/arrow/pull/8652#issuecomment-726240920 https://issues.apache.org/jira/browse/ARROW-10566 This is an automated message from the Apache Git

[GitHub] [arrow] pitrou commented on pull request #8591: ARROW-10484: [C++] Make Future<> more generic

2020-11-12 Thread GitBox
pitrou commented on pull request #8591: URL: https://github.com/apache/arrow/pull/8591#issuecomment-726239188 Can you rebase to get a clean ASAN build? This is an automated message from the Apache Git Service. To respond to

[GitHub] [arrow] github-actions[bot] commented on pull request #8386: ARROW-10224: [Python] Build, test, and support Python 3.9

2020-11-12 Thread GitBox
github-actions[bot] commented on pull request #8386: URL: https://github.com/apache/arrow/pull/8386#issuecomment-726238225 Revision: e4874a84fdb589391713c596e9c16c0ef2b168df Submitted crossbow builds: [ursa-labs/crossbow @

[GitHub] [arrow] nealrichardson commented on pull request #8365: ARROW-6582: [R] Arrow to R fails with embedded nuls in strings

2020-11-12 Thread GitBox
nealrichardson commented on pull request #8365: URL: https://github.com/apache/arrow/pull/8365#issuecomment-726237202 What you describe (including using GetView) is essentially what we now have on master: https://github.com/apache/arrow/blob/master/r/src/array_to_vector.cpp#L290-L321

[GitHub] [arrow] bkietz commented on a change in pull request #8461: ARROW-10197: [python][Gandiva] Execute expression on filtered data

2020-11-12 Thread GitBox
bkietz commented on a change in pull request #8461: URL: https://github.com/apache/arrow/pull/8461#discussion_r522287090 ## File path: python/pyarrow/includes/libgandiva.pxd ## @@ -58,6 +67,31 @@ cdef extern from "gandiva/selection_vector.h" namespace "gandiva" nogil:

[GitHub] [arrow] pitrou opened a new pull request #8652: ARROW-10566: [C++] Allow validating ArrayData directly

2020-11-12 Thread GitBox
pitrou opened a new pull request #8652: URL: https://github.com/apache/arrow/pull/8652 Having to construct an Array from ArrayData entails various assertions which may fail. It is therefore safer to be able to validate the data *before*.

[GitHub] [arrow] kou commented on pull request #8386: ARROW-10224: [Python] Build, test, and support Python 3.9

2020-11-12 Thread GitBox
kou commented on pull request #8386: URL: https://github.com/apache/arrow/pull/8386#issuecomment-726230804 @github-actions crossbow submit wheel-win-cp39 This is an automated message from the Apache Git Service. To respond

[GitHub] [arrow] bkietz commented on pull request #8365: ARROW-6582: [R] Arrow to R fails with embedded nuls in strings

2020-11-12 Thread GitBox
bkietz commented on pull request #8365: URL: https://github.com/apache/arrow/pull/8365#issuecomment-726224318 @romainfrancois that looks good to me. I'd recommend using `GetView` instead of `GetString` to avoid allocating an unnecessary temporary for non-short strings

[GitHub] [arrow] pitrou commented on pull request #8632: ARROW-10426: [C++] Allow writing large strings to Parquet

2020-11-12 Thread GitBox
pitrou commented on pull request #8632: URL: https://github.com/apache/arrow/pull/8632#issuecomment-726224610 @kou Well, the Python tests sometimes seem to time out on MinGW... This is an automated message from the Apache

[GitHub] [arrow] github-actions[bot] commented on pull request #8386: ARROW-10224: [Python] Build, test, and support Python 3.9

2020-11-12 Thread GitBox
github-actions[bot] commented on pull request #8386: URL: https://github.com/apache/arrow/pull/8386#issuecomment-726218308 Revision: 3488e1c6fc6b6d5393daa26549a3ea023a627512 Submitted crossbow builds: [ursa-labs/crossbow @

[GitHub] [arrow] kou commented on pull request #8386: ARROW-10224: [Python] Build, test, and support Python 3.9

2020-11-12 Thread GitBox
kou commented on pull request #8386: URL: https://github.com/apache/arrow/pull/8386#issuecomment-726217209 @github-actions crossbow submit wheel-win-* This is an automated message from the Apache Git Service. To respond to

[GitHub] [arrow] kou commented on pull request #8386: ARROW-10224: [Python] Build, test, and support Python 3.9

2020-11-12 Thread GitBox
kou commented on pull request #8386: URL: https://github.com/apache/arrow/pull/8386#issuecomment-726217045 Thanks! > Is this newly introduced by github? It seems that it's introduced at 2020-06-03:

[GitHub] [arrow] kou commented on pull request #8632: ARROW-10426: [C++] Allow writing large strings to Parquet

2020-11-12 Thread GitBox
kou commented on pull request #8632: URL: https://github.com/apache/arrow/pull/8632#issuecomment-726215438 Yes. It's normal. This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [arrow] github-actions[bot] commented on pull request #8651: ARROW-10568: [C++][Parquet] Throw exception instead of crashing when …

2020-11-12 Thread GitBox
github-actions[bot] commented on pull request #8651: URL: https://github.com/apache/arrow/pull/8651#issuecomment-726178310 https://issues.apache.org/jira/browse/ARROW-10568 This is an automated message from the Apache Git

[GitHub] [arrow] Kopilov commented on pull request #8605: ARROW-10508 [Java] Allow FixedSizeListVector to have empty children

2020-11-12 Thread GitBox
Kopilov commented on pull request #8605: URL: https://github.com/apache/arrow/pull/8605#issuecomment-726169585 @liyafan82 good idea, added test case to PR. Also tested together with C++ in my demo project:

[GitHub] [arrow] wesm commented on issue #8646: Predicate pushdown question

2020-11-12 Thread GitBox
wesm commented on issue #8646: URL: https://github.com/apache/arrow/issues/8646#issuecomment-726169236 Would you mind asking this question on the dev@ or user@ mailing list? Thanks! This is an automated message from the

[GitHub] [arrow] kiszk commented on a change in pull request #7507: ARROW-8797: [C++] Read RecordBatch in a different endian

2020-11-12 Thread GitBox
kiszk commented on a change in pull request #7507: URL: https://github.com/apache/arrow/pull/7507#discussion_r522218040 ## File path: cpp/src/arrow/type.h ## @@ -1604,13 +1605,26 @@ class ARROW_EXPORT FieldRef { //

[GitHub] [arrow] josiahyan opened a new pull request #8651: ARROW-10568: [C++][Parquet] Throw exception instead of crashing when …

2020-11-12 Thread GitBox
josiahyan opened a new pull request #8651: URL: https://github.com/apache/arrow/pull/8651 …initializing Parquet writer when arrow::io::OutputStream::Tell() fails This is an automated message from the Apache Git Service. To

[GitHub] [arrow] github-actions[bot] commented on pull request #8649: ARROW-10567: [C++] Add multiple perf runs options for higher precision reporting

2020-11-12 Thread GitBox
github-actions[bot] commented on pull request #8649: URL: https://github.com/apache/arrow/pull/8649#issuecomment-726156692 https://issues.apache.org/jira/browse/ARROW-10567 This is an automated message from the Apache Git

[GitHub] [arrow] github-actions[bot] commented on pull request #8650: Use Converter api to convert SEXP to Array

2020-11-12 Thread GitBox
github-actions[bot] commented on pull request #8650: URL: https://github.com/apache/arrow/pull/8650#issuecomment-726156395 Thanks for opening a pull request! Could you open an issue for this pull request on JIRA? https://issues.apache.org/jira/browse/ARROW Then

[GitHub] [arrow] romainfrancois opened a new pull request #8650: Use Converter api to convert SEXP to Array

2020-11-12 Thread GitBox
romainfrancois opened a new pull request #8650: URL: https://github.com/apache/arrow/pull/8650 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL

[GitHub] [arrow] wesm opened a new pull request #8649: ARROW-10567: [C++] Add multiple perf runs options for higher precision reporting

2020-11-12 Thread GitBox
wesm opened a new pull request #8649: URL: https://github.com/apache/arrow/pull/8649 I've found that these benchmarks have high variance on the Linux platforms (Ubuntu 18.04-based) where I've tested, and increasing the number of iterations of the test produces more precise results

[GitHub] [arrow] pitrou commented on a change in pull request #7507: ARROW-8797: [C++] Read RecordBatch in a different endian

2020-11-12 Thread GitBox
pitrou commented on a change in pull request #7507: URL: https://github.com/apache/arrow/pull/7507#discussion_r522162456 ## File path: cpp/src/arrow/type.h ## @@ -1604,13 +1605,26 @@ class ARROW_EXPORT FieldRef { //

[GitHub] [arrow] pitrou commented on pull request #8585: ARROW-10475: [C++][FlightRPC] handle IPv6 hosts

2020-11-12 Thread GitBox
pitrou commented on pull request #8585: URL: https://github.com/apache/arrow/pull/8585#issuecomment-726122921 I meant a separate function, not a method. The host doesn't necessarily come from a URI. This is an automated

[GitHub] [arrow] pitrou commented on pull request #8542: ARROW-10407: [C++] Add BasicDecimal256 Division Support

2020-11-12 Thread GitBox
pitrou commented on pull request #8542: URL: https://github.com/apache/arrow/pull/8542#issuecomment-726122152 Thanks for the updates. Can you look at the errors on CI and fix them? Thanks! This is an automated message from

[GitHub] [arrow] alamb commented on a change in pull request #8401: ARROW-10109: [Rust] Add support to the C data interface for primitive types and utf8

2020-11-12 Thread GitBox
alamb commented on a change in pull request #8401: URL: https://github.com/apache/arrow/pull/8401#discussion_r522108476 ## File path: rust/arrow-c-integration/README.md ## @@ -0,0 +1,57 @@ + + +# Arrow c integration + +This is a Rust crate that tests compatibility between

[GitHub] [arrow] kszucs commented on pull request #8386: ARROW-10224: [Python] Build, test, and support Python 3.9

2020-11-12 Thread GitBox
kszucs commented on pull request #8386: URL: https://github.com/apache/arrow/pull/8386#issuecomment-726088673 @kou enabled the workflow scope and it works now. This is an automated message from the Apache Git Service. To

[GitHub] [arrow] github-actions[bot] commented on pull request #8386: ARROW-10224: [Python] Build, test, and support Python 3.9

2020-11-12 Thread GitBox
github-actions[bot] commented on pull request #8386: URL: https://github.com/apache/arrow/pull/8386#issuecomment-726084117 Revision: 3488e1c6fc6b6d5393daa26549a3ea023a627512 Submitted crossbow builds: [ursa-labs/crossbow @

[GitHub] [arrow] kszucs commented on pull request #8386: ARROW-10224: [Python] Build, test, and support Python 3.9

2020-11-12 Thread GitBox
kszucs commented on pull request #8386: URL: https://github.com/apache/arrow/pull/8386#issuecomment-726083122 @github-actions crossbow submit wheel-win-cp38 This is an automated message from the Apache Git Service. To

[GitHub] [arrow] github-actions[bot] commented on pull request #8386: ARROW-10224: [Python] Build, test, and support Python 3.9

2020-11-12 Thread GitBox
github-actions[bot] commented on pull request #8386: URL: https://github.com/apache/arrow/pull/8386#issuecomment-726078878 Revision: 3488e1c6fc6b6d5393daa26549a3ea023a627512 Submitted crossbow builds: [ursa-labs/crossbow @

[GitHub] [arrow] kszucs commented on pull request #8386: ARROW-10224: [Python] Build, test, and support Python 3.9

2020-11-12 Thread GitBox
kszucs commented on pull request #8386: URL: https://github.com/apache/arrow/pull/8386#issuecomment-726077812 @github-actions crossbow submit wheel-win-cp38 This is an automated message from the Apache Git Service. To

[GitHub] [arrow] alamb commented on pull request #8635: ARROW-10551: [Rust] Fix unreproducible benches by seeding random number generator

2020-11-12 Thread GitBox
alamb commented on pull request #8635: URL: https://github.com/apache/arrow/pull/8635#issuecomment-726041730 Thanks again for this @vertexclique -- it is much appreciated This is an automated message from the Apache Git

[GitHub] [arrow] alamb closed pull request #8635: ARROW-10551: [Rust] Fix unreproducible benches by seeding random number generator

2020-11-12 Thread GitBox
alamb closed pull request #8635: URL: https://github.com/apache/arrow/pull/8635 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the

[GitHub] [arrow] github-actions[bot] commented on pull request #8386: ARROW-10224: [Python] Build, test, and support Python 3.9

2020-11-12 Thread GitBox
github-actions[bot] commented on pull request #8386: URL: https://github.com/apache/arrow/pull/8386#issuecomment-726029641 Revision: 3488e1c6fc6b6d5393daa26549a3ea023a627512 Submitted crossbow builds: [ursa-labs/crossbow @

[GitHub] [arrow] pitrou closed pull request #8637: ARROW-10021: [C++][Compute] Return top-n modes in mode kernel

2020-11-12 Thread GitBox
pitrou closed pull request #8637: URL: https://github.com/apache/arrow/pull/8637 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to

[GitHub] [arrow] kszucs commented on pull request #8386: ARROW-10224: [Python] Build, test, and support Python 3.9

2020-11-12 Thread GitBox
kszucs commented on pull request #8386: URL: https://github.com/apache/arrow/pull/8386#issuecomment-726028859 @github-actions crossbow submit wheel-win-cp38 This is an automated message from the Apache Git Service. To

[GitHub] [arrow] kszucs edited a comment on pull request #8386: ARROW-10224: [Python] Build, test, and support Python 3.9

2020-11-12 Thread GitBox
kszucs edited a comment on pull request #8386: URL: https://github.com/apache/arrow/pull/8386#issuecomment-726026294 > It's not a problem that `actions-XXX` branch doesn't have GitHub Actions config. > It's a problem that `actions-XXX-github-YYY` branch isn't created. Correct.

[GitHub] [arrow] kszucs commented on pull request #8386: ARROW-10224: [Python] Build, test, and support Python 3.9

2020-11-12 Thread GitBox
kszucs commented on pull request #8386: URL: https://github.com/apache/arrow/pull/8386#issuecomment-726026294 > It's not a problem that `actions-XXX` branch doesn't have GitHub Actions config. > It's a problem that `actions-XXX-github-YYY` branch isn't created. Correct. > > It

  1   2   >