[GitHub] [arrow] github-actions[bot] commented on pull request #8673: ARROW-10609: [Rust] Optimize min/max of non null strings

2020-11-16 Thread GitBox
github-actions[bot] commented on pull request #8673: URL: https://github.com/apache/arrow/pull/8673#issuecomment-727810644 https://issues.apache.org/jira/browse/ARROW-10609 This is an automated message from the Apache Git

[GitHub] [arrow] stefan-lange-dataeng commented on issue #8607: Deletion of existing file when write_table fails

2020-11-16 Thread GitBox
stefan-lange-dataeng commented on issue #8607: URL: https://github.com/apache/arrow/issues/8607#issuecomment-727846447 Thanks, I have created https://issues.apache.org/jira/browse/ARROW-10611. This is an automated message

[GitHub] [arrow] github-actions[bot] commented on pull request #8674: ARROW-10610: [C++] Support exponential float nontion on big-endian in fast_float

2020-11-16 Thread GitBox
github-actions[bot] commented on pull request #8674: URL: https://github.com/apache/arrow/pull/8674#issuecomment-727869105 https://issues.apache.org/jira/browse/ARROW-10610 This is an automated message from the Apache Git

[GitHub] [arrow] Dandandan opened a new pull request #8673: ARROW-10609: [Rust] Optimize min/max of non null strings

2020-11-16 Thread GitBox
Dandandan opened a new pull request #8673: URL: https://github.com/apache/arrow/pull/8673 Applies the same optimization as in ARROW-10595. Difference is smaller, but still there: ``` min string 512 time: [3.4096 us 3.4378 us 3.4683 us]

[GitHub] [arrow] pitrou commented on a change in pull request #8632: ARROW-10426: [C++] Allow writing large strings to Parquet

2020-11-16 Thread GitBox
pitrou commented on a change in pull request #8632: URL: https://github.com/apache/arrow/pull/8632#discussion_r524195648 ## File path: cpp/src/parquet/statistics.cc ## @@ -691,9 +634,60 @@ void TypedStatisticsImpl::PlainDecode(const std::string& src, dst->ptr =

[GitHub] [arrow] xhochy opened a new pull request #8675: ARROW-10616: [Developer] Expand PR labeler to all supported languages

2020-11-16 Thread GitBox
xhochy opened a new pull request #8675: URL: https://github.com/apache/arrow/pull/8675 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go

[GitHub] [arrow] kiszk commented on pull request #8674: ARROW-10610: [C++] Support exponential float nontion on big-endian in fast_float

2020-11-16 Thread GitBox
kiszk commented on pull request #8674: URL: https://github.com/apache/arrow/pull/8674#issuecomment-727958817 Not yet. I will report this issue today. This is an automated message from the Apache Git Service. To respond to

[GitHub] [arrow] github-actions[bot] commented on pull request #8677: ARROW-10617: [Python] Fixes RecordBatchStreamReader's iteraction bug in python 3.8.

2020-11-16 Thread GitBox
github-actions[bot] commented on pull request #8677: URL: https://github.com/apache/arrow/pull/8677#issuecomment-727997218 https://issues.apache.org/jira/browse/ARROW-10617 This is an automated message from the Apache Git

[GitHub] [arrow] pitrou closed pull request #8585: ARROW-10475: [C++][FlightRPC] handle IPv6 hosts

2020-11-16 Thread GitBox
pitrou closed pull request #8585: URL: https://github.com/apache/arrow/pull/8585 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to

[GitHub] [arrow] pitrou commented on a change in pull request #8632: ARROW-10426: [C++] Allow writing large strings to Parquet

2020-11-16 Thread GitBox
pitrou commented on a change in pull request #8632: URL: https://github.com/apache/arrow/pull/8632#discussion_r524192977 ## File path: cpp/src/parquet/encoding.cc ## @@ -127,6 +129,21 @@ class PlainEncoder : public EncoderImpl, virtual public TypedEncoder { }

[GitHub] [arrow] pitrou commented on a change in pull request #8632: ARROW-10426: [C++] Allow writing large strings to Parquet

2020-11-16 Thread GitBox
pitrou commented on a change in pull request #8632: URL: https://github.com/apache/arrow/pull/8632#discussion_r524193403 ## File path: cpp/src/parquet/encoding.cc ## @@ -127,6 +129,21 @@ class PlainEncoder : public EncoderImpl, virtual public TypedEncoder { }

[GitHub] [arrow] alamb commented on pull request #8660: ARROW-10173: [Rust][DataFusion] Implement support for direct comparison to scalar values

2020-11-16 Thread GitBox
alamb commented on pull request #8660: URL: https://github.com/apache/arrow/pull/8660#issuecomment-727983561 > @alamb it might be necessary to fallback to generating an array where the scalar value is repeated, for some operations that do not have a version which accepts a scalar argument

[GitHub] [arrow] xhochy opened a new pull request #8676: WIP: autotune

2020-11-16 Thread GitBox
xhochy opened a new pull request #8676: URL: https://github.com/apache/arrow/pull/8676 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go

[GitHub] [arrow] xhochy commented on pull request #8676: WIP: autotune

2020-11-16 Thread GitBox
xhochy commented on pull request #8676: URL: https://github.com/apache/arrow/pull/8676#issuecomment-727946765 @github-actions autotune everything This is an automated message from the Apache Git Service. To respond to the

[GitHub] [arrow] xhochy closed pull request #8676: WIP: autotune

2020-11-16 Thread GitBox
xhochy closed pull request #8676: URL: https://github.com/apache/arrow/pull/8676 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to

[GitHub] [arrow] pitrou commented on pull request #8671: ARROW-10598: [C++] Separate out bit-packing in internal::GenerateBitsUnrolled for better performance

2020-11-16 Thread GitBox
pitrou commented on pull request #8671: URL: https://github.com/apache/arrow/pull/8671#issuecomment-727959654 The PR looks reasonable to me. This is an automated message from the Apache Git Service. To respond to the

[GitHub] [arrow] pitrou closed pull request #8653: ARROW-10560: [Python] Fix crash when creating array from huge string

2020-11-16 Thread GitBox
pitrou closed pull request #8653: URL: https://github.com/apache/arrow/pull/8653 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to

[GitHub] [arrow] vertexclique commented on pull request #8665: ARROW-10589: [Rust] Implement AVX-512 bit and operation

2020-11-16 Thread GitBox
vertexclique commented on pull request #8665: URL: https://github.com/apache/arrow/pull/8665#issuecomment-727912338 @andygrove @nevi-me https://issues.apache.org/jira/browse/ARROW-10612 Umbrella issue for AVX-512. Includes CI support follow up subtask. I will create a subtask for every

[GitHub] [arrow] github-actions[bot] commented on pull request #8675: ARROW-10616: [Developer] Expand PR labeler to all supported languages

2020-11-16 Thread GitBox
github-actions[bot] commented on pull request #8675: URL: https://github.com/apache/arrow/pull/8675#issuecomment-727947748 https://issues.apache.org/jira/browse/ARROW-10616 This is an automated message from the Apache Git

[GitHub] [arrow] sighingnow opened a new pull request #8677: ARROW-10617: [Python] Fixes RecordBatchStreamReader's iteraction bug in python 3.8.

2020-11-16 Thread GitBox
sighingnow opened a new pull request #8677: URL: https://github.com/apache/arrow/pull/8677 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above

[GitHub] [arrow] alamb closed pull request #8658: ARROW-10577: [Rust][DataFusion] HashAggregator stream finishes unexpectedly after going to Pending state - tests

2020-11-16 Thread GitBox
alamb closed pull request #8658: URL: https://github.com/apache/arrow/pull/8658 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the

[GitHub] [arrow] xhochy commented on pull request #8678: WIP: Debug Windows/R issues

2020-11-16 Thread GitBox
xhochy commented on pull request #8678: URL: https://github.com/apache/arrow/pull/8678#issuecomment-728059430 @github-actions crossbow submit conda-win-vs2017-py37 This is an automated message from the Apache Git Service. To

[GitHub] [arrow] vertexclique commented on a change in pull request #8673: ARROW-10609: [Rust] Optimize min/max of non null strings

2020-11-16 Thread GitBox
vertexclique commented on a change in pull request #8673: URL: https://github.com/apache/arrow/pull/8673#discussion_r524238466 ## File path: rust/arrow/benches/aggregate_kernels.rs ## @@ -43,6 +43,25 @@ fn create_array(size: usize, with_nulls: bool) -> ArrayRef {

[GitHub] [arrow] vertexclique commented on pull request #8673: ARROW-10609: [Rust] Optimize min/max of non null strings

2020-11-16 Thread GitBox
vertexclique commented on pull request #8673: URL: https://github.com/apache/arrow/pull/8673#issuecomment-727958046 Liked it, can you address the comments, that will make some improvements on the benches indirectly. This is

[GitHub] [arrow] xhochy opened a new pull request #8678: WIP: Debug Windows/R issues

2020-11-16 Thread GitBox
xhochy opened a new pull request #8678: URL: https://github.com/apache/arrow/pull/8678 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go

[GitHub] [arrow] github-actions[bot] commented on pull request #8678: WIP: Debug Windows/R issues

2020-11-16 Thread GitBox
github-actions[bot] commented on pull request #8678: URL: https://github.com/apache/arrow/pull/8678#issuecomment-728065382 Revision: 061566200fb4db37ec973539c7f6483b468d1a2e Submitted crossbow builds: [ursa-labs/crossbow @

[GitHub] [arrow] pitrou commented on a change in pull request #8542: ARROW-10407: [C++] Add BasicDecimal256 Division Support

2020-11-16 Thread GitBox
pitrou commented on a change in pull request #8542: URL: https://github.com/apache/arrow/pull/8542#discussion_r524304063 ## File path: cpp/src/arrow/util/basic_decimal.cc ## @@ -490,49 +527,60 @@ static void FixDivisionSigns(BasicDecimal128* result, BasicDecimal128* remainder

[GitHub] [arrow] pitrou commented on pull request #8632: ARROW-10426: [C++] Allow writing large strings to Parquet

2020-11-16 Thread GitBox
pitrou commented on pull request #8632: URL: https://github.com/apache/arrow/pull/8632#issuecomment-728110464 @emkornfield Any other concern? This is an automated message from the Apache Git Service. To respond to the

[GitHub] [arrow] vertexclique commented on a change in pull request #8634: ARROW-10535: [Rust] Implement bit & byte slice iterator for big endian platforms

2020-11-16 Thread GitBox
vertexclique commented on a change in pull request #8634: URL: https://github.com/apache/arrow/pull/8634#discussion_r524313352 ## File path: rust/arrow/src/util/bit_slice_iterator.rs ## @@ -0,0 +1,108 @@ +#[cfg(all(test, target_endian = "big"))] Review comment: I think

[GitHub] [arrow] github-actions[bot] commented on pull request #8679: ARROW-10619: [C++] Fix IPC validation regressions

2020-11-16 Thread GitBox
github-actions[bot] commented on pull request #8679: URL: https://github.com/apache/arrow/pull/8679#issuecomment-728122505 https://issues.apache.org/jira/browse/ARROW-10619 This is an automated message from the Apache Git

[GitHub] [arrow] kiszk commented on pull request #8674: ARROW-10610: [C++] Support exponential float nontion on big-endian in fast_float

2020-11-16 Thread GitBox
kiszk commented on pull request #8674: URL: https://github.com/apache/arrow/pull/8674#issuecomment-728155154 Reported at https://github.com/lemire/fast_float/issues/37 This is an automated message from the Apache Git

[GitHub] [arrow] nealrichardson commented on pull request #8643: ARROW-10522: [R] Allow rename Table and RecordBatch columns with names()

2020-11-16 Thread GitBox
nealrichardson commented on pull request #8643: URL: https://github.com/apache/arrow/pull/8643#issuecomment-728154879 @bkietz ☝️ This is an automated message from the Apache Git Service. To respond to the message, please

[GitHub] [arrow] pitrou commented on a change in pull request #8542: ARROW-10407: [C++] Add BasicDecimal256 Division Support

2020-11-16 Thread GitBox
pitrou commented on a change in pull request #8542: URL: https://github.com/apache/arrow/pull/8542#discussion_r524305990 ## File path: cpp/src/arrow/util/basic_decimal.cc ## @@ -490,49 +529,64 @@ static void FixDivisionSigns(BasicDecimal128* result, BasicDecimal128* remainder

[GitHub] [arrow] pitrou opened a new pull request #8679: ARROW-10619: [C++] Fix IPC validation regressions

2020-11-16 Thread GitBox
pitrou opened a new pull request #8679: URL: https://github.com/apache/arrow/pull/8679 Some cases of invalid IPC stream were missed following the ARROW-10619 refactor. This is an automated message from the Apache Git

[GitHub] [arrow] wesm commented on a change in pull request #8671: ARROW-10598: [C++] Separate out bit-packing in internal::GenerateBitsUnrolled for better performance

2020-11-16 Thread GitBox
wesm commented on a change in pull request #8671: URL: https://github.com/apache/arrow/pull/8671#discussion_r524359452 ## File path: cpp/src/arrow/util/bitmap_generate.h ## @@ -83,17 +83,19 @@ void GenerateBitsUnrolled(uint8_t* bitmap, int64_t start_offset, int64_t length,

[GitHub] [arrow] pitrou commented on a change in pull request #8542: ARROW-10407: [C++] Add BasicDecimal256 Division Support

2020-11-16 Thread GitBox
pitrou commented on a change in pull request #8542: URL: https://github.com/apache/arrow/pull/8542#discussion_r524305460 ## File path: cpp/src/arrow/util/basic_decimal.cc ## @@ -490,49 +529,64 @@ static void FixDivisionSigns(BasicDecimal128* result, BasicDecimal128* remainder

[GitHub] [arrow] pitrou commented on pull request #8624: ARROW-10532: [Python] Fix metadata in Table.from_pandas conversion with specified schema with different column order

2020-11-16 Thread GitBox
pitrou commented on pull request #8624: URL: https://github.com/apache/arrow/pull/8624#issuecomment-728124091 Rebased, will merge. This is an automated message from the Apache Git Service. To respond to the message, please

[GitHub] [arrow] nealrichardson commented on pull request #8643: ARROW-10522: [R] Allow rename Table and RecordBatch columns with names()

2020-11-16 Thread GitBox
nealrichardson commented on pull request #8643: URL: https://github.com/apache/arrow/pull/8643#issuecomment-728153475 Are you sure it's not from the PR I merged before this? https://github.com/apache/arrow/pull/8256 seems like a more likely culprit to have broken something. Btw, we

[GitHub] [arrow] lemire commented on pull request #8674: ARROW-10610: [C++] Support exponential float nontion on big-endian in fast_float

2020-11-16 Thread GitBox
lemire commented on pull request #8674: URL: https://github.com/apache/arrow/pull/8674#issuecomment-728178811 @kiszk Thanks for the report. @pitrou How do you test Big Endian support? I have different ARM boards, but none of them run in Big Endian mode. I have not been able to find

[GitHub] [arrow] github-actions[bot] commented on pull request #8680: ARROW-10182: [C++] Add basic continuation support to Future

2020-11-16 Thread GitBox
github-actions[bot] commented on pull request #8680: URL: https://github.com/apache/arrow/pull/8680#issuecomment-728205092 https://issues.apache.org/jira/browse/ARROW-10182 This is an automated message from the Apache Git

[GitHub] [arrow] lemire commented on pull request #8674: ARROW-10610: [C++] Support exponential float nontion on big-endian in fast_float

2020-11-16 Thread GitBox
lemire commented on pull request #8674: URL: https://github.com/apache/arrow/pull/8674#issuecomment-728217863 (The hard part is to add the necessary CI testing.) This is an automated message from the Apache Git Service. To

[GitHub] [arrow] lemire commented on pull request #8674: ARROW-10610: [C++] Support exponential float nontion on big-endian in fast_float

2020-11-16 Thread GitBox
lemire commented on pull request #8674: URL: https://github.com/apache/arrow/pull/8674#issuecomment-728217716 We are looking into fixing this upstream (big endian support). This is an automated message from the Apache Git

[GitHub] [arrow] lemire commented on pull request #8674: ARROW-10610: [C++] Support exponential float nontion on big-endian in fast_float

2020-11-16 Thread GitBox
lemire commented on pull request #8674: URL: https://github.com/apache/arrow/pull/8674#issuecomment-728226679 (I managed to add CI tests now.) This is an automated message from the Apache Git Service. To respond to the

[GitHub] [arrow] Dandandan commented on a change in pull request #8673: ARROW-10609: [Rust] Optimize min/max of non null strings

2020-11-16 Thread GitBox
Dandandan commented on a change in pull request #8673: URL: https://github.com/apache/arrow/pull/8673#discussion_r524387603 ## File path: rust/arrow/benches/aggregate_kernels.rs ## @@ -43,6 +43,25 @@ fn create_array(size: usize, with_nulls: bool) -> ArrayRef {

[GitHub] [arrow] jorgecarleitao commented on a change in pull request #8640: ARROW-4193: [Rust] Add support for decimal data type

2020-11-16 Thread GitBox
jorgecarleitao commented on a change in pull request #8640: URL: https://github.com/apache/arrow/pull/8640#discussion_r524410551 ## File path: rust/arrow/src/array/builder.rs ## @@ -1882,6 +1975,67 @@ impl FixedSizeBinaryBuilder { } } +impl DecimalBuilder { +///

[GitHub] [arrow] bkietz opened a new pull request #8680: ARROW-10182: [C++] Add basic continuation support to Future

2020-11-16 Thread GitBox
bkietz opened a new pull request #8680: URL: https://github.com/apache/arrow/pull/8680 Adds `Future::Then(OnSuccess, OnFailure)` which registers callbacks to be executed on completion of the future and yields a future which wraps the result of those callbacks; if a callback returns:

[GitHub] [arrow] velvia opened a new pull request #8688: ARROW-10330: [Rust][DataFusion] Implement NULLIF() SQL function

2020-11-16 Thread GitBox
velvia opened a new pull request #8688: URL: https://github.com/apache/arrow/pull/8688 This PR implements the NULLIF() SQL function in DataFusion. It is implemented as a BuiltInScalarFunction, with a boolean kernel at the core which creates a new array with a modified null bitmap from

[GitHub] [arrow] velvia commented on pull request #8688: ARROW-10330: [Rust][DataFusion] Implement NULLIF() SQL function

2020-11-16 Thread GitBox
velvia commented on pull request #8688: URL: https://github.com/apache/arrow/pull/8688#issuecomment-728504277 @andygrove @nevi-me would love to hear your feedback on this this addition has been useful to us internally.

[GitHub] [arrow] codecov-io commented on pull request #8660: ARROW-10173: [Rust][DataFusion] Implement support for direct comparison to scalar values

2020-11-16 Thread GitBox
codecov-io commented on pull request #8660: URL: https://github.com/apache/arrow/pull/8660#issuecomment-728556384 # [Codecov](https://codecov.io/gh/apache/arrow/pull/8660?src=pr=h1) Report > Merging [#8660](https://codecov.io/gh/apache/arrow/pull/8660?src=pr=desc) (e5fce7f) into

[GitHub] [arrow] kou commented on pull request #8386: ARROW-10224: [Python] Build, test, and support Python 3.9

2020-11-16 Thread GitBox
kou commented on pull request #8386: URL: https://github.com/apache/arrow/pull/8386#issuecomment-728321361 The current status: * We can't use old packages that depend on `vs2015_runtime<14.16` on conda-forge for Python 3.9 on Windows. Because Python 3.9 for Windows on

[GitHub] [arrow] lemire commented on pull request #8674: ARROW-10610: [C++] Support exponential float nontion on big-endian in fast_float

2020-11-16 Thread GitBox
lemire commented on pull request #8674: URL: https://github.com/apache/arrow/pull/8674#issuecomment-728582428 @wesm Thanks to @kiszk, I figured that out. That we could run CI on IBM mainframe-like systems was a shock to me. :-) I knew about POWER, but not that. So I learned

[GitHub] [arrow] pitrou closed pull request #8659: ARROW-10480: [Python] don't infer compression by extension for Parquet

2020-11-16 Thread GitBox
pitrou closed pull request #8659: URL: https://github.com/apache/arrow/pull/8659 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to

[GitHub] [arrow] github-actions[bot] commented on pull request #8688: ARROW-10330: [Rust][DataFusion] Implement NULLIF() SQL function

2020-11-16 Thread GitBox
github-actions[bot] commented on pull request #8688: URL: https://github.com/apache/arrow/pull/8688#issuecomment-728568531 https://issues.apache.org/jira/browse/ARROW-10330 This is an automated message from the Apache Git

[GitHub] [arrow] pitrou closed pull request #8624: ARROW-10532: [Python] Fix metadata in Table.from_pandas conversion with specified schema with different column order

2020-11-16 Thread GitBox
pitrou closed pull request #8624: URL: https://github.com/apache/arrow/pull/8624 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to

[GitHub] [arrow] alamb closed pull request #8665: ARROW-10589: [Rust] Implement AVX-512 bit and operation

2020-11-16 Thread GitBox
alamb closed pull request #8665: URL: https://github.com/apache/arrow/pull/8665 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the

[GitHub] [arrow] alamb commented on pull request #8673: ARROW-10609: [Rust] Optimize min/max of non null strings

2020-11-16 Thread GitBox
alamb commented on pull request #8673: URL: https://github.com/apache/arrow/pull/8673#issuecomment-728349940 CI failure seems to be infrastructure related: ``` C:\windows\System32\tar.exe -cz -f D:\a\_temp\a71516c7-d573-4e09-b21a-03fde6e0e547\cache.tgz -C

[GitHub] [arrow] github-actions[bot] commented on pull request #8681: ARROW-10622: [R] Nameof should not use "void" as the crib

2020-11-16 Thread GitBox
github-actions[bot] commented on pull request #8681: URL: https://github.com/apache/arrow/pull/8681#issuecomment-728320080 https://issues.apache.org/jira/browse/ARROW-10622 This is an automated message from the Apache Git

[GitHub] [arrow] alamb commented on pull request #8666: ARROW-10269: [Rust] Update to 2020-11-14 nightly

2020-11-16 Thread GitBox
alamb commented on pull request #8666: URL: https://github.com/apache/arrow/pull/8666#issuecomment-728397579 @nevi-me check it out: https://github.com/apache/arrow/pull/8687 -- which I think enables clippy lint in all parts of the project

[GitHub] [arrow] alamb commented on pull request #8640: ARROW-4193: [Rust] Add support for decimal data type

2020-11-16 Thread GitBox
alamb commented on pull request #8640: URL: https://github.com/apache/arrow/pull/8640#issuecomment-728398825 @jorgecarleitao I will put it on my queue for tomorrow. Hopefully the morning This is an automated message from the

[GitHub] [arrow] wesm commented on pull request #8665: ARROW-10589: [Rust] Implement AVX-512 bit and operation

2020-11-16 Thread GitBox
wesm commented on pull request #8665: URL: https://github.com/apache/arrow/pull/8665#issuecomment-728507289 I'm looking at contributing an AVX-512-capable machine to run occasional builds on Buildkite, I'd guess we're looking at 2-3 month time frame for that though. Note that anyone can

[GitHub] [arrow] bkietz commented on pull request #8680: ARROW-10182: [C++] Add basic continuation support to Future

2020-11-16 Thread GitBox
bkietz commented on pull request #8680: URL: https://github.com/apache/arrow/pull/8680#issuecomment-728333688 Added a benchmark to measure the cost of creating a Future in Executor::Submit instead of just using Executor::Spawn. Locally: ```

[GitHub] [arrow] codecov-io commented on pull request #8542: ARROW-10407: [C++] Add BasicDecimal256 Division Support

2020-11-16 Thread GitBox
codecov-io commented on pull request #8542: URL: https://github.com/apache/arrow/pull/8542#issuecomment-728347240 # [Codecov](https://codecov.io/gh/apache/arrow/pull/8542?src=pr=h1) Report > Merging [#8542](https://codecov.io/gh/apache/arrow/pull/8542?src=pr=desc) (1df4d0c) into

[GitHub] [arrow] yordan-pavlov commented on pull request #8660: ARROW-10173: [Rust][DataFusion] Implement support for direct comparison to scalar values

2020-11-16 Thread GitBox
yordan-pavlov commented on pull request #8660: URL: https://github.com/apache/arrow/pull/8660#issuecomment-728337269 @alamb I have now implemented falling back to scalar arrays for operations where scalar arguments are currently not supported; this should now work for operators such as

[GitHub] [arrow] alamb commented on pull request #8673: ARROW-10609: [Rust] Optimize min/max of non null strings

2020-11-16 Thread GitBox
alamb commented on pull request #8673: URL: https://github.com/apache/arrow/pull/8673#issuecomment-728362759 The TravisCI builds are not affected by Rust, so merging this one in This is an automated message from the Apache

[GitHub] [arrow] alamb commented on a change in pull request #8687: ARROW-10636: [Rust] flag clippy warnings as errors

2020-11-16 Thread GitBox
alamb commented on a change in pull request #8687: URL: https://github.com/apache/arrow/pull/8687#discussion_r524703589 ## File path: rust/parquet_derive/src/parquet_field.rs ## @@ -252,6 +252,8 @@ impl Field { } } +#[allow(clippy::enum_variant_names)] Review comment:

[GitHub] [arrow] alamb commented on a change in pull request #8687: ARROW-10636: [Rust] flag clippy warnings as errors

2020-11-16 Thread GitBox
alamb commented on a change in pull request #8687: URL: https://github.com/apache/arrow/pull/8687#discussion_r524702929 ## File path: ci/scripts/rust_lint.sh ## @@ -17,10 +17,14 @@ # specific language governing permissions and limitations # under the License. +# Entrypoint

[GitHub] [arrow] alamb commented on pull request #8660: ARROW-10173: [Rust][DataFusion] Implement support for direct comparison to scalar values

2020-11-16 Thread GitBox
alamb commented on pull request #8660: URL: https://github.com/apache/arrow/pull/8660#issuecomment-728393610 @yordan-pavlov When I ran the benchmark locally again on my laptop: ``` cargo run --release --bin tpch -- --iterations 3 --path

[GitHub] [arrow] alamb edited a comment on pull request #8660: ARROW-10173: [Rust][DataFusion] Implement support for direct comparison to scalar values

2020-11-16 Thread GitBox
alamb edited a comment on pull request #8660: URL: https://github.com/apache/arrow/pull/8660#issuecomment-728393610 @yordan-pavlov When I ran the benchmark locally again on my laptop: ``` cargo run --release --bin tpch -- --iterations 3 --path

[GitHub] [arrow] alamb opened a new pull request #8687: ARROW-10636: flag clippy warnings as errors

2020-11-16 Thread GitBox
alamb opened a new pull request #8687: URL: https://github.com/apache/arrow/pull/8687 We do currently run clippy (aka rust linter) on all PRs. However, the way we invoke clippy means that any clippy warnings introduced in PRs do not cause a CI failure I believe the intended

[GitHub] [arrow] alamb closed pull request #8673: ARROW-10609: [Rust] Optimize min/max of non null strings

2020-11-16 Thread GitBox
alamb closed pull request #8673: URL: https://github.com/apache/arrow/pull/8673 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the

[GitHub] [arrow] wesm commented on pull request #8674: ARROW-10610: [C++] Support exponential float nontion on big-endian in fast_float

2020-11-16 Thread GitBox
wesm commented on pull request #8674: URL: https://github.com/apache/arrow/pull/8674#issuecomment-728528823 Travis CI has s390x architecture (that's how we've been testing on BE) This is an automated message from the Apache

[GitHub] [arrow] jorgecarleitao closed pull request #8565: ARROW-10451: [Rust] Add support for merge-sort

2020-11-16 Thread GitBox
jorgecarleitao closed pull request #8565: URL: https://github.com/apache/arrow/pull/8565 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to

[GitHub] [arrow] jorgecarleitao edited a comment on pull request #8630: ARROW-10540 [Rust] Improve filtering

2020-11-16 Thread GitBox
jorgecarleitao edited a comment on pull request #8630: URL: https://github.com/apache/arrow/pull/8630#issuecomment-728736301 This code can be re-used to implement `merge-sort` and `join`. Some notes about this here:

[GitHub] [arrow] jorgecarleitao commented on pull request #8630: ARROW-10540 [Rust] Improve filtering

2020-11-16 Thread GitBox
jorgecarleitao commented on pull request #8630: URL: https://github.com/apache/arrow/pull/8630#issuecomment-728736301 This code will be re-used to implement `merge-sort` and `join`. Some notes about this here:

[GitHub] [arrow] jorgecarleitao commented on pull request #8687: ARROW-10628: [Rust] flag clippy warnings as errors

2020-11-16 Thread GitBox
jorgecarleitao commented on pull request #8687: URL: https://github.com/apache/arrow/pull/8687#issuecomment-728703547 @alamb There was no issue 10636 on Jira, so I created one, assigned it to you, and assigned this PR to it.

[GitHub] [arrow] jorgecarleitao closed pull request #8687: ARROW-10628: [Rust] flag clippy warnings as errors

2020-11-16 Thread GitBox
jorgecarleitao closed pull request #8687: URL: https://github.com/apache/arrow/pull/8687 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to

[GitHub] [arrow] sweb commented on a change in pull request #8640: ARROW-4193: [Rust] Add support for decimal data type

2020-11-16 Thread GitBox
sweb commented on a change in pull request #8640: URL: https://github.com/apache/arrow/pull/8640#discussion_r524925542 ## File path: rust/arrow/src/array/equal/mod.rs ## @@ -604,6 +613,76 @@ mod tests { test_equal(_slice, _slice, true); } +fn

[GitHub] [arrow] github-actions[bot] commented on pull request #8689: ARROW-10591: [Rust] Added support for filter of StructArray

2020-11-16 Thread GitBox
github-actions[bot] commented on pull request #8689: URL: https://github.com/apache/arrow/pull/8689#issuecomment-728715819 https://issues.apache.org/jira/browse/ARROW-10591 This is an automated message from the Apache Git

[GitHub] [arrow] sweb commented on a change in pull request #8640: ARROW-4193: [Rust] Add support for decimal data type

2020-11-16 Thread GitBox
sweb commented on a change in pull request #8640: URL: https://github.com/apache/arrow/pull/8640#discussion_r524922526 ## File path: rust/arrow/src/array/builder.rs ## @@ -1882,6 +1975,67 @@ impl FixedSizeBinaryBuilder { } } +impl DecimalBuilder { +/// Creates a

[GitHub] [arrow] sighingnow commented on pull request #8677: ARROW-10617: [Python] Fix RecordBatchStreamReader iteration with Python 3.8

2020-11-16 Thread GitBox
sighingnow commented on pull request #8677: URL: https://github.com/apache/arrow/pull/8677#issuecomment-728669860 I do think a test is needed, but the case has already been tested in `test_ipc.py` (e.g., `test_stream_simple_roundtrip`) but seems it was skiped in the Python 3.8 case (in

[GitHub] [arrow] jorgecarleitao commented on a change in pull request #8640: ARROW-4193: [Rust] Add support for decimal data type

2020-11-16 Thread GitBox
jorgecarleitao commented on a change in pull request #8640: URL: https://github.com/apache/arrow/pull/8640#discussion_r524923532 ## File path: rust/arrow/src/array/builder.rs ## @@ -1882,6 +1975,67 @@ impl FixedSizeBinaryBuilder { } } +impl DecimalBuilder { +///

[GitHub] [arrow] jorgecarleitao commented on a change in pull request #8688: ARROW-10330: [Rust][DataFusion] Implement NULLIF() SQL function

2020-11-16 Thread GitBox
jorgecarleitao commented on a change in pull request #8688: URL: https://github.com/apache/arrow/pull/8688#discussion_r524889515 ## File path: rust/arrow/src/compute/kernels/boolean.rs ## @@ -149,6 +150,64 @@ pub fn is_not_null(input: ) -> Result {

[GitHub] [arrow] nevi-me commented on pull request #8640: ARROW-4193: [Rust] Add support for decimal data type

2020-11-16 Thread GitBox
nevi-me commented on pull request #8640: URL: https://github.com/apache/arrow/pull/8640#issuecomment-728739069 Hi @sweb, I'm providing general comments, I'll look at this in detail over the days. I see that you're using `i128` as the backing type. One of the reasons that's

[GitHub] [arrow] jhorstmann opened a new pull request #8685: ARROW-10216: [Rust] Simd implementation for primitive min/max kernels

2020-11-16 Thread GitBox
jhorstmann opened a new pull request #8685: URL: https://github.com/apache/arrow/pull/8685 This refactors the simd aggregation to a reusable trait and adds implementations for min and max. Some tests were failing with the simd feature active because the different order of additions

[GitHub] [arrow] github-actions[bot] commented on pull request #8685: ARROW-10216: [Rust] Simd implementation for primitive min/max kernels

2020-11-16 Thread GitBox
github-actions[bot] commented on pull request #8685: URL: https://github.com/apache/arrow/pull/8685#issuecomment-728284606 https://issues.apache.org/jira/browse/ARROW-10216 This is an automated message from the Apache Git

[GitHub] [arrow] jhorstmann opened a new pull request #8686: ARROW-10297: [Rust] Change output of parquet-read binary to json

2020-11-16 Thread GitBox
jhorstmann opened a new pull request #8686: URL: https://github.com/apache/arrow/pull/8686 This makes it much easier to analyze parquet files, for example by processing the output with other command line tools like `jq`. I'm opening this as a draft for now since I'd like some

[GitHub] [arrow] github-actions[bot] commented on pull request #8686: ARROW-10297: [Rust] Change output of parquet-read binary to json

2020-11-16 Thread GitBox
github-actions[bot] commented on pull request #8686: URL: https://github.com/apache/arrow/pull/8686#issuecomment-728294214 https://issues.apache.org/jira/browse/ARROW-10297 This is an automated message from the Apache Git

[GitHub] [arrow] kou closed pull request #8675: ARROW-10616: [Developer] Expand PR labeler to all supported languages

2020-11-16 Thread GitBox
kou closed pull request #8675: URL: https://github.com/apache/arrow/pull/8675 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the

[GitHub] [arrow] github-actions[bot] commented on pull request #8681: [WIP] use 'double' for the crib instead of 'void'

2020-11-16 Thread GitBox
github-actions[bot] commented on pull request #8681: URL: https://github.com/apache/arrow/pull/8681#issuecomment-728236716 Revision: ea40b46b231b0f250a38e28660c25463b997 Submitted crossbow builds: [ursa-labs/crossbow @

[GitHub] [arrow] rdettai opened a new pull request #8682: ARROW-10620: [Rust][Parquet] move column chunk range logic to metadata.rs

2020-11-16 Thread GitBox
rdettai opened a new pull request #8682: URL: https://github.com/apache/arrow/pull/8682 > Getting the range of bytes of a column chunk inside a parquet file can be useful for external crates (for instance if they want to pre-fetch the columns), and is not completely obvious (it is enough

[GitHub] [arrow] github-actions[bot] commented on pull request #8682: ARROW-10620: [Rust][Parquet] move column chunk range logic to metadata.rs

2020-11-16 Thread GitBox
github-actions[bot] commented on pull request #8682: URL: https://github.com/apache/arrow/pull/8682#issuecomment-728241051 https://issues.apache.org/jira/browse/ARROW-10620 This is an automated message from the Apache Git

[GitHub] [arrow] rdettai commented on pull request #8682: ARROW-10620: [Rust][Parquet] move column chunk range logic to metadata.rs

2020-11-16 Thread GitBox
rdettai commented on pull request #8682: URL: https://github.com/apache/arrow/pull/8682#issuecomment-728245070 I did not expect to encounter problems on this tiny PR, but it seems that the file `alltypes_plain.parquet` that is used in datafusion tests has some very weird metadata 

[GitHub] [arrow] bkietz opened a new pull request #8681: [WIP] use 'double' for the crib instead of 'void'

2020-11-16 Thread GitBox
bkietz opened a new pull request #8681: URL: https://github.com/apache/arrow/pull/8681 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go

[GitHub] [arrow] bkietz commented on pull request #8681: [WIP] use 'double' for the crib instead of 'void'

2020-11-16 Thread GitBox
bkietz commented on pull request #8681: URL: https://github.com/apache/arrow/pull/8681#issuecomment-728235768 @github-actions crossbow submit conda-win-vs2017-py37 This is an automated message from the Apache Git Service. To

[GitHub] [arrow] rdettai removed a comment on pull request #8682: ARROW-10620: [Rust][Parquet] move column chunk range logic to metadata.rs

2020-11-16 Thread GitBox
rdettai removed a comment on pull request #8682: URL: https://github.com/apache/arrow/pull/8682#issuecomment-728245070 I did not expect to encounter problems on this tiny PR, but it seems that the file `alltypes_plain.parquet` that is used in datafusion tests has some very weird metadata

[GitHub] [arrow] lemire commented on pull request #8674: ARROW-10610: [C++] Support exponential float nontion on big-endian in fast_float

2020-11-16 Thread GitBox
lemire commented on pull request #8674: URL: https://github.com/apache/arrow/pull/8674#issuecomment-728252123 The issue has been fixed upstream (big endian support). I am running exhaustive tests and once they complete, I will issue a new release. If you wish, you can wait for this

[GitHub] [arrow] kiszk opened a new pull request #8683: ARROW-10621: [C++] Put required libraries into the common directory

2020-11-16 Thread GitBox
kiszk opened a new pull request #8683: URL: https://github.com/apache/arrow/pull/8683 This PR copies the required shared libraries into the directory (`${ARROW_HOME}/lib`) where the test will refer to thru LD_LIBRARY_PATH.

[GitHub] [arrow] github-actions[bot] commented on pull request #8683: ARROW-10621: [C++] Put required libraries into the common directory

2020-11-16 Thread GitBox
github-actions[bot] commented on pull request #8683: URL: https://github.com/apache/arrow/pull/8683#issuecomment-728253232 https://issues.apache.org/jira/browse/ARROW-10621 This is an automated message from the Apache Git

[GitHub] [arrow] nealrichardson closed pull request #8618: ARROW-10530: [R] Optionally use distro package in linuxlibs.R

2020-11-16 Thread GitBox
nealrichardson closed pull request #8618: URL: https://github.com/apache/arrow/pull/8618 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to

[GitHub] [arrow] Dandandan commented on a change in pull request #8673: ARROW-10609: [Rust] Optimize min/max of non null strings

2020-11-16 Thread GitBox
Dandandan commented on a change in pull request #8673: URL: https://github.com/apache/arrow/pull/8673#discussion_r52451 ## File path: rust/arrow/src/compute/kernels/aggregate.rs ## @@ -32,19 +32,20 @@ fn min_max_string bool>( if null_count == array.len() {

  1   2   >