[GitHub] [arrow] nevi-me commented on a change in pull request #8664: ARROW-10588: [Rust] Safe and parallel bit operations for Arrow

2020-11-25 Thread GitBox
nevi-me commented on a change in pull request #8664: URL: https://github.com/apache/arrow/pull/8664#discussion_r530293315 ## File path: rust/arrow/Cargo.toml ## @@ -48,11 +49,12 @@ lazy_static = "1.4" packed_simd = { version = "0.3.4", optional = true, package =

[GitHub] [arrow] vertexclique commented on a change in pull request #8664: ARROW-10588: [Rust] Safe and parallel bit operations for Arrow

2020-11-25 Thread GitBox
vertexclique commented on a change in pull request #8664: URL: https://github.com/apache/arrow/pull/8664#discussion_r530326379 ## File path: rust/arrow/Cargo.toml ## @@ -48,11 +49,12 @@ lazy_static = "1.4" packed_simd = { version = "0.3.4", optional = true, package =

[GitHub] [arrow] nevi-me commented on pull request #8664: ARROW-10588: [Rust] Safe and parallel bit operations for Arrow

2020-11-25 Thread GitBox
nevi-me commented on pull request #8664: URL: https://github.com/apache/arrow/pull/8664#issuecomment-733668680 > @nevi-me , can I have 12h to review it? Yup, I more meant that if someone else picks up things they'd like addressed, we could open JIRAs for them instead of trying to

[GitHub] [arrow] jorgecarleitao commented on a change in pull request #8664: ARROW-10588: [Rust] Safe and parallel bit operations for Arrow

2020-11-25 Thread GitBox
jorgecarleitao commented on a change in pull request #8664: URL: https://github.com/apache/arrow/pull/8664#discussion_r530329221 ## File path: rust/arrow/src/compute/kernels/aggregate.rs ## @@ -141,32 +146,41 @@ where match array.data().null_buffer() { None =>

[GitHub] [arrow] alamb commented on pull request #8752: ARROW-10705: [Rust] Loosen restrictions on some lifetime annotations

2020-11-25 Thread GitBox
alamb commented on pull request #8752: URL: https://github.com/apache/arrow/pull/8752#issuecomment-733678504 Thank you @carols10cents -- this was an epic PR description This is an automated message from the Apache Git

[GitHub] [arrow] Dandandan commented on a change in pull request #8769: ARROW-10710: [Rust] Revert tokio upgrade, go back to 0.2

2020-11-25 Thread GitBox
Dandandan commented on a change in pull request #8769: URL: https://github.com/apache/arrow/pull/8769#discussion_r530338275 ## File path: rust/benchmarks/Cargo.toml ## @@ -30,5 +30,5 @@ arrow = { path = "../arrow" } parquet = { path = "../parquet" } datafusion = { path =

[GitHub] [arrow] Dandandan commented on a change in pull request #8769: ARROW-10710: [Rust] Revert tokio upgrade, go back to 0.2

2020-11-25 Thread GitBox
Dandandan commented on a change in pull request #8769: URL: https://github.com/apache/arrow/pull/8769#discussion_r530338275 ## File path: rust/benchmarks/Cargo.toml ## @@ -30,5 +30,5 @@ arrow = { path = "../arrow" } parquet = { path = "../parquet" } datafusion = { path =

[GitHub] [arrow] Dandandan commented on a change in pull request #8769: ARROW-10710: [Rust] Revert tokio upgrade, go back to 0.2

2020-11-25 Thread GitBox
Dandandan commented on a change in pull request #8769: URL: https://github.com/apache/arrow/pull/8769#discussion_r530338379 ## File path: rust/datafusion/Cargo.toml ## @@ -57,7 +57,7 @@ chrono = "0.4" async-trait = "0.1.41" futures = "0.3" pin-project-lite= "^0.2.0" -tokio

[GitHub] [arrow] jorgecarleitao commented on a change in pull request #8401: ARROW-10109: [Rust] Add support to the C data interface for primitive types and utf8

2020-11-25 Thread GitBox
jorgecarleitao commented on a change in pull request #8401: URL: https://github.com/apache/arrow/pull/8401#discussion_r530348245 ## File path: rust/arrow/src/ffi.rs ## @@ -0,0 +1,657 @@ +// Licensed to the Apache Software Foundation (ASF) under one +// or more contributor

[GitHub] [arrow] alamb commented on pull request #8769: ARROW-10710: [Rust] Revert tokio upgrade, go back to 0.2

2020-11-25 Thread GitBox
alamb commented on pull request #8769: URL: https://github.com/apache/arrow/pull/8769#issuecomment-733686362 FYI @jorgecarleitao This is an automated message from the Apache Git Service. To respond to the message, please

[GitHub] [arrow] bkietz commented on a change in pull request #8735: ARROW-6781: [C++] Improve and consolidate CHECK macros

2020-11-25 Thread GitBox
bkietz commented on a change in pull request #8735: URL: https://github.com/apache/arrow/pull/8735#discussion_r530348934 ## File path: cpp/src/arrow/util/logging.h ## @@ -86,56 +67,40 @@ enum class ArrowLogLevel : int { #define ARROW_CHECK_GE(val1, val2) ARROW_CHECK((val1) >=

[GitHub] [arrow] pitrou closed pull request #8474: ARROW-10301: [C++][Compute] Implement "all" reduction kernel for boolean data

2020-11-25 Thread GitBox
pitrou closed pull request #8474: URL: https://github.com/apache/arrow/pull/8474 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to

[GitHub] [arrow] maartenbreddels commented on pull request #8621: ARROW-9128: [C++] Implement string space trimming kernels: trim, ltrim, and rtrim

2020-11-25 Thread GitBox
maartenbreddels commented on pull request #8621: URL: https://github.com/apache/arrow/pull/8621#issuecomment-733820473 @pitrou this is ready for review. This is an automated message from the Apache Git Service. To respond to

[GitHub] [arrow] vertexclique commented on a change in pull request #8664: ARROW-10588: [Rust] Safe and parallel bit operations for Arrow

2020-11-25 Thread GitBox
vertexclique commented on a change in pull request #8664: URL: https://github.com/apache/arrow/pull/8664#discussion_r530343892 ## File path: rust/arrow/src/compute/kernels/aggregate.rs ## @@ -141,32 +146,41 @@ where match array.data().null_buffer() { None => {

[GitHub] [arrow] xhochy commented on a change in pull request #8756: ARROW-10541: [C++] Add re2 library to core arrow / ARROW_WITH_RE2

2020-11-25 Thread GitBox
xhochy commented on a change in pull request #8756: URL: https://github.com/apache/arrow/pull/8756#discussion_r530373157 ## File path: cpp/cmake_modules/DefineOptions.cmake ## @@ -363,6 +363,8 @@ if("${CMAKE_SOURCE_DIR}" STREQUAL "${CMAKE_CURRENT_SOURCE_DIR}")

[GitHub] [arrow] alamb commented on a change in pull request #8751: ARROW-10584: [Rust] [DataFusion] SQL support for JOINs [WIP]

2020-11-25 Thread GitBox
alamb commented on a change in pull request #8751: URL: https://github.com/apache/arrow/pull/8751#discussion_r530466598 ## File path: rust/datafusion/src/optimizer/projection_push_down.rs ## @@ -114,6 +114,10 @@ fn optimize_plan( has_projection: bool, ) -> Result {

[GitHub] [arrow] jorgecarleitao commented on a change in pull request #8401: ARROW-10109: [Rust] Add support to the C data interface for primitive types and utf8

2020-11-25 Thread GitBox
jorgecarleitao commented on a change in pull request #8401: URL: https://github.com/apache/arrow/pull/8401#discussion_r530348245 ## File path: rust/arrow/src/ffi.rs ## @@ -0,0 +1,657 @@ +// Licensed to the Apache Software Foundation (ASF) under one +// or more contributor

[GitHub] [arrow] alamb commented on pull request #8745: ARROW-10701: [Rust] Fix sort_limit_query_sql benchmark

2020-11-25 Thread GitBox
alamb commented on pull request #8745: URL: https://github.com/apache/arrow/pull/8745#issuecomment-733685460 This time @nevi-me beat me to the merge   This is an automated message from the Apache Git Service. To respond

[GitHub] [arrow] jorgecarleitao removed a comment on pull request #8769: ARROW-10710: [Rust] Revert tokio upgrade, go back to 0.2

2020-11-25 Thread GitBox
jorgecarleitao removed a comment on pull request #8769: URL: https://github.com/apache/arrow/pull/8769#issuecomment-733687016 :shipit: I also found a regression related to this update in `datafusion-python` on which the process hangs.

[GitHub] [arrow] alamb commented on pull request #8769: ARROW-10710: [Rust] Revert tokio upgrade, go back to 0.2

2020-11-25 Thread GitBox
alamb commented on pull request #8769: URL: https://github.com/apache/arrow/pull/8769#issuecomment-733698117 I am waiting for CI to finish running and then I will merge this in This is an automated message from the Apache

[GitHub] [arrow] xhochy commented on a change in pull request #8756: ARROW-10541: [C++] Add re2 library to core arrow / ARROW_WITH_RE2

2020-11-25 Thread GitBox
xhochy commented on a change in pull request #8756: URL: https://github.com/apache/arrow/pull/8756#discussion_r530417334 ## File path: cpp/cmake_modules/DefineOptions.cmake ## @@ -363,6 +363,8 @@ if("${CMAKE_SOURCE_DIR}" STREQUAL "${CMAKE_CURRENT_SOURCE_DIR}")

[GitHub] [arrow] xhochy closed pull request #8759: TEST: Arrow-10195

2020-11-25 Thread GitBox
xhochy closed pull request #8759: URL: https://github.com/apache/arrow/pull/8759 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to

[GitHub] [arrow] josiahyan commented on pull request #8757: ARROW-8147: [C++] Add google-cloud-cpp to ThirdpartyToolchain

2020-11-25 Thread GitBox
josiahyan commented on pull request #8757: URL: https://github.com/apache/arrow/pull/8757#issuecomment-733751240 Oh, I know why! The CI queue was full (my test runs were taking hours to go through). By the time the runner got to my PR (Wed, 25 Nov 2020 02:15:47 GMT), my final commit had

[GitHub] [arrow] andygrove commented on a change in pull request #8751: ARROW-10584: [Rust] [DataFusion] Add SQL support for JOIN ON syntax

2020-11-25 Thread GitBox
andygrove commented on a change in pull request #8751: URL: https://github.com/apache/arrow/pull/8751#discussion_r530496041 ## File path: rust/datafusion/src/logical_plan/plan.rs ## @@ -114,6 +114,8 @@ pub enum LogicalPlan { /// The output schema, containing fields

[GitHub] [arrow] jorgecarleitao commented on pull request #8664: ARROW-10588: [Rust] Safe and parallel bit operations for Arrow

2020-11-25 Thread GitBox
jorgecarleitao commented on pull request #8664: URL: https://github.com/apache/arrow/pull/8664#issuecomment-733667373 Really impressive improvement, @vertexclique . @nevi-me , can I have 12h to review it?

[GitHub] [arrow] alamb opened a new pull request #8769: ARROW-10710: [Rust] Revery tokio upgrade, go back to 0.2

2020-11-25 Thread GitBox
alamb opened a new pull request #8769: URL: https://github.com/apache/arrow/pull/8769 ## Changes This PR backs out the version upgrade to tokio done in https://github.com/apache/arrow/pull/8697 (see comment https://github.com/apache/arrow/pull/8697#issuecomment-732936572 by

[GitHub] [arrow] alamb commented on a change in pull request #8769: ARROW-10710: [Rust] Revert tokio upgrade, go back to 0.2

2020-11-25 Thread GitBox
alamb commented on a change in pull request #8769: URL: https://github.com/apache/arrow/pull/8769#discussion_r530342400 ## File path: rust/benchmarks/Cargo.toml ## @@ -30,5 +30,5 @@ arrow = { path = "../arrow" } parquet = { path = "../parquet" } datafusion = { path =

[GitHub] [arrow] jhorstmann commented on pull request #8664: ARROW-10588: [Rust] Safe and parallel bit operations for Arrow

2020-11-25 Thread GitBox
jhorstmann commented on pull request #8664: URL: https://github.com/apache/arrow/pull/8664#issuecomment-733710712 > Introducing thread parallelism at a compute kernel level Conceptually this is not a small change. Personally I think parallelizing on the datafusion level and keeping

[GitHub] [arrow] maartenbreddels commented on pull request #8755: ARROW-10709: [Python] Allow PythonFile.read() to always return a buffer

2020-11-25 Thread GitBox
maartenbreddels commented on pull request #8755: URL: https://github.com/apache/arrow/pull/8755#issuecomment-733734030 @pitrou this is ready for review This is an automated message from the Apache Git Service. To respond to

[GitHub] [arrow] vertexclique commented on pull request #8664: ARROW-10588: [Rust] Safe and parallel bit operations for Arrow

2020-11-25 Thread GitBox
vertexclique commented on pull request #8664: URL: https://github.com/apache/arrow/pull/8664#issuecomment-733737357 > Conceptually this is not a small change. Personally I think parallelizing on the datafusion level and keeping kernels single-threaded is the better model. I don't

[GitHub] [arrow] xhochy commented on pull request #8756: ARROW-10541: [C++] Add re2 library to core arrow / ARROW_WITH_RE2

2020-11-25 Thread GitBox
xhochy commented on pull request #8756: URL: https://github.com/apache/arrow/pull/8756#issuecomment-733745584 Everything failing in https://github.com/apache/arrow/pull/8759 is past the CMake additions I did here, so this should be good to go.

[GitHub] [arrow] jorgecarleitao closed pull request #8749: ARROW-10689: [Rust] [DataFusion] Add SQL support for CASE WHEN

2020-11-25 Thread GitBox
jorgecarleitao closed pull request #8749: URL: https://github.com/apache/arrow/pull/8749 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to

[GitHub] [arrow] vertexclique commented on a change in pull request #8664: ARROW-10588: [Rust] Safe and parallel bit operations for Arrow

2020-11-25 Thread GitBox
vertexclique commented on a change in pull request #8664: URL: https://github.com/apache/arrow/pull/8664#discussion_r530327927 ## File path: rust/arrow/src/compute/kernels/aggregate.rs ## @@ -141,32 +146,41 @@ where match array.data().null_buffer() { None => {

[GitHub] [arrow] vertexclique commented on a change in pull request #8664: ARROW-10588: [Rust] Safe and parallel bit operations for Arrow

2020-11-25 Thread GitBox
vertexclique commented on a change in pull request #8664: URL: https://github.com/apache/arrow/pull/8664#discussion_r530328033 ## File path: rust/arrow/src/compute/kernels/aggregate.rs ## @@ -180,9 +194,12 @@ where /// /// Returns `None` if the array is empty or only

[GitHub] [arrow] vertexclique commented on a change in pull request #8664: ARROW-10588: [Rust] Safe and parallel bit operations for Arrow

2020-11-25 Thread GitBox
vertexclique commented on a change in pull request #8664: URL: https://github.com/apache/arrow/pull/8664#discussion_r530327927 ## File path: rust/arrow/src/compute/kernels/aggregate.rs ## @@ -141,32 +146,41 @@ where match array.data().null_buffer() { None => {

[GitHub] [arrow] jorgecarleitao commented on a change in pull request #8664: ARROW-10588: [Rust] Safe and parallel bit operations for Arrow

2020-11-25 Thread GitBox
jorgecarleitao commented on a change in pull request #8664: URL: https://github.com/apache/arrow/pull/8664#discussion_r530331929 ## File path: rust/arrow/src/util/bit_ops.rs ## @@ -0,0 +1,588 @@ +// Licensed to the Apache Software Foundation (ASF) under one +// or more

[GitHub] [arrow] maartenbreddels commented on pull request #8756: ARROW-10541: [C++] Add re2 library to core arrow / ARROW_WITH_RE2

2020-11-25 Thread GitBox
maartenbreddels commented on pull request #8756: URL: https://github.com/apache/arrow/pull/8756#issuecomment-733748270 #8459 builds on Travis, looking good  This is an automated message from the Apache Git Service. To

[GitHub] [arrow] jorgecarleitao commented on pull request #8765: ARROW-10722: [Rust][DataFusion] Reduce overhead of some data types in aggregations / joins, improve benchmarks

2020-11-25 Thread GitBox
jorgecarleitao commented on pull request #8765: URL: https://github.com/apache/arrow/pull/8765#issuecomment-733815809 Genuinely curious: does the key size has such a large impact? Or is there any memory constraints that you are looking for?

[GitHub] [arrow] jorgecarleitao commented on pull request #8769: ARROW-10710: [Rust] Revert tokio upgrade, go back to 0.2

2020-11-25 Thread GitBox
jorgecarleitao commented on pull request #8769: URL: https://github.com/apache/arrow/pull/8769#issuecomment-733687016 :shipit: I also found a regression related to this update in `datafusion-python` on which the process hangs.

[GitHub] [arrow] alamb closed pull request #8750: ARROW-10704: [Rust][DataFusion] Remove Nested from expression enum

2020-11-25 Thread GitBox
alamb closed pull request #8750: URL: https://github.com/apache/arrow/pull/8750 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the

[GitHub] [arrow] vertexclique commented on pull request #8664: ARROW-10588: [Rust] Safe and parallel bit operations for Arrow

2020-11-25 Thread GitBox
vertexclique commented on pull request #8664: URL: https://github.com/apache/arrow/pull/8664#issuecomment-733773942 > Things to do after this pr: > * Other kernels can be improved by different prs I have already mentioned that here I think:

[GitHub] [arrow] alamb commented on pull request #8751: ARROW-10584: [Rust] [DataFusion] SQL support for JOINs [WIP]

2020-11-25 Thread GitBox
alamb commented on pull request #8751: URL: https://github.com/apache/arrow/pull/8751#issuecomment-733672017 @andygrove - I will try and look at it later today This is an automated message from the Apache Git Service. To

[GitHub] [arrow] alamb commented on a change in pull request #8769: ARROW-10710: [Rust] Revert tokio upgrade, go back to 0.2

2020-11-25 Thread GitBox
alamb commented on a change in pull request #8769: URL: https://github.com/apache/arrow/pull/8769#discussion_r530339818 ## File path: rust/benchmarks/Cargo.toml ## @@ -30,5 +30,5 @@ arrow = { path = "../arrow" } parquet = { path = "../parquet" } datafusion = { path =

[GitHub] [arrow] github-actions[bot] commented on pull request #8769: ARROW-10710: [Rust] Revert tokio upgrade, go back to 0.2

2020-11-25 Thread GitBox
github-actions[bot] commented on pull request #8769: URL: https://github.com/apache/arrow/pull/8769#issuecomment-733679895 https://issues.apache.org/jira/browse/ARROW-10710 This is an automated message from the Apache Git

[GitHub] [arrow] jhorstmann commented on a change in pull request #8401: ARROW-10109: [Rust] Add support to the C data interface for primitive types and utf8

2020-11-25 Thread GitBox
jhorstmann commented on a change in pull request #8401: URL: https://github.com/apache/arrow/pull/8401#discussion_r530350708 ## File path: rust/arrow/src/bytes.rs ## @@ -0,0 +1,166 @@ +// Licensed to the Apache Software Foundation (ASF) under one +// or more contributor

[GitHub] [arrow] bkietz commented on a change in pull request #8735: ARROW-6781: [C++] Improve and consolidate CHECK macros

2020-11-25 Thread GitBox
bkietz commented on a change in pull request #8735: URL: https://github.com/apache/arrow/pull/8735#discussion_r530348934 ## File path: cpp/src/arrow/util/logging.h ## @@ -86,56 +67,40 @@ enum class ArrowLogLevel : int { #define ARROW_CHECK_GE(val1, val2) ARROW_CHECK((val1) >=

[GitHub] [arrow] alamb commented on pull request #8769: ARROW-10710: [Rust] Revert tokio upgrade, go back to 0.2

2020-11-25 Thread GitBox
alamb commented on pull request #8769: URL: https://github.com/apache/arrow/pull/8769#issuecomment-733708535 CI is green -- merging this in! This is an automated message from the Apache Git Service. To respond to the

[GitHub] [arrow] bkietz commented on a change in pull request #8756: ARROW-10541: [C++] Add re2 library to core arrow / ARROW_WITH_RE2

2020-11-25 Thread GitBox
bkietz commented on a change in pull request #8756: URL: https://github.com/apache/arrow/pull/8756#discussion_r530376061 ## File path: cpp/cmake_modules/DefineOptions.cmake ## @@ -363,6 +363,8 @@ if("${CMAKE_SOURCE_DIR}" STREQUAL "${CMAKE_CURRENT_SOURCE_DIR}")

[GitHub] [arrow] vertexclique commented on a change in pull request #8664: ARROW-10588: [Rust] Safe and parallel bit operations for Arrow

2020-11-25 Thread GitBox
vertexclique commented on a change in pull request #8664: URL: https://github.com/apache/arrow/pull/8664#discussion_r530327927 ## File path: rust/arrow/src/compute/kernels/aggregate.rs ## @@ -141,32 +146,41 @@ where match array.data().null_buffer() { None => {

[GitHub] [arrow] jhorstmann commented on a change in pull request #8401: ARROW-10109: [Rust] Add support to the C data interface for primitive types and utf8

2020-11-25 Thread GitBox
jhorstmann commented on a change in pull request #8401: URL: https://github.com/apache/arrow/pull/8401#discussion_r530346547 ## File path: rust/arrow/src/ffi.rs ## @@ -0,0 +1,657 @@ +// Licensed to the Apache Software Foundation (ASF) under one +// or more contributor license

[GitHub] [arrow] nevi-me closed pull request #8745: ARROW-10701: [Rust] Fix sort_limit_query_sql benchmark

2020-11-25 Thread GitBox
nevi-me closed pull request #8745: URL: https://github.com/apache/arrow/pull/8745 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to

[GitHub] [arrow] bkietz commented on a change in pull request #8735: ARROW-6781: [C++] Improve and consolidate CHECK macros

2020-11-25 Thread GitBox
bkietz commented on a change in pull request #8735: URL: https://github.com/apache/arrow/pull/8735#discussion_r530369496 ## File path: cpp/src/arrow/util/logging.h ## @@ -202,48 +169,81 @@ class ARROW_EXPORT ArrowLog : public ArrowLogBase { private:

[GitHub] [arrow] pitrou commented on a change in pull request #8735: ARROW-6781: [C++] Improve and consolidate CHECK macros

2020-11-25 Thread GitBox
pitrou commented on a change in pull request #8735: URL: https://github.com/apache/arrow/pull/8735#discussion_r530408123 ## File path: cpp/src/arrow/util/logging.h ## @@ -202,48 +169,81 @@ class ARROW_EXPORT ArrowLog : public ArrowLogBase { private:

[GitHub] [arrow] maartenbreddels commented on pull request #8459: ARROW-10195: [C++] Add string struct extract kernel using re2

2020-11-25 Thread GitBox
maartenbreddels commented on pull request #8459: URL: https://github.com/apache/arrow/pull/8459#issuecomment-733739638 There are two issues with this PR: ## empty slices In #8728

[GitHub] [arrow] xhochy commented on pull request #8757: ARROW-8147: [C++] Add google-cloud-cpp to ThirdpartyToolchain

2020-11-25 Thread GitBox
xhochy commented on pull request #8757: URL: https://github.com/apache/arrow/pull/8757#issuecomment-733745201 > @xhochy I can't seem to summon the bot. But I read what the action did, and saw that it required a very specific version of the cmake autoformatter, as you suggested. It works

[GitHub] [arrow] Dandandan commented on a change in pull request #8760: ARROW-10712: [Rust] Add tests to TPC-H benchmarks

2020-11-25 Thread GitBox
Dandandan commented on a change in pull request #8760: URL: https://github.com/apache/arrow/pull/8760#discussion_r530428982 ## File path: rust/benchmarks/src/bin/tpch.rs ## @@ -353,3 +353,36 @@ fn get_schema(table: ) -> Schema { _ => unimplemented!(), } } +

[GitHub] [arrow] josiahyan edited a comment on pull request #8757: ARROW-8147: [C++] Add google-cloud-cpp to ThirdpartyToolchain

2020-11-25 Thread GitBox
josiahyan edited a comment on pull request #8757: URL: https://github.com/apache/arrow/pull/8757#issuecomment-733751240 Oh, I know why! The CI queue was full (my test runs were taking hours to go through). By the time the runner got to my PR (Wed, 25 Nov 2020 02:15:47 GMT), my final

[GitHub] [arrow] maartenbreddels commented on pull request #8468: ARROW-10306: [C++] Add string replacement kernel

2020-11-25 Thread GitBox
maartenbreddels commented on pull request #8468: URL: https://github.com/apache/arrow/pull/8468#issuecomment-733803300 I'd rather keep this 1 PR, looks like #8756 is working This is an automated message from the Apache Git

[GitHub] [arrow] nevi-me closed pull request #8752: ARROW-10705: [Rust] Loosen restrictions on some lifetime annotations

2020-11-25 Thread GitBox
nevi-me closed pull request #8752: URL: https://github.com/apache/arrow/pull/8752 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to

[GitHub] [arrow] alamb commented on a change in pull request #8769: ARROW-10710: [Rust] Revert tokio upgrade, go back to 0.2

2020-11-25 Thread GitBox
alamb commented on a change in pull request #8769: URL: https://github.com/apache/arrow/pull/8769#discussion_r530353896 ## File path: rust/benchmarks/Cargo.toml ## @@ -30,5 +30,5 @@ arrow = { path = "../arrow" } parquet = { path = "../parquet" } datafusion = { path =

[GitHub] [arrow] bkietz commented on a change in pull request #8735: ARROW-6781: [C++] Improve and consolidate CHECK macros

2020-11-25 Thread GitBox
bkietz commented on a change in pull request #8735: URL: https://github.com/apache/arrow/pull/8735#discussion_r530353939 ## File path: cpp/src/arrow/util/logging.h ## @@ -22,22 +22,21 @@ // The LLVM IR code doesn't have an NDEBUG mode. And, it shouldn't include references to

[GitHub] [arrow] alamb closed pull request #8769: ARROW-10710: [Rust] Revert tokio upgrade, go back to 0.2

2020-11-25 Thread GitBox
alamb closed pull request #8769: URL: https://github.com/apache/arrow/pull/8769 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the

[GitHub] [arrow] alamb commented on pull request #8664: ARROW-10588: [Rust] Safe and parallel bit operations for Arrow

2020-11-25 Thread GitBox
alamb commented on pull request #8664: URL: https://github.com/apache/arrow/pull/8664#issuecomment-733811489 > Conceptually this is not a small change. Personally I think parallelizing on the datafusion level and keeping kernels single-threaded is the better model. I agree with

[GitHub] [arrow] Dandandan edited a comment on pull request #8765: ARROW-10722: [Rust][DataFusion] Reduce overhead of some data types in aggregations / joins, improve benchmarks

2020-11-25 Thread GitBox
Dandandan edited a comment on pull request #8765: URL: https://github.com/apache/arrow/pull/8765#issuecomment-733824217 @jorgecarleitao Not really on performance as current benchmarks / queries show, just looking at ways to improve the aggregate / join performance. The main thing I

[GitHub] [arrow] Dandandan edited a comment on pull request #8765: ARROW-10722: [Rust][DataFusion] Reduce overhead of some data types in aggregations / joins, improve benchmarks

2020-11-25 Thread GitBox
Dandandan edited a comment on pull request #8765: URL: https://github.com/apache/arrow/pull/8765#issuecomment-733824217 @jorgecarleitao Not really on performance as current benchmarks / queries show, just looking at ways to improve the aggregate / join performance. The main thing I

[GitHub] [arrow] velvia commented on a change in pull request #8688: ARROW-10330: [Rust][DataFusion] Implement NULLIF() SQL function

2020-11-25 Thread GitBox
velvia commented on a change in pull request #8688: URL: https://github.com/apache/arrow/pull/8688#discussion_r530546823 ## File path: rust/datafusion/tests/sql.rs ## @@ -508,6 +508,26 @@ async fn csv_query_avg_multi_batch() -> Result<()> { Ok(()) } +#[tokio::test]

[GitHub] [arrow] andygrove commented on pull request #8751: ARROW-10584: [Rust] [DataFusion] Add SQL support for JOIN ON syntax

2020-11-25 Thread GitBox
andygrove commented on pull request #8751: URL: https://github.com/apache/arrow/pull/8751#issuecomment-733886717 @alamb I decided to reduce the scope of this and not try and deal with compound identifiers and table aliases for now. Please see the updated description of this PR. I will do

[GitHub] [arrow] pitrou edited a comment on pull request #8770: ARROW-10696: [C++] Add SetBitRunReader

2020-11-25 Thread GitBox
pitrou edited a comment on pull request #8770: URL: https://github.com/apache/arrow/pull/8770#issuecomment-733918851 ArrayRangeEquals benchmarks: ``` benchmarkbaseline contender change %

[GitHub] [arrow] Dandandan edited a comment on pull request #8765: ARROW-10722: [Rust][DataFusion] Reduce overhead of some data types in aggregations / joins, improve benchmarks

2020-11-25 Thread GitBox
Dandandan edited a comment on pull request #8765: URL: https://github.com/apache/arrow/pull/8765#issuecomment-733824217 @jorgecarleitao Not really on performance as current benchmarks / queries show, just looking at ways to improve the aggregate / join performance. The main thing I

[GitHub] [arrow] github-actions[bot] commented on pull request #8771: ARROW-10740: [Rust][DataFusion] Remove redundant clones found by clippy

2020-11-25 Thread GitBox
github-actions[bot] commented on pull request #8771: URL: https://github.com/apache/arrow/pull/8771#issuecomment-733952113 https://issues.apache.org/jira/browse/ARROW-10740 This is an automated message from the Apache Git

[GitHub] [arrow] Dandandan edited a comment on pull request #8765: ARROW-10722: [Rust][DataFusion] Reduce overhead of some data types in aggregations / joins, improve benchmarks

2020-11-25 Thread GitBox
Dandandan edited a comment on pull request #8765: URL: https://github.com/apache/arrow/pull/8765#issuecomment-733824217 @jorgecarleitao Not really on performance as current benchmarks / queries show, just looking at ways to improve the aggregate / join performance. The main thing I

[GitHub] [arrow] Dandandan commented on pull request #8765: ARROW-10722: [Rust][DataFusion] Reduce overhead of some data types in aggregations / joins, improve benchmarks

2020-11-25 Thread GitBox
Dandandan commented on pull request #8765: URL: https://github.com/apache/arrow/pull/8765#issuecomment-733824217 @jorgecarleitao Not really on performance as current benchmarks / queries show, just looking at ways to improve the aggregate / join performance. The main thing I wanted

[GitHub] [arrow] Dandandan edited a comment on pull request #8765: ARROW-10722: [Rust][DataFusion] Reduce overhead of some data types in aggregations / joins, improve benchmarks

2020-11-25 Thread GitBox
Dandandan edited a comment on pull request #8765: URL: https://github.com/apache/arrow/pull/8765#issuecomment-733824217 @jorgecarleitao Not really on performance as current benchmarks / queries show, just looking at ways to improve the aggregate / join performance. The main thing I

[GitHub] [arrow] Dandandan edited a comment on pull request #8765: ARROW-10722: [Rust][DataFusion] Reduce overhead of some data types in aggregations / joins, improve benchmarks

2020-11-25 Thread GitBox
Dandandan edited a comment on pull request #8765: URL: https://github.com/apache/arrow/pull/8765#issuecomment-733824217 @jorgecarleitao Not really on performance as current benchmarks / queries show, just looking at ways to improve the aggregate / join performance. The main thing I

[GitHub] [arrow] lyndonb-bq commented on a change in pull request #8724: ARROW-10487 [FlightRPC][C++][Python] Header-based auth in clients

2020-11-25 Thread GitBox
lyndonb-bq commented on a change in pull request #8724: URL: https://github.com/apache/arrow/pull/8724#discussion_r530544565 ## File path: cpp/src/arrow/flight/flight_test.cc ## @@ -1010,6 +1140,57 @@ class TestErrorMiddleware : public ::testing::Test { std::unique_ptr

[GitHub] [arrow] jorgecarleitao commented on a change in pull request #8688: ARROW-10330: [Rust][DataFusion] Implement NULLIF() SQL function

2020-11-25 Thread GitBox
jorgecarleitao commented on a change in pull request #8688: URL: https://github.com/apache/arrow/pull/8688#discussion_r530549403 ## File path: rust/datafusion/tests/sql.rs ## @@ -508,6 +508,26 @@ async fn csv_query_avg_multi_batch() -> Result<()> { Ok(()) }

[GitHub] [arrow] pitrou opened a new pull request #8770: ARROW-10696: [C++] Add SetBitRunReader

2020-11-25 Thread GitBox
pitrou opened a new pull request #8770: URL: https://github.com/apache/arrow/pull/8770 A specialized bitmap reader that yields runs of set bits, for use cases where reset bits (e.g. null bits) don't need any handling. On some use cases it can be significantly faster than the

[GitHub] [arrow] lyndonb-bq commented on pull request #8724: ARROW-10487 [FlightRPC][C++][Python] Header-based auth in clients

2020-11-25 Thread GitBox
lyndonb-bq commented on pull request #8724: URL: https://github.com/apache/arrow/pull/8724#issuecomment-733923181 @lidavidm Hey David, I have made the requested changes. I got a random Javascript failure in one of the CI teardowns, not sure if this is accepted or if you can restart the CI

[GitHub] [arrow] kou commented on a change in pull request #8756: ARROW-10541: [C++] Add re2 library to core arrow / ARROW_WITH_RE2

2020-11-25 Thread GitBox
kou commented on a change in pull request #8756: URL: https://github.com/apache/arrow/pull/8756#discussion_r530622823 ## File path: cpp/CMakeLists.txt ## @@ -330,6 +330,10 @@ if(ARROW_BUILD_BENCHMARKS set(ARROW_TESTING ON) endif() +if(ARROW_GANDIVA) + set(ARROW_WITH_RE2

[GitHub] [arrow] andygrove commented on pull request #8664: ARROW-10588: [Rust] Safe and parallel bit operations for Arrow

2020-11-25 Thread GitBox
andygrove commented on pull request #8664: URL: https://github.com/apache/arrow/pull/8664#issuecomment-733956565 > Also, some of your comments are giving false information (which I stopped giving feedback). I prefer instead of having counterproductive comments, productive comments from

[GitHub] [arrow] Dandandan edited a comment on pull request #8765: ARROW-10722: [Rust][DataFusion] Reduce overhead of some data types in aggregations / joins, improve benchmarks

2020-11-25 Thread GitBox
Dandandan edited a comment on pull request #8765: URL: https://github.com/apache/arrow/pull/8765#issuecomment-733824217 @jorgecarleitao Not really on performance as current benchmarks / queries show, just looking at ways to improve the aggregate / join performance. The main thing I

[GitHub] [arrow] velvia commented on a change in pull request #8688: ARROW-10330: [Rust][DataFusion] Implement NULLIF() SQL function

2020-11-25 Thread GitBox
velvia commented on a change in pull request #8688: URL: https://github.com/apache/arrow/pull/8688#discussion_r530590352 ## File path: rust/datafusion/tests/sql.rs ## @@ -508,6 +508,26 @@ async fn csv_query_avg_multi_batch() -> Result<()> { Ok(()) } +#[tokio::test]

[GitHub] [arrow] Dandandan opened a new pull request #8771: ARROW-10740: [Rust][DataFusion] Remove redundant clones found by clippy

2020-11-25 Thread GitBox
Dandandan opened a new pull request #8771: URL: https://github.com/apache/arrow/pull/8771 I think this clippy warning is useful to have in CI and reduces noise in the code (and maybe some small performance or compile time wins).

[GitHub] [arrow] Dandandan edited a comment on pull request #8765: ARROW-10722: [Rust][DataFusion] Reduce overhead of some data types in aggregations / joins, improve benchmarks

2020-11-25 Thread GitBox
Dandandan edited a comment on pull request #8765: URL: https://github.com/apache/arrow/pull/8765#issuecomment-733824217 @jorgecarleitao Not really on performance as current benchmarks / queries show, just looking at ways to improve the aggregate / join performance. The main thing I

[GitHub] [arrow] lidavidm commented on a change in pull request #8725: ARROW-10526: [FlightRPC][C++][Python] Client cookie middleware

2020-11-25 Thread GitBox
lidavidm commented on a change in pull request #8725: URL: https://github.com/apache/arrow/pull/8725#discussion_r530544603 ## File path: cpp/src/arrow/flight/client_cookie_middleware.cc ## @@ -0,0 +1,353 @@ +// Licensed to the Apache Software Foundation (ASF) under one +// or

[GitHub] [arrow] maartenbreddels commented on pull request #8468: ARROW-10306: [C++] Add string replacement kernel

2020-11-25 Thread GitBox
maartenbreddels commented on pull request #8468: URL: https://github.com/apache/arrow/pull/8468#issuecomment-733870297 @pitrou this is ready for review, failure seems unrelated (minio on windows). This is an automated

[GitHub] [arrow] pitrou edited a comment on pull request #8770: ARROW-10696: [C++] Add SetBitRunReader

2020-11-25 Thread GitBox
pitrou edited a comment on pull request #8770: URL: https://github.com/apache/arrow/pull/8770#issuecomment-733918851 ArrayRangeEquals benchmarks: ``` benchmarkbaseline contender change %

[GitHub] [arrow] pitrou edited a comment on pull request #8770: ARROW-10696: [C++] Add SetBitRunReader

2020-11-25 Thread GitBox
pitrou edited a comment on pull request #8770: URL: https://github.com/apache/arrow/pull/8770#issuecomment-733919445 Parquet benchmarks: ``` benchmarkbaseline contender change %

[GitHub] [arrow] pitrou commented on pull request #8770: ARROW-10696: [C++] Add SetBitRunReader

2020-11-25 Thread GitBox
pitrou commented on pull request #8770: URL: https://github.com/apache/arrow/pull/8770#issuecomment-733919445 Parquet benchmarks: ``` 199 BM_PlainDecodingSpacedFloat/32768/100 5.506 GiB/sec 18.466 GiB/sec 235.349

[GitHub] [arrow] lidavidm closed pull request #8724: ARROW-10487 [FlightRPC][C++] Header-based auth in clients

2020-11-25 Thread GitBox
lidavidm closed pull request #8724: URL: https://github.com/apache/arrow/pull/8724 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to

[GitHub] [arrow] lidavidm commented on pull request #8724: ARROW-10487 [FlightRPC][C++] Header-based auth in clients

2020-11-25 Thread GitBox
lidavidm commented on pull request #8724: URL: https://github.com/apache/arrow/pull/8724#issuecomment-733943642 Thanks @lyndonb-bq! This is an automated message from the Apache Git Service. To respond to the message, please

[GitHub] [arrow] github-actions[bot] commented on pull request #8772: ARROW-10741: [Rust] Apply previously ignored clippy suggestions

2020-11-25 Thread GitBox
github-actions[bot] commented on pull request #8772: URL: https://github.com/apache/arrow/pull/8772#issuecomment-733960357 https://issues.apache.org/jira/browse/ARROW-10741 This is an automated message from the Apache Git

[GitHub] [arrow] lyndonb-bq commented on a change in pull request #8725: ARROW-10526: [FlightRPC][C++][Python] Client cookie middleware

2020-11-25 Thread GitBox
lyndonb-bq commented on a change in pull request #8725: URL: https://github.com/apache/arrow/pull/8725#discussion_r530695895 ## File path: cpp/src/arrow/flight/client_cookie_middleware.cc ## @@ -0,0 +1,353 @@ +// Licensed to the Apache Software Foundation (ASF) under one +//

[GitHub] [arrow] lyndonb-bq commented on pull request #8725: ARROW-10526: [FlightRPC][C++] Client cookie middleware

2020-11-25 Thread GitBox
lyndonb-bq commented on pull request #8725: URL: https://github.com/apache/arrow/pull/8725#issuecomment-734107259 There appears to be a failure in the python build, I think it's caused by something unrelated to this pull request though, I noticed the same here:

[GitHub] [arrow] Dandandan opened a new pull request #8772: ARROW-10741: [Rust] Apply previously ignored clippy suggestions

2020-11-25 Thread GitBox
Dandandan opened a new pull request #8772: URL: https://github.com/apache/arrow/pull/8772 Just remove some from the ignore list and apply them. This is an automated message from the Apache Git Service. To respond to the

[GitHub] [arrow] Dandandan edited a comment on pull request #8765: ARROW-10722: [Rust][DataFusion] Reduce overhead of some data types in aggregations / joins, improve benchmarks

2020-11-25 Thread GitBox
Dandandan edited a comment on pull request #8765: URL: https://github.com/apache/arrow/pull/8765#issuecomment-733824217 @jorgecarleitao Not really on performance as current benchmarks / queries show, just looking at ways to improve the aggregate / join performance. The main thing I

[GitHub] [arrow] lidavidm commented on a change in pull request #8724: ARROW-10487 [FlightRPC][C++][Python] Header-based auth in clients

2020-11-25 Thread GitBox
lidavidm commented on a change in pull request #8724: URL: https://github.com/apache/arrow/pull/8724#discussion_r530540183 ## File path: cpp/src/arrow/flight/client.cc ## @@ -1198,6 +1227,12 @@ Status FlightClient::Authenticate(const FlightCallOptions& options, return

[GitHub] [arrow] velvia commented on a change in pull request #8688: ARROW-10330: [Rust][DataFusion] Implement NULLIF() SQL function

2020-11-25 Thread GitBox
velvia commented on a change in pull request #8688: URL: https://github.com/apache/arrow/pull/8688#discussion_r530547648 ## File path: rust/arrow/src/compute/kernels/boolean.rs ## @@ -223,6 +224,101 @@ pub fn is_not_null(input: ) -> Result {

[GitHub] [arrow] jorgecarleitao commented on a change in pull request #8688: ARROW-10330: [Rust][DataFusion] Implement NULLIF() SQL function

2020-11-25 Thread GitBox
jorgecarleitao commented on a change in pull request #8688: URL: https://github.com/apache/arrow/pull/8688#discussion_r530549403 ## File path: rust/datafusion/tests/sql.rs ## @@ -508,6 +508,26 @@ async fn csv_query_avg_multi_batch() -> Result<()> { Ok(()) }

[GitHub] [arrow] Ulimo commented on pull request #8694: ARROW-10542: [C#][Flight] Add beginning on flight code for net core

2020-11-25 Thread GitBox
Ulimo commented on pull request #8694: URL: https://github.com/apache/arrow/pull/8694#issuecomment-733891188 @eerhardt I just have one question, getting this out as a preview nuget package soonish, is that possible? This is

  1   2   >