[GitHub] [arrow] jorgecarleitao commented on a change in pull request #7971: ARROW-9752: [Rust] [DataFusion] Add support for aggregate UDFs

2020-08-17 Thread GitBox
jorgecarleitao commented on a change in pull request #7971: URL: https://github.com/apache/arrow/pull/7971#discussion_r471899766 ## File path: rust/datafusion/src/execution/physical_plan/udf.rs ## @@ -146,3 +154,99 @@ impl PhysicalExpr for ScalarFunctionExpr { (fun)()

[GitHub] [arrow] jorgecarleitao commented on a change in pull request #7971: ARROW-9752: [Rust] [DataFusion] Add support for aggregate UDFs

2020-08-17 Thread GitBox
jorgecarleitao commented on a change in pull request #7971: URL: https://github.com/apache/arrow/pull/7971#discussion_r471899766 ## File path: rust/datafusion/src/execution/physical_plan/udf.rs ## @@ -146,3 +154,99 @@ impl PhysicalExpr for ScalarFunctionExpr { (fun)()

[GitHub] [arrow] jorgecarleitao commented on a change in pull request #7971: ARROW-9752: [Rust] [DataFusion] Add support for aggregate UDFs

2020-08-17 Thread GitBox
jorgecarleitao commented on a change in pull request #7971: URL: https://github.com/apache/arrow/pull/7971#discussion_r471894969 ## File path: rust/datafusion/src/execution/context.rs ## @@ -509,33 +534,76 @@ impl SchemaProvider for ExecutionContextState { } fn

[GitHub] [arrow] github-actions[bot] commented on pull request #7983: Arrow-9733: [Rust] [DataFusion] Added support for COUNT/MIN/MAX on string columns

2020-08-17 Thread GitBox
github-actions[bot] commented on pull request #7983: URL: https://github.com/apache/arrow/pull/7983#issuecomment-675251338 Thanks for opening a pull request! Could you open an issue for this pull request on JIRA? https://issues.apache.org/jira/browse/ARROW Then

[GitHub] [arrow] github-actions[bot] commented on pull request #7983: ARROW-9733: [Rust] [DataFusion] Added support for COUNT/MIN/MAX on string columns

2020-08-17 Thread GitBox
github-actions[bot] commented on pull request #7983: URL: https://github.com/apache/arrow/pull/7983#issuecomment-675256101 https://issues.apache.org/jira/browse/ARROW-9733 This is an automated message from the Apache Git

[GitHub] [arrow] andygrove commented on pull request #7971: ARROW-9752: [Rust] [DataFusion] Add support for aggregate UDFs

2020-08-17 Thread GitBox
andygrove commented on pull request #7971: URL: https://github.com/apache/arrow/pull/7971#issuecomment-675254823 I took a very quick look at Spark just now and here are some observations: - math expressions such as sqrt always return double and don't try to optimize to smaller

[GitHub] [arrow] jorgecarleitao opened a new pull request #7983: Arrow-9733: [Rust] [DataFusion] Added support for COUNT/MIN/MAX on string columns

2020-08-17 Thread GitBox
jorgecarleitao opened a new pull request #7983: URL: https://github.com/apache/arrow/pull/7983 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL

[GitHub] [arrow] houqp commented on pull request #7880: ARROW-9619: [Rust] [DataFusion] Add predicate push-down

2020-08-17 Thread GitBox
houqp commented on pull request #7880: URL: https://github.com/apache/arrow/pull/7880#issuecomment-675259193 One thing I do like about spark's optimizer is all optimization rules share a common plan tree traversal and mutation routine, which made individual optimization rule easier to

[GitHub] [arrow] jorgecarleitao commented on a change in pull request #7880: ARROW-9619: [Rust] [DataFusion] Add predicate push-down

2020-08-17 Thread GitBox
jorgecarleitao commented on a change in pull request #7880: URL: https://github.com/apache/arrow/pull/7880#discussion_r471887798 ## File path: rust/datafusion/src/optimizer/filter_push_down.rs ## @@ -0,0 +1,631 @@ +// regarding copyright ownership. The ASF licenses this file

[GitHub] [arrow] andygrove commented on pull request #7971: ARROW-9752: [Rust] [DataFusion] Add support for aggregate UDFs

2020-08-17 Thread GitBox
andygrove commented on pull request #7971: URL: https://github.com/apache/arrow/pull/7971#issuecomment-675245608 When faced with choices like this, it is often helpful to look at how other projects implement this. Perhaps we could look at calcite or spark to see what choices they

[GitHub] [arrow] sunchao commented on a change in pull request #7917: ARROW-8243: [Rust] [Parquet] Serialize Arrow schema metadata

2020-08-17 Thread GitBox
sunchao commented on a change in pull request #7917: URL: https://github.com/apache/arrow/pull/7917#discussion_r471927282 ## File path: rust/parquet/src/arrow/schema.rs ## @@ -83,12 +90,77 @@ where .map(|fields| Schema::new_with_metadata(fields, metadata)) } +///

[GitHub] [arrow] jorgecarleitao commented on pull request #7967: ARROW-9751: [Rust] [DataFusion] Allow UDFs to accept multiple data types per argument

2020-08-17 Thread GitBox
jorgecarleitao commented on pull request #7967: URL: https://github.com/apache/arrow/pull/7967#issuecomment-675041274 > Currently we only do optimizations on the logical plan, but there is a plan to have physical plan optozations as well. Won't those be made against a physical plan?

[GitHub] [arrow] jorgecarleitao commented on a change in pull request #7974: ARROW-9756: [Rust] [DataFusion] Added support for scalar UDFs of arbitrary return types

2020-08-17 Thread GitBox
jorgecarleitao commented on a change in pull request #7974: URL: https://github.com/apache/arrow/pull/7974#discussion_r471707581 ## File path: rust/datafusion/src/lib.rs ## @@ -31,6 +31,7 @@ extern crate sqlparser; pub mod dataframe; pub mod datasource; +mod datatyped;

[GitHub] [arrow] nealrichardson commented on pull request #7928: ARROW-6437: [R] Add AWS SDK to system dependencies for macOS and Windows

2020-08-17 Thread GitBox
nealrichardson commented on pull request #7928: URL: https://github.com/apache/arrow/pull/7928#issuecomment-675100389 @github-actions crossbow submit homebrew-r-autobrew This is an automated message from the Apache Git

[GitHub] [arrow] github-actions[bot] commented on pull request #7982: ARROW-9744: [Python] Fix build failure on aarch64

2020-08-17 Thread GitBox
github-actions[bot] commented on pull request #7982: URL: https://github.com/apache/arrow/pull/7982#issuecomment-675138080 https://issues.apache.org/jira/browse/ARROW-9744 This is an automated message from the Apache Git

[GitHub] [arrow] nealrichardson commented on pull request #7928: ARROW-6437: [R] Add AWS SDK to system dependencies for macOS and Windows

2020-08-17 Thread GitBox
nealrichardson commented on pull request #7928: URL: https://github.com/apache/arrow/pull/7928#issuecomment-675065052 @github-actions crossbow submit homebrew-r-autobrew This is an automated message from the Apache Git

[GitHub] [arrow] kiszk commented on pull request #7973: ARROW-8493: [C++][Parquet] Start populating repeated ancestor defintion

2020-08-17 Thread GitBox
kiszk commented on pull request #7973: URL: https://github.com/apache/arrow/pull/7973#issuecomment-675087462 Is it better to create an issue in PARQUET instead of ARROW? This is an automated message from the Apache Git

[GitHub] [arrow] jorgecarleitao commented on a change in pull request #7974: ARROW-9756: [Rust] [DataFusion] Added support for scalar UDFs of arbitrary return types

2020-08-17 Thread GitBox
jorgecarleitao commented on a change in pull request #7974: URL: https://github.com/apache/arrow/pull/7974#discussion_r471097047 ## File path: rust/datafusion/src/logicalplan.rs ## @@ -1087,7 +1101,7 @@ impl LogicalPlanBuilder { /// Apply a projection pub fn

[GitHub] [arrow] jorgecarleitao commented on pull request #7971: ARROW-9752: [Rust] [DataFusion] Add support for aggregate UDFs

2020-08-17 Thread GitBox
jorgecarleitao commented on pull request #7971: URL: https://github.com/apache/arrow/pull/7971#issuecomment-675042360 @andygrove , since I rebased this branch on top of master with the changes of the lock of scalar_functions, the tests consistently halt. Do you or @alamb have any hints on

[GitHub] [arrow] jorgecarleitao commented on a change in pull request #7971: ARROW-9752: [Rust] [DataFusion] Add support for aggregate UDFs

2020-08-17 Thread GitBox
jorgecarleitao commented on a change in pull request #7971: URL: https://github.com/apache/arrow/pull/7971#discussion_r471706745 ## File path: rust/datafusion/src/execution/physical_plan/udf.rs ## @@ -20,25 +20,33 @@ use std::fmt; use arrow::array::ArrayRef; -use

[GitHub] [arrow] sunchao commented on pull request #7917: ARROW-8243: [Rust] [Parquet] Serialize Arrow schema metadata

2020-08-17 Thread GitBox
sunchao commented on pull request #7917: URL: https://github.com/apache/arrow/pull/7917#issuecomment-675178642 @nevi-me ah OK - I'll take a look soon! This is an automated message from the Apache Git Service. To respond to

[GitHub] [arrow] nealrichardson commented on pull request #7928: ARROW-6437: [R] Add AWS SDK to system dependencies for macOS and Windows

2020-08-17 Thread GitBox
nealrichardson commented on pull request #7928: URL: https://github.com/apache/arrow/pull/7928#issuecomment-675107914 Sigh, https://github.com/apache/arrow/commit/1018a4f50fb3cd9b5f622a25ad0c69b1389c9a96 just merged, and it adds additional aws-sdk-cpp components, so the new builds are

[GitHub] [arrow] nevi-me commented on pull request #7917: ARROW-8243: [Rust] [Parquet] Serialize Arrow schema metadata

2020-08-17 Thread GitBox
nevi-me commented on pull request #7917: URL: https://github.com/apache/arrow/pull/7917#issuecomment-675177564 @sunchao I forgot to mention, I made the changes, so this is ready for review This is an automated message from

[GitHub] [arrow] jorgecarleitao commented on a change in pull request #7967: ARROW-9751: [Rust] [DataFusion] Allow UDFs to accept multiple data types per argument

2020-08-17 Thread GitBox
jorgecarleitao commented on a change in pull request #7967: URL: https://github.com/apache/arrow/pull/7967#discussion_r471688534 ## File path: rust/datafusion/tests/sql.rs ## @@ -232,6 +326,55 @@ fn custom_sqrt(args: &[ArrayRef]) -> Result { Ok(Arc::new(builder.finish()))

[GitHub] [arrow] github-actions[bot] commented on pull request #7928: ARROW-6437: [R] Add AWS SDK to system dependencies for macOS and Windows

2020-08-17 Thread GitBox
github-actions[bot] commented on pull request #7928: URL: https://github.com/apache/arrow/pull/7928#issuecomment-675101369 Revision: 3bd693e259c897dc1d95f5cf6e724ffebab4c1bf Submitted crossbow builds: [ursa-labs/crossbow @

[GitHub] [arrow] kou opened a new pull request #7982: ARROW-9744: [Python] Fix build failure on aarch64

2020-08-17 Thread GitBox
kou opened a new pull request #7982: URL: https://github.com/apache/arrow/pull/7982 pyarrow just uses SetupCxxFlags.cmake. It doesn't use DefineOptions.cmake. We need to define ARROW_ARMV8_ARCH in pyarrow too. It's defined in DefineOptions.cmake.

[GitHub] [arrow] jorgecarleitao commented on a change in pull request #7974: ARROW-9756: [Rust] [DataFusion] Added support for scalar UDFs of arbitrary return types

2020-08-17 Thread GitBox
jorgecarleitao commented on a change in pull request #7974: URL: https://github.com/apache/arrow/pull/7974#discussion_r471704229 ## File path: rust/datafusion/src/execution/physical_plan/expressions.rs ## @@ -156,7 +168,7 @@ macro_rules! sum_accumulate { #[derive(Debug)]

[GitHub] [arrow] jorgecarleitao commented on a change in pull request #7974: ARROW-9756: [Rust] [DataFusion] Added support for scalar UDFs of arbitrary return types

2020-08-17 Thread GitBox
jorgecarleitao commented on a change in pull request #7974: URL: https://github.com/apache/arrow/pull/7974#discussion_r471718884 ## File path: rust/datafusion/src/sql/planner.rs ## @@ -476,70 +478,50 @@ impl SqlToRel { } SQLExpr::Function(function)

[GitHub] [arrow] github-actions[bot] commented on pull request #7928: ARROW-6437: [R] Add AWS SDK to system dependencies for macOS and Windows

2020-08-17 Thread GitBox
github-actions[bot] commented on pull request #7928: URL: https://github.com/apache/arrow/pull/7928#issuecomment-675066462 Revision: f4afc43981c2cdae9f1758d521ec0ac0e28921fc Submitted crossbow builds: [ursa-labs/crossbow @

[GitHub] [arrow] jorgecarleitao commented on a change in pull request #7974: ARROW-9756: [Rust] [DataFusion] Added support for scalar UDFs of arbitrary return types

2020-08-17 Thread GitBox
jorgecarleitao commented on a change in pull request #7974: URL: https://github.com/apache/arrow/pull/7974#discussion_r471708914 ## File path: rust/datafusion/src/logicalplan.rs ## @@ -364,22 +364,25 @@ pub enum Expr { /// List of expressions to feed to the functions

[GitHub] [arrow] jorgecarleitao commented on a change in pull request #7974: ARROW-9756: [Rust] [DataFusion] Added support for scalar UDFs of arbitrary return types

2020-08-17 Thread GitBox
jorgecarleitao commented on a change in pull request #7974: URL: https://github.com/apache/arrow/pull/7974#discussion_r471708549 ## File path: rust/datafusion/src/logicalplan.rs ## @@ -355,7 +355,7 @@ pub enum Expr { /// List of expressions to feed to the functions as

[GitHub] [arrow] jorgecarleitao commented on a change in pull request #7967: ARROW-9751: [Rust] [DataFusion] Allow UDFs to accept multiple data types per argument

2020-08-17 Thread GitBox
jorgecarleitao commented on a change in pull request #7967: URL: https://github.com/apache/arrow/pull/7967#discussion_r471652148 ## File path: rust/datafusion/src/sql/planner.rs ## @@ -515,27 +515,29 @@ impl SqlToRel { } _ => match

[GitHub] [arrow] bkietz commented on a change in pull request #7819: ARROW-9405: [R] Switch to cpp11

2020-08-17 Thread GitBox
bkietz commented on a change in pull request #7819: URL: https://github.com/apache/arrow/pull/7819#discussion_r471664878 ## File path: r/src/arrow_cpp11.h ## @@ -0,0 +1,243 @@ +// Licensed to the Apache Software Foundation (ASF) under one +// or more contributor license

[GitHub] [arrow] pitrou commented on pull request #7928: ARROW-6437: [R] Add AWS SDK to system dependencies for macOS and Windows

2020-08-17 Thread GitBox
pitrou commented on pull request #7928: URL: https://github.com/apache/arrow/pull/7928#issuecomment-675030202 Just retry the build? This could be a sporadic issue. This is an automated message from the Apache Git Service. To

[GitHub] [arrow] pitrou closed pull request #7945: ARROW-9710: [C++] Improve performance of Decimal128::ToString by 10x, and make the implementation reusable for Decimal256.

2020-08-17 Thread GitBox
pitrou closed pull request #7945: URL: https://github.com/apache/arrow/pull/7945 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to

[GitHub] [arrow] lidavidm commented on pull request #7981: ARROW-9670: [C++][FlightRPC] don't hang if Close and Read called simultaneously

2020-08-17 Thread GitBox
lidavidm commented on pull request #7981: URL: https://github.com/apache/arrow/pull/7981#issuecomment-675035275 I originally added a unit test, but removed it since I realized it's flaky (the error happens when two blocking operations overlap; in a test, this can't be simulated since you

[GitHub] [arrow] andygrove commented on pull request #7967: ARROW-9751: [Rust] [DataFusion] Allow UDFs to accept multiple data types per argument

2020-08-17 Thread GitBox
andygrove commented on pull request #7967: URL: https://github.com/apache/arrow/pull/7967#issuecomment-675035668 Currently we only do optimizations on the logical plan, but there is a plan to have physical plan optozations as well. On Mon, Aug 17, 2020, 11:37 AM Jorge Leitao

[GitHub] [arrow] tianchen92 commented on a change in pull request #7887: ARROW-9304: [C++] Add "AppendEmpty" builder APIs for use inside StructBuilder::AppendNull

2020-08-17 Thread GitBox
tianchen92 commented on a change in pull request #7887: URL: https://github.com/apache/arrow/pull/7887#discussion_r471248061 ## File path: cpp/src/arrow/buffer_builder.h ## @@ -292,6 +292,11 @@ class TypedBufferBuilder { return Status::OK(); } + void

[GitHub] [arrow] romainfrancois commented on a change in pull request #7819: ARROW-9405: [R] Switch to cpp11

2020-08-17 Thread GitBox
romainfrancois commented on a change in pull request #7819: URL: https://github.com/apache/arrow/pull/7819#discussion_r471335906 ## File path: r/src/arrow_cpp11.h ## @@ -0,0 +1,243 @@ +// Licensed to the Apache Software Foundation (ASF) under one +// or more contributor

[GitHub] [arrow] liyafan82 commented on a change in pull request #7748: ARROW-9388: [C++] Division kernels

2020-08-17 Thread GitBox
liyafan82 commented on a change in pull request #7748: URL: https://github.com/apache/arrow/pull/7748#discussion_r471346362 ## File path: docs/source/cpp/compute.rst ## @@ -197,6 +197,10 @@ an ``Invalid`` :class:`Status` when overflow is detected.

[GitHub] [arrow] liyafan82 commented on a change in pull request #7748: ARROW-9388: [C++] Division kernels

2020-08-17 Thread GitBox
liyafan82 commented on a change in pull request #7748: URL: https://github.com/apache/arrow/pull/7748#discussion_r471346229 ## File path: cpp/src/arrow/compute/api_scalar.h ## @@ -129,6 +129,20 @@ Result Multiply(const Datum& left, const Datum& right,

[GitHub] [arrow] liyafan82 commented on a change in pull request #7748: ARROW-9388: [C++] Division kernels

2020-08-17 Thread GitBox
liyafan82 commented on a change in pull request #7748: URL: https://github.com/apache/arrow/pull/7748#discussion_r471345721 ## File path: cpp/src/arrow/compute/kernels/scalar_arithmetic.cc ## @@ -186,6 +187,42 @@ struct MultiplyChecked { } }; +struct Divide { + template

[GitHub] [arrow] liyafan82 commented on a change in pull request #7748: ARROW-9388: [C++] Division kernels

2020-08-17 Thread GitBox
liyafan82 commented on a change in pull request #7748: URL: https://github.com/apache/arrow/pull/7748#discussion_r471346102 ## File path: cpp/src/arrow/compute/kernels/scalar_arithmetic.cc ## @@ -186,6 +187,42 @@ struct MultiplyChecked { } }; +struct Divide { + template

[GitHub] [arrow] liyafan82 commented on a change in pull request #7748: ARROW-9388: [C++] Division kernels

2020-08-17 Thread GitBox
liyafan82 commented on a change in pull request #7748: URL: https://github.com/apache/arrow/pull/7748#discussion_r471346573 ## File path: cpp/src/arrow/compute/api_scalar.h ## @@ -129,6 +129,20 @@ Result Multiply(const Datum& left, const Datum& right,

[GitHub] [arrow] romainfrancois commented on a change in pull request #7819: ARROW-9405: [R] Switch to cpp11

2020-08-17 Thread GitBox
romainfrancois commented on a change in pull request #7819: URL: https://github.com/apache/arrow/pull/7819#discussion_r471349293 ## File path: r/src/arrow_cpp11.h ## @@ -0,0 +1,243 @@ +// Licensed to the Apache Software Foundation (ASF) under one +// or more contributor

[GitHub] [arrow] liyafan82 commented on a change in pull request #7748: ARROW-9388: [C++] Division kernels

2020-08-17 Thread GitBox
liyafan82 commented on a change in pull request #7748: URL: https://github.com/apache/arrow/pull/7748#discussion_r471348958 ## File path: cpp/src/arrow/compute/kernels/scalar_arithmetic_test.cc ## @@ -492,6 +486,81 @@ TYPED_TEST(TestBinaryArithmeticFloating, Add) {

[GitHub] [arrow] kszucs commented on pull request #7803: ARROW-9517: [C++/Python] Add support for temporary credentials to S3Options

2020-08-17 Thread GitBox
kszucs commented on pull request #7803: URL: https://github.com/apache/arrow/pull/7803#issuecomment-674806014 @ursabot build This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [arrow] liyafan82 commented on a change in pull request #7748: ARROW-9388: [C++] Division kernels

2020-08-17 Thread GitBox
liyafan82 commented on a change in pull request #7748: URL: https://github.com/apache/arrow/pull/7748#discussion_r471343433 ## File path: cpp/src/arrow/compute/kernels/scalar_arithmetic.cc ## @@ -186,6 +187,42 @@ struct MultiplyChecked { } }; +struct Divide { + template

[GitHub] [arrow] romainfrancois commented on a change in pull request #7819: ARROW-9405: [R] Switch to cpp11

2020-08-17 Thread GitBox
romainfrancois commented on a change in pull request #7819: URL: https://github.com/apache/arrow/pull/7819#discussion_r471347426 ## File path: r/src/arrow_cpp11.h ## @@ -0,0 +1,243 @@ +// Licensed to the Apache Software Foundation (ASF) under one +// or more contributor

[GitHub] [arrow] liyafan82 commented on a change in pull request #7748: ARROW-9388: [C++] Division kernels

2020-08-17 Thread GitBox
liyafan82 commented on a change in pull request #7748: URL: https://github.com/apache/arrow/pull/7748#discussion_r471347282 ## File path: cpp/src/arrow/compute/kernels/scalar_arithmetic_test.cc ## @@ -139,13 +139,7 @@ class TestBinaryArithmetic : public TestBase {

[GitHub] [arrow] romainfrancois commented on a change in pull request #7819: ARROW-9405: [R] Switch to cpp11

2020-08-17 Thread GitBox
romainfrancois commented on a change in pull request #7819: URL: https://github.com/apache/arrow/pull/7819#discussion_r471335034 ## File path: r/src/array_from_vector.cpp ## @@ -406,9 +403,12 @@ std::shared_ptr MakeFactorArray(Rcpp::IntegerVector_ factor, case

[GitHub] [arrow] romainfrancois commented on a change in pull request #7819: ARROW-9405: [R] Switch to cpp11

2020-08-17 Thread GitBox
romainfrancois commented on a change in pull request #7819: URL: https://github.com/apache/arrow/pull/7819#discussion_r471335186 ## File path: r/src/array_from_vector.cpp ## @@ -1064,42 +1063,42 @@ class FixedSizeBinaryVectorConverter : public VectorConverter {

[GitHub] [arrow] liyafan82 commented on a change in pull request #7748: ARROW-9388: [C++] Division kernels

2020-08-17 Thread GitBox
liyafan82 commented on a change in pull request #7748: URL: https://github.com/apache/arrow/pull/7748#discussion_r471348419 ## File path: cpp/src/arrow/compute/kernels/scalar_arithmetic_test.cc ## @@ -492,6 +486,81 @@ TYPED_TEST(TestBinaryArithmeticFloating, Add) {

[GitHub] [arrow] pitrou commented on a change in pull request #7748: ARROW-9388: [C++] Division kernels

2020-08-17 Thread GitBox
pitrou commented on a change in pull request #7748: URL: https://github.com/apache/arrow/pull/7748#discussion_r471364203 ## File path: cpp/src/arrow/compute/kernels/scalar_arithmetic_test.cc ## @@ -492,6 +486,81 @@ TYPED_TEST(TestBinaryArithmeticFloating, Add) {

[GitHub] [arrow] liyafan82 commented on a change in pull request #7748: ARROW-9388: [C++] Division kernels

2020-08-17 Thread GitBox
liyafan82 commented on a change in pull request #7748: URL: https://github.com/apache/arrow/pull/7748#discussion_r471379295 ## File path: cpp/src/arrow/compute/kernels/scalar_arithmetic_test.cc ## @@ -492,6 +486,81 @@ TYPED_TEST(TestBinaryArithmeticFloating, Add) {

[GitHub] [arrow] kszucs commented on pull request #7952: ARROW-9556: [Python][C++] Segfaults in UnionArray with null values

2020-08-17 Thread GitBox
kszucs commented on pull request #7952: URL: https://github.com/apache/arrow/pull/7952#issuecomment-674783758 +1, merging on green. This is an automated message from the Apache Git Service. To respond to the message, please

[GitHub] [arrow] kszucs commented on pull request #7803: ARROW-9517: [C++/Python] Add support for temporary credentials to S3Options

2020-08-17 Thread GitBox
kszucs commented on pull request #7803: URL: https://github.com/apache/arrow/pull/7803#issuecomment-674786830 Indeed, the docker images there need to be rebuilt. I'm updating them. This is an automated message from the

[GitHub] [arrow] kszucs closed pull request #7952: ARROW-9556: [Python][C++] Segfaults in UnionArray with null values

2020-08-17 Thread GitBox
kszucs closed pull request #7952: URL: https://github.com/apache/arrow/pull/7952 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to

[GitHub] [arrow] liyafan82 commented on a change in pull request #7748: ARROW-9388: [C++] Division kernels

2020-08-17 Thread GitBox
liyafan82 commented on a change in pull request #7748: URL: https://github.com/apache/arrow/pull/7748#discussion_r471345890 ## File path: cpp/src/arrow/compute/kernels/scalar_arithmetic.cc ## @@ -186,6 +187,42 @@ struct MultiplyChecked { } }; +struct Divide { + template

[GitHub] [arrow] kszucs commented on a change in pull request #7952: ARROW-9556: [Python][C++] Segfaults in UnionArray with null values

2020-08-17 Thread GitBox
kszucs commented on a change in pull request #7952: URL: https://github.com/apache/arrow/pull/7952#discussion_r471360126 ## File path: python/pyarrow/tests/test_array.py ## @@ -975,6 +975,47 @@ def check_result(result, expected_field_names, expected_type_codes, arr =

[GitHub] [arrow] romainfrancois commented on a change in pull request #7819: ARROW-9405: [R] Switch to cpp11

2020-08-17 Thread GitBox
romainfrancois commented on a change in pull request #7819: URL: https://github.com/apache/arrow/pull/7819#discussion_r471349293 ## File path: r/src/arrow_cpp11.h ## @@ -0,0 +1,243 @@ +// Licensed to the Apache Software Foundation (ASF) under one +// or more contributor

[GitHub] [arrow] kszucs commented on pull request #7932: ARROW-9703: [Developer][Archery] Restartable cherry-picking process for creating maintenance branches

2020-08-17 Thread GitBox
kszucs commented on pull request #7932: URL: https://github.com/apache/arrow/pull/7932#issuecomment-674782381 The tests are passing locally, but the arrow repository gets fetched differently on github actions, so need to fix that before merging.

[GitHub] [arrow] pitrou commented on pull request #7979: ARROW-9766: [C++][Parquet] Add EngineVersion for toggling between new and old logic

2020-08-17 Thread GitBox
pitrou commented on pull request #7979: URL: https://github.com/apache/arrow/pull/7979#issuecomment-674782797 Is there a reason to introduce this and burden the API with a new setting? If the "new" engine supports a superset of what the current engine supports, then it seems it should

[GitHub] [arrow] pitrou edited a comment on pull request #7803: ARROW-9517: [C++/Python] Add support for temporary credentials to S3Options

2020-08-17 Thread GitBox
pitrou edited a comment on pull request #7803: URL: https://github.com/apache/arrow/pull/7803#issuecomment-674781958 @corleyma Unfortunately, the "ursabot" builders use a different execution scheme. The packages need to be updated manually there, I believe. I pinged @kszucs who should be

[GitHub] [arrow] pitrou commented on pull request #7803: ARROW-9517: [C++/Python] Add support for temporary credentials to S3Options

2020-08-17 Thread GitBox
pitrou commented on pull request #7803: URL: https://github.com/apache/arrow/pull/7803#issuecomment-674781958 @corleyma Unfortunately, the "ursabot" builders use a different execution scheme. The packages need to be updated manually there, I beliave. I pinged @kszucs who should be able to

[GitHub] [arrow] github-actions[bot] commented on pull request #7980: ARROW-9768 [Rust] [DataFusion] Rename PhysicalPlannerImpl to DefaultPhysicalPlanner

2020-08-17 Thread GitBox
github-actions[bot] commented on pull request #7980: URL: https://github.com/apache/arrow/pull/7980#issuecomment-674850748 https://issues.apache.org/jira/browse/ARROW-9768 This is an automated message from the Apache Git

[GitHub] [arrow] alamb commented on a change in pull request #7967: ARROW-9751: [Rust] [DataFusion] Allow UDFs to accept multiple data types per argument

2020-08-17 Thread GitBox
alamb commented on a change in pull request #7967: URL: https://github.com/apache/arrow/pull/7967#discussion_r471445155 ## File path: rust/datafusion/src/optimizer/type_coercion.rs ## @@ -345,4 +345,143 @@ mod tests { assert_eq!(expected, format!("{:?}", expr2));

[GitHub] [arrow] pitrou commented on pull request #7945: ARROW-9710: [C++] Improve performance of Decimal128::ToString by 10x, and make the implementation reusable for Decimal256.

2020-08-17 Thread GitBox
pitrou commented on pull request #7945: URL: https://github.com/apache/arrow/pull/7945#issuecomment-674884766 Will merge if CI passes. Thank you @MingyuZhong ! This is an automated message from the Apache Git Service. To

[GitHub] [arrow] kszucs commented on pull request #7933: [Release] Cherry-pick commits to 1.0.x maintenance branch

2020-08-17 Thread GitBox
kszucs commented on pull request #7933: URL: https://github.com/apache/arrow/pull/7933#issuecomment-674887212 @github-actions crossbow submit -g integration -g conda -g wheel -g linux This is an automated message from the

[GitHub] [arrow] pitrou commented on pull request #7803: ARROW-9517: [C++/Python] Add support for temporary credentials to S3Options

2020-08-17 Thread GitBox
pitrou commented on pull request #7803: URL: https://github.com/apache/arrow/pull/7803#issuecomment-674891087 Seems like we're fine now, will merge. This is an automated message from the Apache Git Service. To respond to the

[GitHub] [arrow] pitrou commented on pull request #7789: PARQUET-1878: [C++] lz4 codec is not compatible with Hadoop Lz4Codec

2020-08-17 Thread GitBox
pitrou commented on pull request #7789: URL: https://github.com/apache/arrow/pull/7789#issuecomment-674893008 @patrickpai Do you have some time to make the desired changes here? This is an automated message from the Apache

[GitHub] [arrow] andygrove commented on a change in pull request #7980: ARROW-9768 [Rust] [DataFusion] Rename PhysicalPlannerImpl to DefaultPhysicalPlanner

2020-08-17 Thread GitBox
andygrove commented on a change in pull request #7980: URL: https://github.com/apache/arrow/pull/7980#discussion_r471502392 ## File path: rust/datafusion/src/execution/physical_plan/mod.rs ## @@ -33,7 +33,8 @@ use arrow::{ }; use udf::ScalarFunction; -/// Physical query

[GitHub] [arrow] github-actions[bot] commented on pull request #7933: [Release] Cherry-pick commits to 1.0.x maintenance branch

2020-08-17 Thread GitBox
github-actions[bot] commented on pull request #7933: URL: https://github.com/apache/arrow/pull/7933#issuecomment-674910699 Revision: 2c362d6fcc0e29bd5b9ab2d5c62d0d759b44404d Submitted crossbow builds: [ursa-labs/crossbow @

[GitHub] [arrow] alamb opened a new pull request #7980: ARROW-9768 [Rust] [DataFusion] Rename PhysicalPlannerImpl to DefaultPhysicalPlanner

2020-08-17 Thread GitBox
alamb opened a new pull request #7980: URL: https://github.com/apache/arrow/pull/7980 Proposed follow up to https://github.com/apache/arrow/pull/7975 -- rename `PhysicalPlannerImpl` to `DefaultPhysicalPlanner` to better describe what it is and that the design allows for more than one.

[GitHub] [arrow] alamb commented on a change in pull request #7978: ARROW-9757: [Rust] [DataFusion] Add prelude.rs

2020-08-17 Thread GitBox
alamb commented on a change in pull request #7978: URL: https://github.com/apache/arrow/pull/7978#discussion_r471440516 ## File path: rust/datafusion/src/lib.rs ## @@ -15,17 +15,46 @@ // specific language governing permissions and limitations // under the License.

[GitHub] [arrow] pitrou commented on a change in pull request #7945: ARROW-9710: [C++] Improve performance of Decimal128::ToString by 10x, and make the implementation reusable for Decimal256.

2020-08-17 Thread GitBox
pitrou commented on a change in pull request #7945: URL: https://github.com/apache/arrow/pull/7945#discussion_r471472052 ## File path: cpp/src/arrow/util/decimal.cc ## @@ -241,64 +284,43 @@ Decimal128::operator int64_t() const { return static_cast(low_bits()); } -static

[GitHub] [arrow] pitrou closed pull request #7803: ARROW-9517: [C++/Python] Add support for temporary credentials to S3Options

2020-08-17 Thread GitBox
pitrou closed pull request #7803: URL: https://github.com/apache/arrow/pull/7803 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to

[GitHub] [arrow] kszucs commented on pull request #7933: [Release] Cherry-pick commits to 1.0.x maintenance branch

2020-08-17 Thread GitBox
kszucs commented on pull request #7933: URL: https://github.com/apache/arrow/pull/7933#issuecomment-674909175 @github-actions crossbow submit -g integration -g conda -g wheel -g linux This is an automated message from the

[GitHub] [arrow] alamb commented on a change in pull request #7975: ARROW-9758: [Rust] [DataFusion] Allow physical planner to be replaced

2020-08-17 Thread GitBox
alamb commented on a change in pull request #7975: URL: https://github.com/apache/arrow/pull/7975#discussion_r471437170 ## File path: rust/datafusion/src/execution/context.rs ## @@ -373,363 +355,12 @@ impl ExecutionContext { pub fn create_physical_plan( ,

[GitHub] [arrow] andygrove closed pull request #7980: ARROW-9768 [Rust] [DataFusion] Rename PhysicalPlannerImpl to DefaultPhysicalPlanner

2020-08-17 Thread GitBox
andygrove closed pull request #7980: URL: https://github.com/apache/arrow/pull/7980 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to

[GitHub] [arrow] lidavidm opened a new pull request #7981: ARROW-9670: [C++][FlightRPC] don't hang if Close and Read called simultaneously

2020-08-17 Thread GitBox
lidavidm opened a new pull request #7981: URL: https://github.com/apache/arrow/pull/7981 We were accidentally trying to recursively hold a lock. This is an automated message from the Apache Git Service. To respond to the

[GitHub] [arrow] kszucs commented on pull request #7933: [Release] Cherry-pick commits to 1.0.x maintenance branch

2020-08-17 Thread GitBox
kszucs commented on pull request #7933: URL: https://github.com/apache/arrow/pull/7933#issuecomment-674995771 The three integration build failures and the two GHA errors are unrelated, so merging it to the upstream maintenance branch before creating the tag.

[GitHub] [arrow] kszucs commented on pull request #7933: [Release] Cherry-pick commits to 1.0.x maintenance branch

2020-08-17 Thread GitBox
kszucs commented on pull request #7933: URL: https://github.com/apache/arrow/pull/7933#issuecomment-674996246 `Rebase and merge` is disabled, so I'm pushing it directly to the upstream branch. This is an automated message

[GitHub] [arrow] kszucs merged pull request #7933: [Release] Cherry-pick commits to 1.0.x maintenance branch

2020-08-17 Thread GitBox
kszucs merged pull request #7933: URL: https://github.com/apache/arrow/pull/7933 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to

[GitHub] [arrow] github-actions[bot] commented on pull request #7981: ARROW-9670: [C++][FlightRPC] don't hang if Close and Read called simultaneously

2020-08-17 Thread GitBox
github-actions[bot] commented on pull request #7981: URL: https://github.com/apache/arrow/pull/7981#issuecomment-675003254 https://issues.apache.org/jira/browse/ARROW-9670 This is an automated message from the Apache Git

[GitHub] [arrow] alamb commented on a change in pull request #7975: ARROW-9758: [Rust] [DataFusion] Allow physical planner to be replaced

2020-08-17 Thread GitBox
alamb commented on a change in pull request #7975: URL: https://github.com/apache/arrow/pull/7975#discussion_r471433074 ## File path: rust/datafusion/src/execution/context.rs ## @@ -1452,11 +1109,34 @@ mod tests { Ok(()) } +#[test] +fn

[GitHub] [arrow] alamb commented on a change in pull request #7972: ARROW-9742: [Rust] [DataFusion] Improved DataFrame trait (formerly known as the Table trait)

2020-08-17 Thread GitBox
alamb commented on a change in pull request #7972: URL: https://github.com/apache/arrow/pull/7972#discussion_r471438457 ## File path: rust/datafusion/src/execution/context.rs ## @@ -89,12 +91,18 @@ impl ExecutionConfig { } /// Execution context for registering data sources

[GitHub] [arrow] alamb commented on a change in pull request #7967: ARROW-9751: [Rust] [DataFusion] Allow UDFs to accept multiple data types per argument

2020-08-17 Thread GitBox
alamb commented on a change in pull request #7967: URL: https://github.com/apache/arrow/pull/7967#discussion_r471442333 ## File path: rust/datafusion/src/sql/planner.rs ## @@ -523,10 +523,14 @@ impl SqlToRel { let mut safe_args: Vec = vec![];

[GitHub] [arrow] pitrou commented on a change in pull request #7898: ARROW-9642: [C++] Let MakeBuilder refer DictionaryType's index_type for deciding the starting bit width of the indices

2020-08-17 Thread GitBox
pitrou commented on a change in pull request #7898: URL: https://github.com/apache/arrow/pull/7898#discussion_r471486448 ## File path: cpp/src/arrow/array/array_dict_test.cc ## @@ -904,6 +904,67 @@ TEST(TestDecimalDictionaryBuilder, DoubleTableSize) {

[GitHub] [arrow] github-actions[bot] commented on pull request #7933: [Release] Cherry-pick commits to 1.0.x maintenance branch

2020-08-17 Thread GitBox
github-actions[bot] commented on pull request #7933: URL: https://github.com/apache/arrow/pull/7933#issuecomment-674889759 Revision: af3cc932f50f5052b493f48f6d3c24b8d2179a27 Submitted crossbow builds: [ursa-labs/crossbow @

[GitHub] [arrow] pitrou commented on a change in pull request #7748: ARROW-9388: [C++] Division kernels

2020-08-17 Thread GitBox
pitrou commented on a change in pull request #7748: URL: https://github.com/apache/arrow/pull/7748#discussion_r471518131 ## File path: cpp/src/arrow/compute/kernels/scalar_arithmetic_test.cc ## @@ -492,6 +486,81 @@ TYPED_TEST(TestBinaryArithmeticFloating, Add) {

[GitHub] [arrow] alamb commented on pull request #7880: ARROW-9619: [Rust] [DataFusion] Add predicate push-down

2020-08-17 Thread GitBox
alamb commented on pull request #7880: URL: https://github.com/apache/arrow/pull/7880#issuecomment-674923231 > > Something that can be left for future optimization: we can also go the other direction, i.e. break `And` filters into into individual boolean expressions so these filters can

[GitHub] [arrow] alamb commented on a change in pull request #7880: ARROW-9619: [Rust] [DataFusion] Add predicate push-down

2020-08-17 Thread GitBox
alamb commented on a change in pull request #7880: URL: https://github.com/apache/arrow/pull/7880#discussion_r471103102 ## File path: rust/datafusion/src/optimizer/filter_push_down.rs ## @@ -0,0 +1,631 @@ +// regarding copyright ownership. The ASF licenses this file +// to

[GitHub] [arrow] pitrou closed pull request #7826: ARROW-9495: [C++] Equality assertions don't handle Inf / -Inf properly

2020-08-17 Thread GitBox
pitrou closed pull request #7826: URL: https://github.com/apache/arrow/pull/7826 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to