[GitHub] [arrow] github-actions[bot] commented on issue #7014: Go: Minor change to make newBuilder public to aid upstream

2020-04-22 Thread GitBox
github-actions[bot] commented on issue #7014: URL: https://github.com/apache/arrow/pull/7014#issuecomment-617881461 Thanks for opening a pull request! Could you open an issue for this pull request on JIRA? https://issues.apache.org/jira/browse/ARROW Then could you

[GitHub] [arrow] kiszk commented on issue #6954: ARROW-8440: [C++] Refine SIMD header files

2020-04-22 Thread GitBox
kiszk commented on issue #6954: URL: https://github.com/apache/arrow/pull/6954#issuecomment-617854855 Is the function `Armv8CrcHashParallel` uses somewhere? Sorry if I overlook it. This is an automated message from the

[GitHub] [arrow] kiszk edited a comment on issue #6954: ARROW-8440: [C++] Refine SIMD header files

2020-04-22 Thread GitBox
kiszk edited a comment on issue #6954: URL: https://github.com/apache/arrow/pull/6954#issuecomment-617854855 Is the function `Armv8CrcHashParallel` used somewhere? Sorry if I overlook it. This is an automated message from

[GitHub] [arrow] kszucs commented on issue #7000: ARROW-8065: [C++][Dataset] Refactor ScanOptions and Fragment relation

2020-04-22 Thread GitBox
kszucs commented on issue #7000: URL: https://github.com/apache/arrow/pull/7000#issuecomment-617868408 @ursabot build This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [arrow] mindhash opened a new pull request #7014: Go: Minor change to make newBuilder public to aid upstream

2020-04-22 Thread GitBox
mindhash opened a new pull request #7014: URL: https://github.com/apache/arrow/pull/7014 Hello team, This minor change makes newBuilder() public to reduce verbosity in upstream. To give you example, I am working on a parquet read / write into Arrow Record batch where the parquet

[GitHub] [arrow] pitrou commented on a change in pull request #6954: ARROW-8440: [C++] Refine SIMD header files

2020-04-22 Thread GitBox
pitrou commented on a change in pull request #6954: URL: https://github.com/apache/arrow/pull/6954#discussion_r413092039 ## File path: cpp/src/arrow/util/hash_util.h ## @@ -27,39 +27,27 @@ #include "arrow/util/logging.h" #include "arrow/util/macros.h" -#include

[GitHub] [arrow] andygrove commented on a change in pull request #6770: ARROW-7842: [Rust] [Parquet] implement array_reader for list type columns

2020-04-22 Thread GitBox
andygrove commented on a change in pull request #6770: URL: https://github.com/apache/arrow/pull/6770#discussion_r413404215 ## File path: rust/datafusion/src/utils.rs ## @@ -74,6 +74,29 @@ macro_rules! make_string { }}; } +macro_rules! make_string_from_list { +

[GitHub] [arrow] andygrove commented on a change in pull request #6770: ARROW-7842: [Rust] [Parquet] implement array_reader for list type columns

2020-04-22 Thread GitBox
andygrove commented on a change in pull request #6770: URL: https://github.com/apache/arrow/pull/6770#discussion_r413404556 ## File path: rust/datafusion/src/utils.rs ## @@ -120,6 +143,7 @@ pub fn array_value_to_string(column: array::ArrayRef, row: usize) -> Result {

[GitHub] [arrow] andygrove commented on a change in pull request #7004: ARROW-3827: [Rust] Implement UnionArray Updated

2020-04-22 Thread GitBox
andygrove commented on a change in pull request #7004: URL: https://github.com/apache/arrow/pull/7004#discussion_r413408444 ## File path: rust/arrow/src/array/mod.rs ## @@ -85,6 +85,7 @@ mod array; mod builder; mod data; mod equal; +mod union; Review comment: I

[GitHub] [arrow] andygrove commented on a change in pull request #7004: ARROW-3827: [Rust] Implement UnionArray Updated

2020-04-22 Thread GitBox
andygrove commented on a change in pull request #7004: URL: https://github.com/apache/arrow/pull/7004#discussion_r413408241 ## File path: rust/arrow/src/array/equal.rs ## @@ -1046,6 +1062,30 @@ impl PartialEq for Value { } } +impl JsonEqual for UnionArray { +fn

[GitHub] [arrow] andygrove commented on issue #6972: ARROW-8287: [Rust] Add "pretty" util to help with printing tabular output of RecordBatches

2020-04-22 Thread GitBox
andygrove commented on issue #6972: URL: https://github.com/apache/arrow/pull/6972#issuecomment-618100341 Thanks @markhildreth for the detailed write-up in the JIRA! I've started looking through this. I'm not sure I understand all the points you made yet, but if there is a way to

[GitHub] [arrow] wesm commented on issue #5947: ARROW-7300: [C++][Gandiva] Implement functions to cast from strings to integers/floats

2020-04-22 Thread GitBox
wesm commented on issue #5947: URL: https://github.com/apache/arrow/pull/5947#issuecomment-618107435 @praveenbingo @projjal would you be able to take a look now? This is an automated message from the Apache Git Service. To

[GitHub] [arrow] fsaintjacques commented on issue #7011: ARROW-8554 [C++][Benchmark] Fix building error "cannot bind lvalue"

2020-04-22 Thread GitBox
fsaintjacques commented on issue #7011: URL: https://github.com/apache/arrow/pull/7011#issuecomment-61869 I understood that @bkietz added an entry for this, maybe it doesn't have the benchmark enabled? This is an

[GitHub] [arrow] wesm commented on issue #7011: ARROW-8554: [C++][Benchmark] Fix building error "cannot bind lvalue"

2020-04-22 Thread GitBox
wesm commented on issue #7011: URL: https://github.com/apache/arrow/pull/7011#issuecomment-618111873 manylinux1 uses gcc 4.8 but does not build the benchmarks. This is an automated message from the Apache Git Service. To

[GitHub] [arrow] houqp commented on issue #7009: ARROW-8552: [Rust] support iterate parquet row columns

2020-04-22 Thread GitBox
houqp commented on issue #7009: URL: https://github.com/apache/arrow/pull/7009#issuecomment-618158794 @nevi-me rebased and tests are passing now :) This is an automated message from the Apache Git Service. To respond to the

[GitHub] [arrow] andygrove commented on a change in pull request #6770: ARROW-7842: [Rust] [Parquet] implement array_reader for list type columns

2020-04-22 Thread GitBox
andygrove commented on a change in pull request #6770: URL: https://github.com/apache/arrow/pull/6770#discussion_r413403760 ## File path: rust/datafusion/src/utils.rs ## @@ -74,6 +74,29 @@ macro_rules! make_string { }}; } +macro_rules! make_string_from_list { +

[GitHub] [arrow] andygrove commented on a change in pull request #6770: ARROW-7842: [Rust] [Parquet] implement array_reader for list type columns

2020-04-22 Thread GitBox
andygrove commented on a change in pull request #6770: URL: https://github.com/apache/arrow/pull/6770#discussion_r413403453 ## File path: rust/datafusion/src/logicalplan.rs ## @@ -828,8 +828,8 @@ mod tests { .build()?; let expected = "Projection: #id\ -

[GitHub] [arrow] andygrove commented on issue #4140: ARROW-5123: [Rust] Parquet derive for simple structs

2020-04-22 Thread GitBox
andygrove commented on issue #4140: URL: https://github.com/apache/arrow/pull/4140#issuecomment-618098211 @bryantbiggs I will take a look at the release tag issue this weekend. This is an automated message from the Apache

[GitHub] [arrow] fsaintjacques commented on issue #7000: ARROW-8065: [C++][Dataset] Refactor ScanOptions and Fragment relation

2020-04-22 Thread GitBox
fsaintjacques commented on issue #7000: URL: https://github.com/apache/arrow/pull/7000#issuecomment-618110604 Addressed most comments and updated followup ticket with what's missing. PTAL and merge quickly so we can unblock the blocked tickets :)

[GitHub] [arrow] xuancong84 opened a new issue #7017: suggestion: why not serialize complex numbers in a Python list/dict/set

2020-04-22 Thread GitBox
xuancong84 opened a new issue #7017: URL: https://github.com/apache/arrow/issues/7017 Dear developers, I realize that complex numbers in Numpy arrays and Pandas dataframe/series can be serialized, but complex numbers in Python structures (e.g., `[1, 2.5, 3+1.j, np.nan]`) cannot be

[GitHub] [arrow] kou commented on issue #6996: ARROW-8538: [Packaging] Remove boost from homebrew formula

2020-04-22 Thread GitBox
kou commented on issue #6996: URL: https://github.com/apache/arrow/pull/6996#issuecomment-618066972 Could you make a JIRA? This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [arrow] mcassels commented on issue #6770: ARROW-7842 [Parquet][Rust] implement array_reader for list type columns

2020-04-22 Thread GitBox
mcassels commented on issue #6770: URL: https://github.com/apache/arrow/pull/6770#issuecomment-618082896 @andygrove @nevi-me do you have any thoughts on this? This is an automated message from the Apache Git Service. To

[GitHub] [arrow] wesm commented on issue #7011: ARROW-8554 [C++][Benchmark] Fix building error "cannot bind lvalue"

2020-04-22 Thread GitBox
wesm commented on issue #7011: URL: https://github.com/apache/arrow/pull/7011#issuecomment-618102656 Another gcc 4.8 issue here. We may need a more comprehensive build that also builds the benchmark executables This is an

[GitHub] [arrow] fsaintjacques commented on a change in pull request #6985: ARROW-8413: [C++][Parquet][WIP] Refactor Generating validity bitmap for values column

2020-04-22 Thread GitBox
fsaintjacques commented on a change in pull request #6985: URL: https://github.com/apache/arrow/pull/6985#discussion_r413428461 ## File path: cpp/src/arrow/util/bit_util.h ## @@ -43,13 +43,18 @@ #if defined(_MSC_VER) #include +#include #pragma intrinsic(_BitScanReverse)

[GitHub] [arrow] andygrove commented on a change in pull request #7004: ARROW-3827: [Rust] Implement UnionArray Updated

2020-04-22 Thread GitBox
andygrove commented on a change in pull request #7004: URL: https://github.com/apache/arrow/pull/7004#discussion_r413407658 ## File path: rust/arrow/src/array/equal.rs ## @@ -692,6 +692,22 @@ impl ArrayEqual for StructArray { } } +impl ArrayEqual for UnionArray { +

[GitHub] [arrow] andygrove commented on a change in pull request #7004: ARROW-3827: [Rust] Implement UnionArray Updated

2020-04-22 Thread GitBox
andygrove commented on a change in pull request #7004: URL: https://github.com/apache/arrow/pull/7004#discussion_r413407264 ## File path: rust/arrow/src/array/equal.rs ## @@ -692,6 +692,22 @@ impl ArrayEqual for StructArray { } } +impl ArrayEqual for UnionArray { +

[GitHub] [arrow] github-actions[bot] commented on issue #7014: ARROW-8563: GO Minor change to make newBuilder public

2020-04-22 Thread GitBox
github-actions[bot] commented on issue #7014: URL: https://github.com/apache/arrow/pull/7014#issuecomment-618187424 https://issues.apache.org/jira/browse/ARROW-8563 This is an automated message from the Apache Git Service.

[GitHub] [arrow] tustvold commented on a change in pull request #6980: ARROW-8516: [Rust] Improve PrimitiveBuilder::append_slice performance

2020-04-22 Thread GitBox
tustvold commented on a change in pull request #6980: URL: https://github.com/apache/arrow/pull/6980#discussion_r412736972 ## File path: rust/arrow/src/array/builder.rs ## @@ -236,6 +251,14 @@ impl BufferBuilderTrait for BufferBuilder {

[GitHub] [arrow] tustvold commented on a change in pull request #6980: ARROW-8516: [Rust] Improve PrimitiveBuilder::append_slice performance

2020-04-22 Thread GitBox
tustvold commented on a change in pull request #6980: URL: https://github.com/apache/arrow/pull/6980#discussion_r412736972 ## File path: rust/arrow/src/array/builder.rs ## @@ -236,6 +251,14 @@ impl BufferBuilderTrait for BufferBuilder {

[GitHub] [arrow] kou commented on issue #7008: ARROW-8551: [CI][Gandiva] Use LLVM 8 in gandiva linux build

2020-04-22 Thread GitBox
kou commented on issue #7008: URL: https://github.com/apache/arrow/pull/7008#issuecomment-617615843 @github-actions crossbow submit gandiva-jar-xenial This is an automated message from the Apache Git Service. To respond to

[GitHub] [arrow] jianxind commented on a change in pull request #6954: ARROW-8440: [C++] Refine SIMD header files

2020-04-22 Thread GitBox
jianxind commented on a change in pull request #6954: URL: https://github.com/apache/arrow/pull/6954#discussion_r412709153 ## File path: docs/source/developers/benchmarks.rst ## @@ -59,7 +59,7 @@ Sometimes, it is required to pass custom CMake flags, e.g. .. code-block:: shell

[GitHub] [arrow] tustvold commented on a change in pull request #6980: ARROW-8516: [Rust] Improve PrimitiveBuilder::append_slice performance

2020-04-22 Thread GitBox
tustvold commented on a change in pull request #6980: URL: https://github.com/apache/arrow/pull/6980#discussion_r412736972 ## File path: rust/arrow/src/array/builder.rs ## @@ -236,6 +251,14 @@ impl BufferBuilderTrait for BufferBuilder {

[GitHub] [arrow] github-actions[bot] commented on issue #7008: ARROW-8551: [CI][Gandiva] Use LLVM 8 in gandiva linux build

2020-04-22 Thread GitBox
github-actions[bot] commented on issue #7008: URL: https://github.com/apache/arrow/pull/7008#issuecomment-617616494 Revision: 1e235ddc11ff6ee4620b62e3b5f9a318d117512b Submitted crossbow builds: [ursa-labs/crossbow @

[GitHub] [arrow] houqp commented on issue #7009: ARROW-8552: [Rust] support iterate parquet row columns

2020-04-22 Thread GitBox
houqp commented on issue #7009: URL: https://github.com/apache/arrow/pull/7009#issuecomment-617600824 looks like the windows CI is failing with error not related to my change: ``` "error: \'rustfmt.exe\' is not installed for the toolchain

[GitHub] [arrow] kou commented on issue #7008: ARROW-8551: [CI][Gandiva] Use LLVM 8 in gandiva linux build

2020-04-22 Thread GitBox
kou commented on issue #7008: URL: https://github.com/apache/arrow/pull/7008#issuecomment-617618030 Can we move the Docker image for building Gandiva on Linux to our `ci/docker/` like https://github.com/apache/arrow/blob/master/python/manylinux201x/Dockerfile-x86_64_base_2014 ?

[GitHub] [arrow] tustvold commented on a change in pull request #6980: ARROW-8516: [Rust] Improve PrimitiveBuilder::append_slice performance

2020-04-22 Thread GitBox
tustvold commented on a change in pull request #6980: URL: https://github.com/apache/arrow/pull/6980#discussion_r412736972 ## File path: rust/arrow/src/array/builder.rs ## @@ -236,6 +251,14 @@ impl BufferBuilderTrait for BufferBuilder {

[GitHub] [arrow] tustvold commented on a change in pull request #6980: ARROW-8516: [Rust] Improve PrimitiveBuilder::append_slice performance

2020-04-22 Thread GitBox
tustvold commented on a change in pull request #6980: URL: https://github.com/apache/arrow/pull/6980#discussion_r412736972 ## File path: rust/arrow/src/array/builder.rs ## @@ -236,6 +251,14 @@ impl BufferBuilderTrait for BufferBuilder {

[GitHub] [arrow] tustvold commented on a change in pull request #6980: ARROW-8516: [Rust] Improve PrimitiveBuilder::append_slice performance

2020-04-22 Thread GitBox
tustvold commented on a change in pull request #6980: URL: https://github.com/apache/arrow/pull/6980#discussion_r412736972 ## File path: rust/arrow/src/array/builder.rs ## @@ -236,6 +251,14 @@ impl BufferBuilderTrait for BufferBuilder {

[GitHub] [arrow] mrkn commented on issue #6667: ARROW-8162: [Format][Python] Add serialization for CSF sparse tensors to Python

2020-04-22 Thread GitBox
mrkn commented on issue #6667: URL: https://github.com/apache/arrow/pull/6667#issuecomment-618185887 @rok Thank you for working this! I'll merge this. This is an automated message from the Apache Git Service. To respond to

[GitHub] [arrow] pitrou commented on a change in pull request #7004: ARROW-3827: [Rust] Implement UnionArray Updated

2020-04-22 Thread GitBox
pitrou commented on a change in pull request #7004: URL: https://github.com/apache/arrow/pull/7004#discussion_r412926608 ## File path: rust/arrow/src/array/union.rs ## @@ -0,0 +1,1172 @@ +// Licensed to the Apache Software Foundation (ASF) under one +// or more contributor

[GitHub] [arrow] jianxind commented on a change in pull request #6954: ARROW-8440: [C++] Refine SIMD header files

2020-04-22 Thread GitBox
jianxind commented on a change in pull request #6954: URL: https://github.com/apache/arrow/pull/6954#discussion_r412846457 ## File path: docs/source/developers/benchmarks.rst ## @@ -59,7 +59,7 @@ Sometimes, it is required to pass custom CMake flags, e.g. .. code-block:: shell

[GitHub] [arrow] pitrou commented on a change in pull request #7004: ARROW-3827: [Rust] Implement UnionArray Updated

2020-04-22 Thread GitBox
pitrou commented on a change in pull request #7004: URL: https://github.com/apache/arrow/pull/7004#discussion_r412925276 ## File path: rust/arrow/src/array/union.rs ## @@ -0,0 +1,1172 @@ +// Licensed to the Apache Software Foundation (ASF) under one +// or more contributor

[GitHub] [arrow] pitrou commented on issue #6981: PARQUET-1845: [C++] Add expected results of Int96 in big-endian

2020-04-22 Thread GitBox
pitrou commented on issue #6981: URL: https://github.com/apache/arrow/pull/6981#issuecomment-617744347 As a sidenote, I think you may want to start with Arrow unittests before trying to make Parquet unittests successful. Parquet relies on many Arrow facilities.

[GitHub] [arrow] nevi-me opened a new pull request #7010: [Rust] [CI]: fix rustfmt failures

2020-04-22 Thread GitBox
nevi-me opened a new pull request #7010: URL: https://github.com/apache/arrow/pull/7010 This adds the `rustfmt` component to the Rust installations in Windows and MacOS, and fixes `rustfmt` related CI failures. @kszucs @paddyhoran

[GitHub] [arrow] pitrou commented on a change in pull request #7004: ARROW-3827: [Rust] Implement UnionArray Updated

2020-04-22 Thread GitBox
pitrou commented on a change in pull request #7004: URL: https://github.com/apache/arrow/pull/7004#discussion_r412925276 ## File path: rust/arrow/src/array/union.rs ## @@ -0,0 +1,1172 @@ +// Licensed to the Apache Software Foundation (ASF) under one +// or more contributor

[GitHub] [arrow] nevi-me commented on issue #7009: ARROW-8552: [Rust] support iterate parquet row columns

2020-04-22 Thread GitBox
nevi-me commented on issue #7009: URL: https://github.com/apache/arrow/pull/7009#issuecomment-617678597 > looks like the windows CI is failing with error not related to my change: > > ``` > "error: \'rustfmt.exe\' is not installed for the toolchain

[GitHub] [arrow] github-actions[bot] commented on issue #7010: [Rust] [CI]: fix rustfmt failures

2020-04-22 Thread GitBox
github-actions[bot] commented on issue #7010: URL: https://github.com/apache/arrow/pull/7010#issuecomment-617680858 Thanks for opening a pull request! Could you open an issue for this pull request on JIRA? https://issues.apache.org/jira/browse/ARROW Then could you

[GitHub] [arrow] nevi-me commented on issue #7004: ARROW-3827: [Rust] Implement UnionArray Updated

2020-04-22 Thread GitBox
nevi-me commented on issue #7004: URL: https://github.com/apache/arrow/pull/7004#issuecomment-617676072 > @kszucs it's failing due to `rustfmt` not being installed before testing the flight crate, any idea why this would be the case? Sorry, I don't know much about GitHub actions yet...

[GitHub] [arrow] pitrou commented on a change in pull request #6992: ARROW-7950: [Python] Determine + test minimal pandas version + raise error when pandas is too old

2020-04-22 Thread GitBox
pitrou commented on a change in pull request #6992: URL: https://github.com/apache/arrow/pull/6992#discussion_r412929769 ## File path: python/pyarrow/tests/test_pandas.py ## @@ -2685,8 +2685,8 @@ class A: 'a': pd.period_range('2000-01-01', periods=20), })

[GitHub] [arrow] pitrou commented on a change in pull request #6992: ARROW-7950: [Python] Determine + test minimal pandas version + raise error when pandas is too old

2020-04-22 Thread GitBox
pitrou commented on a change in pull request #6992: URL: https://github.com/apache/arrow/pull/6992#discussion_r412929368 ## File path: python/pyarrow/pandas-shim.pxi ## @@ -55,6 +55,16 @@ cdef class _PandasAPIShim(object): from distutils.version import LooseVersion

[GitHub] [arrow] cyb70289 commented on a change in pull request #6954: ARROW-8440: [C++] Refine SIMD header files

2020-04-22 Thread GitBox
cyb70289 commented on a change in pull request #6954: URL: https://github.com/apache/arrow/pull/6954#discussion_r412836834 ## File path: docs/source/developers/benchmarks.rst ## @@ -59,7 +59,7 @@ Sometimes, it is required to pass custom CMake flags, e.g. .. code-block:: shell

[GitHub] [arrow] emkornfield commented on a change in pull request #6985: ARROW-8413: [C++][Parquet][WIP] Refactor Generating validity bitmap for values column

2020-04-22 Thread GitBox
emkornfield commented on a change in pull request #6985: URL: https://github.com/apache/arrow/pull/6985#discussion_r413530683 ## File path: cpp/src/arrow/util/bit_util.h ## @@ -43,13 +43,18 @@ #if defined(_MSC_VER) #include +#include #pragma intrinsic(_BitScanReverse)

[GitHub] [arrow] kiszk commented on a change in pull request #6985: ARROW-8413: [C++][Parquet][WIP] Refactor Generating validity bitmap for values column

2020-04-22 Thread GitBox
kiszk commented on a change in pull request #6985: URL: https://github.com/apache/arrow/pull/6985#discussion_r412990504 ## File path: cpp/src/parquet/column_reader.cc ## @@ -50,6 +51,141 @@ using arrow::internal::checked_cast; namespace parquet { +namespace { + +inline

[GitHub] [arrow] fsaintjacques commented on a change in pull request #7000: ARROW-8065: [C++][Dataset] Refactor ScanOptions and Fragment relation

2020-04-22 Thread GitBox
fsaintjacques commented on a change in pull request #7000: URL: https://github.com/apache/arrow/pull/7000#discussion_r413007995 ## File path: cpp/src/arrow/dataset/dataset.h ## @@ -84,13 +82,12 @@ class ARROW_DS_EXPORT Fragment { class ARROW_DS_EXPORT InMemoryFragment :

[GitHub] [arrow] lidavidm opened a new pull request #7012: ARROW-8555: [FlightRPC][Java] implement DoExchange

2020-04-22 Thread GitBox
lidavidm opened a new pull request #7012: URL: https://github.com/apache/arrow/pull/7012 This is a complete implementation of DoExchange for Java. It is not tested against the C++ implementation yet, however, it still passes integration tests, so the internal refactoring should not have

[GitHub] [arrow] github-actions[bot] commented on issue #7012: ARROW-8555: [FlightRPC][Java] implement DoExchange

2020-04-22 Thread GitBox
github-actions[bot] commented on issue #7012: URL: https://github.com/apache/arrow/pull/7012#issuecomment-617824864 https://issues.apache.org/jira/browse/ARROW-8555 This is an automated message from the Apache Git Service.

[GitHub] [arrow] github-actions[bot] commented on issue #7013: ARROW-8512: [C++] Remove unused expression/operator prototype code

2020-04-22 Thread GitBox
github-actions[bot] commented on issue #7013: URL: https://github.com/apache/arrow/pull/7013#issuecomment-617824867 https://issues.apache.org/jira/browse/ARROW-8512 This is an automated message from the Apache Git Service.

[GitHub] [arrow] wesm commented on issue #6982: ARROW-8514: [Developer][Release] Verify Python 3.5 Windows wheel

2020-04-22 Thread GitBox
wesm commented on issue #6982: URL: https://github.com/apache/arrow/pull/6982#issuecomment-617827466 +1 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the

[GitHub] [arrow] paddyhoran commented on a change in pull request #7004: ARROW-3827: [Rust] Implement UnionArray Updated

2020-04-22 Thread GitBox
paddyhoran commented on a change in pull request #7004: URL: https://github.com/apache/arrow/pull/7004#discussion_r412974468 ## File path: rust/arrow/src/array/union.rs ## @@ -0,0 +1,1172 @@ +// Licensed to the Apache Software Foundation (ASF) under one +// or more contributor

[GitHub] [arrow] kiszk commented on a change in pull request #6985: ARROW-8413: [C++][Parquet][WIP] Refactor Generating validity bitmap for values column

2020-04-22 Thread GitBox
kiszk commented on a change in pull request #6985: URL: https://github.com/apache/arrow/pull/6985#discussion_r412987615 ## File path: cpp/src/parquet/CMakeLists.txt ## @@ -336,6 +336,7 @@ set_source_files_properties(public_api_test.cc add_parquet_test(reader_test

[GitHub] [arrow] github-actions[bot] commented on issue #7011: ARROW-8554 [C++][Benchmark] Fix building error "cannot bind lvalue"

2020-04-22 Thread GitBox
github-actions[bot] commented on issue #7011: URL: https://github.com/apache/arrow/pull/7011#issuecomment-617806628 https://issues.apache.org/jira/browse/ARROW-8554 This is an automated message from the Apache Git Service.

[GitHub] [arrow] kiszk commented on a change in pull request #6954: ARROW-8440: [C++] Refine SIMD header files

2020-04-22 Thread GitBox
kiszk commented on a change in pull request #6954: URL: https://github.com/apache/arrow/pull/6954#discussion_r413030123 ## File path: cpp/src/arrow/util/hash_util.h ## @@ -27,39 +27,27 @@ #include "arrow/util/logging.h" #include "arrow/util/macros.h" -#include

[GitHub] [arrow] wesm opened a new pull request #7013: ARROW-8512: [C++] Remove unused expression/operator prototype code

2020-04-22 Thread GitBox
wesm opened a new pull request #7013: URL: https://github.com/apache/arrow/pull/7013 None of this code was ever used. This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [arrow] wesm commented on issue #7003: from pyarrow import parquet fails with AttributeError: type object 'pyarrow._parquet.Statistics' has no attribute '__reduce_cython__'

2020-04-22 Thread GitBox
wesm commented on issue #7003: URL: https://github.com/apache/arrow/issues/7003#issuecomment-617827061 Can you please open a JIRA issue and provide instructions to reproduce? This is an automated message from the Apache Git

[GitHub] [arrow] fsaintjacques commented on a change in pull request #7000: ARROW-8065: [C++][Dataset] Refactor ScanOptions and Fragment relation

2020-04-22 Thread GitBox
fsaintjacques commented on a change in pull request #7000: URL: https://github.com/apache/arrow/pull/7000#discussion_r413074200 ## File path: cpp/src/arrow/dataset/dataset.cc ## @@ -30,34 +30,40 @@ namespace arrow { namespace dataset { -Fragment::Fragment(std::shared_ptr

[GitHub] [arrow] fsaintjacques commented on a change in pull request #6997: ARROW-8540: [C++] Add memory allocation benchmarks

2020-04-22 Thread GitBox
fsaintjacques commented on a change in pull request #6997: URL: https://github.com/apache/arrow/pull/6997#discussion_r412951352 ## File path: cpp/src/arrow/memory_pool_benchmark.cc ## @@ -0,0 +1,127 @@ +// Licensed to the Apache Software Foundation (ASF) under one +// or more

[GitHub] [arrow] pitrou commented on issue #6997: ARROW-8540: [C++] Add memory allocation benchmarks

2020-04-22 Thread GitBox
pitrou commented on issue #6997: URL: https://github.com/apache/arrow/pull/6997#issuecomment-617760416 > I tried with multiple threads and the numbers look very similar between SystemAlloc and Jemalloc when I would have expected Jemalloc to do better here. Well, it's a trivial

[GitHub] [arrow] lidavidm commented on a change in pull request #6744: PARQUET-1820: [C++] pre-buffer specified columns of row group

2020-04-22 Thread GitBox
lidavidm commented on a change in pull request #6744: URL: https://github.com/apache/arrow/pull/6744#discussion_r412985177 ## File path: cpp/src/parquet/file_reader.h ## @@ -117,6 +117,15 @@ class PARQUET_EXPORT ParquetFileReader { // Returns the file metadata. Only one

[GitHub] [arrow] lidavidm commented on a change in pull request #6744: PARQUET-1820: [C++] pre-buffer specified columns of row group

2020-04-22 Thread GitBox
lidavidm commented on a change in pull request #6744: URL: https://github.com/apache/arrow/pull/6744#discussion_r412987199 ## File path: cpp/src/parquet/properties.h ## @@ -56,10 +60,32 @@ class PARQUET_EXPORT ReaderProperties { bool is_buffered_stream_enabled() const {

[GitHub] [arrow] lidavidm commented on issue #6656: ARROW-8297: [FlightRPC][C++] Implement Flight DoExchange for C++

2020-04-22 Thread GitBox
lidavidm commented on issue #6656: URL: https://github.com/apache/arrow/pull/6656#issuecomment-617806080 This should be ready now. The only issue I have is I could not figure out how to get GCC 4.8 to use the unique_ptr overload of a method instead of the shared_ptr overload.

[GitHub] [arrow] paddyhoran commented on a change in pull request #7004: ARROW-3827: [Rust] Implement UnionArray Updated

2020-04-22 Thread GitBox
paddyhoran commented on a change in pull request #7004: URL: https://github.com/apache/arrow/pull/7004#discussion_r412972235 ## File path: rust/arrow/src/array/union.rs ## @@ -0,0 +1,1172 @@ +// Licensed to the Apache Software Foundation (ASF) under one +// or more contributor

[GitHub] [arrow] lidavidm commented on issue #6744: PARQUET-1820: [C++] pre-buffer specified columns of row group

2020-04-22 Thread GitBox
lidavidm commented on issue #6744: URL: https://github.com/apache/arrow/pull/6744#issuecomment-617784720 Thanks for the review! > It might be nice to have a convenience for prebuffering an entire row group. Something like > auto rg = file_reader->RowGroup(i); >

[GitHub] [arrow] plusplusjiajia opened a new pull request #7011: ARROW-8554 [C++][Benchmark] Fix building error "cannot bind lvalue"

2020-04-22 Thread GitBox
plusplusjiajia opened a new pull request #7011: URL: https://github.com/apache/arrow/pull/7011 https://issues.apache.org/jira/browse/ARROW-8554 This is an automated message from the Apache Git Service. To respond to the

[GitHub] [arrow] fsaintjacques commented on a change in pull request #7000: ARROW-8065: [C++][Dataset] Refactor ScanOptions and Fragment relation

2020-04-22 Thread GitBox
fsaintjacques commented on a change in pull request #7000: URL: https://github.com/apache/arrow/pull/7000#discussion_r413045582 ## File path: cpp/src/arrow/dataset/file_base.cc ## @@ -222,9 +214,8 @@ FragmentIterator FileSystemDataset::GetFragmentsImpl( } Result>

[GitHub] [arrow] fsaintjacques commented on a change in pull request #7000: ARROW-8065: [C++][Dataset] Refactor ScanOptions and Fragment relation

2020-04-22 Thread GitBox
fsaintjacques commented on a change in pull request #7000: URL: https://github.com/apache/arrow/pull/7000#discussion_r413045582 ## File path: cpp/src/arrow/dataset/file_base.cc ## @@ -222,9 +214,8 @@ FragmentIterator FileSystemDataset::GetFragmentsImpl( } Result>

[GitHub] [arrow] fsaintjacques commented on a change in pull request #7000: ARROW-8065: [C++][Dataset] Refactor ScanOptions and Fragment relation

2020-04-22 Thread GitBox
fsaintjacques commented on a change in pull request #7000: URL: https://github.com/apache/arrow/pull/7000#discussion_r413046083 ## File path: cpp/src/arrow/dataset/dataset.h ## @@ -100,16 +97,20 @@ class ARROW_DS_EXPORT InMemoryFragment : public Fragment { RecordBatchVector

[GitHub] [arrow] sunchao commented on a change in pull request #6949: ARROW-7681: [Rust] Explicitly seeking a BufReader will discard the internal buffer (2)

2020-04-22 Thread GitBox
sunchao commented on a change in pull request #6949: URL: https://github.com/apache/arrow/pull/6949#discussion_r413158295 ## File path: rust/parquet/src/util/io.rs ## @@ -31,47 +33,83 @@ pub trait Position { } /// Struct that represents a slice of a file data with

[GitHub] [arrow] fsaintjacques commented on a change in pull request #7000: ARROW-8065: [C++][Dataset] Refactor ScanOptions and Fragment relation

2020-04-22 Thread GitBox
fsaintjacques commented on a change in pull request #7000: URL: https://github.com/apache/arrow/pull/7000#discussion_r413164872 ## File path: python/pyarrow/_dataset.pyx ## @@ -519,30 +500,69 @@ cdef class Fragment: """ return

[GitHub] [arrow] houqp commented on issue #7009: ARROW-8552: [Rust] support iterate parquet row columns

2020-04-22 Thread GitBox
houqp commented on issue #7009: URL: https://github.com/apache/arrow/pull/7009#issuecomment-617914513 Thanks @nevi-me for the fix, looks like your PR has been approved. I will wait for the merge. This is an automated

[GitHub] [arrow] nealrichardson commented on issue #6995: ARROW-8549: [R] Assorted post-0.17 release cleanups

2020-04-22 Thread GitBox
nealrichardson commented on issue #6995: URL: https://github.com/apache/arrow/pull/6995#issuecomment-617926040 @github-actions crossbow submit test-conda-r-3.6 test-ubuntu-18.04-r-3.6 This is an automated message from the

[GitHub] [arrow] kiszk commented on issue #6981: PARQUET-1845: [C++] Add expected results of Int96 in big-endian

2020-04-22 Thread GitBox
kiszk commented on issue #6981: URL: https://github.com/apache/arrow/pull/6981#issuecomment-617935417 @wesm do you have any comment on this change? This is an automated message from the Apache Git Service. To respond to the

[GitHub] [arrow] wesm commented on issue #6981: PARQUET-1845: [C++] Add expected results of Int96 in big-endian

2020-04-22 Thread GitBox
wesm commented on issue #6981: URL: https://github.com/apache/arrow/pull/6981#issuecomment-617947022 I haven't had a chance to look in detail. Perhaps someone on the Parquet mailing list might be able to help This is an

[GitHub] [arrow] github-actions[bot] commented on issue #6995: ARROW-8549: [R] Assorted post-0.17 release cleanups

2020-04-22 Thread GitBox
github-actions[bot] commented on issue #6995: URL: https://github.com/apache/arrow/pull/6995#issuecomment-617926769 Revision: 61e54eb79986e3946e15c373c76e3eee4294dd44 Submitted crossbow builds: [ursa-labs/crossbow @

[GitHub] [arrow] paddyhoran opened a new pull request #7015: ARROW-8560: [Rust] Docs for MutableBuffer resize are incorrect

2020-04-22 Thread GitBox
paddyhoran opened a new pull request #7015: URL: https://github.com/apache/arrow/pull/7015 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above

[GitHub] [arrow] github-actions[bot] commented on issue #7015: ARROW-8560: [Rust] Docs for MutableBuffer resize are incorrect

2020-04-22 Thread GitBox
github-actions[bot] commented on issue #7015: URL: https://github.com/apache/arrow/pull/7015#issuecomment-618016015 https://issues.apache.org/jira/browse/ARROW-8560 This is an automated message from the Apache Git Service.

[GitHub] [arrow] wesm commented on issue #7001: Use lowercase ws2_32 everywhere

2020-04-22 Thread GitBox
wesm commented on issue #7001: URL: https://github.com/apache/arrow/pull/7001#issuecomment-618021001 @davidanthoff would you mind opening a JIRA issue for this and updating the PR title? This is an automated message from

[GitHub] [arrow] kou opened a new pull request #7016: ARROW-8561: [C++][Gandiva] Stop using deprecated google::protobuf::MessageLite::ByteSize()

2020-04-22 Thread GitBox
kou opened a new pull request #7016: URL: https://github.com/apache/arrow/pull/7016 ByteSize() is deprecated and ByteSizeLong() is added since Protobuf 3.4.0. This is an automated message from the Apache Git Service. To

[GitHub] [arrow] kou commented on issue #7016: ARROW-8561: [C++][Gandiva] Stop using deprecated google::protobuf::MessageLite::ByteSize()

2020-04-22 Thread GitBox
kou commented on issue #7016: URL: https://github.com/apache/arrow/pull/7016#issuecomment-618049071 @github-actions crossbow submit -g gandiva This is an automated message from the Apache Git Service. To respond to the

[GitHub] [arrow] github-actions[bot] commented on issue #7016: ARROW-8561: [C++][Gandiva] Stop using deprecated google::protobuf::MessageLite::ByteSize()

2020-04-22 Thread GitBox
github-actions[bot] commented on issue #7016: URL: https://github.com/apache/arrow/pull/7016#issuecomment-618049644 Revision: e6ed98341efbd0f7bfed30a7aaf12935afb85fa5 Submitted crossbow builds: [ursa-labs/crossbow @

[GitHub] [arrow] github-actions[bot] commented on issue #7016: ARROW-8561: [C++][Gandiva] Stop using deprecated google::protobuf::MessageLite::ByteSize()

2020-04-22 Thread GitBox
github-actions[bot] commented on issue #7016: URL: https://github.com/apache/arrow/pull/7016#issuecomment-618051647 https://issues.apache.org/jira/browse/ARROW-8561 This is an automated message from the Apache Git Service.

[GitHub] [arrow] nealrichardson commented on issue #6996: ARROW-8538: [Packaging] Remove boost from homebrew formula

2020-04-22 Thread GitBox
nealrichardson commented on issue #6996: URL: https://github.com/apache/arrow/pull/6996#issuecomment-618053499 @kou are you adding `osx_image: xcode11.3` somewhere already or should I make a JIRA? This is an automated