[GitHub] [arrow] github-actions[bot] commented on pull request #7415: ARROW-7028: [R] Date roundtrip results in different R storage mode

2020-06-12 Thread GitBox
github-actions[bot] commented on pull request #7415: URL: https://github.com/apache/arrow/pull/7415#issuecomment-643169120 https://issues.apache.org/jira/browse/ARROW-7028 This is an automated message from the Apache Git

[GitHub] [arrow] ursabot commented on pull request #7314: ARROW-8996: [C++] runtime support for aggregate sum dense kernel

2020-06-12 Thread GitBox
ursabot commented on pull request #7314: URL: https://github.com/apache/arrow/pull/7314#issuecomment-643115702 [AMD64 Ubuntu 18.04 C++ Benchmark (#111432)](https://ci.ursalabs.org/#builders/73/builds/73) builder has been succeeded. Revision: 23aee3e5f6fc4c3fac7a859df7c52f79033c1c3c

[GitHub] [arrow] svenwb commented on pull request #7379: ARROW-9062: [Rust] json reader dictionary support

2020-06-12 Thread GitBox
svenwb commented on pull request #7379: URL: https://github.com/apache/arrow/pull/7379#issuecomment-643149155 Hey @nevi-me as @vertexclique approved as well, can you merge this? Or how shall we proceed? Thanks, Sven This is

[GitHub] [arrow] jeroen commented on pull request #7412: ARROW-9112: [R] Update autobrew script location

2020-06-12 Thread GitBox
jeroen commented on pull request #7412: URL: https://github.com/apache/arrow/pull/7412#issuecomment-643113980 Thanks! This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [arrow] sonthonaxrk commented on pull request #6979: ARROW-7800 [Python] implement iter_batches() method for ParquetFile and ParquetReader

2020-06-12 Thread GitBox
sonthonaxrk commented on pull request #6979: URL: https://github.com/apache/arrow/pull/6979#issuecomment-643178439 @wjones1 could I have write access to your branch so I can finish the changes without this going stale again?

[GitHub] [arrow] liyafan82 commented on a change in pull request #7290: ARROW-1692: [Java] UnionArray round trip not working

2020-06-12 Thread GitBox
liyafan82 commented on a change in pull request #7290: URL: https://github.com/apache/arrow/pull/7290#discussion_r439336805 ## File path: java/vector/src/main/codegen/templates/UnionVector.java ## @@ -493,6 +576,19 @@ public void splitAndTransfer(int startIndex, int length) {

[GitHub] [arrow] zhztheplayer commented on a change in pull request #7030: ARROW-7808: [Java][Dataset] Implement Datasets Java API by JNI to C++

2020-06-12 Thread GitBox
zhztheplayer commented on a change in pull request #7030: URL: https://github.com/apache/arrow/pull/7030#discussion_r439266993 ## File path: java/dataset/src/main/java/org/apache/arrow/memory/Ownerships.java ## @@ -0,0 +1,47 @@ +/* + * Licensed to the Apache Software

[GitHub] [arrow] zhztheplayer commented on a change in pull request #7030: ARROW-7808: [Java][Dataset] Implement Datasets Java API by JNI to C++

2020-06-12 Thread GitBox
zhztheplayer commented on a change in pull request #7030: URL: https://github.com/apache/arrow/pull/7030#discussion_r439266993 ## File path: java/dataset/src/main/java/org/apache/arrow/memory/Ownerships.java ## @@ -0,0 +1,47 @@ +/* + * Licensed to the Apache Software

[GitHub] [arrow] liyafan82 commented on a change in pull request #7290: ARROW-1692: [Java] UnionArray round trip not working

2020-06-12 Thread GitBox
liyafan82 commented on a change in pull request #7290: URL: https://github.com/apache/arrow/pull/7290#discussion_r439325184 ## File path: java/vector/src/main/codegen/templates/DenseUnionVector.java ## @@ -104,6 +105,7 @@ * The index is the type id, and the value is the

[GitHub] [arrow] liyafan82 commented on a change in pull request #7290: ARROW-1692: [Java] UnionArray round trip not working

2020-06-12 Thread GitBox
liyafan82 commented on a change in pull request #7290: URL: https://github.com/apache/arrow/pull/7290#discussion_r439334686 ## File path: java/vector/src/main/codegen/templates/UnionVector.java ## @@ -325,12 +361,45 @@ private void allocateTypeBuffer() {

[GitHub] [arrow] zhztheplayer commented on a change in pull request #7030: ARROW-7808: [Java][Dataset] Implement Datasets Java API by JNI to C++

2020-06-12 Thread GitBox
zhztheplayer commented on a change in pull request #7030: URL: https://github.com/apache/arrow/pull/7030#discussion_r439266993 ## File path: java/dataset/src/main/java/org/apache/arrow/memory/Ownerships.java ## @@ -0,0 +1,47 @@ +/* + * Licensed to the Apache Software

[GitHub] [arrow] liyafan82 commented on a change in pull request #7290: ARROW-1692: [Java] UnionArray round trip not working

2020-06-12 Thread GitBox
liyafan82 commented on a change in pull request #7290: URL: https://github.com/apache/arrow/pull/7290#discussion_r439338218 ## File path: java/vector/src/main/codegen/templates/UnionVector.java ## @@ -586,7 +686,9 @@ public ValueVector getVectorByType(int typeId) { }

[GitHub] [arrow] liyafan82 commented on a change in pull request #7290: ARROW-1692: [Java] UnionArray round trip not working

2020-06-12 Thread GitBox
liyafan82 commented on a change in pull request #7290: URL: https://github.com/apache/arrow/pull/7290#discussion_r439336805 ## File path: java/vector/src/main/codegen/templates/UnionVector.java ## @@ -493,6 +576,19 @@ public void splitAndTransfer(int startIndex, int length) {

[GitHub] [arrow] jianxind commented on pull request #7314: ARROW-8996: [C++] runtime support for aggregate sum dense kernel

2020-06-12 Thread GitBox
jianxind commented on pull request #7314: URL: https://github.com/apache/arrow/pull/7314#issuecomment-643106609 @ursabot benchmark --suite-filter=arrow-compute-aggregate-benchmark This is an automated message from the Apache

[GitHub] [arrow] romainfrancois opened a new pull request #7415: ARROW-7028: [R] Date roundtrip results in different R storage mode

2020-06-12 Thread GitBox
romainfrancois opened a new pull request #7415: URL: https://github.com/apache/arrow/pull/7415 Although it's to some extent tolerated that Date vectors are backed by `integer` vectors, it's much more common that they are backed by numeric: ``` r .Internal(inspect(Sys.Date()))

[GitHub] [arrow] cool-RR opened a new pull request #7416: Fix exception causes in cli.py

2020-06-12 Thread GitBox
cool-RR opened a new pull request #7416: URL: https://github.com/apache/arrow/pull/7416 I recently went over [Matplotlib](https://github.com/matplotlib/matplotlib/pull/16706), [Pandas](https://github.com/pandas-dev/pandas/pull/32322) and

[GitHub] [arrow] github-actions[bot] commented on pull request #7416: Fix exception causes in cli.py

2020-06-12 Thread GitBox
github-actions[bot] commented on pull request #7416: URL: https://github.com/apache/arrow/pull/7416#issuecomment-643200828 Thanks for opening a pull request! Could you open an issue for this pull request on JIRA? https://issues.apache.org/jira/browse/ARROW Then

[GitHub] [arrow] rok commented on a change in pull request #7044: ARROW-6485: [Format][C++] Support the format of a COO sparse matrix that has separated row and column indices

2020-06-12 Thread GitBox
rok commented on a change in pull request #7044: URL: https://github.com/apache/arrow/pull/7044#discussion_r439226180 ## File path: cpp/src/arrow/ipc/metadata_internal.cc ## @@ -882,7 +882,7 @@ Status MakeSparseTensorIndexCOO(FBB& fbb, const SparseCOOIndex& sparse_index,

[GitHub] [arrow] eyalleshem commented on a change in pull request #7384: ARROW-9082: [Rust] - Stream reader fail when steam not ended with (opt…

2020-06-12 Thread GitBox
eyalleshem commented on a change in pull request #7384: URL: https://github.com/apache/arrow/pull/7384#discussion_r439304801 ## File path: rust/arrow/src/ipc/reader.rs ## @@ -793,7 +793,21 @@ impl StreamReader { } // determine metadata length let mut

[GitHub] [arrow] liyafan82 commented on a change in pull request #7290: ARROW-1692: [Java] UnionArray round trip not working

2020-06-12 Thread GitBox
liyafan82 commented on a change in pull request #7290: URL: https://github.com/apache/arrow/pull/7290#discussion_r439335068 ## File path: java/vector/src/main/codegen/templates/UnionVector.java ## @@ -325,12 +361,45 @@ private void allocateTypeBuffer() {

[GitHub] [arrow] liyafan82 commented on a change in pull request #7275: ARROW-6110: [Java][Integration] Support LargeList Type and add integration test with C++

2020-06-12 Thread GitBox
liyafan82 commented on a change in pull request #7275: URL: https://github.com/apache/arrow/pull/7275#discussion_r439373604 ## File path: java/vector/src/main/codegen/templates/UnionLargeListWriter.java ## @@ -0,0 +1,232 @@ +/* + * Licensed to the Apache Software Foundation

[GitHub] [arrow] liyafan82 commented on a change in pull request #7275: ARROW-6110: [Java][Integration] Support LargeList Type and add integration test with C++

2020-06-12 Thread GitBox
liyafan82 commented on a change in pull request #7275: URL: https://github.com/apache/arrow/pull/7275#discussion_r439373751 ## File path: java/vector/src/main/codegen/templates/UnionLargeListWriter.java ## @@ -0,0 +1,232 @@ +/* + * Licensed to the Apache Software Foundation

[GitHub] [arrow] pitrou commented on a change in pull request #7418: ARROW-9115: [C++] Implementation of ascii_lower/ascii_upper by processing input data buffers in batch

2020-06-12 Thread GitBox
pitrou commented on a change in pull request #7418: URL: https://github.com/apache/arrow/pull/7418#discussion_r439415981 ## File path: cpp/src/arrow/compute/kernels/scalar_string.cc ## @@ -37,26 +37,108 @@ struct AsciiLength { } }; -struct AsciiUpper { - // XXX: the

[GitHub] [arrow] pitrou commented on pull request #7418: ARROW-9115: [C++] Implementation of ascii_lower/ascii_upper by processing input data buffers in batch

2020-06-12 Thread GitBox
pitrou commented on pull request #7418: URL: https://github.com/apache/arrow/pull/7418#issuecomment-643268699 I get similar numbers here. It seems to be a 15x speedup over git master. This is an automated message from the

[GitHub] [arrow] wesm commented on pull request #7417: ARROW-9079: [C++] Write benchmark for arithmetic kernels

2020-06-12 Thread GitBox
wesm commented on pull request #7417: URL: https://github.com/apache/arrow/pull/7417#issuecomment-643281405 There don't seem to be issues on MSVC https://gist.github.com/wesm/45be57393b2d9186f87faae228f12380/revisions

[GitHub] [arrow] wesm commented on pull request #7416: ARROW-9113: [Archery] Fix exception causes in cli.py

2020-06-12 Thread GitBox
wesm commented on pull request #7416: URL: https://github.com/apache/arrow/pull/7416#issuecomment-643236620 To be honest I wasn't even aware of this feature, since for the longest time I've been developing 2.7/3.x hybrid codebases where syntactic features like this are off limits. In

[GitHub] [arrow] nevi-me closed pull request #7379: ARROW-9062: [Rust] json reader dictionary support

2020-06-12 Thread GitBox
nevi-me closed pull request #7379: URL: https://github.com/apache/arrow/pull/7379 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to

[GitHub] [arrow] wesm closed pull request #7357: ARROW-9100: [C++] Add ascii_lower kernel

2020-06-12 Thread GitBox
wesm closed pull request #7357: URL: https://github.com/apache/arrow/pull/7357 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the

[GitHub] [arrow] github-actions[bot] commented on pull request #7418: ARROW-9115: [C++] Implementation of ascii_lower/ascii_upper by processing input data buffers in batch

2020-06-12 Thread GitBox
github-actions[bot] commented on pull request #7418: URL: https://github.com/apache/arrow/pull/7418#issuecomment-643265306 https://issues.apache.org/jira/browse/ARROW-9115 This is an automated message from the Apache Git

[GitHub] [arrow] bkietz commented on a change in pull request #7410: ARROW-971: [C++][Compute] IsValid, IsNull kernels

2020-06-12 Thread GitBox
bkietz commented on a change in pull request #7410: URL: https://github.com/apache/arrow/pull/7410#discussion_r439445910 ## File path: cpp/src/arrow/compute/kernels/codegen_internal.h ## @@ -181,8 +180,7 @@ struct GetOutputType> { }; template -struct GetOutputType< -

[GitHub] [arrow] fsaintjacques commented on a change in pull request #7030: ARROW-7808: [Java][Dataset] Implement Datasets Java API by JNI to C++

2020-06-12 Thread GitBox
fsaintjacques commented on a change in pull request #7030: URL: https://github.com/apache/arrow/pull/7030#discussion_r439413765 ## File path: cpp/src/arrow/dataset/discovery.h ## @@ -216,6 +216,16 @@ class ARROW_DS_EXPORT FileSystemDatasetFactory : public DatasetFactory {

[GitHub] [arrow] nealrichardson closed pull request #7412: ARROW-9112: [R] Update autobrew script location

2020-06-12 Thread GitBox
nealrichardson closed pull request #7412: URL: https://github.com/apache/arrow/pull/7412 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to

[GitHub] [arrow] kszucs opened a new pull request #7417: ARROW-9079: [C++] Write benchmark for arithmetic kernels

2020-06-12 Thread GitBox
kszucs opened a new pull request #7417: URL: https://github.com/apache/arrow/pull/7417 Quickly wanted to add a benchmark for the `Add` function to verify that no significant regressions were introduced by https://github.com/apache/arrow/pull/7341 Before: ```

[GitHub] [arrow] wesm commented on pull request #7417: ARROW-9079: [C++] Write benchmark for arithmetic kernels

2020-06-12 Thread GitBox
wesm commented on pull request #7417: URL: https://github.com/apache/arrow/pull/7417#issuecomment-643266987 Thanks for working on this. I'll check the benchmarks on MSVC also This is an automated message from the Apache Git

[GitHub] [arrow] wesm commented on a change in pull request #7417: ARROW-9079: [C++] Write benchmark for arithmetic kernels

2020-06-12 Thread GitBox
wesm commented on a change in pull request #7417: URL: https://github.com/apache/arrow/pull/7417#discussion_r439431785 ## File path: cpp/src/arrow/compute/kernels/scalar_arithmetic_benchmark.cc ## @@ -0,0 +1,92 @@ +// Licensed to the Apache Software Foundation (ASF) under one

[GitHub] [arrow] bkietz commented on a change in pull request #7410: ARROW-971: [C++][Compute] IsValid, IsNull kernels

2020-06-12 Thread GitBox
bkietz commented on a change in pull request #7410: URL: https://github.com/apache/arrow/pull/7410#discussion_r439442386 ## File path: cpp/src/arrow/compute/kernels/test_util.h ## @@ -33,7 +33,9 @@ #include "arrow/testing/random.h" #include "arrow/testing/util.h" #include

[GitHub] [arrow] bkietz commented on a change in pull request #7410: ARROW-971: [C++][Compute] IsValid, IsNull kernels

2020-06-12 Thread GitBox
bkietz commented on a change in pull request #7410: URL: https://github.com/apache/arrow/pull/7410#discussion_r439442116 ## File path: cpp/src/arrow/util/iterator_test.cc ## @@ -27,6 +27,9 @@ #include #include +#include +#include + Review comment: ```suggestion

[GitHub] [arrow] github-actions[bot] commented on pull request #7420: ARROW-9022: [C++] Add/Sub/Mul arithmetic kernels with overflow check

2020-06-12 Thread GitBox
github-actions[bot] commented on pull request #7420: URL: https://github.com/apache/arrow/pull/7420#issuecomment-643329115 https://issues.apache.org/jira/browse/ARROW-9022 This is an automated message from the Apache Git

[GitHub] [arrow] wesm opened a new pull request #7418: ARROW-9115: [C++] Implementation of ascii_lower/ascii_upper by processing input data buffers in batch

2020-06-12 Thread GitBox
wesm opened a new pull request #7418: URL: https://github.com/apache/arrow/pull/7418 Following on discussion in #7357. I added a simple benchmark also. ``` -- Benchmark Time CPU Iterations

[GitHub] [arrow] wesm closed pull request #7418: ARROW-9115: [C++] Implementation of ascii_lower/ascii_upper by processing input data buffers in batch

2020-06-12 Thread GitBox
wesm closed pull request #7418: URL: https://github.com/apache/arrow/pull/7418 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the

[GitHub] [arrow] github-actions[bot] commented on pull request #7419: ARROW-8510: [C++][Datasets] Do not use variant in WritePlan to fix compiler error with VS 2017

2020-06-12 Thread GitBox
github-actions[bot] commented on pull request #7419: URL: https://github.com/apache/arrow/pull/7419#issuecomment-643311125 https://issues.apache.org/jira/browse/ARROW-8510 This is an automated message from the Apache Git

[GitHub] [arrow] fsaintjacques commented on pull request #7419: ARROW-8510: [C++][Datasets] Do not use variant in WritePlan to fix compiler error with VS 2017

2020-06-12 Thread GitBox
fsaintjacques commented on pull request #7419: URL: https://github.com/apache/arrow/pull/7419#issuecomment-643311800 I also agree with the sentiment on variant, I noted this [here](https://github.com/apache/arrow/pull/7156#discussion_r435921179) (which touches the same code path).

[GitHub] [arrow] pitrou commented on a change in pull request #7378: ARROW-8866: [C++] Split UNION into SPARSE_UNION and DENSE_UNION

2020-06-12 Thread GitBox
pitrou commented on a change in pull request #7378: URL: https://github.com/apache/arrow/pull/7378#discussion_r439478696 ## File path: cpp/src/arrow/type_fwd.h ## @@ -351,39 +358,21 @@ std::shared_ptr ARROW_EXPORT time64(TimeUnit::type unit); std::shared_ptr ARROW_EXPORT

[GitHub] [arrow] wesm commented on pull request #7418: ARROW-9115: [C++] Implementation of ascii_lower/ascii_upper by processing input data buffers in batch

2020-06-12 Thread GitBox
wesm commented on pull request #7418: URL: https://github.com/apache/arrow/pull/7418#issuecomment-643263350 cc @maartenbreddels @pitrou This is an automated message from the Apache Git Service. To respond to the message,

[GitHub] [arrow] nevi-me closed pull request #7384: ARROW-9082: [Rust] - Stream reader fail when steam not ended with (opt…

2020-06-12 Thread GitBox
nevi-me closed pull request #7384: URL: https://github.com/apache/arrow/pull/7384 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to

[GitHub] [arrow] pitrou commented on a change in pull request #7418: ARROW-9115: [C++] Implementation of ascii_lower/ascii_upper by processing input data buffers in batch

2020-06-12 Thread GitBox
pitrou commented on a change in pull request #7418: URL: https://github.com/apache/arrow/pull/7418#discussion_r439416979 ## File path: cpp/src/arrow/compute/kernels/scalar_string_benchmark.cc ## @@ -0,0 +1,58 @@ +// Licensed to the Apache Software Foundation (ASF) under one

[GitHub] [arrow] kszucs commented on pull request #6512: ARROW-8430: [CI] Configure self-hosted runners for Github Actions

2020-06-12 Thread GitBox
kszucs commented on pull request #6512: URL: https://github.com/apache/arrow/pull/6512#issuecomment-643280068 @kou I'm afraid that CMAKE_BUILD_PARALLEL_LEVEL doesn't propagate well with ninja, so it causes OOM during compilation in the arm64v8 build.

[GitHub] [arrow] bkietz commented on a change in pull request #7410: ARROW-971: [C++][Compute] IsValid, IsNull kernels

2020-06-12 Thread GitBox
bkietz commented on a change in pull request #7410: URL: https://github.com/apache/arrow/pull/7410#discussion_r439440106 ## File path: cpp/src/arrow/compute/exec.cc ## @@ -20,6 +20,7 @@ #include #include #include +#include Review comment: ```suggestion ```

[GitHub] [arrow] wesm commented on a change in pull request #7418: ARROW-9115: [C++] Implementation of ascii_lower/ascii_upper by processing input data buffers in batch

2020-06-12 Thread GitBox
wesm commented on a change in pull request #7418: URL: https://github.com/apache/arrow/pull/7418#discussion_r439451091 ## File path: cpp/src/arrow/compute/kernels/scalar_string.cc ## @@ -37,26 +39,108 @@ struct AsciiLength { } }; -struct AsciiUpper { - // XXX: the

[GitHub] [arrow] kszucs commented on a change in pull request #6512: ARROW-8430: [CI] Configure self-hosted runners for Github Actions [WIP][skip ci]

2020-06-12 Thread GitBox
kszucs commented on a change in pull request #6512: URL: https://github.com/apache/arrow/pull/6512#discussion_r43940 ## File path: .github/workflows/cpp_cron.yml ## @@ -95,12 +95,13 @@ jobs: - name: Setup Python uses: actions/setup-python@v1 with:

[GitHub] [arrow] wesm commented on a change in pull request #7418: ARROW-9115: [C++] Implementation of ascii_lower/ascii_upper by processing input data buffers in batch

2020-06-12 Thread GitBox
wesm commented on a change in pull request #7418: URL: https://github.com/apache/arrow/pull/7418#discussion_r439418114 ## File path: cpp/src/arrow/compute/kernels/scalar_string.cc ## @@ -37,26 +37,108 @@ struct AsciiLength { } }; -struct AsciiUpper { - // XXX: the

[GitHub] [arrow] cyb70289 commented on a change in pull request #7418: ARROW-9115: [C++] Implementation of ascii_lower/ascii_upper by processing input data buffers in batch

2020-06-12 Thread GitBox
cyb70289 commented on a change in pull request #7418: URL: https://github.com/apache/arrow/pull/7418#discussion_r439448175 ## File path: cpp/src/arrow/compute/kernels/scalar_string.cc ## @@ -37,26 +39,108 @@ struct AsciiLength { } }; -struct AsciiUpper { - // XXX: the

[GitHub] [arrow] emkornfield commented on pull request #7290: ARROW-1692: [Java] UnionArray round trip not working

2020-06-12 Thread GitBox
emkornfield commented on pull request #7290: URL: https://github.com/apache/arrow/pull/7290#issuecomment-643320990 @jacques-n I think you had some concerns the last time sparse unions were made. Do these changes raise the same issues?

[GitHub] [arrow] pitrou commented on a change in pull request #7378: ARROW-8866: [C++] Split UNION into SPARSE_UNION and DENSE_UNION

2020-06-12 Thread GitBox
pitrou commented on a change in pull request #7378: URL: https://github.com/apache/arrow/pull/7378#discussion_r439480534 ## File path: cpp/src/arrow/array/array_primitive.h ## @@ -108,11 +108,11 @@ class ARROW_EXPORT DayTimeIntervalArray : public PrimitiveArray { // For

[GitHub] [arrow] pitrou commented on a change in pull request #7378: ARROW-8866: [C++] Split UNION into SPARSE_UNION and DENSE_UNION

2020-06-12 Thread GitBox
pitrou commented on a change in pull request #7378: URL: https://github.com/apache/arrow/pull/7378#discussion_r439480534 ## File path: cpp/src/arrow/array/array_primitive.h ## @@ -108,11 +108,11 @@ class ARROW_EXPORT DayTimeIntervalArray : public PrimitiveArray { // For

[GitHub] [arrow] liyafan82 commented on a change in pull request #7275: ARROW-6110: [Java][Integration] Support LargeList Type and add integration test with C++

2020-06-12 Thread GitBox
liyafan82 commented on a change in pull request #7275: URL: https://github.com/apache/arrow/pull/7275#discussion_r439375810 ## File path: java/vector/src/main/java/org/apache/arrow/vector/BitVectorHelper.java ## @@ -73,6 +87,28 @@ public static void setBit(ArrowBuf

[GitHub] [arrow] bkietz commented on a change in pull request #7378: ARROW-8866: [C++] Split UNION into SPARSE_UNION and DENSE_UNION

2020-06-12 Thread GitBox
bkietz commented on a change in pull request #7378: URL: https://github.com/apache/arrow/pull/7378#discussion_r439478099 ## File path: cpp/src/arrow/type_fwd.h ## @@ -351,39 +358,21 @@ std::shared_ptr ARROW_EXPORT time64(TimeUnit::type unit); std::shared_ptr ARROW_EXPORT

[GitHub] [arrow] bkietz commented on a change in pull request #7378: ARROW-8866: [C++] Split UNION into SPARSE_UNION and DENSE_UNION

2020-06-12 Thread GitBox
bkietz commented on a change in pull request #7378: URL: https://github.com/apache/arrow/pull/7378#discussion_r439477647 ## File path: cpp/src/arrow/type.h ## @@ -1015,25 +1018,12 @@ class ARROW_EXPORT Decimal128Type : public DecimalType { /// \brief Concrete type class for

[GitHub] [arrow] kszucs commented on a change in pull request #7417: ARROW-9079: [C++] Write benchmark for arithmetic kernels

2020-06-12 Thread GitBox
kszucs commented on a change in pull request #7417: URL: https://github.com/apache/arrow/pull/7417#discussion_r439478018 ## File path: cpp/src/arrow/compute/kernels/scalar_arithmetic_benchmark.cc ## @@ -0,0 +1,92 @@ +// Licensed to the Apache Software Foundation (ASF) under

[GitHub] [arrow] github-actions[bot] commented on pull request #7416: ARROW-9113: [Archery] Fix exception causes in cli.py

2020-06-12 Thread GitBox
github-actions[bot] commented on pull request #7416: URL: https://github.com/apache/arrow/pull/7416#issuecomment-643206492 https://issues.apache.org/jira/browse/ARROW-9113 This is an automated message from the Apache Git

[GitHub] [arrow] github-actions[bot] commented on pull request #7417: ARROW-9079: [C++] Write benchmark for arithmetic kernels

2020-06-12 Thread GitBox
github-actions[bot] commented on pull request #7417: URL: https://github.com/apache/arrow/pull/7417#issuecomment-643228149 https://issues.apache.org/jira/browse/ARROW-9079 This is an automated message from the Apache Git

[GitHub] [arrow] kszucs commented on a change in pull request #7417: ARROW-9079: [C++] Write benchmark for arithmetic kernels

2020-06-12 Thread GitBox
kszucs commented on a change in pull request #7417: URL: https://github.com/apache/arrow/pull/7417#discussion_r439371127 ## File path: cpp/src/arrow/compute/kernels/scalar_arithmetic_benchmark.cc ## @@ -0,0 +1,92 @@ +// Licensed to the Apache Software Foundation (ASF) under

[GitHub] [arrow] kszucs commented on a change in pull request #7417: ARROW-9079: [C++] Write benchmark for arithmetic kernels

2020-06-12 Thread GitBox
kszucs commented on a change in pull request #7417: URL: https://github.com/apache/arrow/pull/7417#discussion_r439371127 ## File path: cpp/src/arrow/compute/kernels/scalar_arithmetic_benchmark.cc ## @@ -0,0 +1,92 @@ +// Licensed to the Apache Software Foundation (ASF) under

[GitHub] [arrow] liyafan82 commented on a change in pull request #7275: ARROW-6110: [Java][Integration] Support LargeList Type and add integration test with C++

2020-06-12 Thread GitBox
liyafan82 commented on a change in pull request #7275: URL: https://github.com/apache/arrow/pull/7275#discussion_r439376364 ## File path: java/vector/src/main/java/org/apache/arrow/vector/compare/RangeEqualsVisitor.java ## @@ -494,4 +505,46 @@ protected boolean

[GitHub] [arrow] liyafan82 commented on a change in pull request #7275: ARROW-6110: [Java][Integration] Support LargeList Type and add integration test with C++

2020-06-12 Thread GitBox
liyafan82 commented on a change in pull request #7275: URL: https://github.com/apache/arrow/pull/7275#discussion_r439398445 ## File path: java/vector/src/main/java/org/apache/arrow/vector/complex/LargeListVector.java ## @@ -0,0 +1,991 @@ +/* + * Licensed to the Apache

[GitHub] [arrow] pitrou commented on pull request #7418: ARROW-9115: [C++] Implementation of ascii_lower/ascii_upper by processing input data buffers in batch

2020-06-12 Thread GitBox
pitrou commented on pull request #7418: URL: https://github.com/apache/arrow/pull/7418#issuecomment-643273901 Before: ``` AsciiLower 76218768 ns 76206752 ns 28 bytes_per_second=207.921M/s items_per_second=13.7596M/s AsciiUpper 83254436 ns 83232143 ns

[GitHub] [arrow] romainfrancois commented on pull request #7415: ARROW-7028: [R] Date roundtrip results in different R storage mode

2020-06-12 Thread GitBox
romainfrancois commented on pull request #7415: URL: https://github.com/apache/arrow/pull/7415#issuecomment-643274152 Thanks, I had not checked `funs::if_else()`. Yeah, a Date32 arrow gets converted to a `Date` no matter what, that means data copy as arrow backs it with a int vector

[GitHub] [arrow] kszucs commented on a change in pull request #7417: ARROW-9079: [C++] Write benchmark for arithmetic kernels

2020-06-12 Thread GitBox
kszucs commented on a change in pull request #7417: URL: https://github.com/apache/arrow/pull/7417#discussion_r439371127 ## File path: cpp/src/arrow/compute/kernels/scalar_arithmetic_benchmark.cc ## @@ -0,0 +1,92 @@ +// Licensed to the Apache Software Foundation (ASF) under

[GitHub] [arrow] wesm opened a new pull request #7419: ARROW-8510: [C++][Datasets] Do not use variant in WritePlan to fix compiler error with VS 2017

2020-06-12 Thread GitBox
wesm opened a new pull request #7419: URL: https://github.com/apache/arrow/pull/7419 Even without the compiler error on VS 2017 I am not a fan of using variant to solve problems like these (so to me this new code is strictly preferable to the old code), nor exposing a variant in public

[GitHub] [arrow] kszucs opened a new pull request #7420: ARROW-9022: [C++] Add/Sub/Mul arithmetic kernels with overflow check

2020-06-12 Thread GitBox
kszucs opened a new pull request #7420: URL: https://github.com/apache/arrow/pull/7420 Quick draft for checked arithmetics. TODOs: - more portable overflow checks - consolidate the tests - add arithmetics options to let the user choose which variant to run (so remove the

[GitHub] [arrow] wesm closed pull request #7419: ARROW-8510: [C++][Datasets] Do not use variant in WritePlan to fix compiler error with VS 2017

2020-06-12 Thread GitBox
wesm closed pull request #7419: URL: https://github.com/apache/arrow/pull/7419 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the

[GitHub] [arrow] wesm closed pull request #7407: ARROW-9101: [Doc][C++] Document encoding expected for CSV data

2020-06-12 Thread GitBox
wesm closed pull request #7407: URL: https://github.com/apache/arrow/pull/7407 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the

[GitHub] [arrow] bkietz commented on pull request #7378: ARROW-8866: [C++] Split UNION into SPARSE_UNION and DENSE_UNION

2020-06-12 Thread GitBox
bkietz commented on pull request #7378: URL: https://github.com/apache/arrow/pull/7378#issuecomment-643386870 @wesm thanks, I've been trying to get those to pass locally. This is an automated message from the Apache Git

[GitHub] [arrow] github-actions[bot] commented on pull request #7421: ARROW-9030: [Python] Remove pyarrow/compat.py, move some oft-used utility functions to pyarrow.lib

2020-06-12 Thread GitBox
github-actions[bot] commented on pull request #7421: URL: https://github.com/apache/arrow/pull/7421#issuecomment-643424778 https://issues.apache.org/jira/browse/ARROW-9030 This is an automated message from the Apache Git

[GitHub] [arrow] nealrichardson commented on pull request #7415: ARROW-7028: [R] Date roundtrip results in different R storage mode

2020-06-12 Thread GitBox
nealrichardson commented on pull request #7415: URL: https://github.com/apache/arrow/pull/7415#issuecomment-643332789 Thanks y'all for looking into this. Given where we are right now, this PR is probably an improvement on the status quo. If we had benchmarks that the conversion to Date

[GitHub] [arrow] wesm commented on pull request #7414: ARROW-6645: [Python] Use common boundschecking function for checking dictionary indices when converting to pandas

2020-06-12 Thread GitBox
wesm commented on pull request #7414: URL: https://github.com/apache/arrow/pull/7414#issuecomment-643370021 +1 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and

[GitHub] [arrow] wesm closed pull request #7414: ARROW-6645: [Python] Use common boundschecking function for checking dictionary indices when converting to pandas

2020-06-12 Thread GitBox
wesm closed pull request #7414: URL: https://github.com/apache/arrow/pull/7414 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the

[GitHub] [arrow] wesm commented on pull request #7378: ARROW-8866: [C++] Split UNION into SPARSE_UNION and DENSE_UNION

2020-06-12 Thread GitBox
wesm commented on pull request #7378: URL: https://github.com/apache/arrow/pull/7378#issuecomment-643413102 thanks @bkietz! This is an automated message from the Apache Git Service. To respond to the message, please log on

[GitHub] [arrow] wesm closed pull request #7378: ARROW-8866: [C++] Split UNION into SPARSE_UNION and DENSE_UNION

2020-06-12 Thread GitBox
wesm closed pull request #7378: URL: https://github.com/apache/arrow/pull/7378 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the

[GitHub] [arrow] wesm commented on pull request #6806: ARROW-8312: [Java][Gandiva] support TreeNode in IN expression

2020-06-12 Thread GitBox
wesm commented on pull request #6806: URL: https://github.com/apache/arrow/pull/6806#issuecomment-643416750 Cool. @zhouyuan could you rebase? Thank you This is an automated message from the Apache Git Service. To respond to

[GitHub] [arrow] wesm closed pull request #7337: WIP: ARROW-9030: [Python] Clean up some usages of pyarrow.compat module

2020-06-12 Thread GitBox
wesm closed pull request #7337: URL: https://github.com/apache/arrow/pull/7337 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the

[GitHub] [arrow] wesm commented on pull request #7337: WIP: ARROW-9030: [Python] Clean up some usages of pyarrow.compat module

2020-06-12 Thread GitBox
wesm commented on pull request #7337: URL: https://github.com/apache/arrow/pull/7337#issuecomment-643417028 I'll open a new PR for these changes This is an automated message from the Apache Git Service. To respond to the

[GitHub] [arrow] bkietz commented on a change in pull request #7417: ARROW-9079: [C++] Write benchmark for arithmetic kernels

2020-06-12 Thread GitBox
bkietz commented on a change in pull request #7417: URL: https://github.com/apache/arrow/pull/7417#discussion_r439583288 ## File path: cpp/src/arrow/compute/kernels/scalar_arithmetic_benchmark.cc ## @@ -0,0 +1,92 @@ +// Licensed to the Apache Software Foundation (ASF) under

[GitHub] [arrow] cool-RR commented on pull request #7416: ARROW-9113: [Archery] Fix exception causes in cli.py

2020-06-12 Thread GitBox
cool-RR commented on pull request #7416: URL: https://github.com/apache/arrow/pull/7416#issuecomment-643355049 I understand. I personally hate `from None`, and I think that when people are debugging they have a very different mindset than when they are writing code. When you're

[GitHub] [arrow] cool-RR closed pull request #7416: ARROW-9113: [Archery] Fix exception causes in cli.py

2020-06-12 Thread GitBox
cool-RR closed pull request #7416: URL: https://github.com/apache/arrow/pull/7416 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to

[GitHub] [arrow] wesm commented on a change in pull request #7420: ARROW-9022: [C++] Add/Sub/Mul arithmetic kernels with overflow check

2020-06-12 Thread GitBox
wesm commented on a change in pull request #7420: URL: https://github.com/apache/arrow/pull/7420#discussion_r439523904 ## File path: cpp/src/arrow/compute/api_scalar.h ## @@ -45,6 +45,12 @@ namespace compute { ARROW_EXPORT Result Add(const Datum& left, const Datum& right,

[GitHub] [arrow] wesm closed pull request #7408: ARROW-9110: [C++] Fix CPU cache size detection on macOS

2020-06-12 Thread GitBox
wesm closed pull request #7408: URL: https://github.com/apache/arrow/pull/7408 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the

[GitHub] [arrow] wesm commented on a change in pull request #7378: ARROW-8866: [C++] Split UNION into SPARSE_UNION and DENSE_UNION

2020-06-12 Thread GitBox
wesm commented on a change in pull request #7378: URL: https://github.com/apache/arrow/pull/7378#discussion_r439539160 ## File path: cpp/src/arrow/array/array_primitive.h ## @@ -108,11 +108,11 @@ class ARROW_EXPORT DayTimeIntervalArray : public PrimitiveArray { // For

[GitHub] [arrow] wesm commented on a change in pull request #7378: ARROW-8866: [C++] Split UNION into SPARSE_UNION and DENSE_UNION

2020-06-12 Thread GitBox
wesm commented on a change in pull request #7378: URL: https://github.com/apache/arrow/pull/7378#discussion_r439539505 ## File path: cpp/src/arrow/type_fwd.h ## @@ -351,39 +358,21 @@ std::shared_ptr ARROW_EXPORT time64(TimeUnit::type unit); std::shared_ptr ARROW_EXPORT

[GitHub] [arrow] wesm closed pull request #7394: Added trim function

2020-06-12 Thread GitBox
wesm closed pull request #7394: URL: https://github.com/apache/arrow/pull/7394 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the

[GitHub] [arrow] wesm commented on pull request #7394: Added trim function

2020-06-12 Thread GitBox
wesm commented on pull request #7394: URL: https://github.com/apache/arrow/pull/7394#issuecomment-643376415 superseded by #7402 This is an automated message from the Apache Git Service. To respond to the message, please log

[GitHub] [arrow] wesm commented on pull request #7378: ARROW-8866: [C++] Split UNION into SPARSE_UNION and DENSE_UNION

2020-06-12 Thread GitBox
wesm commented on pull request #7378: URL: https://github.com/apache/arrow/pull/7378#issuecomment-643390508 Sorry about the noise I think I've got it now. We might need to create an "arrow-deprecated-test" at some point where we verify that deprecated APIs still work as advertised

[GitHub] [arrow] houqp commented on a change in pull request #7324: ARROW-9005: [Rust] [Datafusion] support sort expression

2020-06-12 Thread GitBox
houqp commented on a change in pull request #7324: URL: https://github.com/apache/arrow/pull/7324#discussion_r439567609 ## File path: rust/datafusion/src/execution/physical_plan/sort.rs ## @@ -0,0 +1,211 @@ +// Licensed to the Apache Software Foundation (ASF) under one +// or

[GitHub] [arrow] wesm commented on pull request #7378: ARROW-8866: [C++] Split UNION into SPARSE_UNION and DENSE_UNION

2020-06-12 Thread GitBox
wesm commented on pull request #7378: URL: https://github.com/apache/arrow/pull/7378#issuecomment-643412670 Appveyor was passing two commits ago: https://ci.appveyor.com/project/BenjaminKietzman/arrow/builds/33489614. The Ursabot CI failure looks transient. I'll go ahead and merge this so

[GitHub] [arrow] bkietz commented on a change in pull request #7417: ARROW-9079: [C++] Write benchmark for arithmetic kernels

2020-06-12 Thread GitBox
bkietz commented on a change in pull request #7417: URL: https://github.com/apache/arrow/pull/7417#discussion_r439577349 ## File path: cpp/src/arrow/compute/kernels/scalar_arithmetic_benchmark.cc ## @@ -0,0 +1,92 @@ +// Licensed to the Apache Software Foundation (ASF) under

[GitHub] [arrow] nealrichardson closed pull request #7411: ARROW-8826: [Crossbow] remote URL should always have .git

2020-06-12 Thread GitBox
nealrichardson closed pull request #7411: URL: https://github.com/apache/arrow/pull/7411 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to

[GitHub] [arrow] bkietz commented on a change in pull request #7156: ARROW-8074: [C++][Dataset][Python] FileFragments from buffers and NativeFiles

2020-06-12 Thread GitBox
bkietz commented on a change in pull request #7156: URL: https://github.com/apache/arrow/pull/7156#discussion_r439514082 ## File path: cpp/src/arrow/util/checked_cast.h ## @@ -39,11 +40,11 @@ inline OutputType checked_cast(InputType&& value) { } template -std::shared_ptr

[GitHub] [arrow] wesm closed pull request #7409: ARROW-8649: [Java] [Website] Java documentation on website is hidden

2020-06-12 Thread GitBox
wesm closed pull request #7409: URL: https://github.com/apache/arrow/pull/7409 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the

[GitHub] [arrow] pitrou commented on pull request #7416: ARROW-9113: [Archery] Fix exception causes in cli.py

2020-06-12 Thread GitBox
pitrou commented on pull request #7416: URL: https://github.com/apache/arrow/pull/7416#issuecomment-643378995 To be honest, I think those are all fine. There isn't much to be gained by forcing ourselves to spell out `raise from ...` explicitly, and we'll inevitably overlook some possible

[GitHub] [arrow] wesm opened a new pull request #7421: ARROW-9030: [Python] Remove pyarrow/compat.py, move some oft-used utility functions to pyarrow.lib

2020-06-12 Thread GitBox
wesm opened a new pull request #7421: URL: https://github.com/apache/arrow/pull/7421 This is some routine scrubbing that deals with some remnants of the prior dual py2/3 codebase. This is an automated message from the

  1   2   >