[GitHub] [arrow] kszucs closed pull request #7855: ARROW-9592: [CI] Update homebrew before calling brew bundle

2020-07-29 Thread GitBox
kszucs closed pull request #7855: URL: https://github.com/apache/arrow/pull/7855 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to

[GitHub] [arrow] kszucs commented on pull request #7820: ARROW-9068: [C++][Dataset] Simplify partitioning interface

2020-07-29 Thread GitBox
kszucs commented on pull request #7820: URL: https://github.com/apache/arrow/pull/7820#issuecomment-665629390 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and

[GitHub] [arrow] jimhester commented on a change in pull request #7819: ARROW-9405: [R] Switch to cpp11

2020-07-29 Thread GitBox
jimhester commented on a change in pull request #7819: URL: https://github.com/apache/arrow/pull/7819#discussion_r462254382 ## File path: r/tests/testthat/test-Array.R ## @@ -131,14 +131,14 @@ test_that("Slice() and RangeEquals()", { expect_true(x$RangeEquals(z, 10, 15, 0))

[GitHub] [arrow] wesm commented on issue #2077: pyarrow crash when closing large parquet file

2020-07-29 Thread GitBox
wesm commented on issue #2077: URL: https://github.com/apache/arrow/issues/2077#issuecomment-665733619 Let's discuss on the JIRA issue or on the Arrow mailing list. This is an automated message from the Apache Git Service.

[GitHub] [arrow] kszucs commented on pull request #7852: ARROW-9394: [Python] Support pickling of Scalars

2020-07-29 Thread GitBox
kszucs commented on pull request #7852: URL: https://github.com/apache/arrow/pull/7852#issuecomment-665602816 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and

[GitHub] [arrow] kszucs closed pull request #7851: ARROW-9286: [C++] Add function "aliases" to compute::FunctionRegistry

2020-07-29 Thread GitBox
kszucs closed pull request #7851: URL: https://github.com/apache/arrow/pull/7851 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to

[GitHub] [arrow] GPSnoopy commented on pull request #7818: ARROW-9541: [C++] CMakeLists requires UTF8PROC_STATIC when building static library

2020-07-29 Thread GitBox
GPSnoopy commented on pull request #7818: URL: https://github.com/apache/arrow/pull/7818#issuecomment-665668482 @kou I think the latest commits should address your comments. The CI seems to fail on `Python / AMD64 CentOS 5.11 Python 3.6 manylinux1` by failing to find utf8proc, but I'm not

[GitHub] [arrow] kszucs commented on pull request #7855: ARROW-9592: [CI] Update homebrew before calling brew bundle

2020-07-29 Thread GitBox
kszucs commented on pull request #7855: URL: https://github.com/apache/arrow/pull/7855#issuecomment-665565518 The build failure is unrelated, merging. This is an automated message from the Apache Git Service. To respond to

[GitHub] [arrow] kszucs closed pull request #7852: ARROW-9394: [Python] Support pickling of Scalars

2020-07-29 Thread GitBox
kszucs closed pull request #7852: URL: https://github.com/apache/arrow/pull/7852 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to

[GitHub] [arrow] patrickpai commented on pull request #7789: PARQUET-1878: [C++] lz4 codec is not compatible with Hadoop Lz4Codec

2020-07-29 Thread GitBox
patrickpai commented on pull request #7789: URL: https://github.com/apache/arrow/pull/7789#issuecomment-665793225 I triggered a new build but it's now failing one check. I think this is the relevant error, but it seems unrelated to my changes. Would appreciate any thoughts on what the

[GitHub] [arrow] kszucs closed pull request #7820: ARROW-9068: [C++][Dataset] Simplify partitioning interface

2020-07-29 Thread GitBox
kszucs closed pull request #7820: URL: https://github.com/apache/arrow/pull/7820 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to

[GitHub] [arrow] tobim commented on a change in pull request #7842: ARROW-9412: [C++] Add non-bundled dependencies to INTERFACE_LINK_LIBRARIES of static libarrow

2020-07-29 Thread GitBox
tobim commented on a change in pull request #7842: URL: https://github.com/apache/arrow/pull/7842#discussion_r462285704 ## File path: cpp/cmake_modules/FindLz4.cmake ## @@ -24,13 +24,14 @@ set(LZ4_STATIC_LIB_SUFFIX "${LZ4_MSVC_STATIC_LIB_SUFFIX}${CMAKE_STATIC_LIBRARY_S

[GitHub] [arrow] kszucs closed pull request #7758: ARROW-9469: [Python] Make more objects weakrefable

2020-07-29 Thread GitBox
kszucs closed pull request #7758: URL: https://github.com/apache/arrow/pull/7758 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to

[GitHub] [arrow] MilenkoM commented on issue #2077: pyarrow crash when closing large parquet file

2020-07-29 Thread GitBox
MilenkoM commented on issue #2077: URL: https://github.com/apache/arrow/issues/2077#issuecomment-665604656 I have smilar problem. I am trying to write 447 json files(more than 3M rows) to Parquet file. For smaller number works OK. But it seems that still exists these memory limitation.

[GitHub] [arrow] kou commented on pull request #7838: [Release] Bump next snapshot versions to 2.0.0

2020-07-29 Thread GitBox
kou commented on pull request #7838: URL: https://github.com/apache/arrow/pull/7838#issuecomment-664704529 @github-actions crossbow submit -g linux This is an automated message from the Apache Git Service. To respond to the

[GitHub] [arrow] github-actions[bot] commented on pull request #7842: ARROW-9412: [C++] Add non-bundled dependencies to INTERFACE_LINK_LIBRARIES of static libarrow

2020-07-29 Thread GitBox
github-actions[bot] commented on pull request #7842: URL: https://github.com/apache/arrow/pull/7842#issuecomment-664661594 https://issues.apache.org/jira/browse/ARROW-9412 This is an automated message from the Apache Git

[GitHub] [arrow] Plebejer commented on a change in pull request #7831: ARROW-9544: [R] Fix version argument of write_parquet()

2020-07-29 Thread GitBox
Plebejer commented on a change in pull request #7831: URL: https://github.com/apache/arrow/pull/7831#discussion_r461341555 ## File path: r/NEWS.md ## @@ -50,6 +50,7 @@ * Non-UTF-8 strings (common on Windows) are correctly coerced to UTF-8 when passing to Arrow memory and

[GitHub] [arrow] nealrichardson commented on issue #7857: Underscores at beginning of directory names create problems for open_dataset function

2020-07-29 Thread GitBox
nealrichardson commented on issue #7857: URL: https://github.com/apache/arrow/issues/7857#issuecomment-665256697 Thanks. Please open a JIRA and we can discuss there. This is an automated message from the Apache Git Service.

[GitHub] [arrow] andygrove commented on pull request #7853: ARROW-9582: [Rust] Add memory_size() method to Array [DRAFT]

2020-07-29 Thread GitBox
andygrove commented on pull request #7853: URL: https://github.com/apache/arrow/pull/7853#issuecomment-665038339 @nevi-me @paddyhoran @sunchao does this general approach seem ok to you? This is an automated message from the

[GitHub] [arrow] github-actions[bot] commented on pull request #7855: [CI] Attempt to fix recently failing macOS builds

2020-07-29 Thread GitBox
github-actions[bot] commented on pull request #7855: URL: https://github.com/apache/arrow/pull/7855#issuecomment-665110206 Thanks for opening a pull request! Could you open an issue for this pull request on JIRA? https://issues.apache.org/jira/browse/ARROW Then

[GitHub] [arrow] kszucs commented on a change in pull request #7803: ARROW-9517: [C++/Python] Add optional session_token to S3Options

2020-07-29 Thread GitBox
kszucs commented on a change in pull request #7803: URL: https://github.com/apache/arrow/pull/7803#discussion_r461493406 ## File path: cpp/src/arrow/filesystem/s3fs.h ## @@ -62,10 +62,12 @@ struct ARROW_EXPORT S3Options { void ConfigureAnonymousCredentials(); ///

[GitHub] [arrow] andygrove opened a new pull request #7848: ARROW-9550: [Rust] [DataFusion] Remove use of Rc> in HashAggregate - DRAFT

2020-07-29 Thread GitBox
andygrove opened a new pull request #7848: URL: https://github.com/apache/arrow/pull/7848 So this actually made things marginally slower ... need to look at this some more tomorrow This is an automated message from the

[GitHub] [arrow] wesm closed issue #7850: Upgrading pyarrow 0.15 -> 1.0 -- deserializing fails

2020-07-29 Thread GitBox
wesm closed issue #7850: URL: https://github.com/apache/arrow/issues/7850 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the

[GitHub] [arrow] mpeleshenko opened a new issue #7843: HDFS client unable to load libjvm on Windows

2020-07-29 Thread GitBox
mpeleshenko opened a new issue #7843: URL: https://github.com/apache/arrow/issues/7843 I've been having trouble connecting to HDFS even with the 1.0.0 pyarrow build as I run into the below error when running: ``` pa.hdfs.connect(host="host", port=port, user="user",

[GitHub] [arrow] liyafan82 commented on a change in pull request #7837: ARROW-9554: [Java] FixedWidthInPlaceVectorSorter sometimes produces wrong result

2020-07-29 Thread GitBox
liyafan82 commented on a change in pull request #7837: URL: https://github.com/apache/arrow/pull/7837#discussion_r461405705 ## File path: java/algorithm/src/test/java/org/apache/arrow/algorithm/sort/TestFixedWidthInPlaceVectorSorter.java ## @@ -209,4 +212,29 @@ public void

[GitHub] [arrow] paddyhoran opened a new pull request #7859: ARROW-9583: [Rust] Offset is mishandled in arithmetic and boolean compute kernels

2020-07-29 Thread GitBox
paddyhoran opened a new pull request #7859: URL: https://github.com/apache/arrow/pull/7859 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above

[GitHub] [arrow] kou commented on a change in pull request #7818: ARROW-9541: [C++] CMakeLists requires UTF8PROC_STATIC when building static library

2020-07-29 Thread GitBox
kou commented on a change in pull request #7818: URL: https://github.com/apache/arrow/pull/7818#discussion_r461245013 ## File path: cpp/src/arrow/CMakeLists.txt ## @@ -490,6 +490,10 @@ if(ARROW_BUILD_STATIC AND WIN32) target_compile_definitions(arrow_static PUBLIC

[GitHub] [arrow] github-actions[bot] commented on pull request #7856: ARROW-9589: [C++/R] Forward declare structs as structs

2020-07-29 Thread GitBox
github-actions[bot] commented on pull request #7856: URL: https://github.com/apache/arrow/pull/7856#issuecomment-665261049 https://issues.apache.org/jira/browse/ARROW-9589 This is an automated message from the Apache Git

[GitHub] [arrow] paddyhoran closed pull request #7859: ARROW-9583: [Rust] Offset is mishandled in arithmetic and boolean compute kernels

2020-07-29 Thread GitBox
paddyhoran closed pull request #7859: URL: https://github.com/apache/arrow/pull/7859 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go

[GitHub] [arrow] domoritz opened a new pull request #7844: [JS][Doc] Fix syntax error in example code

2020-07-29 Thread GitBox
domoritz opened a new pull request #7844: URL: https://github.com/apache/arrow/pull/7844 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to

[GitHub] [arrow] liyafan82 commented on pull request #7837: ARROW-9554: [Java] FixedWidthInPlaceVectorSorter sometimes produces wrong result

2020-07-29 Thread GitBox
liyafan82 commented on pull request #7837: URL: https://github.com/apache/arrow/pull/7837#issuecomment-664855122 > What do we think about adding tests that use randomly generated data? Sounds reasonable. I have added test cases for random data. Please check.

[GitHub] [arrow] github-actions[bot] commented on pull request #7846: ARROW-8445: [Gandiva][UDF] Add a udf for gandiva to extract all named groups.

2020-07-29 Thread GitBox
github-actions[bot] commented on pull request #7846: URL: https://github.com/apache/arrow/pull/7846#issuecomment-664743773 https://issues.apache.org/jira/browse/ARROW-8445 This is an automated message from the Apache Git

[GitHub] [arrow] wesm commented on issue #7843: HDFS client unable to load libjvm on Windows

2020-07-29 Thread GitBox
wesm commented on issue #7843: URL: https://github.com/apache/arrow/issues/7843#issuecomment-664667146 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the

[GitHub] [arrow] github-actions[bot] commented on pull request #7859: ARROW-9583: [Rust] Offset is mishandled in arithmetic and boolean compute kernels

2020-07-29 Thread GitBox
github-actions[bot] commented on pull request #7859: URL: https://github.com/apache/arrow/pull/7859#issuecomment-665371444 https://issues.apache.org/jira/browse/ARROW-9583 This is an automated message from the Apache Git

[GitHub] [arrow] GPSnoopy commented on a change in pull request #7818: ARROW-9541: [C++] CMakeLists requires UTF8PROC_STATIC when building static library

2020-07-29 Thread GitBox
GPSnoopy commented on a change in pull request #7818: URL: https://github.com/apache/arrow/pull/7818#discussion_r461441577 ## File path: cpp/src/arrow/CMakeLists.txt ## @@ -490,6 +490,10 @@ if(ARROW_BUILD_STATIC AND WIN32) target_compile_definitions(arrow_static PUBLIC

[GitHub] [arrow] wesm commented on issue #1336: pyarrow parquet and datasets with metadata files

2020-07-29 Thread GitBox
wesm commented on issue #1336: URL: https://github.com/apache/arrow/issues/1336#issuecomment-664635710 If you're hitting it with 1.0.0 please open a JIRA issue This is an automated message from the Apache Git Service. To

[GitHub] [arrow] kou commented on pull request #7842: ARROW-9412: [C++] Add non-bundled dependencies to INTERFACE_LINK_LIBRARIES of static libarrow

2020-07-29 Thread GitBox
kou commented on pull request #7842: URL: https://github.com/apache/arrow/pull/7842#issuecomment-665350733 Thanks. I can push the branch. I'm still working on this. Please wait for a while. This is an automated

[GitHub] [arrow] kszucs commented on pull request #7799: ARROW-7136: [Rust] Added caching to the docker image [WIP]

2020-07-29 Thread GitBox
kszucs commented on pull request #7799: URL: https://github.com/apache/arrow/pull/7799#issuecomment-665086095 Master - 19m 32s (including docker pull) No Cache - 13m 28s Download Cache - 13m 43s Download and Build Cache - 12m 20s Download and Build Cache on a Volume - 9m 8s

[GitHub] [arrow] kszucs opened a new pull request #7855: [CI] Attempt to fix recently failing macOS builds

2020-07-29 Thread GitBox
kszucs opened a new pull request #7855: URL: https://github.com/apache/arrow/pull/7855 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go

[GitHub] [arrow] paddyhoran commented on pull request #7859: ARROW-9583: [Rust] Offset is mishandled in arithmetic and boolean compute kernels

2020-07-29 Thread GitBox
paddyhoran commented on pull request #7859: URL: https://github.com/apache/arrow/pull/7859#issuecomment-665373190 Closing in favor of #7854 This is an automated message from the Apache Git Service. To respond to the message,

[GitHub] [arrow] nealrichardson opened a new pull request #7858: WIP Rework r-sanitizer tests to build more like CRAN

2020-07-29 Thread GitBox
nealrichardson opened a new pull request #7858: URL: https://github.com/apache/arrow/pull/7858 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL

[GitHub] [arrow] nealrichardson closed issue #7857: Underscores at beginning of directory names create problems for open_dataset function

2020-07-29 Thread GitBox
nealrichardson closed issue #7857: URL: https://github.com/apache/arrow/issues/7857 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to

[GitHub] [arrow] wesm commented on pull request #7819: ARROW-9405: [R] Switch to cpp11

2020-07-29 Thread GitBox
wesm commented on pull request #7819: URL: https://github.com/apache/arrow/pull/7819#issuecomment-665141432 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use

[GitHub] [arrow] andygrove commented on a change in pull request #7798: ARROW-9523 [Rust] Improve filter kernel performance

2020-07-29 Thread GitBox
andygrove commented on a change in pull request #7798: URL: https://github.com/apache/arrow/pull/7798#discussion_r461228050 ## File path: rust/arrow/src/compute/kernels/filter.rs ## @@ -17,139 +17,466 @@ //! Defines miscellaneous array kernels. -use std::sync::Arc; - use

[GitHub] [arrow] github-actions[bot] commented on pull request #7847: Remove duplicated to-do line

2020-07-29 Thread GitBox
github-actions[bot] commented on pull request #7847: URL: https://github.com/apache/arrow/pull/7847#issuecomment-664749970 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [arrow] github-actions[bot] commented on pull request #7858: WIP Rework r-sanitizer tests to build more like CRAN

2020-07-29 Thread GitBox
github-actions[bot] commented on pull request #7858: URL: https://github.com/apache/arrow/pull/7858#issuecomment-665287714 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [arrow] fsaintjacques commented on a change in pull request #7820: ARROW-9068: [C++][Dataset] Simplify partitioning interface

2020-07-29 Thread GitBox
fsaintjacques commented on a change in pull request #7820: URL: https://github.com/apache/arrow/pull/7820#discussion_r461724930 ## File path: cpp/src/arrow/dataset/partition.h ## @@ -59,21 +59,10 @@ class ARROW_DS_EXPORT Partitioning { /// \brief The name identifying the

[GitHub] [arrow] jhorstmann opened a new pull request #7854: ARROW-9583 Fix offsets in result of arithmetic kernels

2020-07-29 Thread GitBox
jhorstmann opened a new pull request #7854: URL: https://github.com/apache/arrow/pull/7854 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above

[GitHub] [arrow] corleyma commented on pull request #7803: ARROW-9517: [C++/Python] Add optional session_token to S3Options

2020-07-29 Thread GitBox
corleyma commented on pull request #7803: URL: https://github.com/apache/arrow/pull/7803#issuecomment-664598308 We're very excited about the 1.0.0 release so no apologies necessary! Out of curiosity, what is the likely timeline between merge of this PR and an official release

[GitHub] [arrow] github-actions[bot] commented on pull request #7854: ARROW-9583 Fix offsets in result of arithmetic kernels

2020-07-29 Thread GitBox
github-actions[bot] commented on pull request #7854: URL: https://github.com/apache/arrow/pull/7854#issuecomment-665097255 https://issues.apache.org/jira/browse/ARROW-9583 This is an automated message from the Apache Git

[GitHub] [arrow] xhochy opened a new pull request #7856: ARROW-9589: [C++/R] Forward declare structs as structs

2020-07-29 Thread GitBox
xhochy opened a new pull request #7856: URL: https://github.com/apache/arrow/pull/7856 See also https://github.com/conda-forge/r-arrow-feedstock/pull/25 (yes, I'll also add the conda recipe to CI in the next days)

[GitHub] [arrow] jhirshman opened a new issue #7850: Upgrading 0.15 -> 1.0 -- deserializing fails

2020-07-29 Thread GitBox
jhirshman opened a new issue #7850: URL: https://github.com/apache/arrow/issues/7850 I have pandas dataframes that were serialized using pyarrow 0.15.1. I am now trying to upgrade to 1.0. When trying to read the old files with 1.0, I get the following error messages: -

[GitHub] [arrow] github-actions[bot] commented on pull request #7841: ARROW-9570: [Doc] Clean up sphinx sidebar

2020-07-29 Thread GitBox
github-actions[bot] commented on pull request #7841: URL: https://github.com/apache/arrow/pull/7841#issuecomment-664583285 https://issues.apache.org/jira/browse/ARROW-9570 This is an automated message from the Apache Git

[GitHub] [arrow] paddyhoran commented on pull request #7853: ARROW-9582: [Rust] Add memory_size() method to Array [DRAFT]

2020-07-29 Thread GitBox
paddyhoran commented on pull request #7853: URL: https://github.com/apache/arrow/pull/7853#issuecomment-665190021 Looks fine to me. Is it worth having `memory_used` (`len`) as well as `memory_size` (`capacity`)? This is an

[GitHub] [arrow] github-actions[bot] commented on pull request #7851: ARROW-9286: [C++] Add function "aliases" to compute::FunctionRegistry

2020-07-29 Thread GitBox
github-actions[bot] commented on pull request #7851: URL: https://github.com/apache/arrow/pull/7851#issuecomment-665019241 https://issues.apache.org/jira/browse/ARROW-9286 This is an automated message from the Apache Git

[GitHub] [arrow] bkietz commented on a change in pull request #7820: ARROW-9068: [C++][Dataset] Simplify partitioning interface

2020-07-29 Thread GitBox
bkietz commented on a change in pull request #7820: URL: https://github.com/apache/arrow/pull/7820#discussion_r461736303 ## File path: cpp/src/arrow/dataset/partition.h ## @@ -59,21 +59,10 @@ class ARROW_DS_EXPORT Partitioning { /// \brief The name identifying the kind of

[GitHub] [arrow] tobim closed pull request #7813: ARROW-9412: [C++] Add non-bundled dependencies to INTERFACE_LINK_LIBRARIES of static libarrow

2020-07-29 Thread GitBox
tobim closed pull request #7813: URL: https://github.com/apache/arrow/pull/7813 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the

[GitHub] [arrow] ZMZ91 opened a new pull request #7846: ARROW-8445: [Gandiva][UDF] Add a udf for gandiva to extract all named groups.

2020-07-29 Thread GitBox
ZMZ91 opened a new pull request #7846: URL: https://github.com/apache/arrow/pull/7846 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go

[GitHub] [arrow] nealrichardson closed pull request #7832: ARROW-9574: [R] Cleanups for CRAN 1.0.0 release

2020-07-29 Thread GitBox
nealrichardson closed pull request #7832: URL: https://github.com/apache/arrow/pull/7832 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to

[GitHub] [arrow] kou closed pull request #7845: ARROW-9576: [Python][Doc] Fix error in example code for extension types

2020-07-29 Thread GitBox
kou closed pull request #7845: URL: https://github.com/apache/arrow/pull/7845 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the

[GitHub] [arrow] nealrichardson closed pull request #7856: ARROW-9589: [C++/R] Forward declare structs as structs

2020-07-29 Thread GitBox
nealrichardson closed pull request #7856: URL: https://github.com/apache/arrow/pull/7856 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to

[GitHub] [arrow] wesm closed issue #7843: HDFS client unable to load libjvm on Windows

2020-07-29 Thread GitBox
wesm closed issue #7843: URL: https://github.com/apache/arrow/issues/7843 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the

[GitHub] [arrow] kou closed pull request #7849: ARROW-9568: [CI][C++] Use msys2/setup-msys2

2020-07-29 Thread GitBox
kou closed pull request #7849: URL: https://github.com/apache/arrow/pull/7849 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the

[GitHub] [arrow] github-actions[bot] commented on pull request #7832: ARROW-9574: [R] Cleanups for CRAN 1.0.0 release

2020-07-29 Thread GitBox
github-actions[bot] commented on pull request #7832: URL: https://github.com/apache/arrow/pull/7832#issuecomment-664629670 https://issues.apache.org/jira/browse/ARROW-9574 This is an automated message from the Apache Git

[GitHub] [arrow] wesm commented on issue #7850: Upgrading pyarrow 0.15 -> 1.0 -- deserializing fails

2020-07-29 Thread GitBox
wesm commented on issue #7850: URL: https://github.com/apache/arrow/issues/7850#issuecomment-665311158 These serialization functions never contained any backward compatibility guarantees and are going to be deprecated soon. I would recommend reading the files with <= 0.17.1 and writing

[GitHub] [arrow] njaard closed pull request #7839: [Rust] Don't output debug information to stdout

2020-07-29 Thread GitBox
njaard closed pull request #7839: URL: https://github.com/apache/arrow/pull/7839 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to

[GitHub] [arrow] randomgambit commented on pull request #7304: ARROW-8878: [R] try_download is confused when download.file.method isn't default

2020-07-29 Thread GitBox
randomgambit commented on pull request #7304: URL: https://github.com/apache/arrow/pull/7304#issuecomment-665053396 Hi ! sorry to jump in but I still have problems running install_arrow() on linux (with firewalls in place). Should "libcurl" be the right choice (instead of "wget") ? It

[GitHub] [arrow] chairmank commented on a change in pull request #7837: ARROW-9554: [Java] FixedWidthInPlaceVectorSorter sometimes produces wrong result

2020-07-29 Thread GitBox
chairmank commented on a change in pull request #7837: URL: https://github.com/apache/arrow/pull/7837#discussion_r461560501 ## File path: java/algorithm/src/test/java/org/apache/arrow/algorithm/sort/TestSortingUtil.java ## @@ -0,0 +1,150 @@ +/* + * Licensed to the Apache

[GitHub] [arrow] andygrove commented on a change in pull request #7687: ARROW-9382: [Rust][DataFusion] Simplified hash aggregations and added Boolean type

2020-07-29 Thread GitBox
andygrove commented on a change in pull request #7687: URL: https://github.com/apache/arrow/pull/7687#discussion_r461856982 ## File path: rust/datafusion/src/execution/physical_plan/hash_aggregate.rs ## @@ -327,120 +278,47 @@ impl RecordBatchReader for

[GitHub] [arrow] tianchen92 opened a new pull request #7851: ARROW-9286: [C++] Add function "aliases" to compute::FunctionRegistry

2020-07-29 Thread GitBox
tianchen92 opened a new pull request #7851: URL: https://github.com/apache/arrow/pull/7851 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above

[GitHub] [arrow] andygrove opened a new pull request #7853: ARROW-9582: [Rust] Add memory_size() method to Array [DRAFT]

2020-07-29 Thread GitBox
andygrove opened a new pull request #7853: URL: https://github.com/apache/arrow/pull/7853 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to

[GitHub] [arrow] kszucs commented on pull request #7838: ARROW-9581: [Dev][Release] Bump next snapshot versions to 2.0.0

2020-07-29 Thread GitBox
kszucs commented on pull request #7838: URL: https://github.com/apache/arrow/pull/7838#issuecomment-665081161 +1, the build failures are unrelated. This is an automated message from the Apache Git Service. To respond to the

[GitHub] [arrow] github-actions[bot] commented on pull request #7845: ARROW-9576: [Doc] Fix error in example code for extension types

2020-07-29 Thread GitBox
github-actions[bot] commented on pull request #7845: URL: https://github.com/apache/arrow/pull/7845#issuecomment-664685956 https://issues.apache.org/jira/browse/ARROW-9576 This is an automated message from the Apache Git

[GitHub] [arrow] github-actions[bot] commented on pull request #7844: [JS][Doc] Fix syntax error in example code

2020-07-29 Thread GitBox
github-actions[bot] commented on pull request #7844: URL: https://github.com/apache/arrow/pull/7844#issuecomment-664681901 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [arrow] kszucs commented on pull request #7839: [Rust] Don't output debug information to stdout

2020-07-29 Thread GitBox
kszucs commented on pull request #7839: URL: https://github.com/apache/arrow/pull/7839#issuecomment-664967848 @njaard please create a jira ticket as described above. This is an automated message from the Apache Git Service.

[GitHub] [arrow] wesm commented on pull request #7304: ARROW-8878: [R] try_download is confused when download.file.method isn't default

2020-07-29 Thread GitBox
wesm commented on pull request #7304: URL: https://github.com/apache/arrow/pull/7304#issuecomment-665313657 Could you open a JIRA issue and provide more detail about what's going wrong? This is an automated message from the

[GitHub] [arrow] kszucs opened a new pull request #7852: ARROW-9394: [Python] Support pickling of Scalars

2020-07-29 Thread GitBox
kszucs opened a new pull request #7852: URL: https://github.com/apache/arrow/pull/7852 Since there are no sequence converters available for Dictionary and Union types we cannot construct them directly thus `pa.scalar` fail as the reducer function to reconstruct them. We can add

[GitHub] [arrow] offthewall123 commented on a change in pull request #7815: ARROW-9536: [Java] Miss parameters in PlasmaOutOfMemoryException.java

2020-07-29 Thread GitBox
offthewall123 commented on a change in pull request #7815: URL: https://github.com/apache/arrow/pull/7815#discussion_r461977379 ## File path: java/plasma/src/main/java/org/apache/arrow/plasma/exceptions/PlasmaOutOfMemoryException.java ## @@ -22,11 +22,11 @@ */ public class

[GitHub] [arrow] BryanCutler opened a new pull request #7845: ARROW-9576: [Doc] Fix error in example code for extension types

2020-07-29 Thread GitBox
BryanCutler opened a new pull request #7845: URL: https://github.com/apache/arrow/pull/7845 This fixes an error in the documentation example code for pyarrow extension types. This is an automated message from the Apache Git

[GitHub] [arrow] github-actions[bot] commented on pull request #7852: ARROW-9394: [Python] Support pickling of Scalars

2020-07-29 Thread GitBox
github-actions[bot] commented on pull request #7852: URL: https://github.com/apache/arrow/pull/7852#issuecomment-665033783 https://issues.apache.org/jira/browse/ARROW-9394 This is an automated message from the Apache Git

[GitHub] [arrow] ZMZ91 commented on pull request #7846: ARROW-8445: [Gandiva][UDF][c++] Add a udf for gandiva to extract all named groups.

2020-07-29 Thread GitBox
ZMZ91 commented on pull request #7846: URL: https://github.com/apache/arrow/pull/7846#issuecomment-665403760 Hi, could someone help review this pr and check the failures? Thanks in advance. This is an automated message from

[GitHub] [arrow] JoshADHD opened a new issue #7857: Underscores at beginning of directory names create problems for open_dataset function

2020-07-29 Thread GitBox
JoshADHD opened a new issue #7857: URL: https://github.com/apache/arrow/issues/7857 When attempting to source parquet files for a dataset, I've found that underscores at the beginning of directory names (which I use often for OCD purposes) cause the function to not find any files.

[GitHub] [arrow] github-actions[bot] commented on pull request #7838: [Release] Bump next snapshot versions to 2.0.0

2020-07-29 Thread GitBox
github-actions[bot] commented on pull request #7838: URL: https://github.com/apache/arrow/pull/7838#issuecomment-664705011 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [arrow] mpeleshenko commented on issue #7843: HDFS client unable to load libjvm on Windows

2020-07-29 Thread GitBox
mpeleshenko commented on issue #7843: URL: https://github.com/apache/arrow/issues/7843#issuecomment-664668096 @wesm This is actually the same issue described in [ARROW-5236](https://issues.apache.org/jira/browse/ARROW-5236). Can we use that one or should I still create a new JIRA issue?

[GitHub] [arrow] andygrove commented on pull request #7848: ARROW-9550: [Rust] [DataFusion] Remove use of Rc> in HashAggregate - DRAFT

2020-07-29 Thread GitBox
andygrove commented on pull request #7848: URL: https://github.com/apache/arrow/pull/7848#issuecomment-664754709 @jorgecarleitao fyi, maybe this wasn't a good idea after all This is an automated message from the Apache Git

[GitHub] [arrow] pgwhalen opened a new pull request #7847: Remove duplicated to-do line

2020-07-29 Thread GitBox
pgwhalen opened a new pull request #7847: URL: https://github.com/apache/arrow/pull/7847 Looks like a copy-paste error perhaps; I can't imagine it's intentional. This is an automated message from the Apache Git Service. To

[GitHub] [arrow] nevi-me commented on pull request #7853: ARROW-9582: [Rust] Add memory_size() method to Array [DRAFT]

2020-07-29 Thread GitBox
nevi-me commented on pull request #7853: URL: https://github.com/apache/arrow/pull/7853#issuecomment-665068771 > @nevi-me @paddyhoran @sunchao does this general approach seem ok to you? Hi Andy, yes it does to me This

[GitHub] [arrow] wesm merged pull request #7847: ARROW-9585: [Rust][DataFusion] Remove duplicated to-do line

2020-07-29 Thread GitBox
wesm merged pull request #7847: URL: https://github.com/apache/arrow/pull/7847 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the

[GitHub] [arrow] github-actions[bot] commented on pull request #7853: ARROW-9582: [Rust] Add memory_size() method to Array [DRAFT]

2020-07-29 Thread GitBox
github-actions[bot] commented on pull request #7853: URL: https://github.com/apache/arrow/pull/7853#issuecomment-665042069 https://issues.apache.org/jira/browse/ARROW-9582 This is an automated message from the Apache Git

[GitHub] [arrow] nealrichardson commented on pull request #7858: WIP Rework r-sanitizer tests to build more like CRAN

2020-07-29 Thread GitBox
nealrichardson commented on pull request #7858: URL: https://github.com/apache/arrow/pull/7858#issuecomment-665287085 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[GitHub] [arrow] jorgecarleitao commented on a change in pull request #7687: ARROW-9382: [Rust][DataFusion] Simplified hash aggregations and added Boolean type

2020-07-29 Thread GitBox
jorgecarleitao commented on a change in pull request #7687: URL: https://github.com/apache/arrow/pull/7687#discussion_r461825412 ## File path: rust/datafusion/src/execution/physical_plan/hash_aggregate.rs ## @@ -327,120 +278,49 @@ impl RecordBatchReader for

[GitHub] [arrow] yordan-pavlov commented on a change in pull request #7798: ARROW-9523 [Rust] Improve filter kernel performance

2020-07-29 Thread GitBox
yordan-pavlov commented on a change in pull request #7798: URL: https://github.com/apache/arrow/pull/7798#discussion_r461149342 ## File path: rust/arrow/src/compute/kernels/filter.rs ## @@ -17,139 +17,466 @@ //! Defines miscellaneous array kernels. -use std::sync::Arc; -

[GitHub] [arrow] kou opened a new pull request #7849: ARROW-9568: [CI][C++] Use msys2/setup-msys2

2020-07-29 Thread GitBox
kou opened a new pull request #7849: URL: https://github.com/apache/arrow/pull/7849 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to

[GitHub] [arrow] kou commented on pull request #7849: ARROW-9568: [CI][C++] Use msys2/setup-msys2

2020-07-29 Thread GitBox
kou commented on pull request #7849: URL: https://github.com/apache/arrow/pull/7849#issuecomment-664831093 +1 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and

[GitHub] [arrow] nealrichardson closed pull request #7831: ARROW-9544: [R] Fix version argument of write_parquet()

2020-07-29 Thread GitBox
nealrichardson closed pull request #7831: URL: https://github.com/apache/arrow/pull/7831 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to

[GitHub] [arrow] github-actions[bot] commented on pull request #7849: ARROW-9568: [CI][C++] Use msys2/setup-msys2

2020-07-29 Thread GitBox
github-actions[bot] commented on pull request #7849: URL: https://github.com/apache/arrow/pull/7849#issuecomment-664791092 https://issues.apache.org/jira/browse/ARROW-9568 This is an automated message from the Apache Git

[GitHub] [arrow] wesm commented on issue #7835: ArrowInvalid: straddling object straddles two block boundaries (try to increase block size?)

2020-07-29 Thread GitBox
wesm commented on issue #7835: URL: https://github.com/apache/arrow/issues/7835#issuecomment-664660186 Could you open a JIRA issue and provide a data file that reproduces the issue? This is an automated message from the

[GitHub] [arrow] tobim opened a new pull request #7842: ARROW-9412: [C++] Add non-bundled dependencies to INTERFACE_LINK_LIBRARIES of static libarrow

2020-07-29 Thread GitBox
tobim opened a new pull request #7842: URL: https://github.com/apache/arrow/pull/7842 Continuation of #7813. This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and

[GitHub] [arrow] nealrichardson commented on pull request #7803: ARROW-9517: [C++/Python] Add optional session_token to S3Options

2020-07-29 Thread GitBox
nealrichardson commented on pull request #7803: URL: https://github.com/apache/arrow/pull/7803#issuecomment-664613233 > Is the cadence of one release every few months likely to continue? I'd expect so. Depending on your use case and stability requirements, you could try using our

[GitHub] [arrow] kou commented on a change in pull request #7838: [Release] Bump next snapshot versions to 2.0.0

2020-07-29 Thread GitBox
kou commented on a change in pull request #7838: URL: https://github.com/apache/arrow/pull/7838#discussion_r461243929 ## File path: r/NEWS.md ## @@ -19,6 +19,8 @@ # arrow 1.0.0.9000 +# arrow 1.0.0.9000 + Review comment: Could you revert this?

  1   2   >