[GitHub] [arrow-adbc] lidavidm opened a new issue, #267: [CI] "All keyword arguments must be after positional arguments"

2022-12-24 Thread GitBox
lidavidm opened a new issue, #267: URL: https://github.com/apache/arrow-adbc/issues/267 ``` + meson setup build --buildtype=debugoptimized --default-library=both --libdir=/usr/lib64 --prefix=/usr

[GitHub] [arrow-adbc] kou closed issue #267: [CI] "All keyword arguments must be after positional arguments"

2022-12-24 Thread GitBox
kou closed issue #267: [CI] "All keyword arguments must be after positional arguments" URL: https://github.com/apache/arrow-adbc/issues/267 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the

[GitHub] [arrow-adbc] lidavidm opened a new issue, #271: [Release] Verification job immediately fails with 404

2022-12-24 Thread GitBox
lidavidm opened a new issue, #271: URL: https://github.com/apache/arrow-adbc/issues/271 https://github.com/apache/arrow-adbc/actions/runs/3773060884/jobs/6414323228 We should run CI with VERBOSE=1. -- This is an automated message from the Apache Git Service. To respond to the

[GitHub] [arrow] chaubold opened a new issue, #15068: [Python] IPC serialization roundtrip of extension type with storage type pa.null() reconstructs invalid buffers

2022-12-22 Thread GitBox
chaubold opened a new issue, #15068: URL: https://github.com/apache/arrow/issues/15068 ### Describe the bug, including details regarding any error messages, version, and platform. Hi guys, I wanted to create a JIRA ticket, but apparently that is not allowed anymore for the

[GitHub] [arrow] jorisvandenbossche closed issue #14885: [Docs] Make changes to the New Contrib Guide (Jira -> GitHub)

2022-12-22 Thread GitBox
jorisvandenbossche closed issue #14885: [Docs] Make changes to the New Contrib Guide (Jira -> GitHub) URL: https://github.com/apache/arrow/issues/14885 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go

[GitHub] [arrow-adbc] lidavidm closed issue #104: [RPM] Add packages for Red Hat Enterprise Linux based distributions

2022-12-22 Thread GitBox
lidavidm closed issue #104: [RPM] Add packages for Red Hat Enterprise Linux based distributions URL: https://github.com/apache/arrow-adbc/issues/104 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to

[GitHub] [arrow] jorisvandenbossche opened a new issue, #15064: [Python][CI] Dask nightly tests are failing due to fsspec bug

2022-12-22 Thread GitBox
jorisvandenbossche opened a new issue, #15064: URL: https://github.com/apache/arrow/issues/15064 ### Describe the bug, including details regarding any error messages, version, and platform. Nightly crossbow tests against dask are failing because of

[GitHub] [arrow] jorisvandenbossche closed issue #15043: [Python] [Docs] Change docstring for `decompressed_size` arg in pyarrow.decompress to reflect implementation

2022-12-22 Thread GitBox
jorisvandenbossche closed issue #15043: [Python] [Docs] Change docstring for `decompressed_size` arg in pyarrow.decompress to reflect implementation URL: https://github.com/apache/arrow/issues/15043 -- This is an automated message from the Apache Git Service. To respond to the message,

[GitHub] [arrow] jorisvandenbossche closed issue #15064: [Python][CI] Dask nightly tests are failing due to fsspec bug

2022-12-22 Thread GitBox
jorisvandenbossche closed issue #15064: [Python][CI] Dask nightly tests are failing due to fsspec bug URL: https://github.com/apache/arrow/issues/15064 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go

[GitHub] [arrow] EpsilonPrime opened a new issue, #15072: Separate round functionality into a separate module

2022-12-22 Thread GitBox
EpsilonPrime opened a new issue, #15072: URL: https://github.com/apache/arrow/issues/15072 ### Describe the enhancement requested 40% of scalar_arithmetic.cc is comprised of round related functionality in roughly 1200 of 3000 lines of code. This is a sufficient bulk of logic to

[GitHub] [arrow] mapleFU opened a new issue, #15074: [Parquet][C++] Using int32_t to store page_ordinal

2022-12-22 Thread GitBox
mapleFU opened a new issue, #15074: URL: https://github.com/apache/arrow/issues/15074 ### Describe the enhancement requested When a Page can be well compressed in `PLAIN` format, if the estimate size is much more larger than compressed size, the Page can be very small. And a 512MB

[GitHub] [arrow-adbc] kou opened a new issue, #291: [Release] Build source archive by GitHub Actions

2022-12-25 Thread GitBox
kou opened a new issue, #291: URL: https://github.com/apache/arrow-adbc/issues/291 All packages use the built source archive (that will be signed and voted later) instead of using Git repository. Note that this is not a blocker of 0.1.0. -- This is an automated message from the

[GitHub] [arrow-julia] CarloLucibello opened a new issue, #370: github releases not in sync

2022-12-26 Thread GitBox
CarloLucibello opened a new issue, #370: URL: https://github.com/apache/arrow-julia/issues/370 Last tagged version is v2.4.1, while github releases are stuck to v2.2.1. Something wrong with tagbot here? -- This is an automated message from the Apache Git Service. To respond to the

[GitHub] [arrow-adbc] lidavidm closed issue #289: [Release] Don't verify APT/Yum on environment that doesn't have Docker

2022-12-25 Thread GitBox
lidavidm closed issue #289: [Release] Don't verify APT/Yum on environment that doesn't have Docker URL: https://github.com/apache/arrow-adbc/issues/289 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go

[GitHub] [arrow] heronshoes opened a new issue, #15092: [CI][C++] "brew update --preinstall" is failed (again)

2022-12-26 Thread GitBox
heronshoes opened a new issue, #15092: URL: https://github.com/apache/arrow/issues/15092 ### Describe the bug, including details regarding any error messages, version, and platform. This is a bug at the same place as #15025 . We must remove `2to3-3.11 ` for this time.

[GitHub] [arrow] jorisvandenbossche closed issue #14975: [Python] Add Dataset.sort_by

2022-12-23 Thread GitBox
jorisvandenbossche closed issue #14975: [Python] Add Dataset.sort_by URL: https://github.com/apache/arrow/issues/14975 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To

[GitHub] [arrow] paleolimbot closed issue #14947: [R] Compatibility of arrow with dplyr 1.1.0

2022-12-26 Thread GitBox
paleolimbot closed issue #14947: [R] Compatibility of arrow with dplyr 1.1.0 URL: https://github.com/apache/arrow/issues/14947 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment.

[GitHub] [arrow] 0x26res opened a new issue, #15078: add type and mask argument to pa.MapArray.from_array

2022-12-23 Thread GitBox
0x26res opened a new issue, #15078: URL: https://github.com/apache/arrow/issues/15078 ### Describe the enhancement requested I would like to be able to create MapArray, using the from_array function, and: 1. specify missing value (by that I mean the whole map is null), by passing

[GitHub] [arrow-julia] jrevels opened a new issue, #367: inappropriately applied `Arrow.NullVector` optimization?

2022-12-23 Thread GitBox
jrevels opened a new issue, #367: URL: https://github.com/apache/arrow-julia/issues/367 It seems like `Union{ZonedDateTime,Missing}` (de)serialization can bork Arrow.jl in some circumstances, presumably to a misapplied `Arrow.NullVector` optimization (or code in that optimization assuming

[GitHub] [arrow] jrbourbeau opened a new issue, #15079: PyArrow fails to rountrip non-ns datetime64 with pandas 2.0

2022-12-23 Thread GitBox
jrbourbeau opened a new issue, #15079: URL: https://github.com/apache/arrow/issues/15079 ### Describe the bug, including details regarding any error messages, version, and platform. While adding support in Dask for the upcoming `pandas=2.0` release, some of our unit tests began

[GitHub] [arrow] arteconceito closed issue #14973: [JS] Using tableFromIPC().toString() results in invalid stringified JSON when containing dates

2022-12-23 Thread GitBox
arteconceito closed issue #14973: [JS] Using tableFromIPC().toString() results in invalid stringified JSON when containing dates URL: https://github.com/apache/arrow/issues/14973 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub

[GitHub] [arrow-adbc] lidavidm opened a new issue, #259: [Release] Linux packaging does not work in RC builds

2022-12-23 Thread GitBox
lidavidm opened a new issue, #259: URL: https://github.com/apache/arrow-adbc/issues/259 ``` Command failed with status (1): [docker pull ghcr.io/lidavidm/arrow-adbc/pa...] /home/runner/work/arrow-adbc/arrow-adbc/arrow/dev/tasks/linux-packages/package-task.rb:209:in `docker_pull'

[GitHub] [arrow] kou closed issue #15085: [Ruby] Add a method to return column names in Arrow::Table

2022-12-26 Thread GitBox
kou closed issue #15085: [Ruby] Add a method to return column names in Arrow::Table URL: https://github.com/apache/arrow/issues/15085 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific

[GitHub] [arrow] kou closed issue #15040: [R][CI] Arrow nightly binaries require aws sdk system dependency

2022-12-26 Thread GitBox
kou closed issue #15040: [R][CI] Arrow nightly binaries require aws sdk system dependency URL: https://github.com/apache/arrow/issues/15040 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the

[GitHub] [arrow] kou opened a new issue, #15094: [CI][Ruby] verify-rc-source-ruby-linux-ubuntu-18.04-amd64 is failed

2022-12-26 Thread GitBox
kou opened a new issue, #15094: URL: https://github.com/apache/arrow/issues/15094 ### Describe the bug, including details regarding any error messages, version, and platform. https://github.com/ursacomputing/crossbow/actions/runs/3781349273/jobs/6428165522#step:5:10549

[GitHub] [arrow-adbc] kou opened a new issue, #297: [Release][Java] Can't detect correct local Maven repository

2022-12-26 Thread GitBox
kou opened a new issue, #297: URL: https://github.com/apache/arrow-adbc/issues/297 I don't know why but `mvn help:evaluate -Dexpression=settings.localRepository -q -DforceStdout` has an ANSI escape sequence at the end on my environment: ```console $ mvn help:evaluate

[GitHub] [arrow-adbc] kou opened a new issue, #301: [Release][Ruby] Install gem into build directory in verify-release-candidate.sh

2022-12-26 Thread GitBox
kou opened a new issue, #301: URL: https://github.com/apache/arrow-adbc/issues/301 We should not change out of build directory as much as possible. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go

[GitHub] [arrow] kou closed issue #15092: [CI][C++] "brew update --preinstall" is failed (again)

2022-12-26 Thread GitBox
kou closed issue #15092: [CI][C++] "brew update --preinstall" is failed (again) URL: https://github.com/apache/arrow/issues/15092 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific

[GitHub] [arrow-adbc] kou opened a new issue, #295: [Release][Go] Can't remove temporary directory at the end in verify-release-candidate.sh

2022-12-26 Thread GitBox
kou opened a new issue, #295: URL: https://github.com/apache/arrow-adbc/issues/295 I don't know why but Go modules are installed with `0444` permission. So `rm -rf "${ARROW_TMPDIR}"` is failed. -- This is an automated message from the Apache Git Service. To respond to the message, please

[GitHub] [arrow-adbc] kou opened a new issue, #299: [Release][GLib] Add support for customizing build related environment variables

2022-12-26 Thread GitBox
kou opened a new issue, #299: URL: https://github.com/apache/arrow-adbc/issues/299 For example, `PKG_CONFIG_PATH` and `LD_LIBRARY_PATH`. We can use Apache Arrow C GLib installed in custom directory by this. FYI: I install Apache Arrow C GLib to `/tmp/local/`. -- This is an

[GitHub] [arrow-adbc] WillAyd opened a new issue, #307: Python adbc_driver_postgresql documentation missing python dependencies

2022-12-30 Thread GitBox
WillAyd opened a new issue, #307: URL: https://github.com/apache/arrow-adbc/issues/307 When going through the docs for the adbc_driver_postgresql everything built fine, but on runtime you get an error if you don't also install the adbc_driver_manager and pyarrow packages separately.

[GitHub] [arrow-cookbook] davisusanibar closed issue #208: [Java] Upgrade documentation to be readable for more than one Arrow Java version (current release + others)

2022-12-30 Thread GitBox
davisusanibar closed issue #208: [Java] Upgrade documentation to be readable for more than one Arrow Java version (current release + others) URL: https://github.com/apache/arrow-cookbook/issues/208 -- This is an automated message from the Apache Git Service. To respond to the message, please

[GitHub] [arrow-adbc] lidavidm closed issue #291: [Release] Build source archive by GitHub Actions

2022-12-30 Thread GitBox
lidavidm closed issue #291: [Release] Build source archive by GitHub Actions URL: https://github.com/apache/arrow-adbc/issues/291 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific

[GitHub] [arrow-adbc] lidavidm closed issue #304: [C] PostgreSQL driver does not build on macOS 10.9

2022-12-30 Thread GitBox
lidavidm closed issue #304: [C] PostgreSQL driver does not build on macOS 10.9 URL: https://github.com/apache/arrow-adbc/issues/304 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific

[GitHub] [arrow-cookbook] davisusanibar opened a new issue, #288: [Java] Document how to read compression files LZ4 / ZSTD

2022-12-30 Thread GitBox
davisusanibar opened a new issue, #288: URL: https://github.com/apache/arrow-cookbook/issues/288 Related to https://issues.apache.org/jira/browse/ARROW-18198 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL

[GitHub] [arrow] assignUser closed issue #15122: [Benchmarking][Python] Benchmark builds on `test-mac-arm` are failing

2022-12-30 Thread GitBox
assignUser closed issue #15122: [Benchmarking][Python] Benchmark builds on `test-mac-arm` are failing URL: https://github.com/apache/arrow/issues/15122 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go

[GitHub] [arrow] lidavidm closed issue #14844: [Java] VectorValueComparator should skip the null test for NonNullable FieldTypes types

2022-12-30 Thread GitBox
lidavidm closed issue #14844: [Java] VectorValueComparator should skip the null test for NonNullable FieldTypes types URL: https://github.com/apache/arrow/issues/14844 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the

[GitHub] [arrow] assignUser opened a new issue, #15126: [R] purrr::rerun was deprecated in purrr 1.0.0

2022-12-30 Thread GitBox
assignUser opened a new issue, #15126: URL: https://github.com/apache/arrow/issues/15126 ### Describe the bug, including details regarding any error messages, version, and platform. ``` `rerun()` was deprecated in purrr 1.0.0. i Please use `map()` instead. # Previously

[GitHub] [arrow] westonpace opened a new issue, #15130: [C++] Simplify ExecNode contract by removing the concept of "node finished"

2022-12-30 Thread GitBox
westonpace opened a new issue, #15130: URL: https://github.com/apache/arrow/issues/15130 ### Describe the enhancement requested We've slowly been migrating to a model where all work done by the exec plan happens via the scheduler. We are at a point now where it is mostly true that

[GitHub] [arrow-adbc] WillAyd opened a new issue, #308: NotSupportedError for postgres CHAR / VARCHAR columns

2022-12-30 Thread GitBox
WillAyd opened a new issue, #308: URL: https://github.com/apache/arrow-adbc/issues/308 With a table such as: ```sql BEGIN; DROP TABLE IF EXISTS test_table; CREATE TABLE test_table ( integer_column INTEGER NOT NULL, char_column CHAR(1) NOT NULL );

[GitHub] [arrow] RNKuhns opened a new issue, #15105: Add compute between comparison

2022-12-28 Thread GitBox
RNKuhns opened a new issue, #15105: URL: https://github.com/apache/arrow/issues/15105 ### Describe the enhancement requested Between is a common comparison operation. It is currently possible to implement this in Arrow using a combination of greater_equal and less_equal comparisons

[GitHub] [arrow] RNKuhns opened a new issue, #15103: Weighted stat aggregations in arrow-compute

2022-12-28 Thread GitBox
RNKuhns opened a new issue, #15103: URL: https://github.com/apache/arrow/issues/15103 ### Describe the enhancement requested Arrow’s compute functions currently include several aggregate statistics (mean, sum, variance, etc). It would be great to offer weighted versions of

[GitHub] [arrow] wjones127 opened a new issue, #15100: Add benchmarks for reading and writing strings

2022-12-27 Thread GitBox
wjones127 opened a new issue, #15100: URL: https://github.com/apache/arrow/issues/15100 ### Describe the enhancement requested To evaluate #14353 we should benchmark reading and writing string data. ### Component(s) Benchmarking, C++, Parquet -- This is an automated

[GitHub] [arrow] lanmao-alibaba opened a new issue, #15102: [C++]Could not decompress arrow stream sent from [Java] arrow SDK

2022-12-28 Thread GitBox
lanmao-alibaba opened a new issue, #15102: URL: https://github.com/apache/arrow/issues/15102 ### Describe the bug, including details regarding any error messages, version, and platform. Hi guys, I am using arrow java sdk to send lz4/zstd stream to a server which is running c++

[GitHub] [arrow] cyb70289 closed issue #15072: [C++] Separate round functionality into a separate module

2022-12-27 Thread GitBox
cyb70289 closed issue #15072: [C++] Separate round functionality into a separate module URL: https://github.com/apache/arrow/issues/15072 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the

[GitHub] [arrow] lukester1975 opened a new issue, #15139: [C++] arrow.pc is missing dependencies with Windows static builds

2022-12-31 Thread GitBox
lukester1975 opened a new issue, #15139: URL: https://github.com/apache/arrow/issues/15139 ### Describe the bug, including details regarding any error messages, version, and platform. I have been having to manually edit `arrow.pc` from a vcpkg installation of a static build of arrow

[GitHub] [arrow] wjones127 opened a new issue, #15136: [Python] Python C++ libraries no longer in pyarrow/lib directory

2022-12-30 Thread GitBox
wjones127 opened a new issue, #15136: URL: https://github.com/apache/arrow/issues/15136 ### Describe the bug, including details regarding any error messages, version, and platform. Prior to #14925, libraries such as `arrow_python.dylib` were installed in both `python/pyarrow` and

[GitHub] [arrow] reh183 closed issue #14797: Is it possible to write a parquet file with a bloom filter for a column

2022-12-30 Thread GitBox
reh183 closed issue #14797: Is it possible to write a parquet file with a bloom filter for a column URL: https://github.com/apache/arrow/issues/14797 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go

[GitHub] [arrow] wjones127 closed issue #15069: [C++][Python] PyArrow Flight DoAction does not return results as available

2022-12-30 Thread GitBox
wjones127 closed issue #15069: [C++][Python] PyArrow Flight DoAction does not return results as available URL: https://github.com/apache/arrow/issues/15069 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to

[GitHub] [arrow] neopaf opened a new issue, #15138: Clustered By -- how?

2022-12-31 Thread GitBox
neopaf opened a new issue, #15138: URL: https://github.com/apache/arrow/issues/15138 ### Describe the usage question you have. Please include as many useful details as possible. As I understand https://cloudxlab.com/blog/bucketing-clustered-by-and-cluster-by/ Hive

[GitHub] [arrow] kou opened a new issue, #15134: [Ruby] Specify -mmacox-version-min=10.14 explicitly for old Xcode

2022-12-30 Thread GitBox
kou opened a new issue, #15134: URL: https://github.com/apache/arrow/issues/15134 ### Describe the enhancement requested If we don't specify `-mmacox-version-min=10.14`, the following error is occurred: ```text /tmp/local/include/arrow/type.h:1745:36: error: 'get>' is

[GitHub] [arrow] westonpace opened a new issue, #15137: [C++][CI] ASAN error in streaming JSON reader tests

2022-12-30 Thread GitBox
westonpace opened a new issue, #15137: URL: https://github.com/apache/arrow/issues/15137 ### Describe the bug, including details regarding any error messages, version, and platform. Example: https://github.com/apache/arrow/actions/runs/3808683807/jobs/6479467470

[GitHub] [arrow] atherkevin opened a new issue, #15133: Pyarrow conversion error

2022-12-30 Thread GitBox
atherkevin opened a new issue, #15133: URL: https://github.com/apache/arrow/issues/15133 ### Describe the bug, including details regarding any error messages, version, and platform. Hello, I'm running into an issue using the pandas to_feather() call with pyarrow. Pyarrow

[GitHub] [arrow] kou closed issue #15105: Add compute between comparison

2022-12-28 Thread GitBox
kou closed issue #15105: Add compute between comparison URL: https://github.com/apache/arrow/issues/15105 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe,

[GitHub] [arrow] mapleFU opened a new issue, #15107: Support RLE Encoder for Boolean type

2022-12-28 Thread GitBox
mapleFU opened a new issue, #15107: URL: https://github.com/apache/arrow/issues/15107 ### Describe the enhancement requested Currently, in our spec, boolean type in data page can use RLE to encode itself. And parquet-mr, parquet-go and other versions all supports it. Thanks

[GitHub] [arrow] westonpace opened a new issue, #15144: [C++] Add an option for the order by node to be stable

2022-12-31 Thread GitBox
westonpace opened a new issue, #15144: URL: https://github.com/apache/arrow/issues/15144 ### Describe the enhancement requested This will require support for ARROW-17762 first and only makes sense if there is some kind of existing (even the implicit) ordering. We can resequence as

[GitHub] [arrow] westonpace closed issue #15141: [CI] arrow-compute: ExecPlanExecution.StressSourceOrderBy may failed

2022-12-31 Thread GitBox
westonpace closed issue #15141: [CI] arrow-compute: ExecPlanExecution.StressSourceOrderBy may failed URL: https://github.com/apache/arrow/issues/15141 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go

[GitHub] [arrow] kou closed issue #15119: [Release][Docs][R] Update version information in patch release

2022-12-31 Thread GitBox
kou closed issue #15119: [Release][Docs][R] Update version information in patch release URL: https://github.com/apache/arrow/issues/15119 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the

[GitHub] [arrow] kou closed issue #15134: [Ruby] Specify -mmacox-version-min=10.14 explicitly for old Xcode

2022-12-31 Thread GitBox
kou closed issue #15134: [Ruby] Specify -mmacox-version-min=10.14 explicitly for old Xcode URL: https://github.com/apache/arrow/issues/15134 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the

[GitHub] [arrow] mapleFU opened a new issue, #15141: [CI] arrow-compute: ExecPlanExecution.StressSourceOrderBy may failed

2022-12-31 Thread GitBox
mapleFU opened a new issue, #15141: URL: https://github.com/apache/arrow/issues/15141 ### Describe the bug, including details regarding any error messages, version, and platform. 1. https://github.com/apache/arrow/actions/runs/3812471160/jobs/6485697443 2.

[GitHub] [arrow] mapleFU opened a new issue, #15145: [C++][Parquet] The DictEncoder is always PLAIN_DICTIONARY even in parquet_v2 format

2023-01-01 Thread GitBox
mapleFU opened a new issue, #15145: URL: https://github.com/apache/arrow/issues/15145 ### Describe the enhancement requested In `DictEncoderImpl`, the encoding of it is fixed, which is `PLAIN_DICTIONARY`. In our standard, it should be `RLE_DICTIONARY` or `PLAIN_DICTIONARY`, and

[GitHub] [arrow-adbc] lidavidm closed issue #295: [Release][Go] Can't remove temporary directory at the end in verify-release-candidate.sh

2022-12-29 Thread GitBox
lidavidm closed issue #295: [Release][Go] Can't remove temporary directory at the end in verify-release-candidate.sh URL: https://github.com/apache/arrow-adbc/issues/295 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use

[GitHub] [arrow-adbc] lidavidm closed issue #297: [Release][Java] Can't detect correct local Maven repository

2022-12-29 Thread GitBox
lidavidm closed issue #297: [Release][Java] Can't detect correct local Maven repository URL: https://github.com/apache/arrow-adbc/issues/297 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the

[GitHub] [arrow] jonkeane opened a new issue, #15110: [R] [CI] Windows build fails in packaging job

2022-12-29 Thread GitBox
jonkeane opened a new issue, #15110: URL: https://github.com/apache/arrow/issues/15110 ### Describe the bug, including details regarding any error messages, version, and platform. One example: https://github.com/ursacomputing/crossbow/actions/runs/3799348531/jobs/6462070137

[GitHub] [arrow-adbc] lidavidm closed issue #301: [Release][Ruby] Install gem into build directory in verify-release-candidate.sh

2022-12-29 Thread GitBox
lidavidm closed issue #301: [Release][Ruby] Install gem into build directory in verify-release-candidate.sh URL: https://github.com/apache/arrow-adbc/issues/301 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL

[GitHub] [arrow-adbc] lidavidm closed issue #293: [Release][Go] verify-release-candidate.sh uses glib-build/ not go-build/ for build directory

2022-12-29 Thread GitBox
lidavidm closed issue #293: [Release][Go] verify-release-candidate.sh uses glib-build/ not go-build/ for build directory URL: https://github.com/apache/arrow-adbc/issues/293 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and

[GitHub] [arrow-adbc] lidavidm opened a new issue, #304: [C] PostgreSQL driver does not build on macOS 10.9

2022-12-29 Thread GitBox
lidavidm opened a new issue, #304: URL: https://github.com/apache/arrow-adbc/issues/304 ``` In file included from /Users/runner/mambaforge/conda-bld/arrow-adbc-split_1672320127369/work/c/driver/postgresql/connection.cc:26:

[GitHub] [arrow-adbc] lidavidm closed issue #285: [CI] Gemfury upload is failing for JARs

2022-12-29 Thread GitBox
lidavidm closed issue #285: [CI] Gemfury upload is failing for JARs URL: https://github.com/apache/arrow-adbc/issues/285 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To

[GitHub] [arrow] BSalita opened a new issue, #15113: Add pa.tuple_ DataType

2022-12-29 Thread GitBox
BSalita opened a new issue, #15113: URL: https://github.com/apache/arrow/issues/15113 ### Describe the enhancement requested I use tuples in my dataframes in Python. When the dataframe is saved/read using parquet, the tuples turn into lists. I prefer the datatype not change. Some

[GitHub] [arrow-adbc] lidavidm closed issue #299: [Release][GLib] Add support for customizing build related environment variables

2022-12-29 Thread GitBox
lidavidm closed issue #299: [Release][GLib] Add support for customizing build related environment variables URL: https://github.com/apache/arrow-adbc/issues/299 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL

[GitHub] [arrow] assignUser closed issue #15110: [R] [CI] Windows build fails in packaging job

2022-12-29 Thread GitBox
assignUser closed issue #15110: [R] [CI] Windows build fails in packaging job URL: https://github.com/apache/arrow/issues/15110 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment.

[GitHub] [arrow] 0x26res opened a new issue, #15109: Can't create a non empty StructArray with no field using `StructArray.from_array`

2022-12-29 Thread GitBox
0x26res opened a new issue, #15109: URL: https://github.com/apache/arrow/issues/15109 ### Describe the bug, including details regarding any error messages, version, and platform. I want to create a StructArray with several rows but no fields/columns: ``` array =

[GitHub] [arrow-adbc] lidavidm closed issue #242: [CI] Clean old nightly packages during upload

2022-12-29 Thread GitBox
lidavidm closed issue #242: [CI] Clean old nightly packages during upload URL: https://github.com/apache/arrow-adbc/issues/242 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment.

[GitHub] [arrow] kou closed issue #15146: [GLib] Add `GADatasetFinishOptions`

2023-01-01 Thread GitBox
kou closed issue #15146: [GLib] Add `GADatasetFinishOptions` URL: https://github.com/apache/arrow/issues/15146 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe,

[GitHub] [arrow] lwhite1 opened a new issue, #15050: [Java] [Docs] Reorganize and update Java tutorial Memory page

2022-12-20 Thread GitBox
lwhite1 opened a new issue, #15050: URL: https://github.com/apache/arrow/issues/15050 ### Describe the enhancement requested Move selected text from the package JavaDoc to the rst/html documentation to avoid duplication and maintenance issues. Update docs to include mention of

[GitHub] [arrow] ava6969 opened a new issue, #15162: Usage of Hash Aggregation

2023-01-02 Thread GitBox
ava6969 opened a new issue, #15162: URL: https://github.com/apache/arrow/issues/15162 ### Describe the usage question you have. Please include as many useful details as possible. I am still confused on the right way to use the hash aggregations. I found 3 different approaches

[GitHub] [arrow] ava6969 opened a new issue, #15161: Inplace renaming of columns in c++ record batch.

2023-01-02 Thread GitBox
ava6969 opened a new issue, #15161: URL: https://github.com/apache/arrow/issues/15161 ### Describe the usage question you have. Please include as many useful details as possible. I would love to rename the columns of record batch without creation of the record batch. Is that

[GitHub] [arrow] vbfox opened a new issue, #15156: [JS] The library fail to load on browsers without BigInt64Array support

2023-01-02 Thread GitBox
vbfox opened a new issue, #15156: URL: https://github.com/apache/arrow/issues/15156 ### Describe the bug, including details regarding any error messages, version, and platform. When `apache-arrow` 10.0.1 is loaded on a browser that doesn't support `BigInt64Array` (Safari on iOS 14

[GitHub] [arrow] westonpace opened a new issue, #15158: [C++] Document asof join

2023-01-02 Thread GitBox
westonpace opened a new issue, #15158: URL: https://github.com/apache/arrow/issues/15158 ### Describe the enhancement requested We've had an asof join node for a while now. We should document it in streaming_execution.rst and add an example to

[GitHub] [arrow] jonkeane closed issue #15001: [R] Map/list equality tests are failing only on the sanitizer nightlies

2023-01-05 Thread GitBox
jonkeane closed issue #15001: [R] Map/list equality tests are failing only on the sanitizer nightlies URL: https://github.com/apache/arrow/issues/15001 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go

[GitHub] [arrow] Zaharid opened a new issue, #15209: Add abs function to Gandiva

2023-01-05 Thread GitBox
Zaharid opened a new issue, #15209: URL: https://github.com/apache/arrow/issues/15209 ### Describe the enhancement requested Gandiva should be able to compute `abs(number)` for various numeric types. ### Component(s) C++ - Gandiva -- This is an automated message from

[GitHub] [arrow-adbc] lidavidm closed issue #309: [CI] Gemfury package cleaning needs to bucket by date/package version

2023-01-05 Thread GitBox
lidavidm closed issue #309: [CI] Gemfury package cleaning needs to bucket by date/package version URL: https://github.com/apache/arrow-adbc/issues/309 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go

[GitHub] [arrow] jonkeane closed issue #14829: [CI][R][Homebrew] Nightly CI jobs aren't maintained

2023-01-05 Thread GitBox
jonkeane closed issue #14829: [CI][R][Homebrew] Nightly CI jobs aren't maintained URL: https://github.com/apache/arrow/issues/14829 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific

[GitHub] [arrow] westonpace closed issue #15199: [C++][Substrait] Allow AGGREGATION_INVOCATION_UNSPECIFIED as valid invocation in Substrait plan

2023-01-05 Thread GitBox
westonpace closed issue #15199: [C++][Substrait] Allow AGGREGATION_INVOCATION_UNSPECIFIED as valid invocation in Substrait plan URL: https://github.com/apache/arrow/issues/15199 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub

[GitHub] [arrow-adbc] jacobmarble opened a new issue, #315: [Go][FlightSQL] Implementation of `database/sql` driver

2023-01-05 Thread GitBox
jacobmarble opened a new issue, #315: URL: https://github.com/apache/arrow-adbc/issues/315 ### Describe the enhancement requested In Golang, the common way to query with a SQL database is by using the [`database/sql`](https://go.dev/src/database/sql/doc.txt) driver interface. For

[GitHub] [arrow] jonkeane opened a new issue, #15205: [R] Fix a parquet-fixture finding in R tests

2023-01-05 Thread GitBox
jonkeane opened a new issue, #15205: URL: https://github.com/apache/arrow/issues/15205 ### Describe the enhancement requested A follow on to #15001 to actually ensure that the tests are run in CI (they are still skipped in some of our github actions tests). ### Component(s)

[GitHub] [arrow] jacobmarble opened a new issue, #15206: Implementation of `database/sql` driver

2023-01-05 Thread GitBox
jacobmarble opened a new issue, #15206: URL: https://github.com/apache/arrow/issues/15206 ### Describe the enhancement requested In Golang, the common way to query with a SQL database is by using the [`database/sql`](https://go.dev/src/database/sql/doc.txt) driver interface. For the

[GitHub] [arrow] assignUser opened a new issue, #15215: [CI] Create suggestions comments in lint job

2023-01-05 Thread GitBox
assignUser opened a new issue, #15215: URL: https://github.com/apache/arrow/issues/15215 ### Describe the enhancement requested The current lint job is not very user friendly, a better visibility for the changes would be nice. Ideally we would create PR review comments with change

[GitHub] [arrow] wgtmac opened a new issue, #15216: [C++][Parquet] Parquet arrow::FileWriter accepts arrow::RecordBatch in the buffered row group mode

2023-01-05 Thread GitBox
wgtmac opened a new issue, #15216: URL: https://github.com/apache/arrow/issues/15216 ### Describe the enhancement requested Current state of the parquet [arrow::FileWriter](https://github.com/apache/arrow/blob/master/cpp/src/parquet/arrow/writer.h#L53) - It does not support

[GitHub] [arrow] wjones127 opened a new issue, #15212: ORC writer doesn't work on sliced list arrays

2023-01-05 Thread GitBox
wjones127 opened a new issue, #15212: URL: https://github.com/apache/arrow/issues/15212 ### Describe the bug, including details regarding any error messages, version, and platform. Follow up on #15210, where there was another misuse of `BaseListArray.values()`. This can be

[GitHub] [arrow] westonpace closed issue #15062: Remove unused kUnspecified enum value and simplify EnumParser.

2023-01-05 Thread GitBox
westonpace closed issue #15062: Remove unused kUnspecified enum value and simplify EnumParser. URL: https://github.com/apache/arrow/issues/15062 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the

[GitHub] [arrow] jonkeane closed issue #15205: [R] Fix a parquet-fixture finding in R tests

2023-01-06 Thread GitBox
jonkeane closed issue #15205: [R] Fix a parquet-fixture finding in R tests URL: https://github.com/apache/arrow/issues/15205 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To

[GitHub] [arrow-testing] lidavidm commented on pull request #85: GH-15203: [C++][Java] Add files with uncompressible buffers

2023-01-06 Thread GitBox
lidavidm commented on PR #85: URL: https://github.com/apache/arrow-testing/pull/85#issuecomment-1373731328 Java passes, C++ fails -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific

[GitHub] [arrow] jonkeane opened a new issue, #15224: [R] [CI] Segfault during python flight tests

2023-01-06 Thread GitBox
jonkeane opened a new issue, #15224: URL: https://github.com/apache/arrow/issues/15224 ### Describe the bug, including details regarding any error messages, version, and platform. This is not consistent (re-runs will sometimes resolve it), but we seem to be getting a segfault when

[GitHub] [arrow] marklit opened a new issue, #15220: Speed up Parquet Writing?

2023-01-06 Thread GitBox
marklit opened a new issue, #15220: URL: https://github.com/apache/arrow/issues/15220 ### Describe the enhancement requested The following was run on Ubuntu 20 on a `e2-highcpu-32` GCP VM with 32 GB of RAM and 32 vCPUs. I downloaded the California dataset from

[GitHub] [arrow] kou opened a new issue, #15218: [Python] Remove auto generated pyarrow_api.h and pyarrow_lib.h

2023-01-06 Thread GitBox
kou opened a new issue, #15218: URL: https://github.com/apache/arrow/issues/15218 ### Describe the enhancement requested We can use auto generated `lib_api.h` and `lib.h` instead of pre committed `pyarrow_api.h and `pyarrow_lib.h` because we unified build system of PyArrow and

[GitHub] [arrow] dylanscott opened a new issue, #15225: Ability to customize JdbcConsumer construction

2023-01-06 Thread GitBox
dylanscott opened a new issue, #15225: URL: https://github.com/apache/arrow/issues/15225 ### Describe the enhancement requested I'm working on a project that heavily uses the arrow-jdbc library to convert results from a number of different JDBC drivers to Arrow. When the integration

[GitHub] [arrow] wjones127 closed issue #15185: Improvement for the documentation of Parquet reader

2023-01-06 Thread GitBox
wjones127 closed issue #15185: Improvement for the documentation of Parquet reader URL: https://github.com/apache/arrow/issues/15185 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific

[GitHub] [arrow] jbrockmendel opened a new issue, #15226: ENH: dictionary_encode support duration types

2023-01-06 Thread GitBox
jbrockmendel opened a new issue, #15226: URL: https://github.com/apache/arrow/issues/15226 ### Describe the enhancement requested In troubleshooting pandas xfails, I'm finding a chunk of them trace back to trying to call dictionary_encode with duration types. @jorisvandenbossche

[GitHub] [arrow] jonkeane opened a new issue, #15228: [R] [CI] Add Github PAT to jobs that are reaching limit

2023-01-06 Thread GitBox
jonkeane opened a new issue, #15228: URL: https://github.com/apache/arrow/issues/15228 ### Describe the enhancement requested This passes the already extant github pat down to where it can be used during installation of R dependencies (which are hitting rate limits) ```

<    4   5   6   7   8   9   10   11   12   13   >