Re: [PR] GH-41410: [C++][FS][Azure][Docs] Add AzureFileSystem to Filesystems API reference [arrow]

2024-04-26 Thread via GitHub


github-actions[bot] commented on PR #41411:
URL: https://github.com/apache/arrow/pull/41411#issuecomment-2080381656

   Revision: 703137eda155222396a044c6682c82aa451c6493
   
   Submitted crossbow builds: [ursacomputing/crossbow @ 
actions-a15f1ef9b4](https://github.com/ursacomputing/crossbow/branches/all?query=actions-a15f1ef9b4)
   
   |Task|Status|
   ||--|
   |preview-docs|[![GitHub 
Actions](https://github.com/ursacomputing/crossbow/actions/workflows/crossbow.yml/badge.svg?branch=actions-a15f1ef9b4-github-preview-docs)](https://github.com/ursacomputing/crossbow/actions/runs/8857848768/job/24325806006)|


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] GH-41410: [C++][FS][Azure][Docs] Add AzureFileSystem to Filesystems API reference [arrow]

2024-04-26 Thread via GitHub


amoeba commented on PR #41411:
URL: https://github.com/apache/arrow/pull/41411#issuecomment-2080381018

   @github-actions crossbow submit preview-docs


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] GH-41410: [C++][FS][Azure][Docs] Add AzureFileSystem to Filesystems API reference [arrow]

2024-04-26 Thread via GitHub


github-actions[bot] commented on PR #41411:
URL: https://github.com/apache/arrow/pull/41411#issuecomment-2080367078

   Revision: 0a5cbb335566d882de7fb8949796115d52e1c961
   
   Submitted crossbow builds: [ursacomputing/crossbow @ 
actions-14581fd73b](https://github.com/ursacomputing/crossbow/branches/all?query=actions-14581fd73b)
   
   |Task|Status|
   ||--|
   |preview-docs|[![GitHub 
Actions](https://github.com/ursacomputing/crossbow/actions/workflows/crossbow.yml/badge.svg?branch=actions-14581fd73b-github-preview-docs)](https://github.com/ursacomputing/crossbow/actions/runs/8857536742/job/24325148711)|


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] GH-41410: [C++][FS][Azure][Docs] Add AzureFileSystem to Filesystems API reference [arrow]

2024-04-26 Thread via GitHub


github-actions[bot] commented on PR #41411:
URL: https://github.com/apache/arrow/pull/41411#issuecomment-2080366509

   :warning: GitHub issue #41410 **has been automatically assigned in GitHub** 
to PR creator.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



[PR] GH-41410: [C++][FS][Azure][Docs] Add AzureFileSystem to Filesystems API reference [arrow]

2024-04-26 Thread via GitHub


amoeba opened a new pull request, #41411:
URL: https://github.com/apache/arrow/pull/41411

   ### Rationale for this change
   
   See https://github.com/apache/arrow/issues/41410.
   
   ### What changes are included in this PR?
   
   Just changes to filesystem.rst.
   
   ### Are these changes tested?
   
   Yes, locally.
   
   ### Are there any user-facing changes?
   
   These are those changes.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] GH-41410: [C++][FS][Azure][Docs] Add AzureFileSystem to Filesystems API reference [arrow]

2024-04-26 Thread via GitHub


amoeba commented on PR #41411:
URL: https://github.com/apache/arrow/pull/41411#issuecomment-2080366546

   @github-actions crossbow submit preview-docs


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] feat(csharp): Implement support for transactions, isolation level and read-only flag [arrow-adbc]

2024-04-26 Thread via GitHub


CurtHagenlocher commented on PR #1784:
URL: https://github.com/apache/arrow-adbc/pull/1784#issuecomment-2080354058

   @davidhcoe, would appreciate a look at your leisure.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



[PR] feat(csharp): Implement support for transactions, isolation level and read-only flag [arrow-adbc]

2024-04-26 Thread via GitHub


CurtHagenlocher opened a new pull request, #1784:
URL: https://github.com/apache/arrow-adbc/pull/1784

   Implements support for transactions, isolation level and read-only flags on 
imported and exported drivers.
   
   Closes #1782


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] GH-41316: [CI][Python] Reduce CI time [arrow]

2024-04-26 Thread via GitHub


assignUser commented on code in PR #41378:
URL: https://github.com/apache/arrow/pull/41378#discussion_r1581724489


##
.github/workflows/python.yml:
##
@@ -171,6 +171,23 @@ jobs:
 with:
   fetch-depth: 0
   submodules: recursive
+  - name: Install ccache

Review Comment:
   Hey! So the issue here is that `actions/cache` doesn't really work great for 
caching build caches like ccache as caches are immutable and if a primary key 
was hit it will not update the cache, leading to worse and worse hit rates.
   
   In this case you also used the same key for 2 different jobs, this will lead 
to a key clash and prevent one job from caching (looking at the results likely 
arm64 was cached -> 0% hit rate on amd64 as ccache is os/arch specific).
   
   A workaround is to create unique keys (e.g. `ccache-python-macos--${{ 
github.sha }}` on each run and use 'restore-keys: ccache-python-macos--' 
to still get a hit. This is quite wasteful and leads to frequent cache 
evictions as only 10GB/repo are allowed which we frequently go over (massivly)
   
![image](https://github.com/apache/arrow/assets/16141871/8cf7ccb4-8e41-4d60-a74f-d23bb3810d14)
   
   I created an [alternative](https://github.com/assignUser/stash) for this 
case but have to ask infra to allow-list it.



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] GH-41402: [CI][R] Update our backwards compatibility CI any other R 4.4 cleanups [arrow]

2024-04-26 Thread via GitHub


assignUser commented on code in PR #41403:
URL: https://github.com/apache/arrow/pull/41403#discussion_r1581719580


##
dev/tasks/r/github.linux.versions.yml:
##
@@ -30,7 +30,6 @@ jobs:
 r_version:
   # We test devel, release, and oldrel in regular CI.
   # This is for older versions
-  - "3.6"
   - "4.0"
   - "4.1"

Review Comment:
   Should we add 4.2 as replacement for 3.6?



##
ci/docker/linux-apt-r.dockerfile:
##
@@ -35,7 +35,7 @@ ENV LANG=C.UTF-8
 # Build R
 # [1] 
https://www.digitalocean.com/community/tutorials/how-to-install-r-on-ubuntu-18-04
 # [2] 
https://linuxize.com/post/how-to-install-r-on-ubuntu-18-04/#installing-r-packages-from-cran
-ARG r=3.6

Review Comment:
   oO :D



##
r/R/dplyr-funcs-type.R:
##
@@ -140,7 +140,7 @@ register_bindings_type_cast <- function() {
  fix.empty.names = TRUE,
  stringsAsFactors = FALSE) {
   # we need a specific value of stringsAsFactors because the default was
-  # TRUE in R <= 3.6
+  # TRUE in R <= 3.6 and folks might still be cargoculting to stay in the 
past.

Review Comment:
   :grin: 



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [I] `snappy_jll v1.2.0` lead to Arrow_jll failed to build [arrow-julia]

2024-04-26 Thread via GitHub


visr commented on issue #505:
URL: https://github.com/apache/arrow-julia/issues/505#issuecomment-2080341534

   No direct wrapper yet. We built it for GDAL to be able to read geospatial 
data based on Arrow and Parquet; GeoArrow and GeoParquet.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] feat(python): Implement extension type and Schema metadata support [arrow-nanoarrow]

2024-04-26 Thread via GitHub


paleolimbot commented on PR #431:
URL: https://github.com/apache/arrow-nanoarrow/pull/431#issuecomment-2080322076

   Thank you for taking a look!


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] feat(python): Implement extension type and Schema metadata support [arrow-nanoarrow]

2024-04-26 Thread via GitHub


paleolimbot merged PR #431:
URL: https://github.com/apache/arrow-nanoarrow/pull/431


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] feat(python): Implement extension type and Schema metadata support [arrow-nanoarrow]

2024-04-26 Thread via GitHub


paleolimbot commented on code in PR #431:
URL: https://github.com/apache/arrow-nanoarrow/pull/431#discussion_r1581712390


##
python/src/nanoarrow/_lib.pyx:
##
@@ -967,10 +989,50 @@ cdef class CSchemaBuilder:
 if self._ptr.release == NULL:
 ArrowSchemaInit(self._ptr)
 
+@staticmethod
+def copy_existing(CSchema existing_schema):
+return CSchemaBuilder(existing_schema.__deepcopy__())
+
 @staticmethod
 def allocate():
 return CSchemaBuilder(CSchema.allocate())
 
+def clear_metadata(self):

Review Comment:
   At this level of abstraction it would be hard (since it is basically 
wrapping a single C API call) but I agree that it would be a better interface!



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] feat(python): Implement extension type and Schema metadata support [arrow-nanoarrow]

2024-04-26 Thread via GitHub


paleolimbot commented on code in PR #431:
URL: https://github.com/apache/arrow-nanoarrow/pull/431#discussion_r1581712127


##
python/src/nanoarrow/schema.py:
##
@@ -108,10 +115,73 @@ def create(obj):
 return TimeUnit(obj)
 
 
+class ExtensionAccessor:
+"""Accessor for extension type parameters"""
+
+def __init__(self, schema) -> None:
+self._schema = schema
+
+@property
+def name(self) -> str:
+"""Extension name for this extension type"""
+return self._schema._c_schema_view.extension_name
+
+@property
+def metadata(self) -> Union[bytes, None]:
+"""Extension metadata for this extension type if present"""
+extension_metadata = self._schema._c_schema_view.extension_metadata
+return extension_metadata if extension_metadata else None
+
+@property
+def storage(self):
+"""Storage type for this extension type"""
+metadata = dict(self._schema.metadata.items())
+del metadata[b"ARROW:extension:name"]
+if b"ARROW:extension:metadata" in metadata:
+del metadata[b"ARROW:extension:metadata"]

Review Comment:
   I added a comment! (Removing the metadata keys is required, or else the 
schema will still be treated as an extension).



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] GH-41307: [Java] Use org.apache:apache parent pom version 31 [arrow]

2024-04-26 Thread via GitHub


laurentgo commented on PR #41309:
URL: https://github.com/apache/arrow/pull/41309#issuecomment-2080316679

   I believe I found the issue. Recent version of `maven-jar-plugin` uses an 
updated version of `plexus-archiver` which will rewrite the jar content to fix 
modification time if the jar contains a `module-info.class` file. But as Java 
11 `jar` command does not support `--date` option, `plexus-archiver` implements 
a 
[fallback](https://github.com/codehaus-plexus/plexus-archiver/blob/8458ffacecb543c60dae8e618accbba96a6ff689/src/main/java/org/codehaus/plexus/archiver/jar/JarToolModularJarArchiver.java#L201)
 which relies on `Files#createTemp()` method. Unfortunately, the file created 
by this method has only user read/write permission and no group/others 
permissions.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] GH-41358: [R] Support join "na_matches" argument [arrow]

2024-04-26 Thread via GitHub


conbench-apache-arrow[bot] commented on PR #41372:
URL: https://github.com/apache/arrow/pull/41372#issuecomment-2080315675

   After merging your PR, Conbench analyzed the 7 benchmarking runs that have 
been run so far on merge-commit ea314a3f8d9d4446836aa999b66659c07421f7a4.
   
   There were no benchmark performance regressions. 🎉
   
   The [full Conbench report](https://github.com/apache/arrow/runs/24322775693) 
has more details. It also includes information about 15 possible false 
positives for unstable benchmarks that are known to sometimes produce them.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [I] Change default branch name from master to main [arrow-rs]

2024-04-26 Thread via GitHub


Jefffrey closed issue #3569: Change default branch name from master to main
URL: https://github.com/apache/arrow-rs/issues/3569


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [I] Change default branch name from master to main [arrow-rs]

2024-04-26 Thread via GitHub


Jefffrey commented on issue #3569:
URL: https://github.com/apache/arrow-rs/issues/3569#issuecomment-2080315554

   Closed as duplicate


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



[PR] GH-41397: n [R][CI] Windows job failing after R 4.4 release [arrow]

2024-04-26 Thread via GitHub


assignUser opened a new pull request, #41409:
URL: https://github.com/apache/arrow/pull/41409

   
   ### Rationale for this change
   
   We can't throw warnings on cran.
   
   ### What changes are included in this PR?
   
   Update function to match changes in libarrow
   
   ### Are these changes tested?
   
   CI
   ### Are there any user-facing changes?
   
   No
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] GH-41397: n [R][CI] Windows job failing after R 4.4 release [arrow]

2024-04-26 Thread via GitHub


github-actions[bot] commented on PR #41409:
URL: https://github.com/apache/arrow/pull/41409#issuecomment-2080307237

   :warning: GitHub issue #41397 **has been automatically assigned in GitHub** 
to PR creator.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [I] [R][CI] Windows job failing after R 4.4 release [arrow]

2024-04-26 Thread via GitHub


assignUser commented on issue #41398:
URL: https://github.com/apache/arrow/issues/41398#issuecomment-2080305862

   I'll give it a try


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [I] [C++] Add support for mold [arrow]

2024-04-26 Thread via GitHub


amoeba commented on issue #40394:
URL: https://github.com/apache/arrow/issues/40394#issuecomment-2080272148

   Done, see https://github.com/apache/arrow/issues/41408.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [I] [C++] Add support for mold [arrow]

2024-04-26 Thread via GitHub


amoeba commented on issue #40394:
URL: https://github.com/apache/arrow/issues/40394#issuecomment-2080271217

   Yep, will do.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] GH-41367: [C++] Replace [[maybe_unused]] with Arrow macro [arrow]

2024-04-26 Thread via GitHub


github-actions[bot] commented on PR #41359:
URL: https://github.com/apache/arrow/pull/41359#issuecomment-2080269396

   Revision: bdf0bfff1332e4446005d485462e2c9a0088fa9c
   
   Submitted crossbow builds: [ursacomputing/crossbow @ 
actions-eb949ffd4d](https://github.com/ursacomputing/crossbow/branches/all?query=actions-eb949ffd4d)
   
   |Task|Status|
   ||--|
   |preview-docs|[![GitHub 
Actions](https://github.com/ursacomputing/crossbow/actions/workflows/crossbow.yml/badge.svg?branch=actions-eb949ffd4d-github-preview-docs)](https://github.com/ursacomputing/crossbow/actions/runs/8855874267/job/24321419381)|


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] GH-41367: [C++] Replace [[maybe_unused]] with Arrow macro [arrow]

2024-04-26 Thread via GitHub


kou commented on PR #41359:
URL: https://github.com/apache/arrow/pull/41359#issuecomment-2080268577

   @github-actions crossbow submit preview-docs


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [I] [C++] Add support for mold [arrow]

2024-04-26 Thread via GitHub


kou commented on issue #40394:
URL: https://github.com/apache/arrow/issues/40394#issuecomment-2080268067

   You're right.
   We should document this and LLD's one.
   We just forgot them...
   Could you open an issue for them?


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [I] `snappy_jll v1.2.0` lead to Arrow_jll failed to build [arrow-julia]

2024-04-26 Thread via GitHub


kongdd commented on issue #505:
URL: https://github.com/apache/arrow-julia/issues/505#issuecomment-2080265483

   
   
   
   
   > Arrow.jl is a Julia implementation of the Arrow format, and does not use 
Arrow_jll, so this issue can be closed.
   
   closed


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [I] [C++] Rename `Function::is_impure()` to `is_pure()` [arrow]

2024-04-26 Thread via GitHub


amoeba commented on issue #40607:
URL: https://github.com/apache/arrow/issues/40607#issuecomment-2080244437

   Hi @felipecrv, this looks like a breaking change to me. What do you think?


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] fix(csharp/src/Apache.Arrow.Adbc): correct StandardSchemas.ColumnSchema data types [arrow-adbc]

2024-04-26 Thread via GitHub


birschick-bq commented on PR #1731:
URL: https://github.com/apache/arrow-adbc/pull/1731#issuecomment-2080244619

   > On the whole, I'm not excited about exposing two public and 
non-ADBC-specific functions from this assembly -- particularly without a 
stronger specification for them. Can we make them internal and then use 
InternalsVisibleTo? This is already being used in the C# Arrow implementation, 
so the build processes should support it.
   > 
   > It would also be nice to eliminate the large number of nullability 
warnings by either removing the nullability annotations or by putting the code 
into a `#nullable` block.
   
   RE: nullability annotations
   
   I believe we need a separate ticket for this. It will require hundreds of 
small changes to mark current non-nullable types as nullable. I'll create a new 
issue for this.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] fix(csharp/src/Apache.Arrow.Adbc): correct StandardSchemas.ColumnSchema data types [arrow-adbc]

2024-04-26 Thread via GitHub


birschick-bq commented on code in PR #1731:
URL: https://github.com/apache/arrow-adbc/pull/1731#discussion_r1581606785


##
csharp/src/Apache.Arrow.Adbc/Extensions/ListArrayExtensions.cs:
##
@@ -0,0 +1,156 @@
+/*
+ * Licensed to the Apache Software Foundation (ASF) under one or more
+ * contributor license agreements.  See the NOTICE file distributed with
+ * this work for additional information regarding copyright ownership.
+ * The ASF licenses this file to You under the Apache License, Version 2.0
+ * (the "License"); you may not use this file except in compliance with
+ * the License.  You may obtain a copy of the License at
+ *
+ *http://www.apache.org/licenses/LICENSE-2.0
+ *
+ * Unless required by applicable law or agreed to in writing, software
+ * distributed under the License is distributed on an "AS IS" BASIS,
+ * WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+ * See the License for the specific language governing permissions and
+ * limitations under the License.
+ */
+
+using System;
+using System.Collections.Generic;
+using Apache.Arrow.Types;
+
+namespace Apache.Arrow.Adbc.Extensions
+{
+public static class ListArrayExtensions
+{
+/// 
+/// Creates a  from a list of  data for the given datatype .
+/// 
+/// The list of data.
+/// The data type of the contained data.
+/// A  of the data.
+public static ListArray CreateNestedListArray(this 
IReadOnlyList arrayList, IArrowType dataType)

Review Comment:
   Yes. I believe this should likely be included in the Arrow library. Not sure 
of it proper placement in the namespace, yet.



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [I] [C++] Add support for mold [arrow]

2024-04-26 Thread via GitHub


amoeba commented on issue #40394:
URL: https://github.com/apache/arrow/issues/40394#issuecomment-2080236538

   Hi @kou, @assignUser: It seems like we should document this in the developer 
docs. Any reason not to?


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] fix(csharp/src/Apache.Arrow.Adbc): correct StandardSchemas.ColumnSchema data types [arrow-adbc]

2024-04-26 Thread via GitHub


birschick-bq commented on code in PR #1731:
URL: https://github.com/apache/arrow-adbc/pull/1731#discussion_r1581606335


##
csharp/src/Apache.Arrow.Adbc/Extensions/StandardSchemaExtensions.cs:
##
@@ -0,0 +1,93 @@
+/*
+ * Licensed to the Apache Software Foundation (ASF) under one or more
+ * contributor license agreements.  See the NOTICE file distributed with
+ * this work for additional information regarding copyright ownership.
+ * The ASF licenses this file to You under the Apache License, Version 2.0
+ * (the "License"); you may not use this file except in compliance with
+ * the License.  You may obtain a copy of the License at
+ *
+ *http://www.apache.org/licenses/LICENSE-2.0
+ *
+ * Unless required by applicable law or agreed to in writing, software
+ * distributed under the License is distributed on an "AS IS" BASIS,
+ * WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+ * See the License for the specific language governing permissions and
+ * limitations under the License.
+ */
+
+using System;
+using System.Collections.Generic;
+using System.Linq;
+using Apache.Arrow.Types;
+
+namespace Apache.Arrow.Adbc.Extensions
+{
+public static class StandardSchemaExtensions
+{
+
+/// 
+/// Validates a data array that its column number and types match a 
given schema.
+/// 
+/// The schema to validate against.
+/// The data array to validate.
+/// Throws an exception if the 
number of columns or type data types in the data array do not match the schema 
fields.
+public static IReadOnlyList Validate(this Schema schema, 
IReadOnlyList data)

Review Comment:
   I'd be happy to use the existing class, if it were exposed. I could also 
make this better and more rigorous, if you believe it might be used outside of 
its current context.



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] fix(csharp/src/Apache.Arrow.Adbc): correct StandardSchemas.ColumnSchema data types [arrow-adbc]

2024-04-26 Thread via GitHub


birschick-bq commented on code in PR #1731:
URL: https://github.com/apache/arrow-adbc/pull/1731#discussion_r1581605494


##
csharp/src/Apache.Arrow.Adbc/Extensions/ListArrayExtensions.cs:
##
@@ -0,0 +1,156 @@
+/*
+ * Licensed to the Apache Software Foundation (ASF) under one or more
+ * contributor license agreements.  See the NOTICE file distributed with
+ * this work for additional information regarding copyright ownership.
+ * The ASF licenses this file to You under the Apache License, Version 2.0
+ * (the "License"); you may not use this file except in compliance with
+ * the License.  You may obtain a copy of the License at
+ *
+ *http://www.apache.org/licenses/LICENSE-2.0
+ *
+ * Unless required by applicable law or agreed to in writing, software
+ * distributed under the License is distributed on an "AS IS" BASIS,
+ * WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+ * See the License for the specific language governing permissions and
+ * limitations under the License.
+ */
+
+using System;
+using System.Collections.Generic;
+using Apache.Arrow.Types;
+
+namespace Apache.Arrow.Adbc.Extensions
+{
+internal static class ListArrayExtensions

Review Comment:
   This extensions is marked as `internal`.



##
csharp/src/Apache.Arrow.Adbc/Properties/AssemblyInfo.cs:
##
@@ -0,0 +1,20 @@
+// Licensed to the Apache Software Foundation (ASF) under one or more
+// contributor license agreements. See the NOTICE file distributed with
+// this work for additional information regarding copyright ownership.
+// The ASF licenses this file to You under the Apache License, Version 2.0
+// (the "License"); you may not use this file except in compliance with
+// the License.  You may obtain a copy of the License at
+//
+// http://www.apache.org/licenses/LICENSE-2.0
+//
+// Unless required by applicable law or agreed to in writing, software
+// distributed under the License is distributed on an "AS IS" BASIS,
+// WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+// See the License for the specific language governing permissions and
+// limitations under the License.
+
+using System.Runtime.CompilerServices;
+
+[assembly: InternalsVisibleTo("Apache.Arrow.Adbc.Drivers.Apache, 
PublicKey=0024048094000602002452534131000401000100e504183f6d470d6b67b6d19212be3e1f598f70c246a120194bc38130101d0c1853e4a0f2232cb12e37a7a90e707aabd38511dac4f25fcb0d691b2aa265900bf42de7f70468fc997551a40e1e0679b605aa2088a4a69e07c117e988f5b1738c570ee66997fba02485e7856a49eca5fd0706d09899b8312577cbb9034599fc92d4")]

Review Comment:
   Expose internal interfaces to three libraries, assuming the current signing 
key.



##
csharp/src/Apache.Arrow.Adbc/Extensions/StandardSchemaExtensions.cs:
##
@@ -0,0 +1,93 @@
+/*
+ * Licensed to the Apache Software Foundation (ASF) under one or more
+ * contributor license agreements.  See the NOTICE file distributed with
+ * this work for additional information regarding copyright ownership.
+ * The ASF licenses this file to You under the Apache License, Version 2.0
+ * (the "License"); you may not use this file except in compliance with
+ * the License.  You may obtain a copy of the License at
+ *
+ *http://www.apache.org/licenses/LICENSE-2.0
+ *
+ * Unless required by applicable law or agreed to in writing, software
+ * distributed under the License is distributed on an "AS IS" BASIS,
+ * WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+ * See the License for the specific language governing permissions and
+ * limitations under the License.
+ */
+
+using System;
+using System.Collections.Generic;
+using System.Linq;
+using Apache.Arrow.Types;
+
+namespace Apache.Arrow.Adbc.Extensions
+{
+internal static class StandardSchemaExtensions

Review Comment:
   This extension is marked as internal.



##
csharp/src/Apache.Arrow.Adbc/Extensions/ListArrayExtensions.cs:
##
@@ -0,0 +1,156 @@
+/*
+ * Licensed to the Apache Software Foundation (ASF) under one or more
+ * contributor license agreements.  See the NOTICE file distributed with
+ * this work for additional information regarding copyright ownership.
+ * The ASF licenses this file to You under the Apache License, Version 2.0
+ * (the "License"); you may not use this file except in compliance with
+ * the License.  You may obtain a copy of the License at
+ *
+ *http://www.apache.org/licenses/LICENSE-2.0
+ *
+ * Unless required by applicable law or agreed to in writing, software
+ * distributed under the License is distributed on an "AS IS" BASIS,
+ * WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+ * See the License for the specific language governing permissions and
+ * limitations under the License.
+ */
+
+using System;
+using System.Collections.Generic;
+using Apache.Arrow.Types;
+
+namespace Apache.Arrow.Adbc.Extensions
+{
+public static class ListArrayExten

Re: [I] `snappy_jll v1.2.0` lead to Arrow_jll failed to build [arrow-julia]

2024-04-26 Thread via GitHub


Moelf commented on issue #505:
URL: https://github.com/apache/arrow-julia/issues/505#issuecomment-2080229853

   What is using Arrow_jll -- do we have a wrapper ?


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] chore: wrap .env values in quotes to prevent parsing errors [arrow-adbc]

2024-04-26 Thread via GitHub


lidavidm commented on PR #1779:
URL: https://github.com/apache/arrow-adbc/pull/1779#issuecomment-2080202473

   The integration test pipeline sets everything by hand, I suppose it could 
source .env instead


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] fix(go/adbc/driver/flightsql): should use `ctx.Err().Error()` [arrow-adbc]

2024-04-26 Thread via GitHub


lidavidm commented on PR #1769:
URL: https://github.com/apache/arrow-adbc/pull/1769#issuecomment-2080201521

   @zeroshade there's something in the Flight SQL client that is treating 
io.EOF as an actual error, I think


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [I] [R] Support join "na_matches" argument [arrow]

2024-04-26 Thread via GitHub


nealrichardson commented on issue #41358:
URL: https://github.com/apache/arrow/issues/41358#issuecomment-2080196949

   Fixed by https://github.com/apache/arrow/pull/41372


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] GH-41358: [R] Support join "na_matches" argument [arrow]

2024-04-26 Thread via GitHub


nealrichardson merged PR #41372:
URL: https://github.com/apache/arrow/pull/41372


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [I] [R][CI] Windows job failing after R 4.4 release [arrow]

2024-04-26 Thread via GitHub


nealrichardson commented on issue #41398:
URL: https://github.com/apache/arrow/issues/41398#issuecomment-2080195502

   The line in the C++ header in question was introduced here: 
https://github.com/apache/arrow/pull/39888/files#diff-dd3ca7b08e6c9982fcc1be7ea36c93a2d7998ed3ff6bab2ae1e0343ada0358d3R53


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [I] [R] CRAN packaging checklist for version 16.0.0 [arrow]

2024-04-26 Thread via GitHub


nealrichardson commented on issue #41404:
URL: https://github.com/apache/arrow/issues/41404#issuecomment-2080195010

   Re: #41398, I suspect we do need to fix it before releasing, unless we're 
not updating the libarrow binaries. It's raising on a line that was changed 
since 15.0: 
https://github.com/apache/arrow/pull/39888/files#diff-dd3ca7b08e6c9982fcc1be7ea36c93a2d7998ed3ff6bab2ae1e0343ada0358d3R53
   
   The reason we have `-Werror` on that CI job is because CRAN will cause 
problems if there are compiler warnings thrown, so we want to be alerted 
earlier.
   
   Re: the so-called non-API functions, I wonder if those are related to 
ALTREP, and if they'll go away once newer R-devel builds cycle through after 
the policy was clarified?


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] GH-41358: [R] Support join "na_matches" argument [arrow]

2024-04-26 Thread via GitHub


jonkeane commented on code in PR #41372:
URL: https://github.com/apache/arrow/pull/41372#discussion_r1581582262


##
r/src/compute-exec.cpp:
##
@@ -411,10 +411,17 @@ std::shared_ptr ExecNode_Join(
 const std::shared_ptr& right_data,
 std::vector left_keys, std::vector right_keys,
 std::vector left_output, std::vector 
right_output,
-std::string output_suffix_for_left, std::string output_suffix_for_right) {
+std::string output_suffix_for_left, std::string output_suffix_for_right,
+bool na_matches) {
   std::vector left_refs, right_refs, left_out_refs, 
right_out_refs;
+  std::vector key_cmps;
   for (auto&& name : left_keys) {
 left_refs.emplace_back(std::move(name));
+// Populate key_cmps in this loop, one for each key
+// Note that Acero supports having different values for each key, but dplyr
+// only supports one value for all keys, so we're only going to support 
that
+// for now.
+key_cmps.emplace_back(na_matches ? acero::JoinKeyCmp::IS : 
acero::JoinKeyCmp::EQ);

Review Comment:
   So many ways of swapping and passing and evaluating booleans 😂 



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [I] [C++][R] R Sanitizer nightly error [arrow]

2024-04-26 Thread via GitHub


jonkeane commented on issue #41407:
URL: https://github.com/apache/arrow/issues/41407#issuecomment-2080182585

   I suspect this is also what is responsible for the 
test-fedore-r-clang-sanitizer as well:
   
   
   
   
https://github.com/ursacomputing/crossbow/actions/runs/8809030613/job/24179194857


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] GH-41179: [Docs] Documentation for Dissociated IPC Protocol [arrow]

2024-04-26 Thread via GitHub


kou commented on code in PR #41180:
URL: https://github.com/apache/arrow/pull/41180#discussion_r1581577699


##
ci/docker/linux-apt-docs.dockerfile:
##
@@ -55,7 +57,10 @@ RUN apt-get update -y && \
 r-base=${r}* \
 rsync \
 ruby-dev \
-wget && \
+wget \
+libasound2-dev && \
+wget -q 
https://freeshell.de/phd/chromium/jammy/pool/latest/chromium_124.0.6367.60~linuxmint1+virginia_amd64.deb
 && \
+apt-get install -y ./chromium_124.0.6367.60~linuxmint1+virginia_amd64.deb 
&& \

Review Comment:
   
https://freeshell.de/phd/chromium/jammy/pool/latest/chromium_124.0.6367.60~linuxmint1+virginia_amd64.deb
 will be invalid eventually. Our CI will be failed after that.



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [I] csharp/adbc/capi: CAdbcDriverImporter leaks memory [arrow-adbc]

2024-04-26 Thread via GitHub


CurtHagenlocher commented on issue #1303:
URL: https://github.com/apache/arrow-adbc/issues/1303#issuecomment-2080177247

   Retroactive duplicate of #1774 


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [I] [R] CRAN packaging checklist for version 16.0.0 [arrow]

2024-04-26 Thread via GitHub


jonkeane commented on issue #41404:
URL: https://github.com/apache/arrow/issues/41404#issuecomment-2080168636

   # Status of [current check 
results](https://cran.r-project.org/web/checks/check_results_arrow.html) 
   
   - [x] There are two errors on macos — both are libtool. Which are resolved 
by https://github.com/apache/arrow/pull/40259 which was not included in the 
last submission.
   - [ ] We have a number of notes about using `SETLENGTH` and `SET_TRUELENGTH` 
IIUC, this is CRAN attempting to cut down on people using so called non-API 
functions. I believe these are coming from vctrs or cpp11 (oddly cpp11 checks 
are clean, even [though they definitely have 
those](https://github.com/r-lib/cpp11/blob/51f4cd5ad9425a491dedf951a3679346d416e51c/inst/include/cpp11/r_vector.hpp#L900-L901)
   
   Note: we will need to figure out if we need to resolve #41398 before we can 
submit to CRAN. The current windows check _is_ on 4.4.0 — that doesn't compile 
from source there anyway so so long as #41398 is truly a warning -> error 
situation, we should ok to ship without it. 


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] GH-41358: [R] Support join "na_matches" argument [arrow]

2024-04-26 Thread via GitHub


nealrichardson commented on code in PR #41372:
URL: https://github.com/apache/arrow/pull/41372#discussion_r1581571725


##
r/src/compute-exec.cpp:
##
@@ -411,10 +411,17 @@ std::shared_ptr ExecNode_Join(
 const std::shared_ptr& right_data,
 std::vector left_keys, std::vector right_keys,
 std::vector left_output, std::vector 
right_output,
-std::string output_suffix_for_left, std::string output_suffix_for_right) {
+std::string output_suffix_for_left, std::string output_suffix_for_right,
+bool na_matches) {
   std::vector left_refs, right_refs, left_out_refs, 
right_out_refs;
+  std::vector key_cmps;
   for (auto&& name : left_keys) {
 left_refs.emplace_back(std::move(name));
+// Populate key_cmps in this loop, one for each key
+// Note that Acero supports having different values for each key, but dplyr
+// only supports one value for all keys, so we're only going to support 
that
+// for now.
+key_cmps.emplace_back(na_matches ? acero::JoinKeyCmp::IS : 
acero::JoinKeyCmp::EQ);

Review Comment:
   Correct



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] GH-41405: [Release][Docs][GLib] Use Sphinx based GLib front page [arrow]

2024-04-26 Thread via GitHub


github-actions[bot] commented on PR #41406:
URL: https://github.com/apache/arrow/pull/41406#issuecomment-2080158797

   :warning: GitHub issue #41405 **has been automatically assigned in GitHub** 
to PR creator.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



[PR] GH-41405: [Release][Docs][GLib] Use Sphinx based GLib front page [arrow]

2024-04-26 Thread via GitHub


kou opened a new pull request, #41406:
URL: https://github.com/apache/arrow/pull/41406

   ### Rationale for this change
   
   We should use the GLib front page generated by Sphinx.
   
   ### What changes are included in this PR?
   
   Stop reverting the GLib front page change in release script.
   
   ### Are these changes tested?
   
   No.
   
   ### Are there any user-facing changes?
   
   Yes.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [I] [R] CRAN packaging checklist for version 16.0.0 [arrow]

2024-04-26 Thread via GitHub


jonkeane commented on issue #41404:
URL: https://github.com/apache/arrow/issues/41404#issuecomment-2080158134

   This release will be slightly different since it will include the change in 
maintainer. IIUC We'll need to do:
   
   > * Explain any change in the maintainer’s email address and if possible 
send confirmation from the previous address (by a separate email to 
cran-submissi...@r-project.org) or explain why it is not possible.
   > If the package needs special treatment (for example if vignettes can 
only be run or re-built on the maintainer’s machine or take a very long time), 
say so on the submission form.
   
   from https://cran.r-project.org/web/packages/policies.html#Submission-1


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [I] [Release][Docs][GLib] Use Sphinx based GLib front page [arrow]

2024-04-26 Thread via GitHub


kou commented on issue #41405:
URL: https://github.com/apache/arrow/issues/41405#issuecomment-2080157124

   The GLib front page: https://arrow.apache.org/docs/c_glib/


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] feat(csharp/test): implement DuckDb test fixture [arrow-adbc]

2024-04-26 Thread via GitHub


CurtHagenlocher merged PR #1781:
URL: https://github.com/apache/arrow-adbc/pull/1781


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] GH-41390: [CI] Use setup-python GitHub action on csharp macOS job [arrow]

2024-04-26 Thread via GitHub


conbench-apache-arrow[bot] commented on PR #41392:
URL: https://github.com/apache/arrow/pull/41392#issuecomment-2080148925

   After merging your PR, Conbench analyzed the 5 benchmarking runs that have 
been run so far on merge-commit 15986ae5ffef2f274c04cf0d5eec2155fe6523a6.
   
   There were no benchmark performance regressions. 🎉
   
   The [full Conbench report](https://github.com/apache/arrow/runs/24317778328) 
has more details. It also includes information about 16 possible false 
positives for unstable benchmarks that are known to sometimes produce them.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] GH-41358: [R] Support join "na_matches" argument [arrow]

2024-04-26 Thread via GitHub


jonkeane commented on code in PR #41372:
URL: https://github.com/apache/arrow/pull/41372#discussion_r1581558982


##
r/src/compute-exec.cpp:
##
@@ -411,10 +411,17 @@ std::shared_ptr ExecNode_Join(
 const std::shared_ptr& right_data,
 std::vector left_keys, std::vector right_keys,
 std::vector left_output, std::vector 
right_output,
-std::string output_suffix_for_left, std::string output_suffix_for_right) {
+std::string output_suffix_for_left, std::string output_suffix_for_right,
+bool na_matches) {
   std::vector left_refs, right_refs, left_out_refs, 
right_out_refs;
+  std::vector key_cmps;
   for (auto&& name : left_keys) {
 left_refs.emplace_back(std::move(name));
+// Populate key_cmps in this loop, one for each key
+// Note that Acero supports having different values for each key, but dplyr
+// only supports one value for all keys, so we're only going to support 
that
+// for now.
+key_cmps.emplace_back(na_matches ? acero::JoinKeyCmp::IS : 
acero::JoinKeyCmp::EQ);

Review Comment:
   Out of my curiosity to make sure I'm following this all: 
`acero::JoinKeyCmp::IS` ends up being used when `na_match == "na"`, and 
therefore `NA`s will match. `acero::JoinKeyCmp::EQ` ends up in all other cases 
(i.e. `na_match == "never"`) and that uses equality where `NA`s _do not_ match.



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] GH-40339: [Java] StringView Initial Implementation [arrow]

2024-04-26 Thread via GitHub


kou commented on PR #40340:
URL: https://github.com/apache/arrow/pull/40340#issuecomment-2080135497

   https://github.com/apache/arrow/pull/41379


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] GH-41402: [CI][R] Update our backwards compatibility CI any other R 4.4 cleanups [arrow]

2024-04-26 Thread via GitHub


github-actions[bot] commented on PR #41403:
URL: https://github.com/apache/arrow/pull/41403#issuecomment-2080133567

   Revision: f31f859ed8e74b02b8b57e39c2107e5f0ff16197
   
   Submitted crossbow builds: [ursacomputing/crossbow @ 
actions-802b06fdc8](https://github.com/ursacomputing/crossbow/branches/all?query=actions-802b06fdc8)
   
   |Task|Status|
   ||--|
   |r-binary-packages|[![GitHub 
Actions](https://github.com/ursacomputing/crossbow/actions/workflows/crossbow.yml/badge.svg?branch=actions-802b06fdc8-github-r-binary-packages)](https://github.com/ursacomputing/crossbow/actions/runs/8854419894/job/24317420516)|
   |test-fedora-r-clang-sanitizer|[![GitHub 
Actions](https://github.com/ursacomputing/crossbow/actions/workflows/crossbow.yml/badge.svg?branch=actions-802b06fdc8-github-test-fedora-r-clang-sanitizer)](https://github.com/ursacomputing/crossbow/actions/runs/8854419771/job/24317405716)|
   |test-r-arrow-backwards-compatibility|[![GitHub 
Actions](https://github.com/ursacomputing/crossbow/actions/workflows/crossbow.yml/badge.svg?branch=actions-802b06fdc8-github-test-r-arrow-backwards-compatibility)](https://github.com/ursacomputing/crossbow/actions/runs/8854419991/job/24317406573)|
   
|test-r-depsource-bundled|[![Azure](https://dev.azure.com/ursacomputing/crossbow/_apis/build/status/ursacomputing.crossbow?branchName=actions-802b06fdc8-azure-test-r-depsource-bundled)](https://github.com/ursacomputing/crossbow/runs/24317407694)|
   |test-r-depsource-system|[![GitHub 
Actions](https://github.com/ursacomputing/crossbow/actions/workflows/crossbow.yml/badge.svg?branch=actions-802b06fdc8-github-test-r-depsource-system)](https://github.com/ursacomputing/crossbow/actions/runs/8854420303/job/24317407415)|
   |test-r-dev-duckdb|[![GitHub 
Actions](https://github.com/ursacomputing/crossbow/actions/workflows/crossbow.yml/badge.svg?branch=actions-802b06fdc8-github-test-r-dev-duckdb)](https://github.com/ursacomputing/crossbow/actions/runs/8854420092/job/24317407012)|
   |test-r-devdocs|[![GitHub 
Actions](https://github.com/ursacomputing/crossbow/actions/workflows/crossbow.yml/badge.svg?branch=actions-802b06fdc8-github-test-r-devdocs)](https://github.com/ursacomputing/crossbow/actions/runs/8854420026/job/24317407011)|
   |test-r-gcc-11|[![GitHub 
Actions](https://github.com/ursacomputing/crossbow/actions/workflows/crossbow.yml/badge.svg?branch=actions-802b06fdc8-github-test-r-gcc-11)](https://github.com/ursacomputing/crossbow/actions/runs/8854419869/job/24317405980)|
   |test-r-gcc-12|[![GitHub 
Actions](https://github.com/ursacomputing/crossbow/actions/workflows/crossbow.yml/badge.svg?branch=actions-802b06fdc8-github-test-r-gcc-12)](https://github.com/ursacomputing/crossbow/actions/runs/8854420094/job/24317406777)|
   |test-r-install-local|[![GitHub 
Actions](https://github.com/ursacomputing/crossbow/actions/workflows/crossbow.yml/badge.svg?branch=actions-802b06fdc8-github-test-r-install-local)](https://github.com/ursacomputing/crossbow/actions/runs/8854420213/job/24317407542)|
   |test-r-install-local-minsizerel|[![GitHub 
Actions](https://github.com/ursacomputing/crossbow/actions/workflows/crossbow.yml/badge.svg?branch=actions-802b06fdc8-github-test-r-install-local-minsizerel)](https://github.com/ursacomputing/crossbow/actions/runs/8854419963/job/24317407000)|
   |test-r-linux-as-cran|[![GitHub 
Actions](https://github.com/ursacomputing/crossbow/actions/workflows/crossbow.yml/badge.svg?branch=actions-802b06fdc8-github-test-r-linux-as-cran)](https://github.com/ursacomputing/crossbow/actions/runs/8854420379/job/24317408618)|
   |test-r-linux-rchk|[![GitHub 
Actions](https://github.com/ursacomputing/crossbow/actions/workflows/crossbow.yml/badge.svg?branch=actions-802b06fdc8-github-test-r-linux-rchk)](https://github.com/ursacomputing/crossbow/actions/runs/8854419657/job/24317405210)|
   |test-r-linux-valgrind|[![GitHub 
Actions](https://github.com/ursacomputing/crossbow/actions/workflows/crossbow.yml/badge.svg?branch=actions-802b06fdc8-github-test-r-linux-valgrind)](https://github.com/ursacomputing/crossbow/actions/runs/8854419701/job/24317405402)|
   
|test-r-minimal-build|[![Azure](https://dev.azure.com/ursacomputing/crossbow/_apis/build/status/ursacomputing.crossbow?branchName=actions-802b06fdc8-azure-test-r-minimal-build)](https://github.com/ursacomputing/crossbow/runs/24317407597)|
   |test-r-offline-maximal|[![GitHub 
Actions](https://github.com/ursacomputing/crossbow/actions/workflows/crossbow.yml/badge.svg?branch=actions-802b06fdc8-github-test-r-offline-maximal)](https://github.com/ursacomputing/crossbow/actions/runs/8854419814/job/24317405717)|
   
|test-r-offline-minimal|[![Azure](https://dev.azure.com/ursacomputing/crossbow/_apis/build/status/ursacomputing.crossbow?branchName=actions-802b06fdc8-azure-test-r-offline-minimal)](https://github.com/ursacomputing/crossbow/runs/24317407676)|
   
|test-r-rhub-debian-gcc-devel-lto-latest|[![Azure](https://dev.azure.com/ursacomputing/crossbow/_apis/b

Re: [I] [Ruby] Cannot get gem installed on Ubuntu 20 with arrow 14,15 and/or 16 [arrow]

2024-04-26 Thread via GitHub


kou commented on issue #41396:
URL: https://github.com/apache/arrow/issues/41396#issuecomment-2080132757

   I've released 16.0.0 gems.
   
   @andresbarcenas Could you try again?


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [I] `snappy_jll v1.2.0` lead to Arrow_jll failed to build [arrow-julia]

2024-04-26 Thread via GitHub


visr commented on issue #505:
URL: https://github.com/apache/arrow-julia/issues/505#issuecomment-2080131906

   Arrow.jl is a Julia implementation of the Arrow format, and does not use 
Arrow_jll, so this issue can be closed.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] GH-41402: [CI][R] Update our backwards compatibility CI any other R 4.4 cleanups [arrow]

2024-04-26 Thread via GitHub


jonkeane commented on PR #41403:
URL: https://github.com/apache/arrow/pull/41403#issuecomment-2080131215

   @github-actions crossbow submit -g r


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [I] [Ruby] Cannot get gem installed on Ubuntu 20 with arrow 14,15 and/or 16 [arrow]

2024-04-26 Thread via GitHub


kou commented on issue #41396:
URL: https://github.com/apache/arrow/issues/41396#issuecomment-2080129904

   @raulcd https://github.com/msys2/MINGW-packages/pull/20692 isn't merged yet 
but I'll release 16.0.0 gems. Because there is a problem with Apache Arrow C++ 
16.0.0 and Red Arrow 15.0.x combination on other platforms. Releasing 16.0.0 
gems break `gem install red-arrow` on Windows but it'll be fixed by 
https://github.com/msys2/MINGW-packages/pull/20692 is merged.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] move timezone support to extension [arrow-julia]

2024-04-26 Thread via GitHub


visr commented on PR #482:
URL: https://github.com/apache/arrow-julia/pull/482#issuecomment-2080119324

   Perhaps this is no longer needed when these TimeZones load time improvements 
are merged: https://github.com/JuliaTime/TimeZones.jl/pull/457.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] GH-41402: [CI] [R] Update our backwards compatibility CI any other R 4.4 cleanups [arrow]

2024-04-26 Thread via GitHub


github-actions[bot] commented on PR #41403:
URL: https://github.com/apache/arrow/pull/41403#issuecomment-2080115319

   Revision: 6b5dda3a484b12200480ceb601c4fd264f08e928
   
   Submitted crossbow builds: [ursacomputing/crossbow @ 
actions-3ae27aae8b](https://github.com/ursacomputing/crossbow/branches/all?query=actions-3ae27aae8b)
   
   |Task|Status|
   ||--|
   |test-r-arrow-backwards-compatibility|[![GitHub 
Actions](https://github.com/ursacomputing/crossbow/actions/workflows/crossbow.yml/badge.svg?branch=actions-3ae27aae8b-github-test-r-arrow-backwards-compatibility)](https://github.com/ursacomputing/crossbow/actions/runs/8854226651/job/24316838627)|


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] GH-40735: [Packaging][CentOS] Drop support for CentOS 7 [arrow]

2024-04-26 Thread via GitHub


kou commented on PR #41395:
URL: https://github.com/apache/arrow/pull/41395#issuecomment-2080114806

   FYI: manylinux uses AlmaLinux 8 for manylinux_2_28 (CentOS 7 for 
manylinux2014): 
https://github.com/pypa/manylinux/blob/main/README.rst#docker-images


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] GH-40735: [Packaging][CentOS] Drop support for CentOS 7 [arrow]

2024-04-26 Thread via GitHub


kou commented on PR #41395:
URL: https://github.com/apache/arrow/pull/41395#issuecomment-2080113123

   > I am happy to migrate the `centos-cpp-static` from CentOS 7 to a newer 
CentOS on a different PR
   
   Let's work on this on a different PR. We may use AlmaLinux 8 for it.
   
   > but seem to be unused at the moment.
   
   They are used here:
   
https://github.com/apache/arrow/blob/15986ae5ffef2f274c04cf0d5eec2155fe6523a6/dev/tasks/r/github.packages.yml#L120-L147


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] GH-41402: [CI] [R] Update our backwards compatibility CI any other R 4.4 cleanups [arrow]

2024-04-26 Thread via GitHub


jonkeane commented on PR #41403:
URL: https://github.com/apache/arrow/pull/41403#issuecomment-2080112825

   @github-actions crossbow submit test-r-arrow-backwards-compatibility


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



[PR] GH-41402: [CI] [R] Update our backwards compatibility CI any other R 4.4 cleanups [arrow]

2024-04-26 Thread via GitHub


jonkeane opened a new pull request, #41403:
URL: https://github.com/apache/arrow/pull/41403

   ### Rationale for this change
   
   Keep up with the state of the world, ensure we are maintaining backwards 
compatibility.
   
   ### What changes are included in this PR?
   
   Add missed versions to our backwards compatibility, ensure we're running on 
R 4.4 elsewhere
   
   ### Are these changes tested?
   
   By definition.
   
   ### Are there any user-facing changes?
   
   No.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] GH-41402: [CI] [R] Update our backwards compatibility CI any other R 4.4 cleanups [arrow]

2024-04-26 Thread via GitHub


github-actions[bot] commented on PR #41403:
URL: https://github.com/apache/arrow/pull/41403#issuecomment-2080112385

   :warning: GitHub issue #41402 **has been automatically assigned in GitHub** 
to PR creator.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] GH-40735: [Packaging][CentOS] Drop support for CentOS 7 [arrow]

2024-04-26 Thread via GitHub


kou commented on PR #41395:
URL: https://github.com/apache/arrow/pull/41395#issuecomment-2080101740

   ```diff
   diff --git a/dev/tasks/linux-packages/apache-arrow/yum/arrow.spec.in 
b/dev/tasks/linux-packages/apache-arrow/yum/arrow.spec.in
   index 3ede1814b8..258759a1ec 100644
   --- a/dev/tasks/linux-packages/apache-arrow/yum/arrow.spec.in
   +++ b/dev/tasks/linux-packages/apache-arrow/yum/arrow.spec.in
   @@ -25,19 +25,8 @@
%define _rhel %{?rhel:%{rhel}}%{!?rhel:0}
%define is_rhel (%{_rhel} != 0)

   -%define is_centos_7 (%{_rhel} == 7 && !%{is_amazon_linux})
   -
%define major_version %(echo @VERSION@ | grep -o '^[0-9]*')

   -%define boost_version %( \
   -  if [ %{_rhel} -eq 7 ]; then \
   -echo 169; \
   -  fi)
   -%define cmake_version %( \
   -  if [ %{_rhel} -eq 7 ]; then \
   -echo 3; \
   -  fi)
   -
%define lz4_requirement %( \
  if [ %{_amzn} -eq 0 ]; then \
echo ">= 1.8.0"; \
   @@ -55,31 +44,17 @@
%define arrow_cmake_install DESTDIR="%{buildroot}" make -C 
%{arrow_cmake_builddir} install
%endif

   -%if %{is_centos_7}
   -%define gcc_package devtoolset-11-gcc
   -%else
   -%define gcc_package gcc
   -%endif
   -
   -%define use_flight (%{_rhel} >= 8 || %{_amzn} >= 2023)
   -%define use_gandiva (%{_rhel} >= 8 || %{_amzn} >= 2023)
   -%define use_gcs (%{_rhel} >= 8)
%define use_gflags (!%{is_amazon_linux})
## TODO: Enable this when glog stopped depending on gflags-devel.
# %%define use_glog (%%{_rhel} <= 8)
%define use_glog 0
   -%define use_mimalloc (%{_rhel} >= 8)
# TODO: Enable this. This works on local but is fragile on GitHub Actions 
and
# Travis CI.
# %%define use_s3 (%%{_rhel} >= 8)
%define use_s3 0
   -%define use_vala (%{_rhel} >= 8 || %{is_amazon_linux})

%define have_grpc (%{_amzn} >= 2023)
   -%define have_lz4_libs (%{_rhel} >= 8 || %{_amzn} >= 2023)
%define have_rapidjson (%{_rhel} != 8)
   -%define have_re2 (%{_rhel} >= 8 || %{_amzn} >= 2023)
   -%define have_thrift (%{_rhel} >= 8)
%define have_utf8proc (%{_rhel} >= 9 || %{_amzn} >= 2023)

%define enable_glib_doc (%{_rhel} >= 9 || %{is_amazon_linux})
   @@ -94,16 +69,14 @@ URL: https://arrow.apache.org/
Source0:
https://dist.apache.org/repos/dist/release/@PACKAGE@/@PACKAGE@-%{version}/apache-@PACKAGE@-%{version}.tar.gz

BuildRequires:  bison
   -BuildRequires:  boost%{boost_version}-devel
   +BuildRequires:  boost-devel
BuildRequires:  brotli-devel
BuildRequires:  bzip2-devel
   -%if %{use_flight}
BuildRequires:  c-ares-devel
   -%endif
   -BuildRequires:  cmake%{cmake_version}
   +BuildRequires:  cmake
BuildRequires:  curl-devel
BuildRequires:  flex
   -BuildRequires:  %{gcc_package}-c++
   +BuildRequires:  gcc-c++
%if %{use_gflags}
BuildRequires:  gflags-devel
%endif
   @@ -115,38 +88,27 @@ BuildRequires:  glog-devel
BuildRequires:  grpc-devel
BuildRequires:  grpc-plugins
%endif
   -%if %{use_gcs}
BuildRequires:  json-devel
   -%endif
BuildRequires:  libzstd-devel
   +BuildRequires:  llvm-devel
BuildRequires:  lz4-devel %{lz4_requirement}
   +BuildRequires:  ncurses-devel
BuildRequires:  ninja-build
BuildRequires:  openssl-devel
BuildRequires:  pkgconfig
%if %{have_rapidjson}
BuildRequires:  rapidjson-devel
%endif
   -%if %{have_re2}
BuildRequires:  re2-devel
   -%endif
BuildRequires:  snappy-devel
   -%if %{have_thrift}
BuildRequires:  thrift-devel
   -%endif
%if %{have_utf8proc}
BuildRequires:  utf8proc-devel
%endif
BuildRequires:  zlib-devel

   -%if %{use_gandiva}
   -BuildRequires:  llvm-devel
   -BuildRequires:  ncurses-devel
   -%endif
   -
BuildRequires:  gobject-introspection-devel
   -%if %{use_vala}
BuildRequires:  vala
   -%endif

%description
Apache Arrow is a data processing library for analysis.
   @@ -161,21 +123,13 @@ cd cpp
  -DARROW_BUILD_UTILITIES=ON \
  -DARROW_CSV=ON \
  -DARROW_DATASET=ON \
   -%if %{use_flight}
  -DARROW_FLIGHT=ON \
  -DARROW_FLIGHT_SQL=ON \
   -%endif
   -%if %{use_gandiva}
  -DARROW_GANDIVA=ON \
   -%endif
   -%if %{use_gcs}
  -DARROW_GCS=ON \
   -%endif
  -DARROW_HDFS=ON \
  -DARROW_JSON=ON \
   -%if %{use_mimalloc}
  -DARROW_MIMALLOC=ON \
   -%endif
  -DARROW_ORC=ON \
  -DARROW_PACKAGE_KIND=rpm \
  -DARROW_PARQUET=ON \
   @@ -200,12 +154,7 @@ cd c_glib
%if %{_amzn} >= 2023
  # Do nothing
%else
   -  %if %{is_centos_7}
   -# Meson 0.62.0 or later requires Python 3.7 or later.
   -pip3 install 'meson<0.62.0'
   -  %else
   -pip3 install meson
   -  %endif
   +  pip3 install meson
%endif
%if %{enable_glib_doc}
  pip3 install gi-docgen
   @@ -219,9 +168,7 @@ meson setup build \
%if %{enable_gli

Re: [PR] GH-40735: [Packaging][CentOS] Drop support for CentOS 7 [arrow]

2024-04-26 Thread via GitHub


kou commented on code in PR #41395:
URL: https://github.com/apache/arrow/pull/41395#discussion_r1581510792


##
dev/tasks/tasks.yml:
##
@@ -554,12 +554,9 @@ tasks:
   "almalinux-8",
   "amazon-linux-2023",
   "centos-9-stream",
-  "centos-8-stream",
-  "centos-7"] %}
-  {% set is_rhel7_based = (target in ["amazon-linux-2", "centos-7"]) %}
-  {% for architecture
-   in ["amd64", "arm64"]
-   if not (target == "centos-7" and architecture == "arm64") %}
+  "centos-8-stream"] %}
+  {% set is_rhel7_based = (target in ["amazon-linux-2"]) %}

Review Comment:
   Could you also remove `is_rhel7_based` entirely?
   `amazon-linux-2` isn't supported too.



##
docs/source/developers/continuous_integration/crossbow.rst:
##
@@ -222,7 +222,7 @@ Run only ``conda`` package builds and a Linux one:
 
 .. code::
 
-   $ archery crossbow submit --group conda centos-7
+   $ archery crossbow submit --group conda centos-9

Review Comment:
   `centos-9` doesn't exist...
   
   ```suggestion
  $ archery crossbow submit --group conda almalinux-9
   ```



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] feat(csharp/test): implement DuckDb test fixture [arrow-adbc]

2024-04-26 Thread via GitHub


CurtHagenlocher commented on code in PR #1781:
URL: https://github.com/apache/arrow-adbc/pull/1781#discussion_r1581508617


##
csharp/test/Apache.Arrow.Adbc.Tests/ImportedDuckDbTests.cs:
##
@@ -0,0 +1,63 @@
+/*
+ * Licensed to the Apache Software Foundation (ASF) under one or more
+ * contributor license agreements.  See the NOTICE file distributed with
+ * this work for additional information regarding copyright ownership.
+ * The ASF licenses this file to You under the Apache License, Version 2.0
+ * (the "License"); you may not use this file except in compliance with
+ * the License.  You may obtain a copy of the License at
+ *
+ *http://www.apache.org/licenses/LICENSE-2.0
+ *
+ * Unless required by applicable law or agreed to in writing, software
+ * distributed under the License is distributed on an "AS IS" BASIS,
+ * WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+ * See the License for the specific language governing permissions and
+ * limitations under the License.
+ */
+
+using Apache.Arrow.Types;
+using Xunit;
+
+namespace Apache.Arrow.Adbc.Tests
+{
+public class ImportedDuckDbTests : IClassFixture
+{
+readonly DuckDbFixture _duckDb;
+
+public ImportedDuckDbTests(DuckDbFixture duckDb)
+{
+_duckDb = duckDb;
+}
+
+[Fact]
+public void SimpleEndToEndTest()
+{
+using var database = _duckDb.OpenDatabase("test.db");
+using var connection = database.Connect(null);
+using var statement = connection.CreateStatement();
+
+statement.SqlQuery = "CREATE TABLE integers(foo INTEGER, bar 
INTEGER);";
+statement.ExecuteUpdate();
+
+statement.SqlQuery = "INSERT INTO integers VALUES (3, 4), (5, 6), 
(7, 8);";
+statement.ExecuteUpdate();
+
+statement.SqlQuery = "SELECT * from integers";
+var results = statement.ExecuteQuery();
+
+var schema = results.Stream.Schema;
+Assert.Equal(2, schema.FieldsList.Count);
+Assert.Equal(ArrowTypeId.Int32, 
schema.FieldsList[0].DataType.TypeId);
+Assert.Equal(ArrowTypeId.Int32, 
schema.FieldsList[1].DataType.TypeId);
+
+var firstBatch = results.Stream.ReadNextRecordBatchAsync().Result;
+Assert.Equal(3, firstBatch.Length);
+Assert.Equal(3, (firstBatch.Column(0) as Int32Array).Values[0]);
+Assert.Equal(5, (firstBatch.Column(0) as Int32Array).Values[1]);
+Assert.Equal(7, (firstBatch.Column(0) as Int32Array).Values[2]);
+
+
+var secondBatch = results.Stream.ReadNextRecordBatchAsync().Result;

Review Comment:
   oh, hey, I guess I didn't finish this :/



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [I] [CI][Packaging] Nightly test for centos-8-stream-arm64 times out installing arrow-libs [arrow]

2024-04-26 Thread via GitHub


kou commented on issue #33260:
URL: https://github.com/apache/arrow/issues/33260#issuecomment-2080083142

   We don't need to do anything for this.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



[PR] feat(csharp/test): implement DuckDb test fixture [arrow-adbc]

2024-04-26 Thread via GitHub


CurtHagenlocher opened a new pull request, #1781:
URL: https://github.com/apache/arrow-adbc/pull/1781

   Adds support for running tests against DuckDb and implements a simple smoke 
test.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] feat(python): Implement extension type and Schema metadata support [arrow-nanoarrow]

2024-04-26 Thread via GitHub


danepitkin commented on code in PR #431:
URL: https://github.com/apache/arrow-nanoarrow/pull/431#discussion_r1581480452


##
python/src/nanoarrow/_lib.pyx:
##
@@ -967,10 +989,50 @@ cdef class CSchemaBuilder:
 if self._ptr.release == NULL:
 ArrowSchemaInit(self._ptr)
 
+@staticmethod
+def copy_existing(CSchema existing_schema):
+return CSchemaBuilder(existing_schema.__deepcopy__())

Review Comment:
   nit: I'd prefer a shorter name. WDYT? 
   ```suggestion
   def copy(CSchema schema):
   return CSchemaBuilder(schema.__deepcopy__())
   ```



##
python/src/nanoarrow/iterator.py:
##
@@ -160,6 +164,17 @@ def get_iterator(cls, obj, schema=None):
 yield from iterator._iter1(0, array.length)
 
 def _iter1(self, offset, length):
+# Check for and extension type first since this isn't reflected by

Review Comment:
   ```suggestion
   # Check for an extension type first since this isn't reflected by
   ```



##
python/src/nanoarrow/schema.py:
##
@@ -108,10 +115,73 @@ def create(obj):
 return TimeUnit(obj)
 
 
+class ExtensionAccessor:
+"""Accessor for extension type parameters"""
+
+def __init__(self, schema) -> None:
+self._schema = schema
+
+@property
+def name(self) -> str:
+"""Extension name for this extension type"""
+return self._schema._c_schema_view.extension_name
+
+@property
+def metadata(self) -> Union[bytes, None]:
+"""Extension metadata for this extension type if present"""
+extension_metadata = self._schema._c_schema_view.extension_metadata
+return extension_metadata if extension_metadata else None
+
+@property
+def storage(self):
+"""Storage type for this extension type"""
+metadata = dict(self._schema.metadata.items())
+del metadata[b"ARROW:extension:name"]
+if b"ARROW:extension:metadata" in metadata:
+del metadata[b"ARROW:extension:metadata"]

Review Comment:
   why do we delete the objects here?



##
python/src/nanoarrow/schema.py:
##
@@ -108,10 +115,73 @@ def create(obj):
 return TimeUnit(obj)
 
 
+class ExtensionAccessor:
+"""Accessor for extension type parameters"""
+
+def __init__(self, schema) -> None:
+self._schema = schema
+
+@property
+def name(self) -> str:
+"""Extension name for this extension type"""
+return self._schema._c_schema_view.extension_name
+
+@property
+def metadata(self) -> Union[bytes, None]:
+"""Extension metadata for this extension type if present"""
+extension_metadata = self._schema._c_schema_view.extension_metadata
+return extension_metadata if extension_metadata else None
+
+@property
+def storage(self):
+"""Storage type for this extension type"""
+metadata = dict(self._schema.metadata.items())
+del metadata[b"ARROW:extension:name"]
+if b"ARROW:extension:metadata" in metadata:
+del metadata[b"ARROW:extension:metadata"]
+
+return Schema(self._schema, metadata=metadata)
+
+
 class Schema:
-"""The Schema is nanoarrow's high-level data type representation whose 
scope maps to
-that of the ArrowSchema in the Arrow C Data interface. See :func:`schema` 
for class
-details.
+"""Create a nanoarrow Schema
+
+The Schema is nanoarrow's high-level data type representation, encompasing

Review Comment:
   ```suggestion
   The Schema is nanoarrow's high-level data type representation, 
encompassing
   ```



##
python/src/nanoarrow/_lib.pyx:
##
@@ -967,10 +989,50 @@ cdef class CSchemaBuilder:
 if self._ptr.release == NULL:
 ArrowSchemaInit(self._ptr)
 
+@staticmethod
+def copy_existing(CSchema existing_schema):
+return CSchemaBuilder(existing_schema.__deepcopy__())
+
 @staticmethod
 def allocate():
 return CSchemaBuilder(CSchema.allocate())
 
+def clear_metadata(self):

Review Comment:
   Optional: What if we made this more object oriented? e.g. 
`CSchemaBuilder.clear_metadata()` vs `CSchemaBuilder.metadata.clear()`. Same 
with `append_metadata`. Does it make sense to implement this way? If not (or 
its a lot of work), ignore this comment.



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] GH-40078: [C++] Import/Export ArrowDeviceArrayStream [arrow]

2024-04-26 Thread via GitHub


zeroshade commented on PR #40807:
URL: https://github.com/apache/arrow/pull/40807#issuecomment-2080046370

   Anyone have any further comments here?
   
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] ci: Add pipeline support to bundle Go binaries in NuGet packages [arrow-adbc]

2024-04-26 Thread via GitHub


CurtHagenlocher merged PR #1730:
URL: https://github.com/apache/arrow-adbc/pull/1730


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] ci: Add pipeline support to bundle Go binaries in NuGet packages [arrow-adbc]

2024-04-26 Thread via GitHub


CurtHagenlocher commented on PR #1730:
URL: https://github.com/apache/arrow-adbc/pull/1730#issuecomment-2080040894

   Given what @davidhcoe says about switching to v4, I'm going to commit as-is.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [I] Snowflake driver report invalid JWT token for the keypair auth with privateLink accountIdentifer [arrow-adbc]

2024-04-26 Thread via GitHub


zeroshade commented on issue #1777:
URL: https://github.com/apache/arrow-adbc/issues/1777#issuecomment-2080039850

   We are already using the most recent version of gosnowflake. ADBC v0.11.0 
went out using gosnowflake v1.8.0, and the main branch is using v1.9.0
   
   The snowflake ADBC driver hasn't used gosnowflake v1.6.22 since ADBC v0.9.0


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [I] Snowflake driver report invalid JWT token for the keypair auth with privateLink accountIdentifer [arrow-adbc]

2024-04-26 Thread via GitHub


kiyoungoh commented on issue #1777:
URL: https://github.com/apache/arrow-adbc/issues/1777#issuecomment-2080019213

   I believe the combination mentioned was just a workaround of the issue that 
is caused by the old version 1.6.22 of gosnowflake. Can we consider updating 
the gosnowflake dependency to a more recent version that has already fixed the 
issue? 
   
   Appreciate suggesting the workaround but I believe having a more recent 
version of gosnowflake that already fixed the issue under the hood would be 
more efficient and beneficial. 


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] GH-41386: [Docs] Document Arrow Go Avro read support [arrow]

2024-04-26 Thread via GitHub


conbench-apache-arrow[bot] commented on PR #41393:
URL: https://github.com/apache/arrow/pull/41393#issuecomment-2079998546

   After merging your PR, Conbench analyzed the 5 benchmarking runs that have 
been run so far on merge-commit 0f56339ee803858b597418aefdabb993def19f48.
   
   There were no benchmark performance regressions. 🎉
   
   The [full Conbench report](https://github.com/apache/arrow/runs/24313803524) 
has more details. It also includes information about 30 possible false 
positives for unstable benchmarks that are known to sometimes produce them.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [I] Snowflake driver report invalid JWT token for the keypair auth with privateLink accountIdentifer [arrow-adbc]

2024-04-26 Thread via GitHub


zeroshade commented on issue #1777:
URL: https://github.com/apache/arrow-adbc/issues/1777#issuecomment-2079993072

   Since we don't control the Azure ADF UI, you would have to talk to them 
about being able to try the fix that @davlee1972 mentioned if the UI doesn't 
provide the ability to try that combination.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] GH-15058: [C++][Python] Native support for UUID [arrow]

2024-04-26 Thread via GitHub


rok commented on code in PR #37298:
URL: https://github.com/apache/arrow/pull/37298#discussion_r1581448620


##
cpp/src/arrow/extension/CMakeLists.txt:
##
@@ -15,10 +15,16 @@
 # specific language governing permissions and limitations
 # under the License.
 
+set(CANONICAL_EXTENSION_TESTS uuid_test.cc)
+
+if(ARROW_JSON)

Review Comment:
   See my reply below.



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] GH-15058: [C++][Python] Native support for UUID [arrow]

2024-04-26 Thread via GitHub


rok commented on code in PR #37298:
URL: https://github.com/apache/arrow/pull/37298#discussion_r1581445564


##
cpp/src/arrow/extension_type.cc:
##
@@ -145,12 +146,17 @@ static void CreateGlobalRegistry() {
   g_registry = std::make_shared();
 
 #ifdef ARROW_JSON

Review Comment:
   `fixed_shape_tensor` uses rapidjson to serialize and deserialize metadata 
(see `FixedShapeTensorType::Serialize` and `FixedShapeTensorType::Deserialize`) 
of the type so we need `ARROW_JSON` to build it. UUID has no metadata to 
serialize so we don't require JSON. It's annoying that for some builds we get 
all extensions and for some only a subset, but I don't have a better idea.



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] GH-41358: [R] Support join "na_matches" argument [arrow]

2024-04-26 Thread via GitHub


nealrichardson commented on PR #41372:
URL: https://github.com/apache/arrow/pull/41372#issuecomment-2079983661

   CI failure appears unrelated: https://github.com/apache/arrow/issues/41398


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [I] Snowflake driver report invalid JWT token for the keypair auth with privateLink accountIdentifer [arrow-adbc]

2024-04-26 Thread via GitHub


kiyoungoh commented on issue #1777:
URL: https://github.com/apache/arrow-adbc/issues/1777#issuecomment-2079977533

   Was referring to Azure ADF UI as @ruowan shared above, and it relies on the 
driver to construct the host URL.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] GH-41334: [C++][Acero] Add env var to tune the size of the temp stack [arrow]

2024-04-26 Thread via GitHub


mapleFU commented on code in PR #41335:
URL: https://github.com/apache/arrow/pull/41335#discussion_r1581424142


##
cpp/src/arrow/acero/query_context.cc:
##
@@ -23,6 +23,36 @@ namespace arrow {
 using arrow::internal::CpuInfo;
 namespace acero {
 
+namespace internal {
+
+int64_t GetTempStackSizeFromEnvVar() {
+  auto maybe_env_value = arrow::internal::GetEnvVar(kTempStackSizeEnvVar);
+  if (!maybe_env_value.ok()) {
+return kDefaultTempStackSize;
+  }
+  auto env_value = *std::move(maybe_env_value);
+  if (env_value.empty()) {
+return kDefaultTempStackSize;
+  }
+
+  int64_t temp_stack_size = 0;
+  size_t length = 0;
+  bool exception = false;
+  try {
+temp_stack_size = std::stoll(env_value.c_str(), &length);
+  } catch (const std::exception&) {
+exception = true;
+  }
+  if (length != env_value.length() || exception || temp_stack_size <= 0) {
+ARROW_LOG(WARNING) << "Invalid temp stack size provided in " << 
kTempStackSizeEnvVar
+   << ". Using default temp stack size: " << 
kDefaultTempStackSize;
+return kDefaultTempStackSize;
+  }
+  return temp_stack_size;

Review Comment:
   Should that fill better only in debug mode?



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [I] Snowflake driver report invalid JWT token for the keypair auth with privateLink accountIdentifer [arrow-adbc]

2024-04-26 Thread via GitHub


zeroshade commented on issue #1777:
URL: https://github.com/apache/arrow-adbc/issues/1777#issuecomment-2079963796

   Does our C# binding provide a UI? or is that something external to the ADBC 
project?


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [I] Snowflake driver report invalid JWT token for the keypair auth with privateLink accountIdentifer [arrow-adbc]

2024-04-26 Thread via GitHub


kiyoungoh commented on issue #1777:
URL: https://github.com/apache/arrow-adbc/issues/1777#issuecomment-2079961188

   @zeroshade Thanks for checking but from the UI, we don't have control to try 
the combination mentioned.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [I] Snowflake driver report invalid JWT token for the keypair auth with privateLink accountIdentifer [arrow-adbc]

2024-04-26 Thread via GitHub


zeroshade commented on issue #1777:
URL: https://github.com/apache/arrow-adbc/issues/1777#issuecomment-2079950427

   @kiyoungoh were you able to try using the approach that @davlee1972 
mentioned in 
https://github.com/apache/arrow-adbc/issues/1777#issuecomment-2078612172 which 
he said was able to work?


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [I] Snowflake driver report invalid JWT token for the keypair auth with privateLink accountIdentifer [arrow-adbc]

2024-04-26 Thread via GitHub


kiyoungoh commented on issue #1777:
URL: https://github.com/apache/arrow-adbc/issues/1777#issuecomment-2079928027

   The team owning the driver was not able to repro the issue with their latest 
version and closed the ticket.
   
   https://github.com/snowflakedb/gosnowflake/issues/1033


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [I] Snowflake driver report invalid JWT token for the keypair auth with privateLink accountIdentifer [arrow-adbc]

2024-04-26 Thread via GitHub


kiyoungoh commented on issue #1777:
URL: https://github.com/apache/arrow-adbc/issues/1777#issuecomment-2079919293

   Wasn't the issue fixed with 1.6.25? 
   
   
https://docs.snowflake.com/en/release-notes/clients-drivers/golang-2023#:~:text=Fixed%20an%20issue%20with%20OCSP%20fallback%20requests%20in%20PrivateLink%20environments.
   
   
![image](https://github.com/apache/arrow-adbc/assets/68621713/787c5a9f-8c8d-41f7-b42b-e0dce57c5fc5)
   
   
https://github.com/apache/arrow-adbc/issues/1422#:~:text=Returns%20with%20these%20errors%3A
   
![image](https://github.com/apache/arrow-adbc/assets/68621713/4aa5f7bc-56b5-496a-8517-667b7d20b17b)
   
   Can we test with 1.6.25 or any version above and see if that helps?
   
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] Support casting `StringArray`/`BinaryArray` --> `StringView` / `BinaryView` [arrow-rs]

2024-04-26 Thread via GitHub


alamb commented on PR #5686:
URL: https://github.com/apache/arrow-rs/pull/5686#issuecomment-2079858373

   🚀 


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] Support casting `StringArray`/`BinaryArray` --> `StringView` / `BinaryView` [arrow-rs]

2024-04-26 Thread via GitHub


alamb merged PR #5686:
URL: https://github.com/apache/arrow-rs/pull/5686


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] GH-41367: [C++] Replace [[maybe_unused]] with Arrow macro [arrow]

2024-04-26 Thread via GitHub


WillAyd commented on code in PR #41359:
URL: https://github.com/apache/arrow/pull/41359#discussion_r1581355489


##
cpp/src/arrow/device.h:
##
@@ -150,9 +150,10 @@ class ARROW_EXPORT Device : public 
std::enable_shared_from_this,
   /// @param release_fn a function to call during destruction, `nullptr` or
   ///a no-op function can be passed to indicate ownership is maintained
   ///externally
-  virtual Result> WrapStream(
-  [[maybe_unused]] void* device_stream,
-  [[maybe_unused]] Stream::release_fn_t release_fn) {
+  virtual Result> WrapStream(void* device_stream,
+ Stream::release_fn_t 
release_fn) {
+ARROW_UNUSED(device_stream);
+ARROW_UNUSED(release_fn);

Review Comment:
   Just pushed up this change



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [I] Snowflake driver report invalid JWT token for the keypair auth with privateLink accountIdentifer [arrow-adbc]

2024-04-26 Thread via GitHub


kiyoungoh commented on issue #1777:
URL: https://github.com/apache/arrow-adbc/issues/1777#issuecomment-2079817423

   Hi Team, do we have any ETA to fix the issue? Truly appreciate your time and 
consideration in advance.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [I] [C++] Enable using the GCS+GRPC plugin with Arrow [arrow]

2024-04-26 Thread via GitHub


benibus commented on issue #40254:
URL: https://github.com/apache/arrow/issues/40254#issuecomment-2079807518

   This is tangential, but upgrading gRPC to v1.61.0 or later would also allow 
us to use the new [OpenTelemetry 
plugin](https://github.com/grpc/grpc/blob/b361574a0b647b81129ed33749aaa74585acd779/include/grpcpp/ext/otel_plugin.h).
 I think we could handle the version bump(s) in a standalone PR first - 
assuming we wouldn't need to target a lower version.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [I] csharp: imported drivers can leak transient objects on errors [arrow-adbc]

2024-04-26 Thread via GitHub


CurtHagenlocher commented on issue #1774:
URL: https://github.com/apache/arrow-adbc/issues/1774#issuecomment-2079785707

   Closed by #1776 


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] GH-41375: [C#] Move to .NET 8.0 [arrow]

2024-04-26 Thread via GitHub


conbench-apache-arrow[bot] commented on PR #41376:
URL: https://github.com/apache/arrow/pull/41376#issuecomment-2079785458

   After merging your PR, Conbench analyzed the 5 benchmarking runs that have 
been run so far on merge-commit 2710626b234d5e387a3c63988ca5899c70547dcf.
   
   There were no benchmark performance regressions. 🎉
   
   The [full Conbench report](https://github.com/apache/arrow/runs/24309514782) 
has more details. It also includes information about 2 possible false positives 
for unstable benchmarks that are known to sometimes produce them.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] fix(csharp): imported drivers have the potential for a lot of memory leaks [arrow-adbc]

2024-04-26 Thread via GitHub


CurtHagenlocher merged PR #1776:
URL: https://github.com/apache/arrow-adbc/pull/1776


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [PR] GH-41282: [Dev] Always prompt next major version on merge script if it exists [arrow]

2024-04-26 Thread via GitHub


conbench-apache-arrow[bot] commented on PR #41305:
URL: https://github.com/apache/arrow/pull/41305#issuecomment-2079780742

   After merging your PR, Conbench analyzed the 7 benchmarking runs that have 
been run so far on merge-commit 6cbdec55b6bbaee3f00a42541bf89998452a3718.
   
   There were no benchmark performance regressions. 🎉
   
   The [full Conbench report](https://github.com/apache/arrow/runs/24309396491) 
has more details. It also includes information about 19 possible false 
positives for unstable benchmarks that are known to sometimes produce them.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



Re: [I] [CI] macOS 11 C# job fails to install python dependencies [arrow]

2024-04-26 Thread via GitHub


raulcd commented on issue #41390:
URL: https://github.com/apache/arrow/issues/41390#issuecomment-2079777844

   Issue resolved by pull request 41392
   https://github.com/apache/arrow/pull/41392


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



  1   2   3   >