[GitHub] [arrow-adbc] mystic-lama opened a new pull request, #50: Updated relative path for parent pom
mystic-lama opened a new pull request, #50: URL: https://github.com/apache/arrow-adbc/pull/50 Updated relative path for parent pom in validation module. This fixes the following build error [ERROR] Non-resolvable parent POM for org.apache.arrow.adbc:adbc-driver-validation:9.0.0-SNAPSHOT: Could not find artifact org.apache.arrow.adbc:arrow-adbc-java-root:pom:9.0.0-SNAPSHOT and 'parent.relativePath' points at wrong local POM @ line 14, column 11 -> [Help 2] -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubscr...@arrow.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org
[jira] [Created] (ARROW-17268) [C++] JSON kernels
David Li created ARROW-17268: Summary: [C++] JSON kernels Key: ARROW-17268 URL: https://issues.apache.org/jira/browse/ARROW-17268 Project: Apache Arrow Issue Type: Improvement Components: C++ Reporter: David Li As discussed on dev@: https://lists.apache.org/thread/onzgogx2c2djxs0wbhmvqp2dbx7kjf6o "[ARROW-17255] Logical JSON type in Arrow" It would be interesting to have JSON parsing/serializing compute functions that operate on columns of (stringified) JSON records. For parsing, the problem is we need to know the output schema without being able to look at the data, so we would probably only be able to decode into a {{map[string, union]}} type at best. And/or we could offer "extraction" functions akin to what things like SQLite and Postgres provide (at the cost of having to reparse the JSON over and over). Also see ARROW-17255 for a logical JSON type. -- This message was sent by Atlassian Jira (v8.20.10#820010)
[jira] [Created] (ARROW-17267) Failure loading arrow_dataset_jni library on Mac M1
Alexander Cai created ARROW-17267: - Summary: Failure loading arrow_dataset_jni library on Mac M1 Key: ARROW-17267 URL: https://issues.apache.org/jira/browse/ARROW-17267 Project: Apache Arrow Issue Type: Bug Components: Java Affects Versions: 8.0.0 Environment: Mac M1, 16 GB memory Reporter: Alexander Cai On a Mac M1, using Java 11 and Maven, I was initially trying to load a Parquet file using FileSystemDatasetFactory. However, when calling NativeMemoryPool.getDefault(), it throws an UnsatisfiedLinkError: {code:java} java.lang.UnsatisfiedLinkError: Can't load library: /var/folders/c9/h03f8v4s037gvwlfgs98b_y0gp/T/jnilib-18262759266549290103.tmp at java.lang.ClassLoader.loadLibrary (ClassLoader.java:2633) at java.lang.Runtime.load0 (Runtime.java:768) at java.lang.System.load (System.java:1837) at org.apache.arrow.dataset.jni.JniLoader.load (JniLoader.java:88) at org.apache.arrow.dataset.jni.JniLoader.loadRemaining (JniLoader.java:73) at org.apache.arrow.dataset.jni.JniLoader.ensureLoaded (JniLoader.java:60) at org.apache.arrow.dataset.jni.NativeMemoryPool. (NativeMemoryPool.java:27) at com.example.App.main (App.java:8) at jdk.internal.reflect.NativeMethodAccessorImpl.invoke0 (Native Method) at jdk.internal.reflect.NativeMethodAccessorImpl.invoke (NativeMethodAccessorImpl.java:62) at jdk.internal.reflect.DelegatingMethodAccessorImpl.invoke (DelegatingMethodAccessorImpl.java:43) at java.lang.reflect.Method.invoke (Method.java:566) at org.codehaus.mojo.exec.ExecJavaMojo$1.run (ExecJavaMojo.java:282) at java.lang.Thread.run (Thread.java:829){code} I've created a minimum reproducible example [here|https://gist.github.com/adzcai/7cb1b9316d8529cf2c0ffe19f0901adc], where the only line of code is {noformat}NativeMemoryPool.getDefault(){noformat}. At the call to {noformat}org.apache.arrow.dataset.jni.JniLoader.load{noformat} in the stack trace above, "arrow_dataset_jni" is passed as the name, and "libarrow_dataset_jni.dylib" is the library name, which it successfully locates. However, this works fine in Java 8. Running the same code in an EC2 container on Java 11 also works fine, so I'm fairly certain this is an M1 issue. -- This message was sent by Atlassian Jira (v8.20.10#820010)
[jira] [Created] (ARROW-17266) [Doc] Java nightlies file prefix changed
Rok Mihevc created ARROW-17266: -- Summary: [Doc] Java nightlies file prefix changed Key: ARROW-17266 URL: https://issues.apache.org/jira/browse/ARROW-17266 Project: Apache Arrow Issue Type: Improvement Components: Documentation, Java Reporter: Rok Mihevc As per [Arrow docs|https://arrow.apache.org/docs/dev/java/install.html#installing-manually] Java nightlies are at: [https://github.com/ursacomputing/crossbow/releases/tag/nightly-2022-03-19-0-github-java-jars] However file prefix changed and new url format is: [https://github.com/ursacomputing/crossbow/releases/tag/nightly-packaging-2022-07-30-0-github-java-jars] Since it's hard to search github for old releases it would be good to change the url in the docs. -- This message was sent by Atlassian Jira (v8.20.10#820010)
[jira] [Created] (ARROW-17265) build python lib failed on both X86 and ARMv8
chendan created ARROW-17265: --- Summary: build python lib failed on both X86 and ARMv8 Key: ARROW-17265 URL: https://issues.apache.org/jira/browse/ARROW-17265 Project: Apache Arrow Issue Type: Bug Components: Python Affects Versions: 8.0.0, 2.0.0 Environment: os: centos 7.9 CPU: X86_64 Reporter: chendan Attachments: CMakeOutput.log I want to build pyarrow lib in ARM platform. I download pyarrow source code version 8.0.0 and run "python setup.py install". An error occur: Using ld linker Configured for RELEASE build (set with cmake -DCMAKE_BUILD_TYPE=\{release,debug,...}) -- Build Type: RELEASE -- Generator: Unix Makefiles -- Build output directory: /root/build/pyarrow-8.0.0/build/temp.linux-x86_64-3.6/release -- Found Python3: /root/anaconda3/envs/py36test/bin/python (found version "3.6.13") found components: Interpreter Development.Module NumPy -- Found Python3Alt: /root/anaconda3/envs/py36test/bin/python -- Found PkgConfig: /usr/bin/pkg-config (found version "0.27.1") -- Could NOT find Arrow (missing: Arrow_DIR) -- Checking for module 'arrow' -- No package 'arrow' found CMake Error at /usr/local/share/cmake-3.23/Modules/FindPackageHandleStandardArgs.cmake:230 (message): Could NOT find Arrow (missing: ARROW_INCLUDE_DIR ARROW_LIB_DIR ARROW_FULL_SO_VERSION ARROW_SO_VERSION) Call Stack (most recent call first): /usr/local/share/cmake-3.23/Modules/FindPackageHandleStandardArgs.cmake:594 (_FPHSA_FAILURE_MESSAGE) cmake_modules/FindArrow.cmake:450 (find_package_handle_standard_args) cmake_modules/FindArrowPython.cmake:46 (find_package) CMakeLists.txt:231 (find_package) -- Configuring incomplete, errors occurred! This error always occur no matter what version I choose(pyarrow 8.0.0 or 2.0.0) and no matter what platform (X86 or ARM c complier) I choose. When I downloaded arrow source code and enter python folder and run "python setup.py install" the same error occur. It seems that it's an bug on cmake files. I could not build python lib for my ARM plarform. -- This message was sent by Atlassian Jira (v8.20.10#820010)
[jira] [Created] (ARROW-17264) [Go] Function group by on table
Francisco Garcia created ARROW-17264: Summary: [Go] Function group by on table Key: ARROW-17264 URL: https://issues.apache.org/jira/browse/ARROW-17264 Project: Apache Arrow Issue Type: Wish Components: Go Affects Versions: 8.0.1 Reporter: Francisco Garcia I'm trying to find some way to group data in Apache Arrow with golang, but I couldn't do it. There's a way to do this or it is only implemented in cpp and python. Are there plans to implement this on future releases? -- This message was sent by Atlassian Jira (v8.20.10#820010)