[GitHub] [arrow-adbc] mystic-lama opened a new pull request, #50: Updated relative path for parent pom

2022-07-30 Thread GitBox


mystic-lama opened a new pull request, #50:
URL: https://github.com/apache/arrow-adbc/pull/50

   Updated relative path for parent pom in validation module. This fixes the 
following build error
   
   [ERROR] Non-resolvable parent POM for 
org.apache.arrow.adbc:adbc-driver-validation:9.0.0-SNAPSHOT: Could not find 
artifact org.apache.arrow.adbc:arrow-adbc-java-root:pom:9.0.0-SNAPSHOT and 
'parent.relativePath' points at wrong local POM @ line 14, column 11 -> [Help 2]
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



[jira] [Created] (ARROW-17268) [C++] JSON kernels

2022-07-30 Thread David Li (Jira)
David Li created ARROW-17268:


 Summary: [C++] JSON kernels
 Key: ARROW-17268
 URL: https://issues.apache.org/jira/browse/ARROW-17268
 Project: Apache Arrow
  Issue Type: Improvement
  Components: C++
Reporter: David Li


As discussed on dev@: 
https://lists.apache.org/thread/onzgogx2c2djxs0wbhmvqp2dbx7kjf6o "[ARROW-17255] 
Logical JSON type in Arrow"

It would be interesting to have JSON parsing/serializing compute functions that 
operate on columns of (stringified) JSON records. For parsing, the problem is 
we need to know the output schema without being able to look at the data, so we 
would probably only be able to decode into a {{map[string, union]}} type at 
best. And/or we could offer "extraction" functions akin to what things like 
SQLite and Postgres provide (at the cost of having to reparse the JSON over and 
over).

Also see ARROW-17255 for a logical JSON type.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Created] (ARROW-17267) Failure loading arrow_dataset_jni library on Mac M1

2022-07-30 Thread Alexander Cai (Jira)
Alexander Cai created ARROW-17267:
-

 Summary: Failure loading arrow_dataset_jni library on Mac M1
 Key: ARROW-17267
 URL: https://issues.apache.org/jira/browse/ARROW-17267
 Project: Apache Arrow
  Issue Type: Bug
  Components: Java
Affects Versions: 8.0.0
 Environment: Mac M1, 16 GB memory
Reporter: Alexander Cai


On a Mac M1, using Java 11 and Maven, I was initially trying to load a Parquet 
file using FileSystemDatasetFactory. However, when calling 
NativeMemoryPool.getDefault(), it throws an UnsatisfiedLinkError:

{code:java}
java.lang.UnsatisfiedLinkError: Can't load library: 
/var/folders/c9/h03f8v4s037gvwlfgs98b_y0gp/T/jnilib-18262759266549290103.tmp
    at java.lang.ClassLoader.loadLibrary (ClassLoader.java:2633)
    at java.lang.Runtime.load0 (Runtime.java:768)
    at java.lang.System.load (System.java:1837)
    at org.apache.arrow.dataset.jni.JniLoader.load (JniLoader.java:88)
    at org.apache.arrow.dataset.jni.JniLoader.loadRemaining (JniLoader.java:73)
    at org.apache.arrow.dataset.jni.JniLoader.ensureLoaded (JniLoader.java:60)
    at org.apache.arrow.dataset.jni.NativeMemoryPool. 
(NativeMemoryPool.java:27)
    at com.example.App.main (App.java:8)
    at jdk.internal.reflect.NativeMethodAccessorImpl.invoke0 (Native Method)
    at jdk.internal.reflect.NativeMethodAccessorImpl.invoke 
(NativeMethodAccessorImpl.java:62)
    at jdk.internal.reflect.DelegatingMethodAccessorImpl.invoke 
(DelegatingMethodAccessorImpl.java:43)
    at java.lang.reflect.Method.invoke (Method.java:566)
    at org.codehaus.mojo.exec.ExecJavaMojo$1.run (ExecJavaMojo.java:282)
    at java.lang.Thread.run (Thread.java:829){code}

I've created a minimum reproducible example 
[here|https://gist.github.com/adzcai/7cb1b9316d8529cf2c0ffe19f0901adc], where 
the only line of code is {noformat}NativeMemoryPool.getDefault(){noformat}. At 
the call to {noformat}org.apache.arrow.dataset.jni.JniLoader.load{noformat} in 
the stack trace above, "arrow_dataset_jni" is passed as the name, and 
"libarrow_dataset_jni.dylib" is the library name, which it successfully locates.

However, this works fine in Java 8. Running the same code in an EC2 container 
on Java 11 also works fine, so I'm fairly certain this is an M1 issue.


 



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Created] (ARROW-17266) [Doc] Java nightlies file prefix changed

2022-07-30 Thread Rok Mihevc (Jira)
Rok Mihevc created ARROW-17266:
--

 Summary: [Doc] Java nightlies file prefix changed
 Key: ARROW-17266
 URL: https://issues.apache.org/jira/browse/ARROW-17266
 Project: Apache Arrow
  Issue Type: Improvement
  Components: Documentation, Java
Reporter: Rok Mihevc


As per [Arrow 
docs|https://arrow.apache.org/docs/dev/java/install.html#installing-manually] 
Java nightlies are at: 
[https://github.com/ursacomputing/crossbow/releases/tag/nightly-2022-03-19-0-github-java-jars]

However file prefix changed and new url format is:
[https://github.com/ursacomputing/crossbow/releases/tag/nightly-packaging-2022-07-30-0-github-java-jars]

Since it's hard to search github for old releases it would be good to change 
the url in the docs.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Created] (ARROW-17265) build python lib failed on both X86 and ARMv8

2022-07-30 Thread chendan (Jira)
chendan created ARROW-17265:
---

 Summary: build python lib failed on both X86 and ARMv8
 Key: ARROW-17265
 URL: https://issues.apache.org/jira/browse/ARROW-17265
 Project: Apache Arrow
  Issue Type: Bug
  Components: Python
Affects Versions: 8.0.0, 2.0.0
 Environment: os: centos 7.9
CPU: X86_64
Reporter: chendan
 Attachments: CMakeOutput.log

I want to build pyarrow lib in ARM platform. I download pyarrow source code 
version 8.0.0 and run "python setup.py install". An error occur:

Using ld linker
Configured for RELEASE build (set with cmake 
-DCMAKE_BUILD_TYPE=\{release,debug,...})
-- Build Type: RELEASE
-- Generator: Unix Makefiles
-- Build output directory: 
/root/build/pyarrow-8.0.0/build/temp.linux-x86_64-3.6/release
-- Found Python3: /root/anaconda3/envs/py36test/bin/python (found version 
"3.6.13") found components: Interpreter Development.Module NumPy 
-- Found Python3Alt: /root/anaconda3/envs/py36test/bin/python  
-- Found PkgConfig: /usr/bin/pkg-config (found version "0.27.1") 
-- Could NOT find Arrow (missing: Arrow_DIR)
-- Checking for module 'arrow'
--   No package 'arrow' found
CMake Error at 
/usr/local/share/cmake-3.23/Modules/FindPackageHandleStandardArgs.cmake:230 
(message):
  Could NOT find Arrow (missing: ARROW_INCLUDE_DIR ARROW_LIB_DIR
  ARROW_FULL_SO_VERSION ARROW_SO_VERSION)
Call Stack (most recent call first):
  /usr/local/share/cmake-3.23/Modules/FindPackageHandleStandardArgs.cmake:594 
(_FPHSA_FAILURE_MESSAGE)
  cmake_modules/FindArrow.cmake:450 (find_package_handle_standard_args)
  cmake_modules/FindArrowPython.cmake:46 (find_package)
  CMakeLists.txt:231 (find_package)


-- Configuring incomplete, errors occurred!

 

This error always occur no matter what version I choose(pyarrow 8.0.0 or 2.0.0) 
and no matter what platform (X86 or ARM c complier) I choose. When I downloaded 
arrow source code and enter python folder and run "python setup.py install" the 
same error occur.

It seems that it's an bug on cmake files. I could not build python lib for my 
ARM plarform.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Created] (ARROW-17264) [Go] Function group by on table

2022-07-30 Thread Francisco Garcia (Jira)
Francisco Garcia created ARROW-17264:


 Summary: [Go] Function group by on table
 Key: ARROW-17264
 URL: https://issues.apache.org/jira/browse/ARROW-17264
 Project: Apache Arrow
  Issue Type: Wish
  Components: Go
Affects Versions: 8.0.1
Reporter: Francisco Garcia


I'm trying to find some way to group data in Apache Arrow with golang, but I 
couldn't do it. There's a way to do this or it is only implemented in cpp and 
python.

Are there plans to implement this on future releases?



--
This message was sent by Atlassian Jira
(v8.20.10#820010)