[
https://issues.apache.org/jira/browse/ARROW-15486?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17490586#comment-17490586
]
David Dali Susanibar Arce commented on ARROW-15486:
---------------------------------------------------
Testing jni dataset library: Downloading from snapshots:
[https://repository.apache.org/content/repositories/staging/org/apache/arrow/arrow-memory/7.0.0/org/apache/arrow/arrow-dataset/7.0.0/arrow-dataset-7.0.0.jar]
MacOS Big Sur - 11.5.2 - JDK 8: OK
otool -L libarrow_dataset_jni.dylib
libarrow_dataset_jni.dylib:
@rpath/libarrow_dataset_jni.700.dylib (compatibility version 700.0.0,
current version 700.0.0)
/usr/lib/libz.1.dylib (compatibility version 1.0.0, current version 1.2.11)
/usr/lib/libc++.1.dylib (compatibility version 1.0.0, current version
1200.3.0)
/usr/lib/libSystem.B.dylib (compatibility version 1.0.0, current version
1311.0.0)
Ubuntu 20.04.3 LTS - JDK 11: OK
Evidence:
https://github.com/apache/arrow-cookbook/runs/5149652103?check_suite_focus=true
ldd libarrow_dataset_jni.so
linux-vdso.so.1 (0x00007fff259ac000)
libdl.so.2 => /lib/x86_64-linux-gnu/libdl.so.2 (0x00007f3968d13000)
librt.so.1 => /lib/x86_64-linux-gnu/librt.so.1 (0x00007f3968d08000)
libstdc++.so.6 => /lib/x86_64-linux-gnu/libstdc++.so.6
(0x00007f3968b26000)
libm.so.6 => /lib/x86_64-linux-gnu/libm.so.6 (0x00007f39689d7000)
libgcc_s.so.1 => /lib/x86_64-linux-gnu/libgcc_s.so.1
(0x00007f39689bc000)
libpthread.so.0 => /lib/x86_64-linux-gnu/libpthread.so.0
(0x00007f3968997000)
libc.so.6 => /lib/x86_64-linux-gnu/libc.so.6 (0x00007f39687a5000)
/lib64/ld-linux-x86-64.so.2 (0x00007f396ac7a000)
Testing Code:
{code:java}
.. testcode::
import org.apache.arrow.dataset.file.FileFormat;
import org.apache.arrow.dataset.file.FileSystemDatasetFactory;
import org.apache.arrow.dataset.jni.NativeMemoryPool;
import org.apache.arrow.dataset.source.DatasetFactory;
import org.apache.arrow.memory.RootAllocator;
import org.apache.arrow.vector.types.pojo.Schema;
import org.apache.arrow.util.AutoCloseables;
String uri = "file:" + System.getProperty("user.dir") +
"/thirdpartydeps/parquetfiles/data1.parquet";
RootAllocator rootAllocator = new RootAllocator(Long.MAX_VALUE);
DatasetFactory datasetFactory = new FileSystemDatasetFactory(rootAllocator,
NativeMemoryPool.getDefault(), FileFormat.PARQUET, uri);
Schema schema = datasetFactory.inspect();
AutoCloseables.close(datasetFactory);
System.out.println(schema);
.. testoutput::
Schema<id: Int(32, true), name: Utf8>(metadata:
{parquet.avro.schema={"type":"record","name":"User","namespace":"org.apache.arrow.dataset","fields":[{"name":"id","type":["int","null"]},{"name":"name","type":["string","null"]}]},
writer.model.name=avro}) {code}
{color:#FF0000}Pending{color}: Testing JNI C Data Interface (I need to learn
more about that library to test that)
Question: Is there some reason to not be able to download arrow flight for
staging
https://repository.apache.org/content/repositories/staging/org/apache/arrow/arrow-flight/
?
> [Relase][Java] Verify staged maven artifacts
> --------------------------------------------
>
> Key: ARROW-15486
> URL: https://issues.apache.org/jira/browse/ARROW-15486
> Project: Apache Arrow
> Issue Type: Improvement
> Reporter: Krisztian Szucs
> Priority: Major
> Labels: pull-request-available
>
> We have two tests right now:
> 1. Execute {{mvn test}} from the source tarball's java directory testing the
> source
> https://github.com/apache/arrow/blob/master/dev/release/verify-release-candidate.sh#L278
> 2. Verify the checksums and signatures of the uploaded maven artifacts
> https://github.com/apache/arrow/blob/master/dev/release/verify-release-candidate.sh#L766
> But we don't actually *test* the packages. We should add that to the
> verification scripts, since 7.0 is going to be the first release shipping the
> jars with bundled JNI libraries.
> cc [~kou] [~anthonylouis]
--
This message was sent by Atlassian Jira
(v8.20.1#820001)