zhangyue19921010 opened a new issue, #13018:
URL: https://github.com/apache/arrow/issues/13018
Hi teams,
I am using arrow 7.0.0 dataset api, trying to reading parquet through arrow,
but I meet follow exception sometimes.
Could any one help me out? really appreciate.
```
val factory = HoodieArrowUtils.getDatasetFactory(file.filePath)
val dataset = factory.finish(ArrowUtils.toArrowSchema(requiredSchema,
HoodieArrowUtils.getLocalTimezoneID()))
val scanOptions = new ScanOptions(requiredSchema.map(f =>
f.name).toArray, batchSize)
// JVM crash during new Scan.
val scanner = dataset.newScan(scanOptions)
```
#
# A fatal error has been detected by the Java Runtime Environment:
#
# SIGSEGV (0xb) at pc=0x000000012b911671, pid=21962, tid=0x0000000000014a03
#
# JRE version: Java(TM) SE Runtime Environment (8.0_221-b11) (build
1.8.0_221-b11)
# Java VM: Java HotSpot(TM) 64-Bit Server VM (25.221-b11 mixed mode
bsd-amd64 compressed oops)
# Problematic frame:
# C [jnilib-6445651763812351716.tmp+0x96671]
arrow::dataset::internal::InitializeDatasetWriter(arrow::compute::ExecFactoryRegistry*)+0x51
#
# Failed to write core dump. Core dumps have been disabled. To enable core
dumping, try "ulimit -c unlimited" before starting Java again
#
# An error report file with more information is saved as:
# /Users/yuezhang/workproject/HudiTest/hs_err_pid21962.log
#
# If you would like to submit a bug report, please visit:
# http://bugreport.java.com/bugreport/crash.jsp
# The crash happened outside the Java Virtual Machine in native code.
# See problematic frame for where to report the bug.
#
Disconnected from the target VM, address: '127.0.0.1:60389', transport:
'socket'
Process finished with exit code 134 (interrupted by signal 6: SIGABRT)
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]