[
https://issues.apache.org/jira/browse/ARROW-12103?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Yibo Cai resolved ARROW-12103.
------------------------------
Resolution: Fixed
Issue resolved by pull request 9835
[https://github.com/apache/arrow/pull/9835]
> [C++] "load of misaligned address" in Parquet reader
> ----------------------------------------------------
>
> Key: ARROW-12103
> URL: https://issues.apache.org/jira/browse/ARROW-12103
> Project: Apache Arrow
> Issue Type: Bug
> Components: C++
> Reporter: David Li
> Assignee: Antoine Pitrou
> Priority: Major
> Labels: pull-request-available
> Fix For: 4.0.0
>
> Attachments: testthat.out
>
> Time Spent: 50m
> Remaining Estimate: 0h
>
> When running the R sanitizer job, errors like these are reported:
> {noformat}
> /arrow/cpp/src/arrow/util/bpacking_avx2_generated.h:278:41: runtime error:
> load of misaligned address 0x7fb0a9c0e11d for type 'const uint32_t', which
> requires 4 byte alignment
> 0x7fb0a9c0e11d: note: pointer points here
> 40 01 05 09 00 04 31 48 31 e1 a0 a4 16 63 cd 3d 18 e5 9a 87 d6 67 2f 16
> 00 00 00 00 00 00 00 00
> ^
> #0 0x7fb1786fa536 in arrow::internal::unpack32_avx2(unsigned int const*,
> unsigned int*, int, int)
> (/usr/local/RDsan/lib/R/site-library/arrow/libs/arrow.so+0x17fa2536)
> #1 0x7fb1768bfc03 in arrow::internal::unpack32(unsigned int const*,
> unsigned int*, int, int)
> (/usr/local/RDsan/lib/R/site-library/arrow/libs/arrow.so+0x16167c03)
> #2 0x7fb1721b37ae in int arrow::BitUtil::BitReader::GetBatch<int>(int,
> int*, int)
> (/usr/local/RDsan/lib/R/site-library/arrow/libs/arrow.so+0x11a5b7ae)
> #3 0x7fb1722d270f in int
> arrow::util::RleDecoder::GetBatchWithDict<double>(double const*, int,
> double*, int)
> (/usr/local/RDsan/lib/R/site-library/arrow/libs/arrow.so+0x11b7a70f)
> #4 0x7fb17202a35d in virtual thunk to parquet::(anonymous
> namespace)::DictDecoderImpl<parquet::PhysicalType<(parquet::Type::type)5>
> >::DecodeSpaced(double*, int, int, unsigned char const*, long)
> (/usr/local/RDsan/lib/R/site-library/arrow/libs/arrow.so+0x118d235d)
> {noformat}
> The full log:
> [^testthat.out]
> cpuinfo:
> {noformat}
> processor : 15
> vendor_id : GenuineIntel
> cpu family : 6
> model : 165
> model name : Intel(R) Core(TM) i9-10885H CPU @ 2.40GHz
> stepping : 2
> microcode : 0xe2
> cpu MHz : 852.969
> cache size : 16384 KB
> physical id : 0
> siblings : 16
> core id : 7
> cpu cores : 8
> apicid : 15
> initial apicid : 15
> fpu : yes
> fpu_exception : yes
> cpuid level : 22
> wp : yes
> flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov
> pat pse36 clflush dts acpi mmx fxsr sse sse2 ss ht tm pbe syscall nx pdpe1gb
> rdtscp lm constant_tsc art arch_perfmon pebs bts rep_good nopl xtopology
> nonstop_tsc cpuid aperfmperf pni pclmulqdq dtes64 monitor ds_cpl vmx smx est
> tm2 ssse3 sdbg fma cx16 xtpr pdcm pcid sse4_1 sse4_2 x2apic movbe popcnt
> tsc_deadline_timer aes xsave avx f16c rdrand lahf_lm abm 3dnowprefetch
> cpuid_fault epb invpcid_single ssbd ibrs ibpb stibp ibrs_enhanced tpr_shadow
> vnmi flexpriority ept vpid ept_ad fsgsbase tsc_adjust bmi1 avx2 smep bmi2
> erms invpcid mpx rdseed adx smap clflushopt intel_pt xsaveopt xsavec xgetbv1
> xsaves dtherm ida arat pln pts hwp hwp_notify hwp_act_window hwp_epp pku
> ospke md_clear flush_l1d arch_capabilities
> bugs : spectre_v1 spectre_v2 spec_store_bypass swapgs itlb_multihit
> bogomips : 4800.00
> clflush size : 64
> cache_alignment : 64
> address sizes : 39 bits physical, 48 bits virtual
> power management:
> {noformat}
--
This message was sent by Atlassian Jira
(v8.3.4#803005)