[
https://issues.apache.org/jira/browse/ARROW-4050?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16722611#comment-16722611
]
Antonio Cavallo commented on ARROW-4050:
----------------------------------------
Mmmm, might it have to do with the _generated_version.py (eg. version =
'0.11.1.dev407+g8c413036.d20181217')?
>>> lldb -- python -c "import pyarrow.parquet as pq;
>>> pq.read_table('bug.parquet')"
(lldb) process launch
thread #1: tid = 0x1f93bb, 0x000000010bfcc60f
libparquet.12.dylib`boost::re_detail_106900::cpp_regex_traits_implementation<char>::transform(char
const*, char const*) const + 47, queue = 'com.apple.main-thread', stop reason
= EXC_BAD_ACCESS (code=1, address=0x0)
* frame #0: 0x000000010bfcc60f
libparquet.12.dylib`boost::re_detail_106900::cpp_regex_traits_implementation<char>::transform(char
const*, char const*) const + 47
frame #1: 0x000000010bfcc047 libparquet.12.dylib`unsigned int
boost::re_detail_106900::find_sort_syntax<boost::re_detail_106900::cpp_regex_traits_implementation<char>,
char>(boost::re_detail_106900::cpp_regex_traits_implementation<char> const*,
char*) + 55
frame #2: 0x000000010bfcbe06
libparquet.12.dylib`boost::re_detail_106900::cpp_regex_traits_implementation<char>::init()
+ 1174
frame #3: 0x000000010bfcb911
libparquet.12.dylib`boost::re_detail_106900::cpp_regex_traits_implementation<char>::cpp_regex_traits_implementation(boost::re_detail_106900::cpp_regex_traits_base<char>
const&) + 145
frame #4: 0x000000010bfcb28f
libparquet.12.dylib`boost::object_cache<boost::re_detail_106900::cpp_regex_traits_base<char>,
boost::re_detail_106900::cpp_regex_traits_implementation<char>
>::do_get(boost::re_detail_106900::cpp_regex_traits_base<char> const&, unsigned
long) + 287
frame #5: 0x000000010bfcb06f
libparquet.12.dylib`boost::object_cache<boost::re_detail_106900::cpp_regex_traits_base<char>,
boost::re_detail_106900::cpp_regex_traits_implementation<char>
>::get(boost::re_detail_106900::cpp_regex_traits_base<char> const&, unsigned
long) + 63
frame #6: 0x000000010bfcaf6e
libparquet.12.dylib`boost::re_detail_106900::regex_data<char,
boost::regex_traits<char, boost::cpp_regex_traits<char> > >::regex_data() + 110
frame #7: 0x000000010bfcacd4 libparquet.12.dylib`boost::basic_regex<char,
boost::regex_traits<char, boost::cpp_regex_traits<char> > >::do_assign(char
const*, char const*, unsigned int) + 180
frame #8: 0x000000010bfc2de7
libparquet.12.dylib`parquet::ApplicationVersion::ApplicationVersion(std::__1::basic_string<char,
std::__1::char_traits<char>, std::__1::allocator<char> > const&) + 119
frame #9: 0x000000010bfc6956
libparquet.12.dylib`parquet::FileMetaData::FileMetaDataImpl::FileMetaDataImpl(unsigned
char const*, unsigned int*) + 390
frame #10: 0x000000010bfc24ef
libparquet.12.dylib`parquet::FileMetaData::Make(unsigned char const*, unsigned
int*) + 63
frame #11: 0x000000010bfbdc47
libparquet.12.dylib`parquet::SerializedFile::ParseMetaData() + 391
frame #12: 0x000000010bfbdec1
libparquet.12.dylib`parquet::ParquetFileReader::Open(std::__1::unique_ptr<parquet::RandomAccessSource,
std::__1::default_delete<parquet::RandomAccessSource> >,
parquet::ReaderProperties const&, std::__1::shared_ptr<parquet::FileMetaData>
const&) + 145
frame #13: 0x000000010bf48e78
libparquet.12.dylib`parquet::arrow::OpenFile(std::__1::shared_ptr<arrow::io::RandomAccessFile>
const&, arrow::MemoryPool*, parquet::ReaderProperties const&,
std::__1::shared_ptr<parquet::FileMetaData> const&,
std::__1::unique_ptr<parquet::arrow::FileReader,
std::__1::default_delete<parquet::arrow::FileReader> >*) + 88
frame #14: 0x000000010e1ba4d4
_parquet.cpython-37m-darwin.so`__pyx_pw_7pyarrow_8_parquet_13ParquetReader_3open(_object*,
_object*, _object*) + 564
> core dump on reading parquet file
> ---------------------------------
>
> Key: ARROW-4050
> URL: https://issues.apache.org/jira/browse/ARROW-4050
> Project: Apache Arrow
> Issue Type: Bug
> Components: C++, Python
> Reporter: Antonio Cavallo
> Priority: Blocker
> Attachments: bug.parquet
>
>
> Hi,
> I've a crash when doing this:
> {{import pyarrow.parquet as pq}}
> {{pq.read_table('bug.parquet')}}
> [^bug.parquet]
> (this is the same generated by
> arrow/python/pyarrow/tests/test_parquet.py(112)test_single_pylist_column_roundtrip())
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)