[ 
https://issues.apache.org/jira/browse/ARROW-4050?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16722611#comment-16722611
 ] 

Antonio Cavallo commented on ARROW-4050:
----------------------------------------

Mmmm, might it have to do with the _generated_version.py (eg. version = 
'0.11.1.dev407+g8c413036.d20181217')?

 

>>> lldb -- python -c "import pyarrow.parquet as pq; 
>>> pq.read_table('bug.parquet')"
(lldb) process launch

thread #1: tid = 0x1f93bb, 0x000000010bfcc60f 
libparquet.12.dylib`boost::re_detail_106900::cpp_regex_traits_implementation<char>::transform(char
 const*, char const*) const + 47, queue = 'com.apple.main-thread', stop reason 
= EXC_BAD_ACCESS (code=1, address=0x0)
 * frame #0: 0x000000010bfcc60f 
libparquet.12.dylib`boost::re_detail_106900::cpp_regex_traits_implementation<char>::transform(char
 const*, char const*) const + 47
 frame #1: 0x000000010bfcc047 libparquet.12.dylib`unsigned int 
boost::re_detail_106900::find_sort_syntax<boost::re_detail_106900::cpp_regex_traits_implementation<char>,
 char>(boost::re_detail_106900::cpp_regex_traits_implementation<char> const*, 
char*) + 55
 frame #2: 0x000000010bfcbe06 
libparquet.12.dylib`boost::re_detail_106900::cpp_regex_traits_implementation<char>::init()
 + 1174
 frame #3: 0x000000010bfcb911 
libparquet.12.dylib`boost::re_detail_106900::cpp_regex_traits_implementation<char>::cpp_regex_traits_implementation(boost::re_detail_106900::cpp_regex_traits_base<char>
 const&) + 145
 frame #4: 0x000000010bfcb28f 
libparquet.12.dylib`boost::object_cache<boost::re_detail_106900::cpp_regex_traits_base<char>,
 boost::re_detail_106900::cpp_regex_traits_implementation<char> 
>::do_get(boost::re_detail_106900::cpp_regex_traits_base<char> const&, unsigned 
long) + 287
 frame #5: 0x000000010bfcb06f 
libparquet.12.dylib`boost::object_cache<boost::re_detail_106900::cpp_regex_traits_base<char>,
 boost::re_detail_106900::cpp_regex_traits_implementation<char> 
>::get(boost::re_detail_106900::cpp_regex_traits_base<char> const&, unsigned 
long) + 63
 frame #6: 0x000000010bfcaf6e 
libparquet.12.dylib`boost::re_detail_106900::regex_data<char, 
boost::regex_traits<char, boost::cpp_regex_traits<char> > >::regex_data() + 110
 frame #7: 0x000000010bfcacd4 libparquet.12.dylib`boost::basic_regex<char, 
boost::regex_traits<char, boost::cpp_regex_traits<char> > >::do_assign(char 
const*, char const*, unsigned int) + 180
 frame #8: 0x000000010bfc2de7 
libparquet.12.dylib`parquet::ApplicationVersion::ApplicationVersion(std::__1::basic_string<char,
 std::__1::char_traits<char>, std::__1::allocator<char> > const&) + 119
 frame #9: 0x000000010bfc6956 
libparquet.12.dylib`parquet::FileMetaData::FileMetaDataImpl::FileMetaDataImpl(unsigned
 char const*, unsigned int*) + 390
 frame #10: 0x000000010bfc24ef 
libparquet.12.dylib`parquet::FileMetaData::Make(unsigned char const*, unsigned 
int*) + 63
 frame #11: 0x000000010bfbdc47 
libparquet.12.dylib`parquet::SerializedFile::ParseMetaData() + 391
 frame #12: 0x000000010bfbdec1 
libparquet.12.dylib`parquet::ParquetFileReader::Open(std::__1::unique_ptr<parquet::RandomAccessSource,
 std::__1::default_delete<parquet::RandomAccessSource> >, 
parquet::ReaderProperties const&, std::__1::shared_ptr<parquet::FileMetaData> 
const&) + 145
 frame #13: 0x000000010bf48e78 
libparquet.12.dylib`parquet::arrow::OpenFile(std::__1::shared_ptr<arrow::io::RandomAccessFile>
 const&, arrow::MemoryPool*, parquet::ReaderProperties const&, 
std::__1::shared_ptr<parquet::FileMetaData> const&, 
std::__1::unique_ptr<parquet::arrow::FileReader, 
std::__1::default_delete<parquet::arrow::FileReader> >*) + 88
 frame #14: 0x000000010e1ba4d4 
_parquet.cpython-37m-darwin.so`__pyx_pw_7pyarrow_8_parquet_13ParquetReader_3open(_object*,
 _object*, _object*) + 564

> core dump on reading parquet file
> ---------------------------------
>
>                 Key: ARROW-4050
>                 URL: https://issues.apache.org/jira/browse/ARROW-4050
>             Project: Apache Arrow
>          Issue Type: Bug
>          Components: C++, Python
>            Reporter: Antonio Cavallo
>            Priority: Blocker
>         Attachments: bug.parquet
>
>
> Hi,
> I've a crash when doing this:
> {{import pyarrow.parquet as pq}}
> {{pq.read_table('bug.parquet')}}
> [^bug.parquet]
> (this is the same generated by 
> arrow/python/pyarrow/tests/test_parquet.py(112)test_single_pylist_column_roundtrip())



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to