[jira] [Resolved] (ARROW-7638) [Python] Segfault when inspecting dataset.Source with invalid file/partitioning

Ben Kietzman (Jira) Thu, 23 Jan 2020 11:10:31 -0800


     [ 
https://issues.apache.org/jira/browse/ARROW-7638?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]


Ben Kietzman resolved ARROW-7638.
---------------------------------
    Fix Version/s: 0.16.0
       Resolution: Fixed

Issue resolved by pull request 6267
[https://github.com/apache/arrow/pull/6267]

> [Python] Segfault when inspecting dataset.Source with invalid 
> file/partitioning
> -------------------------------------------------------------------------------
>
>                 Key: ARROW-7638
>                 URL: https://issues.apache.org/jira/browse/ARROW-7638
>             Project: Apache Arrow
>          Issue Type: Bug
>          Components: C++ - Dataset
>            Reporter: Joris Van den Bossche
>            Assignee: Francois Saint-Jacques
>            Priority: Major
>              Labels: pull-request-available
>             Fix For: 0.16.0
>
>          Time Spent: 0.5h
>  Remaining Estimate: 0h
>
> Getting a segfault with:
> {code}
> In [1]: import pyarrow.dataset as ds                                          
>                                                                               
>                                                        
> In [2]: !touch test_empty.txt                                                 
>                                                                               
>                                                        
> In [3]: source_factory = ds.source("test_empty.txt", 
> partitioning=ds.partitioning(field_names=['a', 'b']))                         
>                                                                               
>   
> In [4]: source_factory.inspect()                                              
>                                                                               
>                                                        
> Segmentation fault (core dumped)
> {code}
> Didn't yet further investigate what might be the reason (there are several 
> "wrong" things here: it's an empty file, it's not a valid file for the 
> parquet format, the partitioning does not match the files, etc)



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

[jira] [Resolved] (ARROW-7638) [Python] Segfault when inspecting dataset.Source with invalid file/partitioning

Reply via email to