[ https://issues.apache.org/jira/browse/ARROW-2801?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Francois Saint-Jacques updated ARROW-2801: ------------------------------------------ Labels: dataset datasets parquet pull-request-available (was: datasets parquet pull-request-available) > [Python] Implement splt_row_groups for ParquetDataset > ----------------------------------------------------- > > Key: ARROW-2801 > URL: https://issues.apache.org/jira/browse/ARROW-2801 > Project: Apache Arrow > Issue Type: New Feature > Components: Python > Reporter: Robbie Gruener > Assignee: Robbie Gruener > Priority: Minor > Labels: dataset, datasets, parquet, pull-request-available > Fix For: 1.0.0 > > Time Spent: 1h 50m > Remaining Estimate: 0h > > Currently the split_row_groups argument in ParquetDataset yields a not > implemented error. An easy and efficient way to implement this is by using > the summary metadata file instead of opening every footer file -- This message was sent by Atlassian Jira (v8.3.2#803003)