[ https://issues.apache.org/jira/browse/PARQUET-241?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Ryan Blue resolved PARQUET-241. ------------------------------- Resolution: Fixed Merged #164. Thanks [~mkim] for the contribution! (And sorry this took so long. Next time, feel free to ping the mailing list to remind us!) > ParquetInputFormat.getFooters() should return in the same order as what > listStatus() returns > -------------------------------------------------------------------------------------------- > > Key: PARQUET-241 > URL: https://issues.apache.org/jira/browse/PARQUET-241 > Project: Parquet > Issue Type: Bug > Affects Versions: 1.6.0 > Reporter: Mingyu Kim > Assignee: Mingyu Kim > Fix For: 1.9.0 > > > Because of how the footer cache is implemented, getFooters() returns the > footers in a different order than what listStatus() returns. > When I provided url > "hdfs://.../part-00001.parquet,hdfs://.../part-00002.parquet,hdfs://.../part-00003.parquet", > ParquetInputFormat.getSplits(), which internally calls getFooters(), > returned the splits in a wrong order. -- This message was sent by Atlassian JIRA (v6.3.4#6332)