[GitHub] drill pull request #976: DRILL-5797: Choose parquet reader from read columns

paul-rogers Fri, 06 Oct 2017 11:31:00 -0700

Github user paul-rogers commented on a diff in the pull request:

    https://github.com/apache/drill/pull/976#discussion_r143263332
  
    --- Diff: 
exec/java-exec/src/main/java/org/apache/drill/exec/store/parquet/ParquetScanBatchCreator.java
 ---
    @@ -156,18 +160,39 @@ public ScanBatch getBatch(FragmentContext context, 
ParquetRowGroupScan rowGroupS
         return new ScanBatch(rowGroupScan, context, oContext, readers, 
implicitColumns);
       }
     
    -  private static boolean isComplex(ParquetMetadata footer) {
    -    MessageType schema = footer.getFileMetaData().getSchema();
    +  private static boolean isComplex(ParquetMetadata footer, 
List<SchemaPath> columns) {
    +    if (Utilities.isStarQuery(columns)) {
    --- End diff --
    
    Perhaps a comment with some explanation? If wildcard query, we query all 
columns, so check if any of them are complex. If project list, then check only 
the projected columns.

---

[GitHub] drill pull request #976: DRILL-5797: Choose parquet reader from read columns

Reply via email to