[ 
https://issues.apache.org/jira/browse/DRILL-6554?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16527923#comment-16527923
 ] 

ASF GitHub Bot commented on DRILL-6554:
---------------------------------------

vrozov commented on a change in pull request #1349: DRILL-6554: Minor code 
improvements in parquet statistics handling
URL: https://github.com/apache/drill/pull/1349#discussion_r199217282
 
 

 ##########
 File path: 
exec/java-exec/src/main/java/org/apache/drill/exec/store/parquet/ParquetReaderUtility.java
 ##########
 @@ -417,16 +417,9 @@ public static DateCorruptionStatus 
checkForCorruptDateValuesInStatistics(Parquet
             // column does not appear in this file, skip it
             continue;
           }
-          Statistics statistics = 
footer.getBlocks().get(rowGroupIndex).getColumns().get(colIndex).getStatistics();
-          Integer max = (Integer) statistics.genericGetMax();
-          if (statistics.hasNonNullValue()) {
-            if (max > ParquetReaderUtility.DATE_CORRUPTION_THRESHOLD) {
-              return DateCorruptionStatus.META_SHOWS_CORRUPTION;
-            }
-          } else {
-            // no statistics, go check the first page
-            return DateCorruptionStatus.META_UNCLEAR_TEST_VALUES;
-          }
+          IntStatistics statistics = 
(IntStatistics)footer.getBlocks().get(rowGroupIndex).getColumns().get(colIndex).getStatistics();
 
 Review comment:
   I don't see any specific code style in regards to spacing for casting (grep 
-I -R \(\([a-zA-Z0-9_]*\)[a-zA-Z0-9_] * | grep -c java). It seems to be a 
preference of a contributor.

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
[email protected]


> Minor code improvements in parquet statistics handling
> ------------------------------------------------------
>
>                 Key: DRILL-6554
>                 URL: https://issues.apache.org/jira/browse/DRILL-6554
>             Project: Apache Drill
>          Issue Type: Improvement
>            Reporter: Vlad Rozov
>            Assignee: Vlad Rozov
>            Priority: Minor
>             Fix For: 1.14.0
>
>
> Avoid setting statistics in Metadata when min and max are not defined and few 
> other minor code improvements.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to