[
https://issues.apache.org/jira/browse/DRILL-6353?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16489703#comment-16489703
]
ASF GitHub Bot commented on DRILL-6353:
---------------------------------------
vrozov commented on a change in pull request #1259: DRILL-6353: Upgrade Parquet
MR dependencies
URL: https://github.com/apache/drill/pull/1259#discussion_r190715129
##########
File path:
exec/java-exec/src/test/java/org/apache/drill/exec/store/parquet/TestParquetMetadataCache.java
##########
@@ -737,6 +738,7 @@ public void testBooleanPartitionPruning() throws Exception
{
}
}
+ @Ignore
Review comment:
The tests fail during plan validation as the new version of the parquet
library ignores wrong statistics for the data types used by queries. Even if
statistics is wrong for a small portion of parquet files and for the parquet
files used by the tests it is correct, Drill can't rely on wrong statistics as
it leads to the wrong query results. Basically, there is a bug in the version
of the parquet library that Drill currently uses that may cause query result to
be wrong and this bug is fixed in the new version causing 2 unit tests
failures. IMO, it is better to upgrade to the new library sooner than later
even if it will cause slowdown for some queries.
----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
For queries about this service, please contact Infrastructure at:
[email protected]
> Upgrade Parquet MR dependencies
> -------------------------------
>
> Key: DRILL-6353
> URL: https://issues.apache.org/jira/browse/DRILL-6353
> Project: Apache Drill
> Issue Type: Task
> Reporter: Vlad Rozov
> Assignee: Vlad Rozov
> Priority: Major
> Fix For: 1.14.0
>
>
> Upgrade from a custom build {{1.8.1-drill-r0}} to Apache release {{1.10.0}}.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)