[
https://issues.apache.org/jira/browse/DRILL-5097?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15752291#comment-15752291
]
ASF GitHub Bot commented on DRILL-5097:
---------------------------------------
GitHub user vdiravka opened a pull request:
https://github.com/apache/drill/pull/697
DRILL-5097: Using store.parquet.reader.int96_as_timestamp gives IOOB
whereas convert_from works
When the int96 value is converted into timestamp (long in java) we cut the
nanos precision to millis. But need to change dataTypeLengthInBits of
columnReader (while converting parquet fixed binary type INT96 into drill
TimeStamp) from 12 byte(s) to 8 byte(s) as well.
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/vdiravka/drill DRILL-5097
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/drill/pull/697.patch
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #697
----
commit ab6bad27469234587a059cccd7b5497279828450
Author: Vitalii Diravka <[email protected]>
Date: 2016-12-14T16:24:08Z
DRILL-5097: Using store.parquet.reader.int96_as_timestamp gives IOOB
whereas convert_from works
----
> Using store.parquet.reader.int96_as_timestamp gives IOOB whereas convert_from
> works
> -----------------------------------------------------------------------------------
>
> Key: DRILL-5097
> URL: https://issues.apache.org/jira/browse/DRILL-5097
> Project: Apache Drill
> Issue Type: Bug
> Components: Execution - Data Types, Storage - Parquet
> Affects Versions: 1.9.0
> Reporter: Vitalii Diravka
> Assignee: Vitalii Diravka
> Fix For: Future
>
> Attachments: data.snappy.parquet
>
>
> Using store.parquet.reader.int96_as_timestamp gives IOOB whereas convert_from
> works.
> The below query succeeds:
> {code}
> select c, convert_from(d, 'TIMESTAMP_IMPALA') from
> dfs.`/drill/testdata/parquet_timestamp/spark_generated/d3`;
> {code}
> The below query fails:
> {code}
> 0: jdbc:drill:zk=10.10.100.190:5181> alter session set
> `store.parquet.reader.int96_as_timestamp` = true;
> +-------+---------------------------------------------------+
> | ok | summary |
> +-------+---------------------------------------------------+
> | true | store.parquet.reader.int96_as_timestamp updated. |
> +-------+---------------------------------------------------+
> 1 row selected (0.231 seconds)
> 0: jdbc:drill:zk=10.10.100.190:5181> select c, d from
> dfs.`/drill/testdata/parquet_timestamp/spark_generated/d3`;
> Error: SYSTEM ERROR: IndexOutOfBoundsException: readerIndex: 0, writerIndex:
> 131076 (expected: 0 <= readerIndex <= writerIndex <= capacity(131072))
> Fragment 0:0
> [Error Id: bd94f477-7c01-420f-8920-06263212177b on qa-node190.qa.lab:31010]
> (state=,code=0)
> {code}
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)