[
https://issues.apache.org/jira/browse/TAJO-1925?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14993251#comment-14993251
]
ASF GitHub Bot commented on TAJO-1925:
--------------------------------------
GitHub user blrunner opened a pull request:
https://github.com/apache/tajo/pull/853
TAJO-1925: Improve hive compatibility with TIMESTAMP partition column.
This patch contains following modifications:
* When making partition directories, use partition name which are accepted
in the format ```YYYY-MM-DD HH:MM:SS.MS```.
* When pruning partitions, convert UTC of EvalNode to customer timezone.
* When building partition filter in catalog, convert UTC of algebra
expression to customer timezone.
* Add timestamp cast operation with the number of milliseconds.
And I found that the patch ran successfully with MySQLStore and
HiveCatalogStore.
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/blrunner/tajo TAJO-1925
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/tajo/pull/853.patch
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #853
----
commit e98cf2c7c7bb3b69e871dec45dd53d9ae567b42b
Author: JaeHwa Jung <[email protected]>
Date: 2015-11-05T07:46:48Z
TAJO-1925: Improve hive compatibility with TIMESTAMP partition column.
commit 36e9ec498b7486cb63c78b54d63e438834a51a6a
Author: JaeHwa Jung <[email protected]>
Date: 2015-11-05T07:49:54Z
Remove debug logs
commit 88d1335b36df19d58e2d68b7671e6405df686eec
Author: JaeHwa Jung <[email protected]>
Date: 2015-11-05T08:08:07Z
Add more descriptions for added codes
commit 9856ddae8eb1022af36a8999169a5a429b652377
Author: JaeHwa Jung <[email protected]>
Date: 2015-11-05T08:16:17Z
Add more description for
PartitionFilterAlgebraVisitor::visitTimestampLiteral
commit 2743bfbf63855a80bf081e83adc80b4e1fbbbd22
Author: JaeHwa Jung <[email protected]>
Date: 2015-11-05T08:18:00Z
Remove unncessary codes
commit 359d9886b61a33744ee9748a79e077e5b05a77ff
Author: JaeHwa Jung <[email protected]>
Date: 2015-11-06T03:30:01Z
Implement ColPartitionStoreExec::encodeTimestamp
commit 2e09493e91bccc44c2680408da81843df4b4191d
Author: JaeHwa Jung <[email protected]>
Date: 2015-11-06T03:33:43Z
Escape the path name of timestamp partition
commit 84c016426a817159c3b2dc40675d5951236b40a3
Author: JaeHwa Jung <[email protected]>
Date: 2015-11-06T03:45:59Z
Apply users timezone to partition pruning.
commit 0dad0601edbb710453899b8c9a68350f78bb4670
Author: JaeHwa Jung <[email protected]>
Date: 2015-11-06T05:20:59Z
Apply UTC timezone for casting operation.
commit 491d685cbe43066082e55f4ab36507451d4697b8
Author: JaeHwa Jung <[email protected]>
Date: 2015-11-06T06:56:28Z
Escape partition and unescase partition values
----
> Improve hive compatibility with TIMESTAMP partition column.
> -----------------------------------------------------------
>
> Key: TAJO-1925
> URL: https://issues.apache.org/jira/browse/TAJO-1925
> Project: Tajo
> Issue Type: Improvement
> Components: Catalog, Physical Operator
> Affects Versions: 0.11.0, 0.12.0
> Reporter: Jaehwa Jung
> Assignee: Jaehwa Jung
> Fix For: 0.12.0, 0.11.1
>
>
> -Currently, Tajo allow to use TIMESTAMP column as a partition key. But if
> users use TIMESTAMP partition column, Tajo doesn't keep the original
> TIMESTAMP values. Actually, Tajo automatically converts TIMESTAMP values to
> STRING literals which are accepted in the format YYYY-MM-DD HH:MM:SS. As a
> result, Tajo can't keep mills of second and can't provide right hive
> compatibility-
> We need to support hive compatibility for all partition column types. But
> when using TIMESTAMP partition column, users might not get correct partitions
> occasionally because tajo partition name is different from hive partition
> name. Actually, Tajo automatically converts TIMESTAMP values to STRING
> literals which are accepted in the format YYYY-MM-DD HH:MM:SS. But Hive
> automatically converts it to STRING literals which are accepted in the format
> YYYY-MM-DD HH:MM:SS.MS.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)