[
https://issues.apache.org/jira/browse/HIVE-8267?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Muhammad Ehsan ul Haque updated HIVE-8267:
------------------------------------------
Status: Patch Available (was: Open)
Patch available.
Unable to put a review request on review-board, as the patch failing to be
uploaded. I am new to the review board.
Feature docuentation. Perhaps I should update the page
https://cwiki.apache.org/confluence/display/Hive/HBaseIntegration#HBaseIntegration-HiveHBaseIntegration
* A cell timestamp mapping using {{:timestamp:cf:qualifier}} must be mapped to
a {{BIGINT}} column of hive.
* A column family cells timestamp mapping using {{:timestamp:cf:}} or
{{:timestamp:cf:prefix.*}} must be mapped to a {{MAP<HIVE PRIMITIVE
TYPE,BIGINT>}} of hive.
* It is not allowed to insert only timestamp without a cell value. Use
{{"hbase.put.default.cell.value" = "default value"}} in the {{SERDEPROPERTIES}}
to use a default cell value if the cell value is not mapped or may have a null
value.
* Inserting with a lower timestamp then the current latest timestamp of the
cell will be inserted as an old version.
* If cell value and timestamp are both mapped and timestamp field is {{null}},
then it is filled with {{SERDEPROPERTIES}} {{hbase.put.timestamp}} if provided
otherwise it will be filled with hbase current timestamp.
> Exposing hbase cell latest timestamp through hbase columns mappings to hive
> columns.
> ------------------------------------------------------------------------------------
>
> Key: HIVE-8267
> URL: https://issues.apache.org/jira/browse/HIVE-8267
> Project: Hive
> Issue Type: Bug
> Components: HBase Handler
> Affects Versions: 0.14.0
> Reporter: Muhammad Ehsan ul Haque
> Priority: Minor
> Fix For: 0.14.0
>
> Attachments: HIVE-8267.0.patch
>
>
> Previous attempts HIVE-2781 (not accepted), HIVE-2828 (broken and proposed
> with restricted feature).
> The feature is to have hbase cell latest timestamp accessible in hive query,
> by mapping the cell timestamp with a hive column, using mapping format like
> {code}:timestamp:cf:[optional qualifier or qualifier prefix]{code}
> The hive create table statement would be like
> h4. For mapping a cell latest timestamp.
> {code}
> CREATE TABLE hive_hbase_table (key STRING, col1 STRING, col1_ts BIGINT)
> STORED BY 'org.apache.hadoop.hive.hbase.HBaseStorageHandler'
> WITH SERDEPROPERTIES ("hbase.columns.mapping" = ":key,cf:qualifier,
> :timestamp:cf:qualifier")
> TBLPROPERTIES ("hbase.table.name" = "hbase_table");
> {code}
> h4. For mapping a column family latest timestamp.
> {code}
> CREATE TABLE hive_hbase_table (key STRING, valuemap MAP<STRING, STRING>,
> timestampmap MAP<STRING, BIGINT>)
> STORED BY 'org.apache.hadoop.hive.hbase.HBaseStorageHandler'
> WITH SERDEPROPERTIES ("hbase.columns.mapping" = ":key,cf:,:timestamp:cf:")
> TBLPROPERTIES ("hbase.table.name" = "hbase_table");
> {code}
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)