[
https://issues.apache.org/jira/browse/ORC-42?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15147027#comment-15147027
]
Dinesh S. Atreya commented on ORC-42:
-------------------------------------
{panel:title=MVCC_ORC Timestamp based}
Sample is given below for illustration.
|| Data ID || Activity || Data Create Time || Data Expiry Time ||
Comments ||
| 1 | Insert | T1 | MAX_TIME_VAL | Conventionally MAX_TIME_VAL
is null.
In order to maintain update size, MAX_TIME_VAL is pre-seeded for our purposes. |
| 1 | Delete | T1 | T2 | Marked as delete when time was T2. |
| 2 | Update (old Delete) | T3 | T4 | Mark old data is DELETE |
| 2 | Update (new insert) | T4 | MAX_TIME_VAL | Insert new data. |
{panel}
> Advance Hadoop Architecture (AHA) - Advance ORC (Umbrella) JIRA
> ---------------------------------------------------------------
>
> Key: ORC-42
> URL: https://issues.apache.org/jira/browse/ORC-42
> Project: Orc
> Issue Type: New Feature
> Reporter: Dinesh S. Atreya
>
> Link to Umbrella JIRA
> https://issues.apache.org/jira/browse/HADOOP-12620
> See
> https://issues.apache.org/jira/browse/HADOOP-12620?focusedCommentId=15046300&page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#comment-15046300
> for more details.
> This JIRA is an umbrella (parent/master) JIRA for advancing ORC given
> https://issues.apache.org/jira/browse/HDFS-9607.
> A number of capabilities that can be added to ORC once HDFS update is
> supported may include:
> JSON_ORC -- native processing of JSON (add MongoDB/CouchDB type capabilities
> in Hadoop)
> XML_ORC -- add native XML processing capability to ORC.
> RDF_ORC -- native processing of RDF documents
> MVCC_ORC -- Add Multi Version Concurrency MVCC support to ORC
> INDEX_ORC -- Create a variety of Indexes such as B-Tree, Bitmap etc. to other
> files in Hadoop.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)