[
https://issues.apache.org/jira/browse/HBASE-5612?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13401899#comment-13401899
]
Enis Soztutar commented on HBASE-5612:
--------------------------------------
At the recent HBase hackaton, and the BOF sessions, we had some discussions
about adding some kind of schemas/data types to hbase, and Ian gave a short
talk about it. Other than the use cases for this jira, having optional
schema-data has the advantages of:
- HBase internals can make use of data types (like the block level encoding,
comparators for sub-fields in keys, etc)
- HBase shell can make use of the data types, and display the data correctly
- Hive/Pig can better map their own data-types to hbase types, and their
schemas to hbase schema, instead of managing it themselves.
- Client written coprocessors or system level coprocessors can do data
validation according to the schema and data types.
So, what I am trying to say is that we can start to think of a bigger picture
for the data types, rather than doing something only for compression/block
encoding. WDTY?
> Data types for HBase values
> ---------------------------
>
> Key: HBASE-5612
> URL: https://issues.apache.org/jira/browse/HBASE-5612
> Project: HBase
> Issue Type: Improvement
> Reporter: Mikhail Bautin
> Assignee: Mikhail Bautin
>
> In many real-life applications all values in a certain column family are of a
> certain data type, e.g. 64-bit integer. We could specify that in the column
> descriptor and enable data type-specific compression such as variable-length
> integer encoding.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators:
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira