[
https://issues.apache.org/jira/browse/CASSANDRA-6917?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14039250#comment-14039250
]
Benedict commented on CASSANDRA-6917:
-------------------------------------
My main goal with enums is only uniquely representing a string value
efficiently. Supporting custom orderings on the data might be a possibility for
enums defined _up front_, however in this case I want to support denormalising
arbitrary string data, the universe of which could be moderately large
(certainly 100k+) and is not necessarily known in advance. An enum that must be
defined up front with a predetermined ordering is frankly just as easy to
implement client-side, so whilst it might be a nice feature to support
eventually, I consider it out of scope for this ticket, and I think
guaranteeing any specific order may be undesirable for write performance.
> enum data type
> --------------
>
> Key: CASSANDRA-6917
> URL: https://issues.apache.org/jira/browse/CASSANDRA-6917
> Project: Cassandra
> Issue Type: Improvement
> Components: Core
> Reporter: Benedict
> Priority: Minor
> Labels: performance
>
> It seems like it would be useful to support an enum data type, that
> automatically converts string data from the user into a fixed-width data type
> with guaranteed uniqueness across the cluster. This data would be replicated
> to all nodes for lookup, but ideally would use only the keyspace RF to
> determine nodes for coordinating quorum writes/consistency.
> This would not only permit improved local disk and inter-node network IO for
> symbology information (e.g. stock tickers, ISINs, etc), but also potentially
> for column identifiers also, which are currently stored as their full string
> representation.
> It should be possible then with later updates to propagate the enum map
> (lazily) to clients through the native protocol, reducing network IO further.
--
This message was sent by Atlassian JIRA
(v6.2#6252)