[
https://issues.apache.org/jira/browse/HIVE-18083?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17073713#comment-17073713
]
David Mollitor commented on HIVE-18083:
---------------------------------------
If Hive is going to be a drop-in replacement for MySQL/MariaDB (or something
approaching that) then it needs to support UTF-8:
{quote}
Certain objects within MySQL, including database, table, index, column, alias,
view, stored procedure, partition, tablespace, resource group and other object
names are known as identifiers.
...
# Permitted characters in quoted identifiers include the full Unicode Basic
Multilingual Plane (BMP), except U+0000
{quote}
* https://dev.mysql.com/doc/refman/8.0/en/identifiers.html
> Support UTF8 in MySQL Metastore Backend
> ---------------------------------------
>
> Key: HIVE-18083
> URL: https://issues.apache.org/jira/browse/HIVE-18083
> Project: Hive
> Issue Type: Improvement
> Components: Metastore, Standalone Metastore
> Affects Versions: 3.0.0, 2.4.0
> Reporter: David Mollitor
> Priority: Major
>
> {code:sql|title=hive-schema-2.2.0.mysql.sql}
> CREATE TABLE IF NOT EXISTS `COLUMNS_V2` (
> `CD_ID` bigint(20) NOT NULL,
> `COMMENT` varchar(256) CHARACTER SET latin1 COLLATE latin1_bin DEFAULT NULL,
> `COLUMN_NAME` varchar(767) CHARACTER SET latin1 COLLATE latin1_bin NOT NULL,
> `TYPE_NAME` varchar(4000) DEFAULT NULL,
> `INTEGER_IDX` int(11) NOT NULL,
> PRIMARY KEY (`CD_ID`,`COLUMN_NAME`),
> KEY `COLUMNS_V2_N49` (`CD_ID`),
> CONSTRAINT `COLUMNS_V2_FK1` FOREIGN KEY (`CD_ID`) REFERENCES `CDS` (`CD_ID`)
> ) ENGINE=InnoDB DEFAULT CHARSET=latin1;
> {code}
> Hive explicitly defines a {{CHARACTER SET latin1 COLLATE latin1_bin}} in the
> schema design. This explicit definition should either be removed, so that it
> can fallback onto the database administrator's defaults, or changed to
> {{CHARACTER SET utf8 COLLATE utf8_bin}} to change the explicit definition to
> utf8.
> This will allow Hive to support UTF8 characters in MySQL backend databases
> for our international friends.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)