[jira] [Updated] (HIVE-13697) ListBucketing feature does not support uppercase string.
[ https://issues.apache.org/jira/browse/HIVE-13697?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Stamatis Zampetakis updated HIVE-13697: --- Fix Version/s: (was: 3.2.0) I cleared the fixVersion field since this ticket is not resolved. Please review this ticket and if the fix is already committed to a specific version please set the version accordingly and mark the ticket as RESOLVED. According to the JIRA guidelines (https://cwiki.apache.org/confluence/display/Hive/HowToContribute) the fixVersion should be set only when the issue is resolved/closed. > ListBucketing feature does not support uppercase string. > > > Key: HIVE-13697 > URL: https://issues.apache.org/jira/browse/HIVE-13697 > Project: Hive > Issue Type: Bug > Components: Database/Schema >Affects Versions: 1.2.1 > Environment: 1.2.1 >Reporter: Hao Zhu >Assignee: Oleksiy Sayankin >Priority: Critical > Attachments: HIVE-13697.1.patch, HIVE-13697.2.patch > > > This is the feature: > https://cwiki.apache.org/confluence/display/Hive/ListBucketing > 1. Good example: > {code} > CREATE TABLE testskew (id INT, a STRING) > SKEWED BY (a) ON ('abc', 'xyz') STORED AS DIRECTORIES; > set hive.mapred.supports.subdirectories=true; > set mapred.input.dir.recursive=true; > INSERT OVERWRITE TABLE testskew > SELECT 123,'abc' FROM dual > union all > SELECT 123,'xyz' FROM dual > union all > SELECT 123,'others' FROM dual; > {code} > {code} > # hadoop fs -ls /user/hive/warehouse/testskew > Found 3 items > drwxrwxrwx - mapr mapr 1 2016-05-05 14:56 > /user/hive/warehouse/testskew/HIVE_DEFAULT_LIST_BUCKETING_DIR_NAME > drwxrwxrwx - mapr mapr 1 2016-05-05 14:56 > /user/hive/warehouse/testskew/a=abc > drwxrwxrwx - mapr mapr 1 2016-05-05 14:56 > /user/hive/warehouse/testskew/a=xyz > {code} > This is good, because both "abc" and "xyz" directories got created. > 2. Bad example -- This is the issue > {code} > CREATE TABLE testskew2 (id INT, a STRING) > SKEWED BY (a) ON ('aus', 'US') STORED AS DIRECTORIES; > set hive.mapred.supports.subdirectories=true; > set mapred.input.dir.recursive=true; > INSERT OVERWRITE TABLE testskew2 > SELECT 123, 'aus' FROM dual > union all > SELECT 123, 'US' FROM dual > union all > SELECT 123, 'others' FROM dual; > {code} > You can see, only "aus" directory got created... > {code} > # hadoop fs -ls /user/hive/warehouse/testskew2 > Found 2 items > drwxrwxrwx - mapr mapr 1 2016-05-05 15:11 > /user/hive/warehouse/testskew2/HIVE_DEFAULT_LIST_BUCKETING_DIR_NAME > drwxrwxrwx - mapr mapr 1 2016-05-05 15:11 > /user/hive/warehouse/testskew2/a=aus > {code} -- This message was sent by Atlassian Jira (v8.20.10#820010)
[jira] [Updated] (HIVE-13697) ListBucketing feature does not support uppercase string.
[ https://issues.apache.org/jira/browse/HIVE-13697?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vineet Garg updated HIVE-13697: --- Fix Version/s: (was: 3.1.0) 3.2.0 Deferring this to 3.2.0 since the branch for 3.1.0 has been cut off. > ListBucketing feature does not support uppercase string. > > > Key: HIVE-13697 > URL: https://issues.apache.org/jira/browse/HIVE-13697 > Project: Hive > Issue Type: Bug > Components: Database/Schema >Affects Versions: 1.2.1 > Environment: 1.2.1 >Reporter: Hao Zhu >Assignee: Oleksiy Sayankin >Priority: Critical > Fix For: 3.2.0 > > Attachments: HIVE-13697.1.patch, HIVE-13697.2.patch > > > This is the feature: > https://cwiki.apache.org/confluence/display/Hive/ListBucketing > 1. Good example: > {code} > CREATE TABLE testskew (id INT, a STRING) > SKEWED BY (a) ON ('abc', 'xyz') STORED AS DIRECTORIES; > set hive.mapred.supports.subdirectories=true; > set mapred.input.dir.recursive=true; > INSERT OVERWRITE TABLE testskew > SELECT 123,'abc' FROM dual > union all > SELECT 123,'xyz' FROM dual > union all > SELECT 123,'others' FROM dual; > {code} > {code} > # hadoop fs -ls /user/hive/warehouse/testskew > Found 3 items > drwxrwxrwx - mapr mapr 1 2016-05-05 14:56 > /user/hive/warehouse/testskew/HIVE_DEFAULT_LIST_BUCKETING_DIR_NAME > drwxrwxrwx - mapr mapr 1 2016-05-05 14:56 > /user/hive/warehouse/testskew/a=abc > drwxrwxrwx - mapr mapr 1 2016-05-05 14:56 > /user/hive/warehouse/testskew/a=xyz > {code} > This is good, because both "abc" and "xyz" directories got created. > 2. Bad example -- This is the issue > {code} > CREATE TABLE testskew2 (id INT, a STRING) > SKEWED BY (a) ON ('aus', 'US') STORED AS DIRECTORIES; > set hive.mapred.supports.subdirectories=true; > set mapred.input.dir.recursive=true; > INSERT OVERWRITE TABLE testskew2 > SELECT 123, 'aus' FROM dual > union all > SELECT 123, 'US' FROM dual > union all > SELECT 123, 'others' FROM dual; > {code} > You can see, only "aus" directory got created... > {code} > # hadoop fs -ls /user/hive/warehouse/testskew2 > Found 2 items > drwxrwxrwx - mapr mapr 1 2016-05-05 15:11 > /user/hive/warehouse/testskew2/HIVE_DEFAULT_LIST_BUCKETING_DIR_NAME > drwxrwxrwx - mapr mapr 1 2016-05-05 15:11 > /user/hive/warehouse/testskew2/a=aus > {code} -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Updated] (HIVE-13697) ListBucketing feature does not support uppercase string.
[ https://issues.apache.org/jira/browse/HIVE-13697?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Oleksiy Sayankin updated HIVE-13697: Status: Patch Available (was: In Progress) > ListBucketing feature does not support uppercase string. > > > Key: HIVE-13697 > URL: https://issues.apache.org/jira/browse/HIVE-13697 > Project: Hive > Issue Type: Bug > Components: Database/Schema >Affects Versions: 1.2.1 > Environment: 1.2.1 >Reporter: Hao Zhu >Assignee: Oleksiy Sayankin >Priority: Critical > Fix For: 3.1.0 > > Attachments: HIVE-13697.1.patch, HIVE-13697.2.patch > > > This is the feature: > https://cwiki.apache.org/confluence/display/Hive/ListBucketing > 1. Good example: > {code} > CREATE TABLE testskew (id INT, a STRING) > SKEWED BY (a) ON ('abc', 'xyz') STORED AS DIRECTORIES; > set hive.mapred.supports.subdirectories=true; > set mapred.input.dir.recursive=true; > INSERT OVERWRITE TABLE testskew > SELECT 123,'abc' FROM dual > union all > SELECT 123,'xyz' FROM dual > union all > SELECT 123,'others' FROM dual; > {code} > {code} > # hadoop fs -ls /user/hive/warehouse/testskew > Found 3 items > drwxrwxrwx - mapr mapr 1 2016-05-05 14:56 > /user/hive/warehouse/testskew/HIVE_DEFAULT_LIST_BUCKETING_DIR_NAME > drwxrwxrwx - mapr mapr 1 2016-05-05 14:56 > /user/hive/warehouse/testskew/a=abc > drwxrwxrwx - mapr mapr 1 2016-05-05 14:56 > /user/hive/warehouse/testskew/a=xyz > {code} > This is good, because both "abc" and "xyz" directories got created. > 2. Bad example -- This is the issue > {code} > CREATE TABLE testskew2 (id INT, a STRING) > SKEWED BY (a) ON ('aus', 'US') STORED AS DIRECTORIES; > set hive.mapred.supports.subdirectories=true; > set mapred.input.dir.recursive=true; > INSERT OVERWRITE TABLE testskew2 > SELECT 123, 'aus' FROM dual > union all > SELECT 123, 'US' FROM dual > union all > SELECT 123, 'others' FROM dual; > {code} > You can see, only "aus" directory got created... > {code} > # hadoop fs -ls /user/hive/warehouse/testskew2 > Found 2 items > drwxrwxrwx - mapr mapr 1 2016-05-05 15:11 > /user/hive/warehouse/testskew2/HIVE_DEFAULT_LIST_BUCKETING_DIR_NAME > drwxrwxrwx - mapr mapr 1 2016-05-05 15:11 > /user/hive/warehouse/testskew2/a=aus > {code} -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Updated] (HIVE-13697) ListBucketing feature does not support uppercase string.
[ https://issues.apache.org/jira/browse/HIVE-13697?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Oleksiy Sayankin updated HIVE-13697: Status: In Progress (was: Patch Available) > ListBucketing feature does not support uppercase string. > > > Key: HIVE-13697 > URL: https://issues.apache.org/jira/browse/HIVE-13697 > Project: Hive > Issue Type: Bug > Components: Database/Schema >Affects Versions: 1.2.1 > Environment: 1.2.1 >Reporter: Hao Zhu >Assignee: Oleksiy Sayankin >Priority: Critical > Fix For: 3.1.0 > > Attachments: HIVE-13697.1.patch, HIVE-13697.2.patch > > > This is the feature: > https://cwiki.apache.org/confluence/display/Hive/ListBucketing > 1. Good example: > {code} > CREATE TABLE testskew (id INT, a STRING) > SKEWED BY (a) ON ('abc', 'xyz') STORED AS DIRECTORIES; > set hive.mapred.supports.subdirectories=true; > set mapred.input.dir.recursive=true; > INSERT OVERWRITE TABLE testskew > SELECT 123,'abc' FROM dual > union all > SELECT 123,'xyz' FROM dual > union all > SELECT 123,'others' FROM dual; > {code} > {code} > # hadoop fs -ls /user/hive/warehouse/testskew > Found 3 items > drwxrwxrwx - mapr mapr 1 2016-05-05 14:56 > /user/hive/warehouse/testskew/HIVE_DEFAULT_LIST_BUCKETING_DIR_NAME > drwxrwxrwx - mapr mapr 1 2016-05-05 14:56 > /user/hive/warehouse/testskew/a=abc > drwxrwxrwx - mapr mapr 1 2016-05-05 14:56 > /user/hive/warehouse/testskew/a=xyz > {code} > This is good, because both "abc" and "xyz" directories got created. > 2. Bad example -- This is the issue > {code} > CREATE TABLE testskew2 (id INT, a STRING) > SKEWED BY (a) ON ('aus', 'US') STORED AS DIRECTORIES; > set hive.mapred.supports.subdirectories=true; > set mapred.input.dir.recursive=true; > INSERT OVERWRITE TABLE testskew2 > SELECT 123, 'aus' FROM dual > union all > SELECT 123, 'US' FROM dual > union all > SELECT 123, 'others' FROM dual; > {code} > You can see, only "aus" directory got created... > {code} > # hadoop fs -ls /user/hive/warehouse/testskew2 > Found 2 items > drwxrwxrwx - mapr mapr 1 2016-05-05 15:11 > /user/hive/warehouse/testskew2/HIVE_DEFAULT_LIST_BUCKETING_DIR_NAME > drwxrwxrwx - mapr mapr 1 2016-05-05 15:11 > /user/hive/warehouse/testskew2/a=aus > {code} -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Updated] (HIVE-13697) ListBucketing feature does not support uppercase string.
[ https://issues.apache.org/jira/browse/HIVE-13697?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Oleksiy Sayankin updated HIVE-13697: Attachment: HIVE-13697.2.patch > ListBucketing feature does not support uppercase string. > > > Key: HIVE-13697 > URL: https://issues.apache.org/jira/browse/HIVE-13697 > Project: Hive > Issue Type: Bug > Components: Database/Schema >Affects Versions: 1.2.1 > Environment: 1.2.1 >Reporter: Hao Zhu >Assignee: Oleksiy Sayankin >Priority: Critical > Fix For: 3.1.0 > > Attachments: HIVE-13697.1.patch, HIVE-13697.2.patch > > > This is the feature: > https://cwiki.apache.org/confluence/display/Hive/ListBucketing > 1. Good example: > {code} > CREATE TABLE testskew (id INT, a STRING) > SKEWED BY (a) ON ('abc', 'xyz') STORED AS DIRECTORIES; > set hive.mapred.supports.subdirectories=true; > set mapred.input.dir.recursive=true; > INSERT OVERWRITE TABLE testskew > SELECT 123,'abc' FROM dual > union all > SELECT 123,'xyz' FROM dual > union all > SELECT 123,'others' FROM dual; > {code} > {code} > # hadoop fs -ls /user/hive/warehouse/testskew > Found 3 items > drwxrwxrwx - mapr mapr 1 2016-05-05 14:56 > /user/hive/warehouse/testskew/HIVE_DEFAULT_LIST_BUCKETING_DIR_NAME > drwxrwxrwx - mapr mapr 1 2016-05-05 14:56 > /user/hive/warehouse/testskew/a=abc > drwxrwxrwx - mapr mapr 1 2016-05-05 14:56 > /user/hive/warehouse/testskew/a=xyz > {code} > This is good, because both "abc" and "xyz" directories got created. > 2. Bad example -- This is the issue > {code} > CREATE TABLE testskew2 (id INT, a STRING) > SKEWED BY (a) ON ('aus', 'US') STORED AS DIRECTORIES; > set hive.mapred.supports.subdirectories=true; > set mapred.input.dir.recursive=true; > INSERT OVERWRITE TABLE testskew2 > SELECT 123, 'aus' FROM dual > union all > SELECT 123, 'US' FROM dual > union all > SELECT 123, 'others' FROM dual; > {code} > You can see, only "aus" directory got created... > {code} > # hadoop fs -ls /user/hive/warehouse/testskew2 > Found 2 items > drwxrwxrwx - mapr mapr 1 2016-05-05 15:11 > /user/hive/warehouse/testskew2/HIVE_DEFAULT_LIST_BUCKETING_DIR_NAME > drwxrwxrwx - mapr mapr 1 2016-05-05 15:11 > /user/hive/warehouse/testskew2/a=aus > {code} -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Updated] (HIVE-13697) ListBucketing feature does not support uppercase string.
[ https://issues.apache.org/jira/browse/HIVE-13697?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Vineet Garg updated HIVE-13697: --- Fix Version/s: (was: 3.0.0) 3.1.0 Deferring this to 3.1.0 since the branch for 3.0.0 has been cut off. Please update the JIRA if you would like to get your patch in 3.0.0. > ListBucketing feature does not support uppercase string. > > > Key: HIVE-13697 > URL: https://issues.apache.org/jira/browse/HIVE-13697 > Project: Hive > Issue Type: Bug > Components: Database/Schema >Affects Versions: 1.2.1 > Environment: 1.2.1 >Reporter: Hao Zhu >Assignee: Oleksiy Sayankin >Priority: Critical > Fix For: 3.1.0 > > Attachments: HIVE-13697.1.patch > > > This is the feature: > https://cwiki.apache.org/confluence/display/Hive/ListBucketing > 1. Good example: > {code} > CREATE TABLE testskew (id INT, a STRING) > SKEWED BY (a) ON ('abc', 'xyz') STORED AS DIRECTORIES; > set hive.mapred.supports.subdirectories=true; > set mapred.input.dir.recursive=true; > INSERT OVERWRITE TABLE testskew > SELECT 123,'abc' FROM dual > union all > SELECT 123,'xyz' FROM dual > union all > SELECT 123,'others' FROM dual; > {code} > {code} > # hadoop fs -ls /user/hive/warehouse/testskew > Found 3 items > drwxrwxrwx - mapr mapr 1 2016-05-05 14:56 > /user/hive/warehouse/testskew/HIVE_DEFAULT_LIST_BUCKETING_DIR_NAME > drwxrwxrwx - mapr mapr 1 2016-05-05 14:56 > /user/hive/warehouse/testskew/a=abc > drwxrwxrwx - mapr mapr 1 2016-05-05 14:56 > /user/hive/warehouse/testskew/a=xyz > {code} > This is good, because both "abc" and "xyz" directories got created. > 2. Bad example -- This is the issue > {code} > CREATE TABLE testskew2 (id INT, a STRING) > SKEWED BY (a) ON ('aus', 'US') STORED AS DIRECTORIES; > set hive.mapred.supports.subdirectories=true; > set mapred.input.dir.recursive=true; > INSERT OVERWRITE TABLE testskew2 > SELECT 123, 'aus' FROM dual > union all > SELECT 123, 'US' FROM dual > union all > SELECT 123, 'others' FROM dual; > {code} > You can see, only "aus" directory got created... > {code} > # hadoop fs -ls /user/hive/warehouse/testskew2 > Found 2 items > drwxrwxrwx - mapr mapr 1 2016-05-05 15:11 > /user/hive/warehouse/testskew2/HIVE_DEFAULT_LIST_BUCKETING_DIR_NAME > drwxrwxrwx - mapr mapr 1 2016-05-05 15:11 > /user/hive/warehouse/testskew2/a=aus > {code} -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Updated] (HIVE-13697) ListBucketing feature does not support uppercase string.
[ https://issues.apache.org/jira/browse/HIVE-13697?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Oleksiy Sayankin updated HIVE-13697: Fix Version/s: 3.0.0 > ListBucketing feature does not support uppercase string. > > > Key: HIVE-13697 > URL: https://issues.apache.org/jira/browse/HIVE-13697 > Project: Hive > Issue Type: Bug > Components: Database/Schema >Affects Versions: 1.2.1 > Environment: 1.2.1 >Reporter: Hao Zhu >Assignee: Oleksiy Sayankin >Priority: Critical > Fix For: 3.0.0 > > Attachments: HIVE-13697.1.patch > > > This is the feature: > https://cwiki.apache.org/confluence/display/Hive/ListBucketing > 1. Good example: > {code} > CREATE TABLE testskew (id INT, a STRING) > SKEWED BY (a) ON ('abc', 'xyz') STORED AS DIRECTORIES; > set hive.mapred.supports.subdirectories=true; > set mapred.input.dir.recursive=true; > INSERT OVERWRITE TABLE testskew > SELECT 123,'abc' FROM dual > union all > SELECT 123,'xyz' FROM dual > union all > SELECT 123,'others' FROM dual; > {code} > {code} > # hadoop fs -ls /user/hive/warehouse/testskew > Found 3 items > drwxrwxrwx - mapr mapr 1 2016-05-05 14:56 > /user/hive/warehouse/testskew/HIVE_DEFAULT_LIST_BUCKETING_DIR_NAME > drwxrwxrwx - mapr mapr 1 2016-05-05 14:56 > /user/hive/warehouse/testskew/a=abc > drwxrwxrwx - mapr mapr 1 2016-05-05 14:56 > /user/hive/warehouse/testskew/a=xyz > {code} > This is good, because both "abc" and "xyz" directories got created. > 2. Bad example -- This is the issue > {code} > CREATE TABLE testskew2 (id INT, a STRING) > SKEWED BY (a) ON ('aus', 'US') STORED AS DIRECTORIES; > set hive.mapred.supports.subdirectories=true; > set mapred.input.dir.recursive=true; > INSERT OVERWRITE TABLE testskew2 > SELECT 123, 'aus' FROM dual > union all > SELECT 123, 'US' FROM dual > union all > SELECT 123, 'others' FROM dual; > {code} > You can see, only "aus" directory got created... > {code} > # hadoop fs -ls /user/hive/warehouse/testskew2 > Found 2 items > drwxrwxrwx - mapr mapr 1 2016-05-05 15:11 > /user/hive/warehouse/testskew2/HIVE_DEFAULT_LIST_BUCKETING_DIR_NAME > drwxrwxrwx - mapr mapr 1 2016-05-05 15:11 > /user/hive/warehouse/testskew2/a=aus > {code} -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Updated] (HIVE-13697) ListBucketing feature does not support uppercase string.
[ https://issues.apache.org/jira/browse/HIVE-13697?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Oleksiy Sayankin updated HIVE-13697: Status: Patch Available (was: In Progress) ROOT-CAUSE: toLowerCase() operator while getting skewed values from AST Node in BaseSemanticAnalyzer. Hence Skewed Values are stored lower case only. {code} hive> desc formatted testskew2; OK # col_namedata_type comment id int a string # Detailed Table Information Database: default Owner: hdfs CreateTime: Thu May 12 18:37:20 EEST 2016 LastAccessTime: UNKNOWN Protect Mode: None Retention: 0 Location: hdfs:/user/hive/warehouse/testskew2 Table Type: MANAGED_TABLE Table Parameters: transient_lastDdlTime1463067440 # Storage Information SerDe Library: org.apache.hadoop.hive.serde2.lazy.LazySimpleSerDe InputFormat:org.apache.hadoop.mapred.TextInputFormat OutputFormat: org.apache.hadoop.hive.ql.io.HiveIgnoreKeyTextOutputFormat Compressed: No Num Buckets:-1 Bucket Columns: [] Sort Columns: [] Stored As SubDirectories:Yes Skewed Columns: [a] Skewed Values: [[aus], [us]] < !!! ERROR !!! Storage Desc Params: serialization.format1 {code} SOLUTION: Remove unnecessary toLowerCase() operator. > ListBucketing feature does not support uppercase string. > > > Key: HIVE-13697 > URL: https://issues.apache.org/jira/browse/HIVE-13697 > Project: Hive > Issue Type: Bug > Components: Database/Schema >Affects Versions: 1.2.1 > Environment: 1.2.1 >Reporter: Hao Zhu >Assignee: Oleksiy Sayankin >Priority: Critical > Attachments: HIVE-13697.1.patch > > > This is the feature: > https://cwiki.apache.org/confluence/display/Hive/ListBucketing > 1. Good example: > {code} > CREATE TABLE testskew (id INT, a STRING) > SKEWED BY (a) ON ('abc', 'xyz') STORED AS DIRECTORIES; > set hive.mapred.supports.subdirectories=true; > set mapred.input.dir.recursive=true; > INSERT OVERWRITE TABLE testskew > SELECT 123,'abc' FROM dual > union all > SELECT 123,'xyz' FROM dual > union all > SELECT 123,'others' FROM dual; > {code} > {code} > # hadoop fs -ls /user/hive/warehouse/testskew > Found 3 items > drwxrwxrwx - mapr mapr 1 2016-05-05 14:56 > /user/hive/warehouse/testskew/HIVE_DEFAULT_LIST_BUCKETING_DIR_NAME > drwxrwxrwx - mapr mapr 1 2016-05-05 14:56 > /user/hive/warehouse/testskew/a=abc > drwxrwxrwx - mapr mapr 1 2016-05-05 14:56 > /user/hive/warehouse/testskew/a=xyz > {code} > This is good, because both "abc" and "xyz" directories got created. > 2. Bad example -- This is the issue > {code} > CREATE TABLE testskew2 (id INT, a STRING) > SKEWED BY (a) ON ('aus', 'US') STORED AS DIRECTORIES; > set hive.mapred.supports.subdirectories=true; > set mapred.input.dir.recursive=true; > INSERT OVERWRITE TABLE testskew2 > SELECT 123, 'aus' FROM dual > union all > SELECT 123, 'US' FROM dual > union all > SELECT 123, 'others' FROM dual; > {code} > You can see, only "aus" directory got created... > {code} > # hadoop fs -ls /user/hive/warehouse/testskew2 > Found 2 items > drwxrwxrwx - mapr mapr 1 2016-05-05 15:11 > /user/hive/warehouse/testskew2/HIVE_DEFAULT_LIST_BUCKETING_DIR_NAME > drwxrwxrwx - mapr mapr 1 2016-05-05 15:11 > /user/hive/warehouse/testskew2/a=aus > {code} -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Updated] (HIVE-13697) ListBucketing feature does not support uppercase string.
[ https://issues.apache.org/jira/browse/HIVE-13697?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Oleksiy Sayankin updated HIVE-13697: Attachment: HIVE-13697.1.patch > ListBucketing feature does not support uppercase string. > > > Key: HIVE-13697 > URL: https://issues.apache.org/jira/browse/HIVE-13697 > Project: Hive > Issue Type: Bug > Components: Database/Schema >Affects Versions: 1.2.1 > Environment: 1.2.1 >Reporter: Hao Zhu >Assignee: Oleksiy Sayankin >Priority: Critical > Attachments: HIVE-13697.1.patch > > > This is the feature: > https://cwiki.apache.org/confluence/display/Hive/ListBucketing > 1. Good example: > {code} > CREATE TABLE testskew (id INT, a STRING) > SKEWED BY (a) ON ('abc', 'xyz') STORED AS DIRECTORIES; > set hive.mapred.supports.subdirectories=true; > set mapred.input.dir.recursive=true; > INSERT OVERWRITE TABLE testskew > SELECT 123,'abc' FROM dual > union all > SELECT 123,'xyz' FROM dual > union all > SELECT 123,'others' FROM dual; > {code} > {code} > # hadoop fs -ls /user/hive/warehouse/testskew > Found 3 items > drwxrwxrwx - mapr mapr 1 2016-05-05 14:56 > /user/hive/warehouse/testskew/HIVE_DEFAULT_LIST_BUCKETING_DIR_NAME > drwxrwxrwx - mapr mapr 1 2016-05-05 14:56 > /user/hive/warehouse/testskew/a=abc > drwxrwxrwx - mapr mapr 1 2016-05-05 14:56 > /user/hive/warehouse/testskew/a=xyz > {code} > This is good, because both "abc" and "xyz" directories got created. > 2. Bad example -- This is the issue > {code} > CREATE TABLE testskew2 (id INT, a STRING) > SKEWED BY (a) ON ('aus', 'US') STORED AS DIRECTORIES; > set hive.mapred.supports.subdirectories=true; > set mapred.input.dir.recursive=true; > INSERT OVERWRITE TABLE testskew2 > SELECT 123, 'aus' FROM dual > union all > SELECT 123, 'US' FROM dual > union all > SELECT 123, 'others' FROM dual; > {code} > You can see, only "aus" directory got created... > {code} > # hadoop fs -ls /user/hive/warehouse/testskew2 > Found 2 items > drwxrwxrwx - mapr mapr 1 2016-05-05 15:11 > /user/hive/warehouse/testskew2/HIVE_DEFAULT_LIST_BUCKETING_DIR_NAME > drwxrwxrwx - mapr mapr 1 2016-05-05 15:11 > /user/hive/warehouse/testskew2/a=aus > {code} -- This message was sent by Atlassian JIRA (v6.3.4#6332)