Satish Kotha created HUDI-1414:
----------------------------------
Summary: HoodieInputFormat support for bucketed partitions
Key: HUDI-1414
URL: https://issues.apache.org/jira/browse/HUDI-1414
Project: Apache Hudi
Issue Type: New Feature
Reporter: Satish Kotha
When querying a hoodie partition through presto, we get following error:
{code}
Presto error: {u'errorCode': 13, u'message': u'Presto cannot read bucketed
partition in an input format with UseFileSplitsFromInputFormat annotation:
HoodieInputFormat', u'errorType': u'USER_ERROR', u'failureInfo':
{u'suppressed': [], u'message': u'Presto cannot read bucketed partition in an
input format with UseFileSplitsFromInputFormat annotation: HoodieInputFormat',
u'type': u'com.facebook.presto.spi.PrestoException', u'stack':
[u'com.facebook.presto.hive.BackgroundHiveSplitLoader.lambda$loadPartition$5(BackgroundHiveSplitLoader.java:432)',
u'com.facebook.presto.hive.authentication.UserGroupInformationUtils.lambda$executeActionInDoAs$0(UserGroupInformationUtils.java:29)',
u'java.base/java.security.AccessController.doPrivileged(Native Method)',
u'java.base/javax.security.auth.Subject.doAs(Subject.java:361)',
u'org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1816)',
u'com.facebook.presto.hive.authentication.UserGroupInformationUtils.executeActionInDoAs(UserGroupInformationUtils.java:27)',
u'com.facebook.presto.hive.authentication.ImpersonatingHdfsAuthentication.doAs(ImpersonatingHdfsAuthentication.java:39)',
u'com.facebook.presto.hive.BackgroundHiveSplitLoader.loadPartition(BackgroundHiveSplitLoader.java:430)',
u'com.facebook.presto.hive.BackgroundHiveSplitLoader.loadSplits(BackgroundHiveSplitLoader.java:330)',
u'com.facebook.presto.hive.BackgroundHiveSplitLoader.access$300(BackgroundHiveSplitLoader.java:116)',
u'com.facebook.presto.hive.BackgroundHiveSplitLoader$HiveSplitLoaderTask.process(BackgroundHiveSplitLoader.java:259)',
u'com.facebook.presto.hive.util.ResumableTasks.safeProcessTask(ResumableTasks.java:47)',
u'com.facebook.presto.hive.util.ResumableTasks.access$000(ResumableTasks.java:20)',
u'com.facebook.presto.hive.util.ResumableTasks$1.run(ResumableTasks.java:35)',
u'com.facebook.airlift.concurrent.BoundedExecutor.drainQueue(BoundedExecutor.java:78)',
u'java.base/java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1128)',
u'java.base/java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:628)',
u'java.base/java.lang.Thread.run(Thread.java:834)']}, u'errorName':
u'NOT_SUPPORTED'}
{code}
Figure out how to add support for bucketed partitions.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)