[ https://issues.apache.org/jira/browse/SQOOP-1393?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14133060#comment-14133060 ]
Richard commented on SQOOP-1393: -------------------------------- {code} org.kitesdk.data.DatasetExistsException: Metadata already exists for dataset:null {code} This error means that the table has already existed in metadata in hive. This probably because you have tried to import into hive before, but it failed in the second step. (There are 2 steps when import into hive: 1. create metadata information in hive; 2. create parquet file in hdfs). So please delete table in metadata in hive if this error happens. > Import data from database to Hive as Parquet files > -------------------------------------------------- > > Key: SQOOP-1393 > URL: https://issues.apache.org/jira/browse/SQOOP-1393 > Project: Sqoop > Issue Type: Sub-task > Components: tools > Reporter: Qian Xu > Assignee: Richard > Fix For: 1.4.6 > > Attachments: patch.diff, patch_v2.diff, patch_v3.diff > > > Import data to Hive as Parquet file can be separated into two steps: > 1. Import an individual table from an RDBMS to HDFS as a set of Parquet files. > 2. Import the data into Hive by generating and executing a CREATE TABLE > statement to define the data's layout in Hive with Parquet format table -- This message was sent by Atlassian JIRA (v6.3.4#6332)