[ 
https://issues.apache.org/jira/browse/SQOOP-1334?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14026553#comment-14026553
 ] 

Souvik Bose edited comment on SQOOP-1334 at 6/10/14 3:17 PM:
-------------------------------------------------------------

As you can see above the input format is 
org.apache.hadoop.mapred.TextInputFormat
and output format is 
org.apache.hadoop.hive.ql.io.HiveIgnoreKeyTextOutputFormat. I was digging 
through the logs in more details, and one thing I noticed which is unnatural if 
we compare that with other logs. 

14/06/05 08:45:24 INFO mapred.FileInputFormat: Total input paths to process : 
35244
14/06/05 08:45:43 DEBUG hcat.SqoopHCatExportFormat: Expected split count 4
14/06/05 08:45:43 DEBUG hcat.SqoopHCatExportFormat: HCatInputFormat provided 
split count 35244

In all of the successful cases I see that all the counts match upto 4. But 
those are pretty obvious as the hcat imported tables using sqoops have 4 hdfs 
files generates as part0000-m like that. But this is a case of external table 
which is referencing to a hdfs location with lots of files. So in this case 
Expected split cpount should be 35244 instead of only 4. Why it is assuming 4 
only that may be issue.


was (Author: souvik2008):
As you can see above the input format is 
org.apache.hadoop.mapred.TextInputFormat
and output format is 
org.apache.hadoop.hive.ql.io.HiveIgnoreKeyTextOutputFormat. I was digging 
through the logs in more details, and one thing I noticed which is unnatural if 
we compare that with other logs. 

14/06/05 08:45:24 INFO mapred.FileInputFormat: Total input paths to process : 
35244
14/06/05 08:45:43 DEBUG hcat.SqoopHCatExportFormat: Expected split count 4
14/06/05 08:45:43 DEBUG hcat.SqoopHCatExportFormat: HCatInputFormat provided 
split count 35244

In all of the successful cases I see that all the counts match upto 4. But 
those are pretty obvious as the hcat imported tables using sqoops have 4 hdfs 
files generates as part0000-m like that. But this is a case of external table 
which is referencing to a hdfs location with lots of files.

> Sqoop Hcatalog export failed with exception of Failed to load foster storage 
> handler during map reduce
> ------------------------------------------------------------------------------------------------------
>
>                 Key: SQOOP-1334
>                 URL: https://issues.apache.org/jira/browse/SQOOP-1334
>             Project: Sqoop
>          Issue Type: Bug
>          Components: hive-integration
>    Affects Versions: 1.4.3
>         Environment: CDH4.4.0 with cloudera manager.
>            Reporter: Souvik Bose
>            Priority: Blocker
>              Labels: hcatalog, sqoop, storage-handler
>         Attachments: Sqoop_Hcatalog_Export_Log_verbose.txt
>
>
> I tried to export a hive external table to mysql using sqoop hcatalog export 
> command.
> sqoop export --connect jdbc:mysql://example.com:3306/demo1 --username 
> demo-password demo1234 bd_sqoop1 --hcatalog-table test_data
> I have set the HCAT_HOME, HADOOP_CLASSPATH, HIVE_HOME and paths of Jars. Now 
> the job is progressing upto the map reduce stage, but stops at map 0% reduce 
> 0% with an exception as below:
> 14/06/04 13:27:21 INFO mapred.JobClient: Task Id : 
> attempt_201406021302_0043_m_000002_1, Status : FAILED
> java.io.IOException: Error initializing HCat record reader
>         at 
> org.apache.sqoop.mapreduce.hcat.SqoopHCatRecordReader.initNextRecordReader(SqoopHCatRecordReader.java:146)
>         at 
> org.apache.sqoop.mapreduce.hcat.SqoopHCatRecordReader.<init>(SqoopHCatRecordReader.java:57)
>         at 
> org.apache.sqoop.mapreduce.hcat.SqoopHCatExportFormat.createRecordReader(SqoopHCatExportFormat.java:128)
>         at org.apache.hadoop.mapred.MapTask.runNewMapper(MapTask.java:644)
>         at org.apache.hadoop.mapred.MapTask.run(MapTask.java:330)
>         at org.apache.hadoop.mapred.Child$4.run(Child.java:268)
>         at java.security.AccessController.doPrivileged(Native Method)
>         at javax.security.auth.Subject.doAs(Subject.java:396)
>         at 
> org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1408)
>         at org.apache.hadoop.mapred.Child.main(Child.java:262)
> Caused by: java.io.IOException: Failed to load foster storage handler
>         at 
> org.apache.hcatalog.common.HCatUtil.getStorageHandler(HCatUtil.java:416)
>         at org.apache.hcatalog.common.HCatUtil.getStorageHandl



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to