[
https://issues.apache.org/jira/browse/SQOOP-1392?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14110509#comment-14110509
]
Richard commented on SQOOP-1392:
--------------------------------
For _Avro_, _Text_, _Sequence_ file,
*FileOutputFormat.getWorkOutputPath(context)* is used to get temp work output
path, since the *OutputFormat* for these three types is *FileOutputFormat*.
However, *DatasetKeyOutputFormat* extends *OutputFormat* directly instead of
*FileOutputFormat*. As a result, *FileOutputFormat.getWorkOutputPath(context)*
could not be used in _Parquet_ file.
There are 2 solutions to resolve this.
1. Make *DatasetKeyOutputFormat* extends *FileOutputFormat*. Is there any
special reason not to implement *DatasetKeyOutputFormat*?
2. Generate temp work output path manually, like *patch.diff* attached. It
could make less impact to Kite SDK and Sqoop.
> Create the temporary directory inside task working dir rather then in tmp
> -------------------------------------------------------------------------
>
> Key: SQOOP-1392
> URL: https://issues.apache.org/jira/browse/SQOOP-1392
> Project: Sqoop
> Issue Type: Improvement
> Components: tools
> Reporter: Qian Xu
> Priority: Minor
> Attachments: patch.diff
>
>
> Several mappers will create a temporary directory in /tmp ranther then inside
> the task working directory. We should change the pattern.
--
This message was sent by Atlassian JIRA
(v6.2#6252)