[
https://issues.apache.org/jira/browse/SQOOP-428?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13190068#comment-13190068
]
Tom White commented on SQOOP-428:
---------------------------------
Thanks for working on this Lars.
> The current AvroOutputFormat seems to be a copy from the Avro project or a
> very similar one at least.
Avro doesn't support the new MapReduce API yet, which is what Sqoop uses, hence
the AvroOutputFormat in Sqoop. When an Avro release does support the new API
(AVRO-593) Sqoop should use that.
For the test, I would just have a variant of TestAvroImport#testAvroImport()
that has compression enabled.
> AvroOutputFormat doesn't support compression even though documentation claims
> it does
> -------------------------------------------------------------------------------------
>
> Key: SQOOP-428
> URL: https://issues.apache.org/jira/browse/SQOOP-428
> Project: Sqoop
> Issue Type: Bug
> Components: docs
> Affects Versions: 1.4.0-incubating
> Reporter: Lars Francke
> Priority: Minor
> Labels: avro, document
>
> The documentation claims that Avro files can be compressed as well:
> {quote}
> By default, data is not compressed. You can compress your data by using the
> deflate (gzip) algorithm with the -z or --compress argument, or specify any
> Hadoop compression codec using the --compression-codec argument. This applies
> to SequenceFile, text, and Avro files.
> {quote}
> This is not true as the AvroOutputFormat currently doesn't support
> compression.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators:
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira