[ 
https://issues.apache.org/jira/browse/AVRO-1250?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13576363#comment-13576363
 ] 

Doug Cutting commented on AVRO-1250:
------------------------------------

This will be a great addition!

A few improvements I'd like to see:
 - this should accept Hadoop URIs in addition to file names, perhaps by using 
tool.Util#inputOrStdin
 - running the command without any parameters should print a usage message
 - I'd prefer we call this simply 'concat' rather than 'bulkconcat'
                
> Add a Tool that Efficiently Concatenates Avro Files
> ---------------------------------------------------
>
>                 Key: AVRO-1250
>                 URL: https://issues.apache.org/jira/browse/AVRO-1250
>             Project: Avro
>          Issue Type: New Feature
>          Components: java
>    Affects Versions: 1.7.3
>            Reporter: Nick White
>              Labels: tools
>             Fix For: 1.7.4, 1.8.0
>
>         Attachments: AVRO-1250.0.patch
>
>
> It'd be useful to have a Tool that uses appendAllFrom (with recompress set to 
> false) to concatenate one or more of avro files to either another file or 
> stdout. The Tool would fail if any input files had a different schema or 
> metadata to ensure the tool doesn't try to decompress the contents.
> I've attached a patch that does this.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to