[ 
https://issues.apache.org/jira/browse/AVRO-991?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13187346#comment-13187346
 ] 

Doug Cutting commented on AVRO-991:
-----------------------------------

On second thought, I don't think we ought to add this to the spec.  I think a 
tool that can read appended streams and write a single file would be useful, 
but I don't think we should require every implementation to be able to parse 
appended files.  That would be an incompatible change, and, as Scott points 
out, would also create difficult to split files.

I also think Scott's idea of permitting user-spec'd sync markers could be 
useful.
                
> Allow combining multiple Avro files within a stream. (no files on disk)
> -----------------------------------------------------------------------
>
>                 Key: AVRO-991
>                 URL: https://issues.apache.org/jira/browse/AVRO-991
>             Project: Avro
>          Issue Type: Improvement
>          Components: java
>    Affects Versions: 1.6.1
>            Reporter: Frank Grimes
>
> It would be nice to be able to do as follows:
>   cat file1.avro file2.avro | java -jar avro-tools.jar streamcombine > 
> combined-file.avro
> or similarly
>   
>   hadoop dfs -cat hdfs://hadoop/file1.avro hdfs://hadoop/file2.avro | java 
> -jar avro-tools.jar streamcombine | hdfs -put - 
> hdfs://hadoop/combined-file.avro
> See the following thread for details: 
> http://mail-archives.apache.org/mod_mbox/avro-user/201201.mbox/%[email protected]%3e

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to