[ https://issues.apache.org/jira/browse/AVRO-1980?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15817860#comment-15817860 ]
Gabor Szadovszky commented on AVRO-1980: ---------------------------------------- DataFileWriter does not actually write record-by-record. It writes the records in blocks instead. You can read more about it at http://avro.apache.org/docs/1.8.1/spec.html#Object+Container+Files. Or did I misunderstand your issue? > Write to Avro File in Bulk > --------------------------- > > Key: AVRO-1980 > URL: https://issues.apache.org/jira/browse/AVRO-1980 > Project: Avro > Issue Type: Improvement > Components: build, java > Affects Versions: 1.8.1 > Reporter: Santosh Balasubramanya > > when writing to Avro files usually append happens record by record. > Can't it be done by buffering and then committing it to file? > Below example > DatumWriter<User> userDatumWriter = new SpecificDatumWriter<User>(User.class); > DataFileWriter<User> dataFileWriter = new > DataFileWriter<User>(userDatumWriter); > dataFileWriter.create(user1.getSchema(), new File("users.avro")); > dataFileWriter.append(user1); > dataFileWriter.append(user2); > dataFileWriter.append(user3); > dataFileWriter.close(); > -- This message was sent by Atlassian JIRA (v6.3.4#6332)