[
https://issues.apache.org/jira/browse/NIFI-919?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14736824#comment-14736824
]
Sean Busbey commented on NIFI-919:
----------------------------------
{code}
+ public static final PropertyDescriptor SPLIT_SIZE = new
PropertyDescriptor.Builder()
+ .name("Split Size")
+ .description("The number of Avro records to include per split.")
+ .addValidator(StandardValidators.INTEGER_VALIDATOR)
+ .required(true)
+ .defaultValue("1")
+ .build();
+
{code}
Can we do this as output file size instead? Maybe "no larger than X" so that we
have an easy time dealing with degenerate cases (like the last bit of records).
> Support Splitting Avro Files
> ----------------------------
>
> Key: NIFI-919
> URL: https://issues.apache.org/jira/browse/NIFI-919
> Project: Apache NiFi
> Issue Type: New Feature
> Reporter: Bryan Bende
> Assignee: Bryan Bende
> Priority: Minor
> Fix For: 0.4.0
>
> Attachments: NIFI-919.patch
>
>
> Provide a processor that splits an Avro file into multiple smaller files.
> Would be nice to have a configurable batch size so a user could produce
> single record files and also multi-record files of smaller size than the
> original. Also consider making the output format configurable, data file vs
> bare record.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)