Matt Burgess created NIFI-9814:
----------------------------------
Summary: Add range sampling to SampleRecord
Key: NIFI-9814
URL: https://issues.apache.org/jira/browse/NIFI-9814
Project: Apache NiFi
Issue Type: Improvement
Components: Extensions
Reporter: Matt Burgess
It would be nice to be able to specify exactly which records or ranges of
records are sampled from a FlowFile. For example if the first 5 lines of a
comma-delimited file are free-text (meaning it's not technically a CSV file
from NiFi's perspective as the first 5 lines do not constitute a header in this
example), it would be handy to be able to exclude them by specifying a range
filter of "6-" to say the 6th and every following record should be output.
In that vein SampleRecord could have a "range sampling" strategy where the user
could specify something like "2, 5-7, 25-" where the second, fifth, sixth,
seventh, and every record from the twenty-fifth record on would be included in
the outgoing flow file.
--
This message was sent by Atlassian Jira
(v8.20.1#820001)