[
https://issues.apache.org/jira/browse/NIFI-4728?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17863074#comment-17863074
]
Pushpraj Singh Jadon commented on NIFI-4728:
--------------------------------------------
Does the ReplaceText processor read the entire file content, or does it stop
reading the file when the replacement strategy is met? I have a scenario with
CSV files where the first three rows should be treated as headers. I need to
remove these three rows or keep the first row as the header and remove the
second and third rows from the flow file. Additionally, if I want to search
only for the first match of my replacement strategy, can the processor stop
reading the entire flow file to save time, especially for large files around 2
GB?
> Create ExtractHeader processor
> ------------------------------
>
> Key: NIFI-4728
> URL: https://issues.apache.org/jira/browse/NIFI-4728
> Project: Apache NiFi
> Issue Type: New Feature
> Components: Extensions
> Affects Versions: 1.4.0
> Reporter: Andy LoPresto
> Assignee: Jason Sherman
> Priority: Minor
> Labels: header, processor, text
>
> A frequent community request is to remove a header line from incoming text.
> The {{ExtractHeaderProcessor}} should accept arbitrary text, remove a
> (configurable number of) lines from the beginning, and route a new child
> flowfile containing the header to a {{header}} relationship, and the
> remaining contents to a {{body}} relationship.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)