Re: BufferedReader best option to search through large flowfiles?

Lars Winderling Mon, 05 Jun 2023 05:22:59 -0700

Hi James,

in case the NiFi processors such as ExtractText, ReplaceText and RouteOnContent (maybe multiple in a row/in parallel) do not match your use case, I'd definitely go with a bufferend reader and line wise processing. Afaik you can get it as easily as

    new File("/path/to/my/file").eachLine { line -> ... }


Enjoy your day and take care!
Best,
Lars

On 23-06-05 14:09, James McMahon wrote:

Hello. I have a requirement to scan for multiple regex patterns in very large flowfiles. Given that my flowfiles can be very large, I think my best approach is to employ an ExecuteGroovyScript processor and a script using a BufferedReader to scan the file one line at a time.
I am concerned that I might exhaust jvm resources trying to otherwise process large content if I try to handle it all at once. Is a BufferedReader the right call? Does anyone recommend a better approach?
Thanks in advance,
Jim

OpenPGP_signature
Description: OpenPGP digital signature

Re: BufferedReader best option to search through large flowfiles?

Reply via email to