Hi James,

in case the NiFi processors such as ExtractText, ReplaceText and RouteOnContent (maybe multiple in a row/in parallel) do not match your use case, I'd definitely go with a bufferend reader and line wise processing. Afaik you can get it as easily as
    new File("/path/to/my/file").eachLine { line -> ... }

Enjoy your day and take care!
Best,
Lars

On 23-06-05 14:09, James McMahon wrote:
Hello. I have a requirement to scan for multiple regex patterns in very large flowfiles. Given that my flowfiles can be very large, I think my best approach is to employ an ExecuteGroovyScript processor and a script using a BufferedReader to scan the file one line at a time.

I am concerned that I might exhaust jvm resources trying to otherwise process large content if I try to handle it all at once. Is a BufferedReader the right call? Does anyone recommend a better approach?

Thanks in advance,
Jim

Attachment: OpenPGP_signature
Description: OpenPGP digital signature

Reply via email to