Joe:

Thanks, I did number one today and it worked great. I sent it a 700 meg file and it
just kept the first 10 lines.

    Thank You.

    Leslie Hartman

Joe Percivall wrote:
For number one, you can also use RouteText[1] with the matching strategy "Satisfies Expression". Then as a dynamic 
property use this expression "${lineNo:le(10)}". This will route first 10 lines to the "matched" relationship 
(assuming "Route to each matching Property Name" is not selected). This option also allows you to route those unmatched 
lines elsewhere if you need (if not just auto-terminate the "unmatched" relationship).
The for number two, instead of ReplaceText, you could also use RouteText. Set the matching strategy to "Matches Regular Expression". Then set the dynamic property to match everything and end with "unambiguously" (an example being "((\w|\W)*unambiguously)"). This will route all the text that matches the Regex apart from the end of the file and gives you the option to route the ending text differently if needed.

[1] 
https://nifi.apache.org/docs/nifi-docs/components/org.apache.nifi.processors.standard.RouteText/index.html


Joe- - - - - -
Joseph Percivall
linkedin.com/in/Percivall
e: [email protected]



On Sunday, June 5, 2016 4:41 AM, Leslie Hartman <[email protected]> wrote:



Matthew:

     The modifyBytes processor would be the best if it would allow
    one to
specify the bytes to keep. I could calculate the number of bytes to
    delete,
but when I try and place a variable in the End Offset it says it is
    not in the
<Data Size> <Data Unit> format.

     As for SegmentContent and SplitText I have tried both of these.
    The problem
is that it just takes the original file a splits it in to a bunch of
    little files. So if I wanted
say 256 Bytes of a 30 meg file, after running out of memory it would
    give me
125 Million 829 Thousand 119 Files to get rid of.

     For the 2nd case ReplaceText should work, I'm just having
    problems getting
the correct syntax. If someone could provide an example of the
    correct syntax
I would appreciate it.

     Thank You.

     Leslie Hartman


Matthew Clarke wrote:

You may also want to look at using the modifyBytes processor for number 1.
On Jun 4, 2016 1:49 PM, "Thad Guidry" <[email protected]> wrote:

For your 1st case, you can use either SegmentContent by your 256 bytes (or 
perhaps you can even use SplitText)
https://nifi.apache.org/docs/nifi-docs/components/org.apache.nifi.processors.standard.SegmentContent/index.html

https://nifi.apache.org/docs/nifi-docs/components/org.apache.nifi.processors.standard.SplitText/index.html



For your 2nd case, you can use ReplaceText
https://nifi.apache.org/docs/nifi-docs/components/org.apache.nifi.processors.standard.ReplaceText/index.html



Thad
+ThadGuidry



Reply via email to