Re: Duplicate Attribute Values in Extract Text Processor Output

2020-07-11 Thread muhyid72
Hi Mark, Hi Mark, I would like to say thank you for your advice. I did your described method. It is working and giving better performance. -- Sent from: http://apache-nifi-users-list.2361937.n4.nabble.com/

Re: Duplicate Attribute Values in Extract Text Processor Output

2020-06-26 Thread muhyid72
Hi Mark, Thanks for your answer Actually i don't have so much experience on NiFi I guess, i couldn't understand correctly your explanation I want to append extra words beginning of each line for example: my IIS Log File line like this: 2020-03-13 13:59:19 XXX-YYY GET /Maintenance/Status.svc

Re: Duplicate Attribute Values in Extract Text Processor Output

2020-06-26 Thread Mark Payne
You’ll want to connect FetchAzureBlob -> ReplaceText -> PutTCP. ReplaceText would use the Evaluation Mode of Line-by-Line to update the text. Or, alternatively, you could use UpdateRecord. Thanks -Mark > On Jun 26, 2020, at 2:36 PM, muhyid72 wrote: > > Hi Mark, > > Thank you so much for

Re: Duplicate Attribute Values in Extract Text Processor Output

2020-06-26 Thread muhyid72
Hi Mark, Thank you so much for valuable advice. I tried PutTCP it seems working. I would like to make a summary for your explanation and ask a questions If I understand correctly; 1. Getting IIS Log Files from Azure Blob Storage same as before 1.1. List Azure Blob Storage Processor 1.2. Route

Re: Duplicate Attribute Values in Extract Text Processor Output

2020-06-26 Thread Mark Payne
If performance is the problem, then you definitely want to get rid of any SplitText / Split* processors. These processors are great when they are necessary but they should be avoided if at all possible, because splitting the data apart results in huge overhead for NiFi and will harm performance

Re: Duplicate Attribute Values in Extract Text Processor Output

2020-06-26 Thread muhyid72
Hi Andy, Thank you for your great support My aim is transferring all IIS logs to syslog line by line. Therefore i am using split text for parsing line. I tried Route Text yesterday but i didn't accomplish to transfer line by line to syslog. Extract Text is transferring splitted line on the

Re: Duplicate Attribute Values in Extract Text Processor Output

2020-06-25 Thread Andy LoPresto
The resulting flowfile will always have at least two attributes because the whole match is extracted as an attribute and every capture group is extracted as an attribute, and the expression must contain at least one capture group. What is the objective you are trying to accomplish? If you want

Re: Duplicate Attribute Values in Extract Text Processor Output

2020-06-25 Thread muhyid72
Hi Andy, Thank you for your quick answer and interest. Actually I tried that but there were still 2 attributes on the flow file. As far as I understand it is by design, I can't set just one attribute, it has at least 2. Am i right? Can I use Route Text Processor instead of Extract Text (I have

Re: Duplicate Attribute Values in Extract Text Processor Output

2020-06-25 Thread Andy LoPresto
The regex you’re using contains a capture group, and so the entire string is captured as one attribute, and then the contained capture groups are also extracted as attributes. You can set the property “Include Capture Group 0” to false to remove one of them. The others are provided as expected.

Duplicate Attribute Values in Extract Text Processor Output

2020-06-25 Thread muhyid72
Dear All I need an information about Flow Files Attribute of Extract Text Processor. My flow is that; 1. Getting IIS Log files from Azure Blob Storage 2. Splitting each IIS Log File to line by line with Split Text Processor. 2.1. Line Split Count:1 2.2. Maximum Fragment Size: No value set 2.3.