Hi,
Thx for your answer. I have one more question, if I use "provided" as strategy 
with earliest and latest time range , I am getting duplicate records as many 
times, how to handle this? And also I have requirement that once we process the 
those records by provided time range, post that it should continue as current 
strategy.

Siva

On Nov 14, 2017, at 7:34 PM, Bryan Bende 
<[email protected]<mailto:[email protected]>> wrote:

Pierre is correct...

Currently you can reset the state back to the beginning by right-clicking on 
the processor and selecting View State and then Clear State.

From there you could use "Managed from Beginning" to start over from the 
beginning", but there is no way to start at a specific point in time, only 
beginning or current time.

On Tue, Nov 14, 2017 at 3:33 AM, Pierre Villard 
<[email protected]<mailto:[email protected]>> wrote:
Hi Siva,

The processor is storing a "state" in the state management back-end of NiFi 
(Zookeeper usually). There is no way for you to edit this value. However, some 
processors expose a property allowing you to manually set the initial value 
(Example GenerateTableFetch [1]) that the processor will use when it is 
started. I don't know Splunk processors but I believe this is what you'd like. 
If yes, I can only suggest you to submit a JIRA asking for this feature [2].

[1] 
https://nifi.apache.org/docs/nifi-docs/components/org.apache.nifi/nifi-standard-nar/1.4.0/org.apache.nifi.processors.standard.GenerateTableFetch/index.html
[2] https://issues.apache.org/jira/projects/NIFI (you need to register first)

Pierre

2017-11-14 3:44 GMT+01:00 Sivakumar, S 
<[email protected]<mailto:[email protected]>>:
Hello,
Even I provide Earliest Time and Latest Time with “Provided” strategy, it is 
causing data duplicate in the system. The same number of records are repeated 
as many times, when the workflow runs.

My question is, even if specify the "Managed" time strategy, how to go back to 
certain time ranges (basically the re-pulling the same records which already 
pulled), by changing the stored value somewhere in the system where processor 
refers to that. Could be from persistent provenance repo. How to tweak the time 
range values from those repo?

-Siva

From: Bryan Bende <[email protected]<mailto:[email protected]>>
Reply-To: "[email protected]<mailto:[email protected]>" 
<[email protected]<mailto:[email protected]>>
Date: Monday, November 13, 2017 at 11:57 PM
To: "[email protected]<mailto:[email protected]>" 
<[email protected]<mailto:[email protected]>>
Cc: "Bharani, Manish" 
<[email protected]<mailto:[email protected]>>
Subject: Re: Splunk Processor - Re-play

Hello,
If you want to specify Earliest Time and Latest Time, then you need to change 
Time Range Strategy to 'Provided".
The "Managed" time ranger strategies are meant to let the processor calculate 
the time ranges for you on each execution and you can not specify time ranges 
when using those strategies.
-Bryan

On Mon, Nov 13, 2017 at 9:03 AM, Sivakumar, S 
<[email protected]<mailto:[email protected]>> wrote:
Hi Folks,
I am newbie to nifi tool. I am using GetSplunk 1.4.0 processor to pull data 
from Splunk. Somewhat I have managed pulled the data for T-3, but I want to 
re-play and again want to pull data with some more additional transformation 
added in the splunk query.

I have below two problems

1. No data is pulled in the Flow control.
2. If I change the “Time Range Strategy “
                a. Provided, the SAME data is pulled as many number of times 
till the Flow control runs,
                b. Managed from Beginning, it is pulled huge volume of data.

Please advise me how to replay the flow control from where I want and continue 
from that point onwards


<image001.png>

-Siva



Reply via email to