Thanks, Joe. Given the fact that we would like to add a few attributes and
set them to be indexed for the provenance, the mentioned rate should be
alright?

Cheers,
Ali

On Sat, Feb 16, 2019 at 2:56 PM Joe Witt <joe.w...@gmail.com> wrote:

> Ali
>
> You certainly can and at the rates you mention you should be able to keep
> it for a good while.
>
> Just set the properties you need for your system and measure the rate at
> which prov storage fills.
>
> Thanks
>
> On Fri, Feb 15, 2019 at 10:29 PM Ali Nazemian <alinazem...@gmail.com>
> wrote:
>
>> I didn't mean to use Nifi provenance search for an external provenance
>> search. I meant to use it for internal search provenance but keep the
>> provenance for a longer time than usual. It means instead of expecting it
>> to keep provenance data for a few days, use it as an event store as it also
>> provides the search capability.
>>
>> Regards,
>> Ali
>>
>> On Sat, Feb 16, 2019 at 5:29 AM Andrew Grande <apere...@gmail.com> wrote:
>>
>>> NiFi provenance searches are not a good integration pattern for external
>>> systems. I.e. using it to periodicaly fetch history burdens the cluster
>>> (those searches can be heavy) and disrupt normal processing SLAs.
>>>
>>> Pushing provenance events out to an external system (pitebtially even
>>> filtered down to components of interest) is a much more predictable pattern
>>> and provides lots of flexibility on how to interpret the events.
>>>
>>> Andrew
>>>
>>> On Thu, Feb 14, 2019, 11:26 PM Ali Nazemian <alinazem...@gmail.com>
>>> wrote:
>>>
>>>> Can I expect the Nifi search provenance part do the job for me?
>>>>
>>>> On Fri, 15 Feb. 2019, 13:21 Mike Thomsen <mikerthom...@gmail.com wrote:
>>>>
>>>>> Ali,
>>>>>
>>>>> There is a site to site publishing task for provenance that you can
>>>>> add as a root controller service that would be great here. It'll just take
>>>>> all of your provenance data periodically and ship it off to another NiFi
>>>>> server or cluster that can process all of the provenance data as blocks of
>>>>> JSON data. A common pattern there is to filter down to the events you want
>>>>> and publish to ElasticSearch.
>>>>>
>>>>> On Thu, Feb 14, 2019 at 7:05 PM Ali Nazemian <alinazem...@gmail.com>
>>>>> wrote:
>>>>>
>>>>>> Hi All,
>>>>>>
>>>>>> I am investigating to see how Nifi provenance can be used as an event
>>>>>> store for a long period of time. Our use case is very burst based and
>>>>>> sometimes we may not receive any event for a period of time and sometimes
>>>>>> we may get burst traffic. On average we can say maybe around 1000 eps is
>>>>>> the expected throughput at this stage. Nifi has a powerful provenance 
>>>>>> that
>>>>>> gives you an ability to also index based on some attributes. I am
>>>>>> investigating how reliable is to use Nifi provenance store for a long
>>>>>> period of time and enable index for a few extra attributes. Has anybody
>>>>>> used Nifi provenance at this scale? Can lots of Lucene indices create 
>>>>>> other
>>>>>> issues within Nifi as provenance uses Lucene for the indexing?
>>>>>>
>>>>>> P.S: Our use case is pretty light for Nifi as we are not going to
>>>>>> have any ETL and Nifi is being used mostly as an Orchestrator of multiple
>>>>>> Microservices.
>>>>>>
>>>>>> Regards,
>>>>>> Ali
>>>>>>
>>>>>
>>
>> --
>> A.Nazemian
>>
>

-- 
A.Nazemian

Reply via email to