Mikhail, Thanks. In your situation, I don't actually see any event files, so it may well be that all events have aged off. You may also want to increase the value of "nifi.provenance.repository.max.storage.time" from "24 hours" to something more long-term - like "365 days". The data will age-off based on either max storage time or max storage size, whichever comes first.
Thanks -Mark On Jul 12, 2019, at 12:48 PM, Mikhail Rolshud (BLOOMBERG/ 120 PARK) <[email protected]<mailto:[email protected]>> wrote: I'll give it a try, but I am nowhere near 1G currently. # du -ksh provenance_repository/* 144K provenance_repository/index-1561998104305 4.0K provenance_repository/toc My flows are batch jobs that run once a day, they generate a few 1000s events per day at most. Different node with 1.8.0 and default settings running the same flows using PersistentProvenanceRepository does not have this problem. Thank you. From: [email protected]<mailto:[email protected]> At: 07/12/19 11:58:14 To: Mikhail Rolshud (BLOOMBERG/ 120 PARK ) <mailto:[email protected]> Cc: [email protected]<mailto:[email protected]> Subject: Re: 1.9.2 Does not show provenance events OK, so the problem is probably these 2 lines: nifi.provenance.repository.max.storage.size=1 GB nifi.provenance.repository.index.shard.size=500 MB You allow up to 1 GB of storage space, which involves both the index (used for searching/lookup) and the event files, which contain the actual events. The index can take up to about 500 MB each. There will be 1-2. Which leaves very little space for event files. These are the default settings and are okay for setting up a small flow on your laptop and playing around with it. But if you have any sort of high volumes at all, you'll want to increase the storage space. The issue is likely that the events are just aging off too quickly for you to see them. Possibly even before they are indexed, if it's running at a pretty high rate. For a "tiny" production or integration system I'd say to set the max.storage.size to 10 GB instead of 1 GB. That should allow you to hold a lot more and will likely address the issue. Thanks -Mark On Jul 12, 2019, at 11:29 AM, Mikhail Rolshud (BLOOMBERG/ 120 PARK) <[email protected]<mailto:[email protected]>> wrote: # Provenance Repository Properties nifi.provenance.repository.implementation=org.apache.nifi.provenance.WriteAheadProvenanceRepository nifi.provenance.repository.debug.frequency=1_000_000 nifi.provenance.repository.encryption.key.provider.implementation= nifi.provenance.repository.encryption.key.provider.location= nifi.provenance.repository.encryption.key.id= nifi.provenance.repository.encryption.key= # Persistent Provenance Repository Properties nifi.provenance.repository.directory.default=./provenance_repository nifi.provenance.repository.max.storage.time=24 hours nifi.provenance.repository.max.storage.size=1 GB nifi.provenance.repository.rollover.time=30 secs nifi.provenance.repository.rollover.size=100 MB nifi.provenance.repository.query.threads=2 nifi.provenance.repository.index.threads=2 nifi.provenance.repository.compress.on.rollover=true nifi.provenance.repository.always.sync=false # Comma-separated list of fields. Fields that are not indexed will not be searchable. Valid fields are: # EventType, FlowFileUUID, Filename, TransitURI, ProcessorID, AlternateIdentifierURI, Relationship, Details nifi.provenance.repository.indexed.fields=EventType, FlowFileUUID, Filename, ProcessorID, Relationship # FlowFile Attributes that should be indexed and made searchable. Some examples to consider are filename, uuid, mime.type nifi.provenance.repository.indexed.attributes= # Large values for the shard size will result in more Java heap usage when searching the Provenance Repository # but should provide better performance nifi.provenance.repository.index.shard.size=500 MB # Indicates the maximum length that a FlowFile attribute can be when retrieving a Provenance Event from # the repository. If the length of any attribute exceeds this value, it will be truncated when the event is retrieved. nifi.provenance.repository.max.attribute.length=65536 nifi.provenance.repository.concurrent.merge.threads=2 # Volatile Provenance Respository Properties nifi.provenance.repository.buffer.size=100000 From: [email protected] At: 07/12/19 10:53:13 To: Mikhail Rolshud (BLOOMBERG/ 120 PARK ) , [email protected] Subject: Re: 1.9.2 Does not show provenance events What do your provenance-related settings look like in nifi.properties? Thanks -Mark On Jul 12, 2019, at 10:21 AM, Mikhail Rolshud (BLOOMBERG/ 120 PARK) <[email protected]<mailto:[email protected]>> wrote: WriteAheadProvenanceRepository might be the culprit here. I have another instance running 1.8.0 and it is using PersistentProvenanceRepository. It's been running for months with same flows and has no issues displaying provenance events. In 1.9.2 PersistentProvenanceRepository is deprecated and by default nifi.properties have nifi.provenance.repository.implementation=org.apache.nifi.provenance.WriteAheadProvenanceRepository From: [email protected] At: 07/12/19 00:42:17 To: [email protected] Subject: Re: 1.9.2 Does not show provenance events I have the same behaviour, with the same setup (container, “external” mount to network drive) on 1.9.2 On Thu, 11 Jul 2019, at 21:08, Mikhail Rolshud (BLOOMBERG/ 120 PARK) wrote: This is a clean install of 1.9.2 And I can see the provenance events for a couple of days if I wipe out provenance_repository and restart the node. After awhile nifi seems to keep writing them but not showing anything in ui. From: [email protected]<mailto:[email protected]> At: 07/11/19 15:01:21 To: Mikhail Rolshud (BLOOMBERG/ 120 PARK ) <mailto:[email protected]> , [email protected]<mailto:[email protected]> Subject: Re: 1.9.2 Does not show provenance events Hello I suspect you have to add the new policy. Please see in the migration guide from old version you had until now. Thanks On Thu, Jul 11, 2019, 2:56 PM Mikhail Rolshud (BLOOMBERG/ 120 PARK) <[email protected]<mailto:[email protected]>> wrote: Hi, We noticed that after some time 1.9.2 stops showing provenance events. Page just comes blank. I checked the provenance_repository/ folder and I can see files there get modified as events are happening, just can't see anything on UI side. We are running this build: 1.9.2 04/03/2019 15:25:53 UTC Tagged nifi-1.9.2-RC2 From ff01ff6 on branch NIFI-6169-RC2 It runs inside a docker container with provenance_repository/ mapped to external drive mounted to a network storage. # ls -ltr provenance_repository/ total 8 drwxrwxr-x. 2 nifi nifi 4096 Jul 7 12:00 toc drwxrwxr-x. 2 nifi nifi 4096 Jul 11 18:52 index-1561998104305 there are files in provenance_repository/index-1561998104305/ that get updated as events are happening. Thanks
