[
https://issues.apache.org/jira/browse/NIFI-3356?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15868701#comment-15868701
]
ASF GitHub Bot commented on NIFI-3356:
--------------------------------------
Github user markap14 commented on a diff in the pull request:
https://github.com/apache/nifi/pull/1493#discussion_r101399653
--- Diff:
nifi-framework-api/src/main/java/org/apache/nifi/provenance/IdentifierLookup.java
---
@@ -0,0 +1,88 @@
+/*
+ * Licensed to the Apache Software Foundation (ASF) under one or more
+ * contributor license agreements. See the NOTICE file distributed with
+ * this work for additional information regarding copyright ownership.
+ * The ASF licenses this file to You under the Apache License, Version 2.0
+ * (the "License"); you may not use this file except in compliance with
+ * the License. You may obtain a copy of the License at
+ *
+ * http://www.apache.org/licenses/LICENSE-2.0
+ *
+ * Unless required by applicable law or agreed to in writing, software
+ * distributed under the License is distributed on an "AS IS" BASIS,
+ * WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+ * See the License for the specific language governing permissions and
+ * limitations under the License.
+ */
+
+package org.apache.nifi.provenance;
+
+import java.util.Collections;
+import java.util.HashMap;
+import java.util.List;
+import java.util.Map;
+
+/**
+ * Provides a mechanism for obtaining the identifiers of components,
queues, etc.
+ */
+public interface IdentifierLookup {
+
+ /**
+ * @return the identifiers of components that may generate Provenance
Events
+ */
+ List<String> getComponentIdentifiers();
+
+ /**
+ * @return a list of component types that may generate Provenance
Events
+ */
+ List<String> getComponentTypes();
+
+ /**
+ *
+ * @return the identifiers of FlowFile Queues that are in the flow
+ */
+ List<String> getQueueIdentifiers();
+
+ default Map<String, Integer> invertQueueIdentifiers() {
+ return invertList(getQueueIdentifiers());
+ }
+
+ default Map<String, Integer> invertComponentTypes() {
+ return invertList(getComponentTypes());
+ }
+
+ default Map<String, Integer> invertComponentIdentifiers() {
+ return invertList(getComponentIdentifiers());
+ }
+
+ default Map<String, Integer> invertList(final List<String> values) {
--- End diff --
That is true. Should not be an issue, though, since these values are all
expected to be unique identifiers.
> Provide a newly refactored provenance repository
> ------------------------------------------------
>
> Key: NIFI-3356
> URL: https://issues.apache.org/jira/browse/NIFI-3356
> Project: Apache NiFi
> Issue Type: Task
> Components: Core Framework
> Reporter: Mark Payne
> Assignee: Mark Payne
> Fix For: 1.2.0
>
>
> The Persistent Provenance Repository has been redesigned a few different
> times over several years. The original design for the repository was to
> provide storage of events and sequential iteration over those events via a
> Reporting Task. After that, we added the ability to compress the data so that
> it could be held longer. We then introduced the notion of indexing and
> searching via Lucene. We've since made several more modifications to try to
> boost performance.
> At this point, however, the repository is still the bottleneck for many flows
> that handle large volumes of small FlowFiles. We need a new implementation
> that is based around the current goals for the repository and that can
> provide better throughput.
--
This message was sent by Atlassian JIRA
(v6.3.15#6346)