[14/51] [partial] nifi git commit: NIFI-850 removed nifi parent, updated nifi pom, moved all nifi subdirs up one level, fixed readme.

joewitt Sat, 15 Aug 2015 10:13:35 -0700

http://git-wip-us.apache.org/repos/asf/nifi/blob/aa998847/nifi-docs/src/main/asciidoc/user-guide.adoc
----------------------------------------------------------------------
diff --git a/nifi-docs/src/main/asciidoc/user-guide.adoc 
b/nifi-docs/src/main/asciidoc/user-guide.adoc
new file mode 100644
index 0000000..b793af4
--- /dev/null
+++ b/nifi-docs/src/main/asciidoc/user-guide.adoc
@@ -0,0 +1,1468 @@
+//
+// Licensed to the Apache Software Foundation (ASF) under one or more
+// contributor license agreements.  See the NOTICE file distributed with
+// this work for additional information regarding copyright ownership.
+// The ASF licenses this file to You under the Apache License, Version 2.0
+// (the "License"); you may not use this file except in compliance with
+// the License.  You may obtain a copy of the License at
+//
+//     http://www.apache.org/licenses/LICENSE-2.0
+//
+// Unless required by applicable law or agreed to in writing, software
+// distributed under the License is distributed on an "AS IS" BASIS,
+// WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+// See the License for the specific language governing permissions and
+// limitations under the License.
+//
+Apache NiFi User Guide
+======================
+Apache NiFi Team <[email protected]>
+:homepage: http://nifi.apache.org
+
+
+Introduction
+------------
+Apache NiFi is a dataflow system based on the concepts of flow-based 
programming. It supports
+powerful and scalable directed graphs of data routing, transformation, and 
system mediation logic. NiFi has
+a web-based user interface for design, control, feedback, and monitoring of 
dataflows. It is highly configurable
+along several dimensions of quality of service, such as loss-tolerant versus 
guaranteed delivery, low latency versus
+high throughput, and priority-based queuing. NiFi provides fine-grained data 
provenance for all data received, forked, joined
+cloned, modified, sent, and ultimately dropped upon reaching its configured 
end-state.
+
+See the Admin Guide for information about system requirements, installation, 
and configuration. Once NiFi is installed,
+use a supported web browser to view the User Interface. Supported web browsers 
include:
+
+* Internet Explorer 9+ (see note below)
+* Mozilla FireFox 24+
+* Google Chrome 36+
+* Safari 8
+
+Note that there is a known issue in Internet Explorer (IE) 10 and 11 that can 
cause problems when moving items on the NiFi graph. If you encounter this 
problem, we suggest using a browser other than IE. This known issue is 
described here: https://connect.microsoft.com/IE/Feedback/Details/1050422.
+
+
+
+[template="glossary", id="terminology"]
+Terminology
+-----------
+*DataFlow Manager*: A DataFlow Manager (DFM) is a NiFi user who has 
permissions to add, remove, and modify components of a NiFi dataflow.
+
+*FlowFile*: The FlowFile represents a single piece of data in NiFi. A FlowFile 
is made up of two components:
+       FlowFile Attributes and FlowFile Content.
+       Content is the data that is represented by the FlowFile. Attributes are 
characteristics that provide information or
+       context about the data; they are made up of key-value pairs.
+       All FlowFiles have the following Standard Attributes:
+
+- *uuid*: A unique identifier for the FlowFile
+- *filename*: A human-readable filename that may be used when storing the data 
to disk or in an external service
+- *path*: A hierarchically structured value that can be used when storing data 
to disk or an external service so that the data is not stored in a single 
directory
+
+*Processor*: The Processor is the NiFi component that is used to listen for 
incoming data; pull data from external sources;
+       publish data to external sources; and route, transform, or extract 
information from FlowFiles.
+
+*Relationship*: Each Processor has zero or more Relationships defined for it. 
These Relationships are named to indicate the result of processing a FlowFile.
+       After a Processor has finished processing a FlowFile, it will route (or 
âtransferâ) the FlowFile to one of the Relationships.
+       A DFM is then able to connect each of these Relationships to other 
components in order to specify where the FlowFile should
+       go next under each potential processing result.
+
+*Connection*: A DFM creates an automated dataflow by dragging components from 
the Components part of the NiFi toolbar to the canvas
+       and then connecting the components together via Connections. Each 
connection consists of one or more Relationships.
+       For each Connection that is drawn, a DFM can determine which 
Relationships should be used for the Connection.
+       This allows data to be routed in different ways based on its processing 
outcome. Each connection houses a FlowFile Queue.
+       When a FlowFile is transferred to a particular Relationship, it is 
added to the queue belonging to the associated Connection.
+
+*Controller Service*: Controller Services are extension points that, after 
being added and configured by a DFM in the User Interface, will start up when 
NiFi starts up and provide information for use by other components (such as 
processors or other controller services). A common Controller Service used by 
several components is the StandardSSLContextService. It provides the ability to 
configure keystore and/or truststore properties once and reuse that 
configuration throughout the application. The idea is that, rather than 
configure this information in every processor that might need it, the 
controller service provides it for any processor to use as needed.
+
+*Reporting Task*: Reporting Tasks run in the background to provide statistical 
reports about what is happening in the NiFi instance. The DFM adds and 
configures Reporting Tasks in the User Interface as desired. Common reporting 
tasks include the ControllerStatusReportingTask, MonitorDiskUsage reporting 
task, MonitorMemory reporting task, and the StandardGangliaReporter.
+
+*Funnel*: A funnel is a NiFi component that is used to combine the data from 
several Connections into a single Connection.
+
+*Process Group*: When a dataflow becomes complex, it often is beneficial to 
reason about the dataflow at a higher, more abstract level.
+       NiFi allows multiple components, such as Processors, to be grouped 
together into a Process Group.
+       The NiFi User Interface then makes it easy for a DFM to connect 
together multiple Process Groups into a logical dataflow,
+       as well as allowing the DFM to enter a Process Group in order to see 
and manipulate the components within the Process Group.
+
+*Port*: Dataflows that are constructed using one or more Process Groups need a 
way to connect a Process Group to other dataflow components.
+       This is achieved by using Ports. A DFM can add any number of Input 
Ports and Output Ports to a Process Group and name these ports appropriately.
+
+*Remote Process Group*: Just as data is transferred into and out of a Process 
Group, it is sometimes necessary to transfer data from one instance of NiFi to 
another.
+       While NiFi provides many different mechanisms for transferring data 
from one system to another, Remote Process Groups are often the easiest way to 
accomplish
+       this if transferring data to another instance of NiFi.
+
+*Bulletin*: The NiFi User Interface provides a significant amount of 
monitoring and feedback about the current status of the application.
+       In addition to rolling statistics and the current status provided for 
each component, components are able to report Bulletins.
+       Whenever a component reports a Bulletin, a bulletin icon is displayed 
on that component. System-level bulletins are displayed on the Status bar near 
the top of the page.
+       Using the mouse to hover over that icon will provide a tool-tip that 
shows the time and severity (Debug, Info, Warning, Error) of the Bulletin,
+       as well as the message of the Bulletin.
+       Bulletins from all components can also be viewed and filtered in the 
Bulletin Board Page, available in the Management Toolbar.
+
+*Template*: Often times, a dataflow is comprised of many sub-flows that could 
be reused. NiFi allows DFMs to select a part of the dataflow
+       (or the entire dataflow) and create a Template. This Template is given 
a name and can then be dragged onto the canvas just like the other components.
+       As a result, several components may be combined together to make a 
larger building block from which to create a dataflow.
+       These templates can also be exported as XML and imported into another 
NiFi instance, allowing these building blocks to be shared.
+
+
+[[User_Interface]]
+NiFi User Interface
+-------------------
+
+The NiFi User Interface (UI) provides mechanisms for creating automated 
dataflows, as well as visualizing,
+editing, monitoring, and administering those dataflows. The UI can be broken 
down into several segments,
+each responsible for different functionality of the application. This section 
provides screenshots of the
+application and highlights the different segments of the UI. Each segment is 
discussed in further detail later
+in the document.
+
+When the application is started, the user is able to navigate to the User 
Interface by going to the default address of
+`http://<hostname>:8080/nifi` in a web browser. There are no permissions 
configured by default, so anyone is
+able to view and modify the dataflow. For information on securing the system, 
see the Systems Administrator guide.
+
+When a DFM navigates to the UI for the first time, a blank canvas is provided 
on which a dataflow can be built:
+
+image::new-flow.png["Empty Flow"]
+
+Along the top of the of the screen is a toolbar that contains several of these 
segments.
+To the left is the Components Toolbar. This toolbar consists of the different 
components that can be dragged onto the canvas.
+
+Next to the Components Toolbar is the Actions Toolbar. This toolbar consists 
of buttons to manipulate the existing
+components on the graph. To the right of the Actions Toolbar is the Search 
Toolbar. This toolbar consists of a single
+Search field that allows users to easily find components on the graph. Users 
are able to search by component name,
+type, identifier, configuration properties, and their values.
+
+The Management Toolbar sits to the right-hand side of the screen. This toolbar 
consists of buttons that are
+used by DFMs to manage the flow as well as by administrators who manage user 
access
+and configure system properties, such as how many system resources should be 
provided to the application.
+
+image::nifi-toolbar-components.png["NiFi Components Toolbar"]
+
+Next, we have segments that provide capabilities to easily navigate around the 
graph. On the left-hand side is a toolbar that
+provides the ability to pan around the graph and zoom in and out. On the 
right-hand side is a âBirds-Eye Viewâ of the dataflow.
+This provides a high-level view of the dataflow and allows the user to quickly 
and easily pan across large portions of the dataflow.
+Along the top of the screen is a trail of breadcrumbs. As users navigate into 
and out of Process Groups, the breadcrumbs show
+the depth in the flow and each Process Group that was entered to reach this 
depth. Each of the Process Groups listed in the breadcrumbs
+is a link that will take you back up to that level in the flow.
+
+image::nifi-navigation.png["NiFi Navigation"]
+
+[[status_bar]]
+Below the breadcrumbs lives the Status bar. The Status bar provides 
information about how many Processors exist in the graph in
+each state (Stopped, Running, Invalid, Disabled), how many Remote Process 
Groups exist on the graph in each state
+(Transmitting, Not Transmitting), the number of threads that are currently 
active in the flow, the amount of data that currently
+exists in the flow, and the timestamp at which all of this information was 
last refreshed. If there are any System-Level bulletins,
+these are shown in the Status bar as well. Additionally, if the instance of 
NiFi is clustered, the Status bar shows how many nodes
+are in the cluster and how many are currently connected.
+
+image::status-bar.png["NiFi Status Bar"]
+
+
+
+
+
+Building a DataFlow
+-------------------
+
+A DFM is able to build an automated dataflow using the NiFi User Interface 
(UI). Simply drag components from the toolbar to the canvas, configure the 
components to meet specific needs, and connect
+the components together.
+
+
+=== Adding Components to the Canvas
+
+In the User Interface section above outlined the different segments of the UI 
and pointed out a Components Toolbar.
+This section looks at each of the Components in that toolbar:
+
+image::components.png["Components"]
+
+[[processor]]
+image:iconProcessor.png["Processor", width=32]
+*Processor*: The Processor is the most commonly used component, as it is 
responsible for data ingress, egress, routing, and
+       manipulating. There are many different types of Processors. In fact, 
this is a very common Extension Point in NiFi,
+       meaning that many vendors may implement their own Processors to perform 
whatever functions are necessary for their use case.
+       When a Processor is dragged onto the graph, the user is presented with 
a dialog to choose which type of Processor to use:
+
+image::add-processor.png["Add Processor Dialog"]
+
+In the top-right corner, the user is able to filter the list based on the 
Processor Type or the Tags associated with a Processor.
+Processor developers have the ability to add Tags to their Processors. These 
tags are used in this dialog for filtering and are
+displayed on the left-hand side in a Tag Cloud. The more Processors that exist 
with a particular Tag, the larger the Tag appears
+in the Tag Cloud. Clicking a Tag in the Cloud will filter the available 
Processors to only those that contain that Tag. If multiple
+Tags are selected, only those Processors that contain all of those Tags are 
shown. For example, if we want to show only those
+Processors that allow us to ingest data via HTTP, we can select both the 
`http` Tag and the `ingest` Tag:
+
+image::add-processor-with-tag-cloud.png["Add Processor with Tag Cloud"]
+
+Clicking the `Add` button or double-clicking on a Processor Type will add the 
selected Processor to the canvas at the
+location that it was dropped.
+
+*Note*: For any component added to the graph, it is possible to select it with 
the mouse and move it anywhere on the graph. Also, it is possible to select 
multiple items at once by either holding down the Shift key and selecting each 
item or by holding down the Shift key and dragging a selection box around the 
desired components. 
+
+
+[[input_port]]
+image:iconInputPort.png["Input Port", width=32]
+*Input Port*: Input Ports provide a mechanism for transferring data into a 
Process Group. When an Input Port is dragged
+onto the canvas, the DFM is prompted to name the Port. All Ports within a 
Process Group must have unique names.
+
+All components exist only within a Process Group. When a user initially 
navigates to the NiFi page, the user is placed 
+in the Root Process Group. If the Input Port is dragged onto the Root Process 
Group, the Input Port provides a mechanism
+to receive data from remote instances of NiFi via 
<<site-to-site,Site-to-Site>>. In this case, the Input Port can be configured 
+to restrict access to appropriate users, if NiFi is configured to run 
securely. For information on configuring NiFi to run 
+securely, see the 
+link:administration-guide.html[Admin Guide].
+
+
+
+[[output_port]]
+image:iconOutputPort.png["Output Port", width=32]
+*Output Port*: Output Ports provide a mechanism for transferring data from a 
Process Group to destinations outside
+of the Process Group. When an Output Port is dragged onto the canvas, the DFM 
is prompted to name the Port. All Ports
+within a Process Group must have unique names.
+
+If the Output Port is dragged onto the Root Process Group, the Output Port 
provides a mechanism for sending data to
+remote instances of NiFi via <<site-to-site,Site-to-Site>>. In this case, the 
Port acts as a queue. As remote instances 
+of NiFi pull data from the port, that data is removed from the queues of the 
incoming Connections. If NiFi is configured 
+to run securely, the Output Port can be configured to restrict access to 
appropriate users. For information on configuring 
+NiFi to run securely, see the 
+link:administration-guide.html[Admin Guide].
+
+
+[[process_group]]
+image:iconProcessGroup.png["Process Group", width=32]
+*Process Group*: Process Groups can be used to logically group a set of 
components so that the dataflow is easier to understand
+and maintain. When a Process Group is dragged onto the canvas, the DFM is 
prompted to name the Process Group. All Process
+Groups within the same parent group must have unique names. The Process Group 
will then be nested within that parent group.
+
+
+
+[[remote_process_group]]
+image:iconRemoteProcessGroup.png["Remote Process Group", width=32]
+*Remote Process Group*: Remote Process Groups appear and behave similar to 
Process Groups. However, the Remote Process Group (RPG)
+references a remote instance of NiFi. When an RPG is dragged onto the canvas, 
rather than being prompted for a name, the DFM
+is prompted for the URL of the remote NiFi instance. If the remote NiFi is a 
clustered instance, the URL that should be used
+is the URL of the remote instance's NiFi Cluster Manager (NCM). When data is 
transferred to a clustered instance of NiFi
+via an RPG, the RPG it will first connect to the remote instance's NCM to 
determine which nodes are in the cluster and
+how busy each node is. This information is then used to load balance the data 
that is pushed to each node. The remote NCM is
+then interrogated periodically to determine information about any nodes that 
are dropped from or added to the cluster and to 
+recalculate the load balancing based on each node's load. For more 
information, see the section on <<site-to-site,Site-to-Site>>.
+
+
+[[funnel]]
+image:iconFunnel.png["Funnel", width=32]
+*Funnel*: Funnels are used to combine the data from many Connections into a 
single Connection. This has two advantages.
+First, if many Connections are created with the same destination, the canvas 
can become cluttered if those Connections
+have to span a large space. By funneling these Connections into a single 
Connection, that single Connection can then be
+drawn to span that large space instead. Secondly, Connections can be 
configured with FlowFile Prioritizers. Data from
+several Connections can be funneled into a single Connection, providing the 
ability to Prioritize all of the data on that
+one Connection, rather than prioritizing the data on each Connection 
independently.
+
+
+[[template]]
+image:iconTemplate.png["Template", width=32]
+*Template*: Templates can be created by DFMs from sections of the flow, or 
they can be imported from other
+dataflows. These Templates provide larger building blocks for creating a  
complex flow quickly. When the Template is
+dragged onto the canvas, the DFM is provided a dialog to choose which Template 
to add to the canvas:
+
+image::instantiate-template.png["Instantiate Template Dialog"]
+
+Clicking the drop-down box shows all available Templates. Any Template that 
was created with a description will show a question mark
+icon, indicating that there is more information. Hovering over the icon with 
the mouse will show this description:
+
+image::instantiate-template-description.png["Instantiate Template Dialog"]
+
+
+
+[[label]]
+image:iconLabel.png["Label"]
+*Label*: Labels are used to provide documentation to parts of a dataflow. When 
a Label is dropped onto the canvas,
+it is created with a default size. The Label can then be resized by dragging 
the handle in the bottom-right corner.
+The Label has no text when initially created. The text of the Label can be 
added by right-clicking on the Label and
+choosing `Configure...`
+
+
+
+
+=== Configuring a Processor
+
+Once a Processor has been dragged onto the Canvas, it is ready to configure. 
This is done by right-clicking on the
+Processor and clicking the `Configure...` option from the context menu. The 
configuration dialog is opened with four
+different tabs, each of which is discussed below. Once you have finished 
configuring the Processor, you can apply
+the changes by clicking the `Apply` button or cancel all changes by clicking 
the `Cancel` button.
+
+Note that after a Processor has been started, the context menu shown for the 
Processor no longer has a `Configure...`
+option but rather has a `View Configuration` option. Processor configuration 
cannot be changed while the Processor is
+running. You must first stop the Processor and wait for all of its active 
tasks to complete before configuring
+the Processor again.
+
+
+==== Settings Tab
+
+The first tab in the Processor Configuration dialog is the Settings tab:
+
+image::settings-tab.png["Settings Tab"]
+
+This tab contains several different configuration items. First, it allows the 
DFM to change the name of the Processor.
+The name of a Processor by default is the same as the Processor type. Next to 
the Processor Name is a checkbox, indicating
+ whether the Processor is Enabled. When a Processor is added to the graph, it 
is enabled. If the
+Processor is disabled, it cannot be started. The disabled state is used to 
indicate that when a group of Processors is started,
+such as when a DFM starts an entire Process Group, this (disabled) Processor 
should be excluded.
+
+Below the Name configuration, the Processor's unique identifier is displayed 
along with the Processor's type. These
+values cannot be modified.
+
+Next are two dialogues for configuring `Penalty duration' and `Yield 
duration'. During the normal course of processing a
+piece of data (a FlowFile), an event may occur that indicates that the data 
cannot be processed at this time but the
+data may be processable at a later time. When this occurs, the Processor may 
choose to Penalize the FlowFile. This will
+prevent the FlowFile from being Processed for some period of time. For 
example, if the Processor is to push the data
+to a remote service, but the remote service already has a file with the same 
name as the filename that the Processor
+is specifying, the Processor may penalize the FlowFile. The `Penalty duration' 
allows the DFM to specify how long the 
+FlowFile should be penalized. The default value is 30 seconds.
+
+Similarly, the Processor may determine that some situation exists such that 
the Processor can no longer make any progress,
+regardless of the data that it is processing. For example, if a Processor is 
to push data to a remote service and that
+service is not responding, the Processor cannot make any progress. As a 
result, the Processor should `yield,' which will
+prevent the Processor from being scheduled to run for some period of time. 
That period of time is specified by setting
+the `Yield duration.' The default value is 1 second.
+
+The last configurable option on the left-hand side of the Settings tab is the 
Bulletin level. Whenever the Processor writes
+to its log, the Processor also will generate a Bulletin. This setting 
indicates the lowest level of Bulletin that should be
+shown in the User Interface. By default, the Bulletin level is set to WARN, 
which means it will display all warning and error-level
+bulletins.
+
+The right-hand side of the Settings tab contains an `Auto-terminate 
relationships' section. Each of the Relationships that is
+defined by the Processor is listed here, along with its description. In order 
for a Processor to be considered valid and
+able to run, each Relationship defined by the Processor must be either 
connected to a downstream component or auto-terminated.
+If a Relationship is auto-terminated, any FlowFile that is routed to that 
Relationship will be removed from the flow and
+its processing considered complete. Any Relationship that is already connected 
to a downstream component cannot be auto-terminated.
+The Relationship must first be removed from any Connection that uses it. 
Additionally, for any Relationship that is selected to be
+auto-terminated, the auto-termination status will be cleared (turned off) if 
the Relationship is added to a Connection.
+
+
+
+
+==== Scheduling Tab
+
+The second tab in the Processor Configuration dialog is the Scheduling Tab:
+
+image::scheduling-tab.png["Scheduling Tab"]
+
+The first configuration option is the Scheduling Strategy. There are three 
options for scheduling components:
+
+- *Timer driven*: This is the default mode. The Processor will be scheduled to 
run on a regular interval. The interval
+       at which the Processor is run is defined by the `Run schedule' option 
(see below).
+- *Event driven*: When this mode is selected, the Processor will be triggered 
to run by an event, and that event occurs when FlowFiles enter Connections
+       feeding this Processor. This mode is currently considered experimental 
and is not supported by all Processors. When this mode is
+       selected, the `Run schedule' option is not configurable, as the 
Processor is not triggered to run periodically but
+        as the result of an event. Additionally, this is the only mode for 
which the `Concurrent tasks'
+       option can be set to 0. In this case, the number of threads is limited 
only by the size of the Event-Driven Thread Pool that
+       the administrator has configured.
+- *CRON driven*: When using the CRON driven scheduling mode, the Processor is 
scheduled to run periodically, similar to the
+       Timer driven scheduling mode. However, the CRON driven mode provides 
significantly more flexibility at the expense of
+       increasing the complexity of the configuration. This value is made up 
of six fields, each separated by a space. These
+       fields include:
++
+               . Seconds
+               . Minutes
+               . Hours
+               . Day of Month
+               . Month
+               . Day of Week
+               . Year
++
+The value for each of these fields should be a number, range, or increment.
+Range here refers to a syntax of <number>-<number>.
+For example,the Seconds field could be set to 0-30, meaning that the Processor 
should only be scheduled if the time is 0 to 30 seconds
+after the minute. Additionally, a value of `*` indicates that all values are 
valid for this field. Multiple values can also
+be entered using a `,` as a separator: `0,5,10,15,30`.
+An increment is written as <start value>/<increment>. For example, settings a 
value of `0/10` for the seconds fields means that valid
+values are 0, 10, 20, 30, 40, and 50. However, if we change this to `5/10`, 
valid values become 5, 15, 25, 35, 45, and 55.
++
+For the Month field, valid values are 1 (January) through 12 (December).
++
+For the Day of Week field, valid values are 1 (Sunday) through 7 (Saturday). 
Additionally, a value of `L` may be appended to one of these
+values to indicate the last occurrence of this day in the month. For example, 
`1L` can be used to indicate the last Monday of the month.
+
+
+Next, the Scheduling Tab provides a configuration option named `Concurrent 
tasks.' This controls how many threads the Processor
+will use. Said a different way, this controls how many FlowFiles should be 
processed by this Processor at the same time. Increasing
+this value will typically allow the Processor to handle more data in the same 
amount of time. However, it does this by using system
+resources that then are not usable by other Processors. This essentially 
provides a relative weighting of Processors -- it controls
+how much of the system's resources should be allocated to this Processor 
instead of other Processors. This field is available for
+most Processors. There are, however, some types of Processors that can only be 
scheduled with a single Concurrent task.
+
+The ``Run schedule'' dictates how often the Processor should be scheduled to 
run. The valid values for this field depend on the selected
+Scheduling Strategy (see above). If using the Event driven Scheduling 
Strategy, this field is not available. When using the Timer driven
+Scheduling Strategy, this value is a time duration specified by a number 
followed by a time unit. For example, `1 second` or `5 mins`.
+The default value of `0 sec` means that the Processor should run as often as 
possible as long as it has data to process. This is true
+for any time duration of 0, regardless of the time unit (i.e., `0 sec`, `0 
mins`, `0 days`). For an explanation of values that are
+applicable for the CRON driven Scheduling Strategy, see the description of the 
CRON driven Scheduling Strategy itself.
+
+The right-hand side of the tab contains a slider for choosing the `Run 
duration.' This controls how long the Processor should be scheduled
+to run each time that it is triggered. On the left-hand side of the slider, it 
is marked `Lower latency' while the right-hand side
+is marked `Higher throughput.' When a Processor finishes running, it must 
update the repository in order to transfer the FlowFiles to
+the next Connection. Updating the repository is expensive, so the more work 
that can be done at once before updating the repository,
+the more work the Processor can handle (Higher throughput). However, this 
means that the next Processor cannot start processing
+those FlowFiles until the previous Process updates this repository. As a 
result, the latency will be longer (the time required to process
+the FlowFile from beginning to end will be longer). As a result, the slider 
provides a spectrum from which the DFM can choose to favor
+Lower Latency or Higher Throughput.
+
+
+==== Properties Tab
+
+The Properties Tab provides a mechanism to configure Processor-specific 
behavior. There are no default properties. Each type of Processor
+must define which Properties make sense for its use case. Below, we see the 
Properties Tab for a RouteOnAttribute Processor:
+
+image::properties-tab.png["Properties Tab"]
+
+This Processor, by default, has only a single property: `Routing Strategy.' 
The default value is `Route on Property name.' Next to
+the name of this property is a small question-mark symbol (
+image:iconInfo.png["Question Mark"]
+). This help symbol is seen in other places throughout the User Interface, and 
it indicates that more information is available.
+Hovering over this symbol with the mouse will provide additional details about 
the property and the default value, as well as
+historical values that have been set for the Property.
+
+Clicking on the value for the property will allow a DFM to change the value. 
Depending on the values that are allowed for the property,
+the user is either provided a drop-down from which to choose a value or is 
given a text area to type a value:
+
+image::edit-property-dropdown.png["Edit Property with Dropdown"]
+
+In the top-right corner of the tab is a button for adding a New Property. 
Clicking this button will provide the DFM with a dialog to
+enter the name and value of a new property. Not all Processors allow 
User-Defined properties. In processors that do not allow them, 
+the Processor becomes invalid when User-Defined properties are applied. 
RouteOnAttribute, however, does allow User-Defined properties. 
+In fact, this Processor will not be valid until the user has added a property.
+
+image:edit-property-textarea.png["Edit Property with Text Area"]
+
+Note that after a User-Defined property has been added, an icon will appear on 
the right-hand side of that row (
+image:iconDelete.png["Delete Icon"]
+). Clicking this button will remove the User-Defined property from the 
Processor.
+
+Some processors also have an Advanced User Interface (UI) built into them. For 
example, the UpdateAttribute processor has an Advanced UI. To access the 
Advanced UI, click the `Advanced` button that appears at the bottom of the 
Configure Processor window. Only processors that have an Advanced UI will have 
this button. 
+
+Some processors have properties that refer to other components, such as 
Controller Services, which also need to be configured. For example, the GetHTTP 
processor has an SSLContextService property, which refers to the 
StandardSSLContextService controller service. When DFMs want to configure this 
property but have not yet created and configured the controller service, they 
have the option to create the service on the spot, as depicted in the image 
below. For more information about configuring Controller Services, see the 
<<Controller_Services_and_Reporting_Tasks>> section.
+
+image:create-service-ssl-context.png["Create Service", width=700]
+
+==== Comments Tab
+
+The last tab in the Processor configuration dialog is the Comments tab. This 
tab simply provides an area for users to include
+whatever comments are appropriate for this component. Use of the Comments tab 
is optional:
+
+image::comments-tab.png["Comments Tab"]
+
+
+=== Additional Help
+
+The user may access additional documentation about each Processor's usage by 
right-clicking
+on the Processor and then selecting `Usage' from the context menu. 
Alternatively, clicking the `Help' link in the top-right
+corner of the User Interface will provide a Help page with all of the 
documentation, including usage documentation
+for all the Processors that are available. Clicking on the desired Processor 
in the list will display its usage documentation.
+
+
+
+[[Controller_Services_and_Reporting_Tasks]]
+=== Controller Services and Reporting Tasks
+
+While DFMs have the ability to create Controller Services from the Configure 
Processor window, there is also a central place within the User Interface for 
adding and configuring both Controller Services and Reporting Tasks. To get 
there, click on the Controller Settings button in the Management section of the 
toolbar.
+
+image:controller-settings-button.png["Controller Settings Button", width=200]
+
+The Controller Settings window has three tabs across the top: General, 
Controller Services, and Reporting Tasks. The General tab is for settings that 
pertain to general information about the NiFi instance. For example, here, the 
DFM can provide a unique name for the overall dataflow, as well as comments 
that describe the flow. Be aware that this information is visible to any other 
NiFi instance that connects remotely to this instance (using Remote Process 
Groups, a.k.a., Site-to-Site).
+
+The General tab also provides settings for the overall maximum thread counts 
of the instance, as well as the ability to click "Back-up flow" to create a 
backup copy of the current flow, which is saved by default in the /conf/archive 
directory.
+
+image:settings-general-tab.png["Controller Settings General Tab", width=700]
+
+To the right of the General tab is the Controller Services tab. From this tab, 
the DFM may click the "+" button in the upper-right corner to create a new 
Controller Service.
+
+image:controller-services-tab.png["Controller Services Tab", width=900]
+
+The Add Controller Service window opens. This window is similar to the Add 
Processor window. It provides a list of the available Controller Services on 
the right and a tag cloud, showing the most common catagory tags used for 
Controller Services, on the left. The DFM may click any tag in the tag cloud in 
order to narrow down the list of Controller Services to those that fit the 
categories desired. The DFM may also use the Filter field at the top of the 
window to search for the desired Contoller Service. Upon selecting a Controller 
Service from the list, the DFM can see a description of the the service below. 
Select the desired controller service and click Add, or simply double-click the 
name of the service to add it.
+
+image:add-controller-service-window.png["Add Controller Service Window", 
width=700]
+
+Once a Controller Service has been added, the DFM may configure it by clicking 
the Edit button (pencil icon) in the far-right column. Other buttons in this 
column include the Enable button (to enable a configured service), the Remove 
button, and the Usage button, which links to the documentation for the 
particular Controller Service.
+
+image:controller-services-edit-buttons.png["Controller Services Buttons"]
+
+When the DFM clicks the Edit button, a Configure Controller Service window 
opens. It has three tabs: Settings, Properties, and Comments. This window is 
similar to the Configure Processor window. The Settings tab provides a place 
for the DFM to give the Controller Service a unique name (if desired). It also 
lists the UUID for the service and provides a list of other components 
(processors or other controller services) that reference the service.
+
+image:configure-controller-service-settings.png["Configure Controller Service 
Settings", width=700]
+
+The Properties tab lists the various properties that apply to the particular 
controller service. As with configuring processors, the DFM may hover the mouse 
over the question mark icons to see more information about each property.
+
+image:configure-controller-service-properties.png["Configure Controller 
Service Properties", width=700]
+
+The Comments tab is just an open-text field, where the DFM may include 
comments about the service. After configuring a Controller Service, click the 
Apply button to apply the configuration and close the window, or click the 
Cancel button to cancel the changes and close the window.
+
+Note that after a Controller Service has been configured, it must be enabled 
in order to run. Do this using the Enable button in the far-right column of the 
Controller Services tab of the Controller Settings window. Then, in order to 
modify an existing/running controller service, the DFM needs to stop/disable it 
(as well as all referencing processors, reporting tasks, and controller 
services). Rather than having to hunt down each component that is referenced by 
that controller service, the DFM has the ability to stop/disable them when 
disabling the controller service in question. Likewise, when enabling a 
controller service, the DFM has the option to start/enable all referencing 
processors, reporting tasks, and controller services.
+
+The Reporting Tasks tab behaves similarly to the Controller Services tab. The 
DFM has the option to add Reporting Tasks and configure them in the same way as 
Controller Services.
+
+image:reporting-tasks-tab.png["Reporting Tasks Tab", width=900]
+
+Once a Reporting Task has been added, the DFM may configure it by clicking the 
Edit (pencil icon) in the far-right column. Other buttons in this column 
include the Start button, Remove button, and Usage button, which links to the 
documentation for the particular Reporting Task.
+
+image:reporting-tasks-edit-buttons2.png["Reporting Tasks Buttons"]
+
+When the DFM clicks the Edit button, a Configure Reporting Task window opens. 
It has three tabs: Settings, Properties, and Comments. This window is also 
similar to the Configure Processor window. The Settings tab provides a place 
for the DFM to give the Reporting Task a unique name (if desired). It also 
lists a UUID for the Reporting Task and provides settings for the task's 
Scheduling Strategy and Run Schedule (similar to the same settings in a 
processor). The DFM may hover the mouse over the question mark icons to see 
more information about each setting.
+
+image:configure-reporting-task-settings.png["Configure Reporting Task 
Settings", width=700]
+
+The Properties tab for a Reporting Task lists the properties that may be 
configured for the task. The DFM may hover the mouse over the question mark 
icons to see more information about each property.
+
+image:configure-reporting-task-properties.png["Configure Reporting Task 
Properties", width=700]
+
+The Comments tab is just an open-text field, where the DFM may include 
comments about the task. After configuring the Reporting Task, click the Apply 
button to apply the configuration and close the window, or click Cancel to 
cancel the changes and close the window.
+
+When you want to run the Reporting Task, click the Start button in the 
far-right column of the Reporting Tasks tab.
+
+
+
+=== Connecting Components
+
+Once processors and other components have been added to the graph and 
configured, the next step is to connect them
+to one another so that NiFi knows what to do with each FlowFile after it has 
been processed. This is accomplished by creating a
+Connection between each component. When the user hovers the mouse over the 
center of a component, a new Connection icon (
+image:addConnect.png["Connection Bubble"]
+) appears:
+
+image:processor-connection-bubble.png["Processor with Connection Bubble"]
+
+The user drags the Connection bubble from one component to another until the 
second component is highlighted. When the user
+releases the mouse, a `Create Connection' dialog appears. This dialog consists 
of two tabs: `Details' and `Settings'. They are
+discussed in detail below. Note that it is possible to draw a connection so 
that it loops back on the same processor. This can be 
+useful if the DFM wants the processor to try to re-process FlowFiles if they 
go down a failure Relationship. To create this type of looping
+connection, simply drag the connection bubble away and then back to the same 
processor until it is highlighted. Then release the mouse and the same 'Create 
Connection' dialog appears. 
+
+==== Details Tab
+
+The Details Tab of the 'Create Connection' dialog provides information about 
the source and destination components, including the component name, the
+component type, and the Process Group in which the component lives:
+
+image::create-connection.png["Create Connection"]
+
+Additionally, this tab provides the ability to choose which Relationships 
should be included in this Connection. At least one
+Relationship must be selected. If only one Relationship is available, it is 
automatically selected.
+
+*Note*: If multiple Connections are added with the same Relationship, any 
FlowFile that is routed to that Relationship will
+automatically be `cloned', and a copy will be sent to each of those 
Connections.
+
+==== Settings
+
+The Settings Tab provides the ability to configure the Connection's name, 
FlowFile expiration, Back Pressure thresholds, and
+Prioritization:
+
+image:connection-settings.png["Connection Settings"]
+
+The Connection name is optional. If not specified, the name shown for the 
Connection will be names of the Relationships
+that are active for the Connection.
+
+File expiration is a concept by which data that cannot be processed in a 
timely fashion can be automatically removed from the flow.
+This is useful, for example, when the volume of data is expected to exceed the 
volume that can be sent to a remote site.
+In this case, the expiration can be used in conjunction with Prioritizers to 
ensure that the highest priority data is
+processed first and then anything that cannot be processed within a certain 
time period (one hour, for example) can be dropped. The expiration period is 
based on the time that the data entered the NiFi instance. In other words, if 
the file expiration on a given connection is set to '1 hour', and a file that 
has been in the NiFi instance for one hour reaches that connection, it will 
expire. The default
+value of `0 sec` indicates that the data will never expire. When a file 
expiration other than '0 sec' is set, a small clock icon appears on the 
connection label, so the DFM can see it at-a-glance when looking at a flow on 
the graph.
+
+
+NiFi provides two configuration elements for Back Pressure. These thresholds 
indicate how much data should be
+allowed to exist in the queue before the component that is the source of the 
Connection is no longer scheduled to run.
+This allows the system to avoid being overrun with data. The first option 
provided is the ``Back pressure object threshold.''
+This is the number of FlowFiles that can be in the queue before back pressure 
is applied. The second configuration option
+is the ``Back pressure data size threshold.'' 
+This specifies the maximum amount of data (in size) that should be queued up 
before
+applying back pressure. This value is configured by entering a number followed 
by a data size (`B` for bytes, `KB` for
+kilobytes, `MB` for megabytes, `GB` for gigabytes, or `TB` for terabytes).
+
+The right-hand side of the tab provides the ability to prioritize the data in 
the queue so that higher priority data is
+processed first. Prioritizers can be dragged from the top (`Available 
prioritizers') to the bottom (`Selected prioritizers').
+Multiple prioritizers can be selected. The prioritizer that is at the top of 
the `Selected prioritizers' list is the highest
+priority. If two FlowFiles have the same value according to this prioritizer, 
the second prioritizer will determine which
+FlowFile to process first, and so on. If a prioritizer is no longer desired, 
it can then be dragged from the `Selected
+prioritizers' list to the `Available prioritizers' list.
+
+The following prioritizers are available:
+
+- *FirstInFirstOutPrioritizer*: Given two FlowFiles, the on that reached the 
connection first will be processed first.
+- *NewestFlowFileFirstPrioritizer*: Given two FlowFiles, the one that is 
newest in the dataflow will be processed first.
+- *OldestFlowFileFirstPrioritizer*: Given two FlowFiles, the on that is oldest 
in the dataflow will be processed first. This is the default scheme that is 
used if no prioritizers are selected.
+- *PriorityAttributePrioritizer*: Given two FlowFiles that both have a 
"priority" attribute, the one that has the highest priority value will be 
prprocessed first. Note that an UpdateAttribute processor should be used to add 
the "priority" attribute to the FlowFiles before they reach a connection that 
has this prioritizer set. Values for the "priority" attribute may be 
alphanumeric, where "a" is a higher priority than "z", and "1" is a higher 
priority than "9", for example.
+
+*Note*: After a connection has been drawn between two components, the 
connection's configuration may be changed, and the connection may be moved to a 
new destination; however, the processors on either side of the connection must 
be stopped before a configuration or destination change may be made.
+
+=== Processor Validation
+
+Before trying to start a Processor, it's important to make sure that the 
Processor's configuration is valid.
+A status indicator is shown in the top-left of the Processor. If the Processor 
is invalid, the indicator
+will show a yellow Warning indicator with an exclamation mark indicating that 
there is a problem:
+
+image::invalid-processor.png["Invalid Processor"]
+
+In this case, hovering over the indicator icon with the mouse will provide a 
tooltip showing all of the validation
+errors for the Processor. Once all of the validation errors have been 
addressed, the status indicator will change
+to a Stop icon, indicating that the Processor is valid and ready to be started 
but currently is not running:
+
+image::valid-processor.png["Valid Processor"]
+
+
+
+[[site-to-site]]
+=== Site-to-Site
+
+When sending data from one instance of NiFi to another, there are many 
different protocols that can be used. The preferred
+protocol, though, is the NiFi Site-to-Site Protocol. Site-to-Site makes it 
easy to transfer data from one NiFi instance to
+another easily, efficiently, and securely.
+
+Using Site-to-Site provides the following benefits:
+
+* Easy to configure
+** After entering the URL of the remote NiFi instance, the available ports 
(endpoints) are automatically discovered and provided in a drop-down list
+
+* Secure
+** Site-to-Site optionally makes use of Certificates in order to encrypt data 
and provide authentication and authorization. Each port can be configured
+   to allow only specific users, and only those users will be able to see that 
the port even exists. For information on configuring the Certificates,
+   see the
+link:administration-guide.html#security-configuration[Security Configuration] 
section of the
+link:administration-guide.html[Admin Guide].
+
+* Scalable
+** As nodes in the remote cluster change, those changes are automatically 
detected and data is scaled out across all nodes in the cluster.
+
+* Efficient
+** Site-to-Site allows batches of FlowFiles to be sent at once in order to 
avoid the overhead of establishing connections and making multiple
+   round-trip requests between peers.
+
+* Reliable
+** Checksums are automatically produced by both the sender and receiver and 
compared after the data has been transmitted, in order
+   to ensure that no corruption has occurred. If the checksums don't match, 
the transaction will simply be canceled and tried again.
+
+* Automatically load balanced
+** As nodes come online or drop out of the remote cluster, or a node's load 
becomes heavier or lighter, the amount of data that is directed
+   to that node will automatically be adjusted.
+
+* FlowFiles maintain attributes
+** When a FlowFile is transferred over this protocol, all of the FlowFile's 
attributes
+   are automatically transferred with it. This can be very advantageous in 
many situations, as all of the context and enrichment
+   that has been determined by one instance of NiFi travels with the data, 
making for easy routing of the data and allowing users
+   to easily inspect the data.
+
+* Adaptable
+** As new technologies and ideas emerge, the protocol for handling 
Site-to-Site communications are able to change with them. When a connection is
+   made to a remote NiFi instance, a handshake is performed in order to 
negotiate which protocol and which version of the protocol will be used.
+   This allows new capabilities to be added while still maintaining backward 
compatibility with all older instances. Additionally, if a vulnerability
+   or deficiency is ever discovered in a protocol, it allows a newer version 
of NiFi to forbid communication over the compromised versions of the protocol.
+
+In order to communicate with a remote NiFi instance via Site-to-Site, simply 
drag a <<remote_process_group,Remote Process Group>> onto the graph
+and enter the URL of the remote NiFi instance (for more information on the 
components of a Remote Process Group, see 
+<<Remote_Group_Transmission,Remote Process Group Transmission>> section of 
this guide.) The URL is the same 
+URL you would use to go to that instance's User Interface. At that point, you 
can drag a connection to or from the Remote Process Group 
+in the same way you would drag a connection to or from a Processor or a local 
Process Group. When you drag the connection, you will have
+a chance to choose which Port to connect to. Note that it may take up to one 
minute for the Remote Process Group to determine
+which ports are available.
+
+If the connection is dragged starting from the Remote Process Group, the ports 
shown will be the Output Ports of the remote group,
+as this indicates that you will be pulling data from the remote instance. If 
the connection instead ends on the Remote Process Group,
+the ports shown will be the Input Ports of the remote group, as this implies 
that you will be pushing data to the remote instance.
+
+*Note*: if the remote instance is configured to use secure data transmission, 
you will see only ports that you are authorized to
+communicate with. For information on configuring NiFi to run securely, see the
+link:administration-guide.html[Admin Guide].
+
+In order to allow another NiFi instance to push data to your local instance, 
you can simply drag an <<input_port,Input Port>> onto the Root Process Group
+of your graph. After entering a name for the port, it will be added to your 
flow. You can now right-click on the Input Port and choose Configure in order
+to adjust the name and the number of concurrent tasks that are used for the 
port. If Site-to-Site is configured to run securely, you will also be given
+the ability to adjust who has access to the port. If secure, only those who 
have been granted access to communicate with the port will be able to see
+that the port exists.
+
+After being given access to a particular port, in order to see that port, the 
operator of a remote NiFi instance may need to right-click on their Remote
+Process Group and choose to "Refresh" the flow.
+
+Similar to an Input Port, a DataFlow Manager may choose to add an 
<<output_port,Output Port>> to the Root Process Group. The Output Port allows 
an 
+authorized NiFi instance to remotely connect to your instance and pull data 
from the Output Port. Configuring the Output Port will again allow the
+DFM to control how many concurrent tasks are allowed, as well as which NiFi 
instances are authorized to pull data from the instance being configured. 
+
+In addition to other instances of NiFi, some other applications may use a 
Site-to-Site client in order to push data to or receive data from a NiFi 
instance.
+For example, NiFi provides an Apache Storm spout and an Apache Spark Receiver 
that are able to pull data from NiFi's Root Group Output Ports.
+
+If your instance of NiFi is running securely, the first time that a client 
establishes a connection to your instance, the client will be forbidden and
+a request for an account for that client will automatically be generated. The 
client will need to be granted the 'NiFi' role in order to communicate
+via Site-to-Site. For more information on managing user accounts, see the
+link:administration-guide.html#controlling-levels-of-access[Controlling Levels 
of Access]
+section of the link:administration-guide.html[Admin Guide].
+
+For information on how to enable and configure Site-to-Site on a NiFi 
instance, see the
+link:administration-guide.html#site_to_site_properties[Site-to-Site 
Properties] section of the
+link:administration-guide.html[Admin Guide].
+
+
+
+
+=== Example Dataflow
+
+This section has described the steps required to build a dataflow. Now, to put 
it all together. The following example dataflow 
+consists of just two processors: GenerateFlowFile and LogAttribute. These 
processors are normally used for testing, but they can also be used
+to build a quick flow for demonstration purposes and see NiFi in action.
+
+After you drag the GenerateFlowFile and LogAttribute processors to the graph 
and connect them (using the guidelines provided above), configure them as 
follows:
+
+* Generate FlowFile
+** On the Scheduling tab, set Run schedule to: 5 sec. Note that the 
GenerateFlowFile processor can create many FlowFiles very quickly; that's why 
setting the Run schedule is important so that this flow does not overwhelm the 
system NiFi is running on.
+** On the Properties tab, set File Size to: 10 kb
+
+* Log Attribute
+** On the Settings tab, under Auto-terminate relationships, select the 
checkbox next to Success. This will terminate FlowFiles after this processor 
has successfully processed them.
+** Also on the Settings tab, set the Bulletin level to Info. This way, when 
the dataflow is running, this processor will display the bulletin icon (see 
<<processor_anatomy>>), and the user may hover over it with the mouse to see 
the attributes that the processor is logging.
+
+The dataflow should look like the following:
+
+image::simple-flow.png["Simple Flow", width=900]
+
+
+Now see the following section on how to start and stop the dataflow. When the 
dataflow is running, be sure to note the statistical information that is 
displayed on the face of each processor (see <<processor_anatomy>>).
+
+
+
+== Command and Control of DataFlow
+
+When a component is added to the NiFi canvas, it is in the Stopped state. In 
order to cause the component to
+be triggered, the component must be started. Once started, the component can 
be stopped at any time. From a
+Stopped state, the component can be configured, started, or disabled.
+
+=== Starting a Component
+
+In order to start a component, the following conditions must be met:
+
+- The component's configuration must be valid.
+
+- All defined Relationships for the component must be connected to another 
component or auto-terminated.
+
+- The component must be stopped.
+
+- The component must be enabled.
+
+- The component must have no active tasks. For more information about active 
tasks, see the ``Anatomy of ...'' 
+sections under <<monitoring>> (<<processor_anatomy>>, 
<<process_group_anatomy>>, <<remote_group_anatomy>>).
+
+Components can be started by selecting all of the components to start and then 
clicking the Start icon (
+image:iconRun.png["Start"]
+) in the
+Actions Toolbar or by right-clicking a single component and choosing Start 
from the context menu.
+
+If starting a Process Group, all components within that Process Group 
(including child Process Groups) will
+be started, with the exception of those components that are invalid or 
disabled.
+
+Once started, the status indicator of a Processor will change to a Play symbol 
(
+image:iconRun.png["Run"]
+).
+
+
+=== Stopping a Component
+
+A component can be stopped any time that it is running. A component is stopped 
by right-clicking on the component
+and clicking Stop from the context menu, or by selecting the component and 
clicking the Stop icon (
+image:iconStop.png["Stop"]
+) in the Actions Toolbar.
+
+If a Process Group is stopped, all of the components within the Process Group 
(including child Process Groups)
+will be stopped.
+
+Once stopped, the status indicator of a component will change to the Stop 
symbol (
+image:iconStop.png["Stop"]
+).
+
+Stopping a component does not interrupt its currently running tasks. Rather, 
it stops scheduling new tasks to
+be performed. The number of active tasks is shown in the top-right corner of 
the Processor (see <<processor_anatomy>>
+for more information).
+
+=== Enabling/Disabling a Component
+
+When a component is enabled, it is able to be started. Users may choose to 
disable components when they are part of a
+dataflow that is still being assembled, for example. Typically, if a component 
is not intended to be run, the component 
+is disabled, rather than being left in the Stopped state. This helps to 
distinguish between components that are 
+intentionally not running and those that may have been stopped temporarily 
(for instance, to change the component's 
+configuration) and inadvertently were never restarted. 
+
+When it is desirable to re-enable a component, it can be enabled by selecting 
the component and 
+clicking the Enable icon (
+image:iconEnable.png["Enable"]
+) in the Actions Toolbar. This is available only when the selected component 
or components are disabled.
+Alternatively, a component can be enabled by checking the checkbox next  to 
the ``Enabled'' option in 
+the Settings tab of the Processor configuration dialog or the configuration 
dialog for a Port.
+
+Once enabled, the component's status indicator will change to either Invalid (
+image:iconAlert.png["Invalid"]
+) or Stopped (
+image:iconStop.png["Stopped"]
+), depending on whether or not the component is valid.
+
+A component is then disabled by selecting the component and clicking the 
Disable icon (
+image:iconDisable.png["Disable"]
+) in the Actions Toolbar, or by clearing the checkbox next to the ``Enabled'' 
option in the Settings tab
+of the Processor configuration dialog or the configuration dialog for a Port.
+
+Only Ports and Processors can be enabled and disabled.
+
+
+[[Remote_Group_Transmission]]
+=== Remote Process Group Transmission
+
+Remote Process Groups provide a mechanism for sending data to or retrieving 
data from a remote instance
+of NiFi. When a Remote Process Group (RPG) is added to the canvas, it is added 
with the Transmision Disabled,
+as indicated by the icon (
+image:iconTransmissionInactive.png["Transmission Disabled"]
+) in the top-left corner. When Transmission is Disabled, it can be enabled by 
right-clicking on the
+RPG and clicking the ``Enable Transmission'' menu item. This will cause all 
ports for which there is a Connection
+to begin transmitting data. This will cause the status indicator to then 
change to the Transmission Enabled icon (
+image:iconTransmissionActive.png["Transmission Enabled"]
+). 
+
+If there are problems communicating with the Remote Process Group, a Warning 
indicator (
+image:iconAlert.png["Warning"]
+) may instead be present in the top-left corner. Hovering over this Warning 
indicator with the mouse will provide
+more information about the problem.
+
+[[Remote_Port_Configuration]]
+==== Individual Port Transmission
+
+There are times when the DFM may want to either enable or disable transmission 
for only a specific 
+Port within the Remote Process Group. This can be accomplished by 
right-clicking on the Remote Process Group
+and choosing the ``Remote ports'' menu item. This provides a configuration 
dialog from which each Port can be
+configured:
+
+image::remote-group-ports-dialog.png["Remote Process Groups"]
+
+The left-hand side lists all of the Input Ports that the remote instance of 
NiFi allows data to be sent to.
+The right-hand side lists all of the Output Ports from which this instance is 
able to pull data.
+If the remote instance is using secure communications (the URL of the NiFi 
instance begins with `https://`, 
+rather than `http://`), any Ports that the remote instance has not made 
available to this instance will not
+be shown.
+
+*Note*: If a Port that is expected to be shown is not shown in this dialog, 
ensure that the instance has proper
+permissions and that the Remote Process Group's flow is current. This can be 
checked by closing the Port
+Configuration Dialog and looking at the bottom-right corner of the Remote 
Process Group. The date at which
+the flow was last refreshed is shown. If the flow appears to be outdated, it 
can be updated by right-clicking
+on the Remote Process Group and selecting ``Refresh flow.'' (See 
<<remote_group_anatomy>> for more information).
+
+Each Port is shown with the Port name, followed by its description, currently 
configured number of Concurrent
+tasks, and whether or not data sent to this port will be compressed. To the 
left of this information is a switch
+to turn the Port on or off. Those Ports that have no Connections attached to 
them are grayed out:
+
+image::remote-port-connection-status.png["Remote Port Statuses"]
+
+The on/off switch provides a mechanism to enable and disable transmission for 
each Port in the Remote
+Process Group independently. Those Ports that are connected but are not 
currently transmitting can be 
+configured by clicking the pencil icon (
+image:iconEdit.png["Edit"]
+) below the on/off switch. Clicking this icon will allow the DFM to change the 
number of Concurrent tasks and whether
+or not compression should be used when transmitting data to or from this Port.
+
+
+
+[[navigating]]
+== Navigating within a DataFlow
+
+NiFi provides various mechanisms for getting around a dataflow. The 
<<User_Interface>> section discussed various ways to navigate around
+the NiFi graph; however, once a flow exists on the graph, there are additional 
ways to get from one component to another. The <<User Interface>> section 
showed that when multiple Process Groups exist in a flow, breadcrumbs appear 
under the toolbar, providing a way to navigate between them. In addition, to 
enter a Process Group that is currently visible on the graph, simply 
double-click it, thereby "drilling down" into it. Connections also provide a 
way to jump from one location to another within the flow. Right-click on a 
connection and select "Go to source" or "Go to destination" in order to jump to 
one end of the connection or another. This can be very useful in large, complex 
dataflows, where the connection lines may be long and span large areas of the 
graph. Finally, all components provide the ability to jump forward or backward 
within the flow. Right-click any component (e.g., a processor, process group, 
port, etc.) and select either "Upstream connections" or "Downstream connec
 tions". A dialog window will open, showing the available upstream or 
downstream connections that the user may jump to. This can be especially useful 
when trying to follow a dataflow in a backward direction. It is typically easy 
to follow the path of a dataflow from start to finish, drilling down into 
nested process groups; however, it can be more difficult to follow the dataflow 
in the other direction.
+
+
+
+[[monitoring]]
+== Monitoring of DataFlow
+
+NiFi provides a great deal of information about the status of the DataFlow in 
order to monitor the
+health and status. The Status bar provides information about the overall 
system health 
+(See <<status_bar>> above for more information). Processors, Process Groups, 
and Remote Process Groups
+provide fine-grained details about their operations. Connections and Process 
Groups provide information 
+about the amount of data in their queues. The Summary Page provides 
information about all of the components
+on the graph in a tabular format and also provides System Diagnostics 
information that includes disk usage,
+CPU utilization, and Java Heap and Garbage Collection information. In a 
clustered environment, this
+information is available per-node or as aggregates across the entire cluster. 
We will explore each of these 
+monitoring artifacts below.
+  
+
+[[processor_anatomy]]
+=== Anatomy of a Processor
+
+NiFi provides a significant amount of information about each Processor on the 
canvas. The following diagram
+shows the anatomy of a Processor:
+
+image:processor-anatomy.png["Anatomy of a Processor"]
+
+The image outlines the following elements:
+
+- *Processor Type*: NiFi provides several different types of Processors in 
order to allow for a wide range
+       of tasks to be performed. Each type of Processor is designed to perform 
one specific task. The Processor
+       type (PutFile, in this example) describes the task that this Processor 
performs. In this case, the
+       Processor writes a FlowFile to disk - or ``Puts'' a FlowFile to a File.
+       
+- *Bulletin Indicator*: When a Processor logs that some event has occurred, it 
generates a Bulletin to notify
+       those who are monitoring NiFi via the User Interface. The DFM is able 
to configure which
+       bulletins should be displayed in the User Interface by updating the 
``Bulletin level'' field in the
+       ``Settings'' tab of the Processor configuration dialog. The default 
value is `WARN`, which means that only
+       warnings and errors will be displayed in the UI. This icon is not 
present unless a Bulletin exists for this
+       Processor. When it is present, hovering over the icon with the mouse 
will provide a tooltip explaining the
+       message provided by the Processor as well as the Bulletin level. If the 
instance of NiFi is clustered,
+       it will also show the Node that emitted the Bulletin. Bulletins 
automatically expire after five minutes.
+       
+- *Status Indicator*: Shows the current Status of the Processor. The following 
indicators are possible:
+       ** image:iconRun.png["Running"]
+               *Running*: The Processor is currently running.
+       ** image:iconStop.png["Stopped"]
+               *Stopped*: The Processor is valid and enabled but is not 
running.
+       ** image:iconAlert.png["Invalid"]
+               *Invalid*: The Processor is enabled but is not currently valid 
and cannot be started. 
+               Hovering over this icon will provide a tooltip indicating why 
the Processor is not valid.
+       ** image:iconDisable.png["Disabled"]
+               *Disabled*: The Processor is not running and cannot be started 
until it has been enabled.
+               This status does not indicate whether or not the Processor is 
valid.
+               
+- *Processor Name*: This is the user-defined name of the Processor. By 
default, the name of the Processor is
+       the same as the Processor Type. In the example, this value is "Copy to 
/review".
+       
+- *Active Tasks*: The number of tasks that this Processor is currently 
executing. This number is constrained
+       by the ``Concurrent tasks'' setting in the ``Scheduling'' tab of the 
Processor configuration dialog.
+       Here, we can see that the Processor is currently performing two tasks. 
If the NiFi instance is clustered,
+       this value represents the number of tasks that are currently executing 
across all nodes in the cluster.
+       
+- *5-Minute Statistics*: The Processor shows several different statistics in 
tabular form. Each of these
+       statistics represents the amount of work that has been performed in the 
past five minutes. If the NiFi
+       instance is clustered, these values indicate how much work has been 
done by all of the Nodes combined
+       in the past five minutes. These metrics are:
+       
+       ** *In*: The amount of data that the Processor has pulled from the 
queues of its incoming Connections.
+               This value is represented as <count> / <size> where <count> is 
the number of FlowFiles that have been
+               pulled from the queues and <size> is the total size of those 
FlowFiles' content. In this example,
+               the Processor has pulled 884 FlowFiles from the input queues, 
for a total of 8.85 megabytes (MB).
+       ** *Read/Write*: The total size of the FlowFile content that the 
Processor has read from disk and written
+               to disk. This provides valuable information about the I/O 
performance that this Processor requires.
+               Some Processors may only read the data without writing anything 
while some will not read the data but
+               will only write data. Others will neither read nor write data, 
and some Processors will both read
+               and write data. In this example, we see that in the past five 
minutes, this Processor has read 4.7
+               MB of the FlowFile content and has written 4.7 MB as well. This 
is what we would expect,
+               since this Processor simply copies the contents of a FlowFile 
to disk. Note, however, that this is
+               not the same as the amount of data that it pulled from its 
input queues. This is because some of
+               the files that it pulled from the input queues already exist in 
the output directory, and the
+               Processor is configured to route FlowFiles to failure when this 
occurs. Therefore, for those files
+               which already existed in the output directory, data was neither 
read nor written to disk.
+       ** *Out*: The amount of data that the Processor has transferred to its 
outbound Connections. This does
+               not include FlowFiles that the Processor removes itself, or 
FlowFiles that are routed to connections
+               that are auto-terminated. Like the ``In'' metric above, this 
value is represented as <count> / <size> 
+               where <count> is the number of FlowFiles that have been 
transferred to outbound Connections and <size>
+               is the total size of those FlowFiles' content. In this example, 
all of the Relationships are configured to be
+               auto-terminated, so no FlowFiles are reported as having been 
transferred Out.
+       ** *Tasks/Time*: The number of times that this Processor has been 
triggered to run in the past 5 minutes, and
+               the amount of time taken to perform those tasks. The format of 
the time is <hour>:<minute>:<second>. Note
+               that the amount of time taken can exceed five minutes, because 
many tasks can be executed in parallel. For
+               instance, if the Processor is scheduled to run with 60 
Concurrent tasks, and each of those tasks takes one
+               second to complete, it is possible that all 60 tasks will be 
completed in a single second. However, in this
+               case we will see the Time metric showing that it took 60 
seconds, instead of 1 second. This time can be
+               thought of as ``System Time,'' or said another way, this value 
is 60 seconds because that's the amount of
+               time it would have taken to perform the action if only a single 
concurrent task were used.
+       
+                
+
+
+
+[[process_group_anatomy]]
+=== Anatomy of a Process Group
+
+The Process Group provides a mechanism for grouping components together into a 
logical construct in order
+to organize the DataFlow in a way that makes it more understandable from a 
higher level. 
+The following image highlights the different elements that make up the anatomy 
of a Process Group:
+
+image::process-group-anatomy.png["Anatomy of a Process Group"]
+
+The Process Group consists of the following elements:
+
+- *Name*: This is the user-defined name of the Process Group. This name is set 
when the Process Group
+       is added to the canvas. The name can later by changed by right-clicking 
on the Process Group and clicking
+       the ``Configure'' menu option. In this example, the name of the Process 
Group is ``Process Group ABC.''
+       
+- *Bulletin Indicator*: When a child component of a Process Group emits a 
bulletin, that bulletin is propagated to
+       the component's parent Process Group, as well. When any component has 
an active Bulletin, this indicator will appear,
+       allowing the user to hover over the icon with the mouse to see Bulletin.
+       
+- *Active Tasks*: The number of tasks that are currently executing by the 
components within this
+       Process Group. Here, we can see that the Process Group is currently 
performing one task. If the 
+       NiFi instance is clustered, this value represents the number of tasks 
that are currently executing 
+       across all nodes in the cluster.
+       
+- *Comments*: When the Process Group is added to the canvas, the user is given 
the option of specifying Comments in order
+       to provide information about the Process Group. The comments can later 
be changed by right-clicking on the Process
+       Group and clicking the ``Configure'' menu option. In this example, the 
Comments are set to ``Example Process Group.''
+
+- *Statistics*: Process Groups provide statistics about the amount of data 
that has been processed by the Process Group in
+       the past 5 minutes as well as the amount of data currently enqueued 
within the Process Group. The following elements
+       comprise the ``Statistics'' portion of a Process Group:
+       ** *Queued*: The number of FlowFiles currently enqueued within the 
Process Group.
+               This field is represented as <count> / <size> where <count> is 
the number of FlowFiles that are
+               currently enqueued in the Process Group and <size> is the total 
size of those FlowFiles' content. In this example,
+               the Process Group currently has 1,738 FlowFiles enqueued; those 
FlowFiles have a total size of 350.03 megabytes (MB).
+       
+       ** *In*: The number of FlowFiles that have been transferred into the 
Process Group through all of its Input Ports
+               over the past 5 minutes. This field is represented as <count> / 
<size> where <count> is the number of FlowFiles that
+               have entered the Process Group in the past 5 minutes and <size> 
is the total size of those FlowFiles' content. 
+               In this example, 686 FlowFiles have entered the Process Group 
and their total size is 214.01 MB.
+       
+       ** *Read/Write*: The total size of the FlowFile content that the 
components within the Process Group have 
+               read from disk and written to disk. This provides valuable 
information about the I/O performance that this 
+               Process Group requires. In this example, we see that in the 
past five minutes, components within this 
+               Process Group have read 72.9 MB of the FlowFile content and 
have written 686.65 MB.
+               
+       ** *Out*: The number of FlowFiles that have been transferred out of the 
Process Group through its Output Ports
+               over the past 5 minutes. This field is represented as <count> / 
<size> where <count> is the number of FlowFiles that
+               have exited the Process Group in the past 5 minutes and <size> 
is the total size of those FlowFiles' content.
+               In this example, 657 FlowFiles have exited the Process Group 
and their total size is 477.74 MB.
+ 
+- *Component Counts*: The Component Counts element provides information about 
how many components of each type exist
+       within the Process Group. The following provides information about each 
of these icons and their meanings:
+       
+       ** image:iconInputPortSmall.png["Input Port", width=16]
+               *Input Ports*: The number of Input Ports that exist directly 
within this Process Group. This does not include any
+                       Input Ports that exist within child Process Groups, as 
child groups' ports cannot be accessed directly.
+                       
+       ** image:iconOutputPortSmall.png["Output Port"]
+               *Output Ports*: The number of Output Ports that exist directly 
within this Process Group. This does not include any
+                        Output Ports that exist within child Process Group as 
child groups' ports cannot be accessed directly.
+                        
+       ** image:iconTransmissionActive.png["Transmission Active"]
+               *Transmitting Ports*: The number of Remote Process Group Ports 
that currently are configured to transmit data to remote
+                       instances of NiFi or pull data from remote instances of 
NiFi.
+                       
+       ** image:iconTransmissionInactive.png["Transmission Inactive"]
+               *Non-Transmitting Ports*: The number of Remote Process Group 
Ports that are currently connected to components within this
+                       Process Group but currently have their transmission 
disabled.
+                       
+       ** image:iconRun.png["Running"]
+               *Running Components*: The number of Processors, Input Ports, 
and Output Ports that are currently running within this
+                       Process Group.
+       
+       ** image:iconStop.png["Stopped Components"]
+               *Stopped Components*: The number of Processors, Input Ports, 
and Output Ports that are currently not running but are
+                       valid and enabled. These components are ready to be 
started.
+                       
+       ** image:iconAlert.png["Invalid Components"]
+               *Invalid Components*: The number of Processors, Input Ports, 
and Output Ports that are enabled but are currently 
+                       not in a valid state. This may be due to misconfigured 
properties or missing Relationships.
+               
+       ** image:iconDisable.png["Disabled Components"]
+               *Disabled Components*: The number of Processors, Input Ports, 
and Output Ports that are currently disabled. These
+                       components may or may not be valid. If the Process 
Group is started, these components will not cause any errors
+                       but will not be started. 
+
+
+
+
+
+[[remote_group_anatomy]]
+=== Anatomy of a Remote Process Group
+
+When creating a DataFlow, it is often necessary to transfer data from one 
instance of NiFi to another.
+In this case, the remote instance of NiFi can be thought of as a Process 
Group. For this reason, NiFi
+provides the concept of a Remote Process Group. From the User Interface, the 
Remote Process Group
+looks similar to the Process Group. However, rather than showing information 
about the inner workings
+and state of a Remote Process Group, such as queue sizes, the information 
rendered about a Remote
+Process Group is related to the interaction that occurs between this instance 
of NiFi and the remote
+instance.
+
+image::remote-group-anatomy.png["Anatomy of a Remote Process Group"]
+
+The image above shows the different elements that make up a Remote Process 
Group. Here, we provide an
+explanation of the icons and details about the information provided.
+
+- *Transmission Status*: The Transmission Status indicates whether or not data 
Transmission between this
+       instance of NiFi and the remote instance is currently enabled. The icon 
shown will be the
+       Transmission Enabled icon (
+image:iconTransmissionActive.png["Transmission Active"]
+       ) if any of the Input Ports or Output Ports is currently configured to 
transmit or the Transmission 
+       Disabled icon (
+image:iconTransmissionInactive.png["Transmission Inactive"]
+       ) if all of the Input Ports and Output Ports that are currently 
connected are stopped.
+
+- *Remote Instance Name*: This is the name of the NiFi instance that was 
reported by the remote instance.
+       When the Remote Process Group is first created, before this information 
has been obtained, the URL
+       of the remote instance will be shown here instead.
+
+- *Remote Instance URL*: This is the URL of the remote instance that the 
Remote Process Group points to.
+       This URL is entered when the Remote Process Group is added to the 
canvas and it cannot be changed.
+       
+- *Secure Indicator*: This icon indicates whether or not communications with 
the remote NiFi instance are
+       secure. If communications with the remote instance are secure, this 
will be indicated by the ``locked''
+       icon (
+image:iconSecure.png["Secure"]
+       ). If the communications are not secure, this will be indicated by the 
``unlocked'' icon (
+image:iconNotSecure.png["Not Secure"]
+       ). If the communications are secure, this instance of NiFi will not be 
able to communicate with the
+       remote instance until an administrator for the remote instance grants 
access. Whenever the Remote Process
+       Group is added to the canvas, this will automatically initiate a 
request to have a user for this instance of NiFi created on the
+       remote instance. This instance will be unable to communicate with the 
remote instance until an administrator 
+       on the remote instance adds the user to the system and adds the 
``NiFi'' role to the user.
+       In the event that communications are not secure, the Remote Process 
Group is able to receive data from anyone,
+       and the data is not encrypted while it is transferred between instances 
of NiFi.
+       
+- *Input Ports*: This section shows three pieces of information:
+       ** image:iconInputPortSmall.png["Input Ports"]
+               *Input Ports*: The number of Input Ports that are available to 
send data to on the remote instance of NiFi.
+                       If the remote instance is secure, only the ports to 
which this instance of NiFi has been granted access
+                       will be counted.
+                       
+       ** image:iconTransmissionActive.png["Transmitting"]
+               *Transmitting Ports*: The number of Input Ports to which this 
NiFi is connected and currently configured to
+                       send data to. Ports can be turned on and off by 
enabling and disabling transmission on the Remote Process
+                       Group (see <<Remote_Group_Transmission>>) or via the 
<<Remote_Port_Configuration>> dialog.
+       
+       ** image:iconTransmissionInactive.png["Not Transmitting"]
+               *Non-Transmitting Ports*: The number of Input Ports to which 
this NiFi is connected but is not currently configured
+                       to send data to. Ports can be turned on and off by 
enabling and disabling transmission on the Remote Process
+                       Group (see <<Remote_Group_Transmission>>) or via the 
<<Remote_Port_Configuration>> dialog.
+
+- *Output Ports*: Similar to the ``Input Ports'' section above, this element 
shows three pieces of information:
+       ** image:iconOutputPortSmall.png["Output Ports"]
+               *Output Ports*: The number of Output Ports that are available 
to pull data from the remote instance of NiFi.
+                       If the remote instance is secure, only the ports to 
which this instance of NiFi has been granted access
+                       will be counted.
+                       
+       ** image:iconTransmissionActive.png["Transmitting"]
+               *Transmitting Ports*: The number of Output Ports from whcih 
this NiFi is connected and currently configured
+                       to pull data from. Ports can be turned on and off by 
enabling and disabling transmission on the Remote Process
+                       Group (see <<Remote_Group_Transmission>>) or via the 
<<Remote_Port_Configuration>> dialog.
+                       
+       ** image:iconTransmissionInactive.png["Not Transmitting"]
+               *Non-Transmitting Ports*: The number of Output Ports to which 
this NiFi is connected but is not currently configured
+                       to pull data from. Ports can be turned on and off by 
enabling and disabling transmission on the Remote Process
+                       Group (see <<Remote_Group_Transmission>>) or via the 
<<Remote_Port_Configuration>> dialog.
+                       
+- *5-Minute Statistics*: Two statistics are shown for Remote Process Groups: 
*Sent* and *Received*. Both of these are
+       in the format <count> / <size> where <count> is the number of FlowFiles 
that have been sent or received in the previous
+       five minutes and <size> is the total size of those FlowFiles' content. 
+
+- *Comments*: The Comments that are provided for a Remote Process Group are 
not comments added by the users of this NiFi but
+       rather the Comments added by the administrators of the remote instance. 
These comments indicate the purpose of the NiFi
+       instance as a whole.
+
+- *Last Refreshed Time*: The information that is pulled from a remote instance 
and rendered on the Remote Process Group
+       in the User Interface is periodically refreshed in the background. This 
element indicates the time at which that refresh
+       last happened, or if the information has not been refreshed for a 
significant amount of time, the value will change to
+       indicate _Remote flow not current_. NiFi can be triggered to initiate a 
refresh of this information by right-clicking
+       on the Remote Process Group and choosing the ``Refresh flow'' menu item.
+
+
+
+
+[[Summary_Page]]
+=== Summary Page
+
+While the NiFi canvas is useful for understanding how the configured DataFlow 
is laid out, this view is not always optimal
+when trying to discern the status of the system. In order to help the user 
understand how the DataFlow is functioning
+at a higher level, NiFi provides a Summary page. This page is available in the 
Management Toolbar in the top-right corner
+of the User Interface. See the <<User_Interface>> section for more information 
about the location of this toolbar.
+
+The Summary Page is opened by clicking the Summary icon (
+image:iconSummary.png["Summary"]
+) from the Management Toolbar. This opens the Summary table dialog:
+
+image::summary-table.png["Summary Table"]
+
+This dialog provides a great deal of information about each of the components 
on the graph. Below, we have annotated
+the different elements within the dialog in order to make the discussion of 
the dialog easier.
+
+image::summary-annotated.png["Summary Table Annotated"]
+
+The Summary page consists mostly of a table that provides information about 
each of the components on the canvas. Above this
+table is a set of five tabs that can be used to view the different types of 
components. The information provided in the table
+is the same information that is provided for each component on the canvas. 
Each of the columns in the table may be sorted by
+double-clicking on the heading of the column. For more on the types of 
information displayed, see the sections
+<<processor_anatomy>>, <<process_group_anatomy>>, and <<remote_group_anatomy>> 
above.
+
+The Summary page also includes the following elements:
+
+- *Bulletin Indicator*: As in other places throughout the User Interface, when 
this icon is present, hovering over the icon will
+       provide information about the Bulletin that was generated, including 
the message, the severity level, the time at which
+       the Bulletin was generated, and (in a clustered environment) the node 
that generated the Bulletin. Like all the columns in the
+       Summary table, this column where bulletins are shown may be sorted
+       by double-clicking on the heading so that all the currently existing 
bulletins are shown at the top of the list.
+       
+- *Details*: Clicking the Details icon will provide the user with the details 
of the component. This dialog is the same as the
+       dialog provided when the user right-clicks on the component and chooses 
the ``View configuration'' menu item.
+       
+- *Go To*: Clicking this button will close the Summary page and take the user 
directly to the component on the NiFi canvas. This
+       may change the Process Group that the user is currently in. This icon 
is not available if the Summary page has been opened
+       in a new browser tab or window (by clicking the ``Pop Out'' button, as 
described below).
+       
+- *Stats History*: Clicking the Stats History icon will open a new dialog that 
shows a historical view of the statistics that
+       are rendered for this component. See the section <<Stats_History>> for 
more information.
+
+- *Refresh*: The Refresh button allows the user to refresh the information 
displayed without closing the dialog and opening it
+       again. The time at which the information was last refreshed is shown 
just to the right of the Refresh button. The information
+       on the page is not automatically refreshed.
+       
+- *Filter*: The Filter element allows users to filter the contents of the 
Summary table by typing in all or part of some criteria,
+       such as a Processor Type or Processor Name. The types of filters 
available differ according to the selected tab. For instance,
+       if viewing the Processor tab, the user is able to filter by name or by 
type. When viewing the Connections tab, the user
+       is able to filter by source name, destination name, or Connection name. 
The filter is automatically applied when the contents
+       of the text box are changed. Below the text box is an indicator of how 
many entries in the table match the filter and how many
+       entries exist in the table.
+
+- *Pop-Out*: When monitoring a flow, it is helpful to be able to open the 
Summary table in a separate browser tab or window. The
+       Pop-Out button, next to the Close button, will cause the entire Summary 
dialog to be opened in a new browser tab or window
+       (depending on the configuration of the browser). Once the page is 
``popped out'', the dialog is closed in the original
+       browser tab/window. In the new tab/window, the Pop-Out button and the 
Go-To button will no longer be available.
+
+- *System Diagnostics*: The System Diagnostics window provides information 
about how the system is performing with respect to
+       system resource utilization. While this is intended mostly for 
administrators, it is provided in this view because it
+       does provide a summary of the system. This dialog shows information 
such as CPU utilization, how full the disks are,
+       and Java-specific metrics, such as memory size and utilization, as well 
as Garbage Collection information.
+       
+
+
+
+[[Stats_History]]
+=== Historical Statistics of a Component
+
+While the Summary table and the canvas show numeric statistics pertaining to 
the performance of a component over the
+past five minutes, it is often useful to have a view of historical statistics 
as well. This information is available
+by right-clicking on a component and choosing the ``Stats'' menu option or by 
clicking on the Stats History in the Summary page (see <<Summary_Page>>
+for more information).
+
+The amount of historical information that is sto


<TRUNCATED>

[14/51] [partial] nifi git commit: NIFI-850 removed nifi parent, updated nifi pom, moved all nifi subdirs up one level, fixed readme.

Reply via email to