Modified: falcon/site/FalconCLI.html
URL: 
http://svn.apache.org/viewvc/falcon/site/FalconCLI.html?rev=1660589&r1=1660588&r2=1660589&view=diff
==============================================================================
--- falcon/site/FalconCLI.html (original)
+++ falcon/site/FalconCLI.html Wed Feb 18 10:55:56 2015
@@ -1,13 +1,13 @@
 <!DOCTYPE html>
 <!--
- | Generated by Apache Maven Doxia at 2015-01-11
+ | Generated by Apache Maven Doxia at 2015-02-18
  | Rendered using Apache Maven Fluido Skin 1.3.0
 -->
 <html xmlns="http://www.w3.org/1999/xhtml"; xml:lang="en" lang="en">
   <head>
     <meta charset="UTF-8" />
     <meta name="viewport" content="width=device-width, initial-scale=1.0" />
-    <meta name="Date-Revision-yyyymmdd" content="20150111" />
+    <meta name="Date-Revision-yyyymmdd" content="20150218" />
     <meta http-equiv="Content-Language" content="en" />
     <title>Falcon - FalconCLI</title>
     <link rel="stylesheet" href="./css/apache-maven-fluido-1.3.0.min.css" />
@@ -245,7 +245,7 @@
         
                 
                     
-                  <li id="publishDate" class="pull-right">Last Published: 
2015-01-11</li> 
+                  <li id="publishDate" class="pull-right">Last Published: 
2015-02-18</li> 
             
                             </ul>
       </div>
@@ -290,13 +290,18 @@
 <div class="section">
 <h4>Summary<a name="Summary"></a></h4>
 <p>Summary of entities of a particular type and a cluster will be listed. 
Entity summary has N most recent instances of entity.</p>
-<p>Usage: $FALCON_HOME/bin/falcon entity -type [cluster|feed|process] 
-summary</p>
+<p>Usage: $FALCON_HOME/bin/falcon entity -type [feed|process] -summary</p>
 <p>Optional Args : -start &quot;yyyy-MM-dd'T'HH:mm'Z'&quot; -end 
&quot;yyyy-MM-dd'T'HH:mm'Z'&quot; -fields &lt;&lt;field1,field2&gt;&gt; 
-filterBy &lt;&lt;field1:value1,field2:value2&gt;&gt; -tags 
&lt;&lt;tagkey=tagvalue,tagkey=tagvalue&gt;&gt; -orderBy &lt;&lt;field&gt;&gt; 
-sortOrder &lt;&lt;sortOrder&gt;&gt; -offset 0 -numResults 10 -numInstances 
7</p>
 <p><a href="./Restapi/EntitySummary.html">Optional params described 
here.</a></p></div>
 <div class="section">
 <h4>Update<a name="Update"></a></h4>
 <p>Update operation allows an already submitted/scheduled entity to be 
updated. Cluster update is currently not allowed.</p>
-<p>Usage: $FALCON_HOME/bin/falcon entity  -type [feed|process] -name 
&lt;&lt;name&gt;&gt; -update [-effective &lt;&lt;effective 
time&gt;&gt;]</p></div>
+<p>Usage: $FALCON_HOME/bin/falcon entity  -type [feed|process] -name 
&lt;&lt;name&gt;&gt; -update -file &lt;&lt;path_to_file&gt;&gt;</p>
+<p>Example: $FALCON_HOME/bin/falcon entity -type process -name <a 
href="./HourlyReportsGenerator.html">HourlyReportsGenerator</a> -update -file 
/process/definition.xml</p></div>
+<div class="section">
+<h4>Touch<a name="Touch"></a></h4>
+<p>Force Update operation allows an already submitted/scheduled entity to be 
updated.</p>
+<p>Usage: $FALCON_HOME/bin/falcon entity  -type [feed|process] -name 
&lt;&lt;name&gt;&gt; -touch</p></div>
 <div class="section">
 <h4>Status<a name="Status"></a></h4>
 <p>Status returns the current status of the entity.</p>
@@ -314,8 +319,7 @@
 <div class="section">
 <h4>Kill<a name="Kill"></a></h4>
 <p>Kill sub-command is used to kill all the instances of the specified process 
whose nominal time is between the given start time and end time.</p>
-<p>Note:  1. For all the instance management sub-commands, if end time is not 
specified, Falcon will perform the actions on all the instances whose instance 
time falls after the start time.</p>
-<p>2. The start time and end time needs to be specified in TZ format.  
Example:   01 Jan 2012 01:00  =&gt; 2012-01-01T01:00Z</p>
+<p>Note:  1. The start time and end time needs to be specified in TZ format. 
Example:   01 Jan 2012 01:00  =&gt; 2012-01-01T01:00Z</p>
 <p>3. Process name is compulsory parameter for each instance management 
command.</p>
 <p>Usage: $FALCON_HOME/bin/falcon instance -type &lt;&lt;feed/process&gt;&gt; 
-name &lt;&lt;name&gt;&gt; -kill -start &quot;yyyy-MM-dd'T'HH:mm'Z'&quot; -end 
&quot;yyyy-MM-dd'T'HH:mm'Z'&quot;</p></div>
 <div class="section">
@@ -324,19 +328,19 @@
 <p>Usage: $FALCON_HOME/bin/falcon instance -type &lt;&lt;feed/process&gt;&gt; 
-name &lt;&lt;name&gt;&gt; -suspend -start &quot;yyyy-MM-dd'T'HH:mm'Z'&quot; 
-end &quot;yyyy-MM-dd'T'HH:mm'Z'&quot;</p></div>
 <div class="section">
 <h4>Continue<a name="Continue"></a></h4>
-<p>Continue option is used to continue the failed workflow instance. This 
option is valid only for process instances in terminal state, i.e. SUCCEDDED, 
KILLED or FAILED.</p>
-<p>Usage: $FALCON_HOME/bin/falcon instance -type &lt;&lt;feed/process&gt;&gt; 
-name &lt;&lt;name&gt;&gt; -re-run -start &quot;yyyy-MM-dd'T'HH:mm'Z'&quot; 
-end &quot;yyyy-MM-dd'T'HH:mm'Z'&quot;</p></div>
+<p>Continue option is used to continue the failed workflow instance. This 
option is valid only for process instances in terminal state, i.e. KILLED or 
FAILED.</p>
+<p>Usage: $FALCON_HOME/bin/falcon instance -type &lt;&lt;feed/process&gt;&gt; 
-name &lt;&lt;name&gt;&gt; -continue -start &quot;yyyy-MM-dd'T'HH:mm'Z'&quot; 
-end &quot;yyyy-MM-dd'T'HH:mm'Z'&quot;</p></div>
 <div class="section">
 <h4>Rerun<a name="Rerun"></a></h4>
 <p>Rerun option is used to rerun instances of a given process. This option is 
valid only for process instances in terminal state, i.e. SUCCEDDED, KILLED or 
FAILED. Optionally, you can specify the properties to override.</p>
-<p>Usage: $FALCON_HOME/bin/falcon instance -type &lt;&lt;feed/process&gt;&gt; 
-name &lt;&lt;name&gt;&gt; -re-run -start &quot;yyyy-MM-dd'T'HH:mm'Z'&quot; 
-end &quot;yyyy-MM-dd'T'HH:mm'Z'&quot; [-file &lt;&lt;properties 
file&gt;&gt;]</p></div>
+<p>Usage: $FALCON_HOME/bin/falcon instance -type &lt;&lt;feed/process&gt;&gt; 
-name &lt;&lt;name&gt;&gt; -rerun -start &quot;yyyy-MM-dd'T'HH:mm'Z'&quot; -end 
&quot;yyyy-MM-dd'T'HH:mm'Z'&quot; [-file &lt;&lt;properties 
file&gt;&gt;]</p></div>
 <div class="section">
 <h4>Resume<a name="Resume"></a></h4>
 <p>Resume option is used to resume any instance that  is in suspended 
state.</p>
 <p>Usage: $FALCON_HOME/bin/falcon instance -type &lt;&lt;feed/process&gt;&gt; 
-name &lt;&lt;name&gt;&gt; -resume -start &quot;yyyy-MM-dd'T'HH:mm'Z'&quot; 
-end &quot;yyyy-MM-dd'T'HH:mm'Z'&quot;</p></div>
 <div class="section">
 <h4>Status<a name="Status"></a></h4>
-<p>Status option via CLI can be used to get the status of a single or multiple 
instances.  If the instance is not yet materialized but is within the process 
validity range, WAITING is returned as the state. Along with the status of the 
instance time is also returned. Log location gives the oozie workflow url If 
the instance is in WAITING state, missing dependencies are listed</p>
+<p>Status option via CLI can be used to get the status of a single or multiple 
instances.  If the instance is not yet materialized but is within the process 
validity range, WAITING is returned as the state. Along with the status of the 
instance time is also returned. Log location gives the oozie workflow url If 
the instance is in WAITING state, missing dependencies are listed. The job urls 
are populated for all actions of user workflow and non-succeeded actions of the 
main-workflow. The user then need not go to the underlying scheduler to get the 
job urls when needed to debug an issue in the job.</p>
 <p>Example : Suppose a process has 3 instance, one has succeeded,one is in 
running state and other one is waiting, the expected output is:</p>
 
<p>{&quot;status&quot;:&quot;SUCCEEDED&quot;,&quot;message&quot;:&quot;getStatus
 is 
successful&quot;,&quot;instances&quot;:[{&quot;instance&quot;:&quot;2012-05-07T05:02Z&quot;,&quot;status&quot;:&quot;SUCCEEDED&quot;,&quot;logFile&quot;:&quot;http://oozie-dashboard-url&quot;},{&quot;instance&quot;:&quot;2012-05-07T05:07Z&quot;,&quot;status&quot;:&quot;RUNNING&quot;,&quot;logFile&quot;:&quot;http://oozie-dashboard-url&quot;},
 
{&quot;instance&quot;:&quot;2010-01-02T11:05Z&quot;,&quot;status&quot;:&quot;WAITING&quot;}]</p>
 <p>Usage: $FALCON_HOME/bin/falcon instance -type &lt;&lt;feed/process&gt;&gt; 
-name &lt;&lt;name&gt;&gt; -status</p>
@@ -365,6 +369,12 @@
 <p>Optional Args : -colo &lt;&lt;colo&gt;&gt; -lifecycle 
&lt;&lt;lifecycles&gt;&gt; -filterBy 
&lt;&lt;field1:value1,field2:value2&gt;&gt; -orderBy &lt;&lt;field&gt;&gt; 
-sortOrder &lt;&lt;sortOrder&gt;&gt; -offset 0 -numResults 10</p>
 <p><a href="./Restapi/InstanceRunning.html">Optional params described 
here.</a></p></div>
 <div class="section">
+<h4>FeedInstanceListing<a name="FeedInstanceListing"></a></h4>
+<p>Get falcon feed instance availability.</p>
+<p>Usage: $FALCON_HOME/bin/falcon instance -entity feed -name 
&lt;&lt;name&gt;&gt; -listing</p>
+<p>Optional Args : -start &quot;yyyy-MM-dd'T'HH:mm'Z'&quot; -end 
&quot;yyyy-MM-dd'T'HH:mm'Z'&quot; -colo &lt;&lt;colo&gt;&gt;</p>
+<p><a href="./Restapi/FeedInstanceListing.html">Optional params described 
here.</a></p></div>
+<div class="section">
 <h4>Logs<a name="Logs"></a></h4>
 <p>Get logs for instance actions</p>
 <p>Usage: $FALCON_HOME/bin/falcon instance -type &lt;&lt;feed/process&gt;&gt; 
-name &lt;&lt;name&gt;&gt; -logs</p>
@@ -379,27 +389,44 @@
 <p>Displays the workflow params of a given instance. Where start time is 
considered as nominal time of that instance.</p>
 <p>Usage: $FALCON_HOME/bin/falcon instance -type &lt;&lt;feed/process&gt;&gt; 
-name &lt;&lt;name&gt;&gt; -params -start 
&quot;yyyy-MM-dd'T'HH:mm'Z'&quot;</p></div>
 <div class="section">
-<h3>Graphs Options<a name="Graphs_Options"></a></h3></div>
+<h3>Metadata Lineage Options<a name="Metadata_Lineage_Options"></a></h3></div>
+<div class="section">
+<h4>Lineage<a name="Lineage"></a></h4>
+<p><a href="http://www.graphviz.org/content/dot-language";>dot</a> format. You 
can use the output and view a graphical representation of DAG using an online 
graphviz viewer like <a href="http://graphviz-dev.appspot.com/";>this</a>.</p>
+<p>Usage:</p>
+<p>$FALCON_HOME/bin/falcon metadata -lineage -pipeline my-pipeline</p>
+<p>pipeline is a mandatory option.</p></div>
 <div class="section">
 <h4>Vertex<a name="Vertex"></a></h4>
 <p>Get the vertex with the specified id.</p>
-<p>Usage: $FALCON_HOME/bin/falcon graph -vertex -id &lt;&lt;id&gt;&gt;</p>
-<p>Example: $FALCON_HOME/bin/falcon graph -vertex -id 4</p></div>
+<p>Usage: $FALCON_HOME/bin/falcon metadata -vertex -id &lt;&lt;id&gt;&gt;</p>
+<p>Example: $FALCON_HOME/bin/falcon metadata -vertex -id 4</p></div>
 <div class="section">
 <h4>Vertices<a name="Vertices"></a></h4>
 <p>Get all vertices for a key index given the specified value.</p>
-<p>Usage: $FALCON_HOME/bin/falcon graph -vertices -key &lt;&lt;key&gt;&gt; 
-value &lt;&lt;value&gt;&gt;</p>
-<p>Example: $FALCON_HOME/bin/falcon graph -vertices -key type -value 
feed-instance</p></div>
+<p>Usage: $FALCON_HOME/bin/falcon metadata -vertices -key &lt;&lt;key&gt;&gt; 
-value &lt;&lt;value&gt;&gt;</p>
+<p>Example: $FALCON_HOME/bin/falcon metadata -vertices -key type -value 
feed-instance</p></div>
 <div class="section">
 <h4>Vertex Edges<a name="Vertex_Edges"></a></h4>
 <p>Get the adjacent vertices or edges of the vertex with the specified 
direction.</p>
-<p>Usage: $FALCON_HOME/bin/falcon graph -edges -id &lt;&lt;vertex-id&gt;&gt; 
-direction &lt;&lt;direction&gt;&gt;</p>
-<p>Example: $FALCON_HOME/bin/falcon graph -edges -id 4 -direction both 
$FALCON_HOME/bin/falcon graph -edges -id 4 -direction inE</p></div>
+<p>Usage: $FALCON_HOME/bin/falcon metadata -edges -id 
&lt;&lt;vertex-id&gt;&gt; -direction &lt;&lt;direction&gt;&gt;</p>
+<p>Example: $FALCON_HOME/bin/falcon metadata -edges -id 4 -direction both 
$FALCON_HOME/bin/falcon metadata -edges -id 4 -direction inE</p></div>
 <div class="section">
 <h4>Edge<a name="Edge"></a></h4>
 <p>Get the edge with the specified id.</p>
-<p>Usage: $FALCON_HOME/bin/falcon graph -edge -id &lt;&lt;id&gt;&gt;</p>
-<p>Example: $FALCON_HOME/bin/falcon graph -edge -id Q9n-Q-5g</p></div>
+<p>Usage: $FALCON_HOME/bin/falcon metadata -edge -id &lt;&lt;id&gt;&gt;</p>
+<p>Example: $FALCON_HOME/bin/falcon metadata -edge -id Q9n-Q-5g</p></div>
+<div class="section">
+<h3>Metadata Discovery Options<a 
name="Metadata_Discovery_Options"></a></h3></div>
+<div class="section">
+<h4>List<a name="List"></a></h4>
+<p>Lists of all dimensions of given type. If the user provides optional param 
cluster, only the dimensions related to the cluster are listed. Usage: 
$FALCON_HOME/bin/falcon metadata -list -type 
[cluster_entity|feed_entity|process_entity|user|colo|tags|groups|pipelines]</p>
+<p>Optional Args : -cluster &lt;&lt;cluster name&gt;&gt;</p>
+<p>Example: $FALCON_HOME/bin/falcon metadata -list -type process_entity 
-cluster primary-cluster $FALCON_HOME/bin/falcon metadata -list -type 
tags</p></div>
+<div class="section">
+<h4>Relations<a name="Relations"></a></h4>
+<p>List all dimensions related to specified Dimension identified by 
dimension-type and dimension-name. Usage: $FALCON_HOME/bin/falcon metadata 
-relations -type 
[cluster_entity|feed_entity|process_entity|user|colo|tags|groups|pipelines] 
-name &lt;&lt;Dimension Name&gt;&gt;</p>
+<p>Example: $FALCON_HOME/bin/falcon metadata -relations -type process_entity 
-name sample-process</p></div>
 <div class="section">
 <h3>Admin Options<a name="Admin_Options"></a></h3></div>
 <div class="section">
@@ -411,6 +438,14 @@
 <div class="section">
 <h4>Status<a name="Status"></a></h4>
 <p>Status returns the current state of Falcon (running or stopped). Usage: 
$FALCON_HOME/bin/falcon admin -status</p></div>
+<div class="section">
+<h3>Recipe Options<a name="Recipe_Options"></a></h3></div>
+<div class="section">
+<h4>Submit Recipe<a name="Submit_Recipe"></a></h4>
+<p>Submit the specified recipe.</p>
+<p>Usage: $FALCON_HOME/bin/falcon recipe -name &lt;name&gt; Name of the 
recipe. User should have defined &lt;name&gt;-template.xml and 
&lt;name&gt;.properties in the path specified by falcon.recipe.path in 
client.properties file. falcon.home path is used if its not specified in 
client.properties file. If its not specified in client.properties file and also 
if files cannot be found at falcon.home, Falcon CLI will fail.</p>
+<p>Optional Args : -tool &lt;recipeToolClassName&gt; Falcon provides a base 
tool that recipes can override. If this option is not specified the default 
Recipe Tool RecipeTool defined is used. This option is required if user defines 
his own recipe tool class.</p>
+<p>Example: $FALCON_HOME/bin/falcon recipe -name hdfs-replication</p></div>
                   </div>
           </div>
 

Modified: falcon/site/FalconDocumentation.html
URL: 
http://svn.apache.org/viewvc/falcon/site/FalconDocumentation.html?rev=1660589&r1=1660588&r2=1660589&view=diff
==============================================================================
--- falcon/site/FalconDocumentation.html (original)
+++ falcon/site/FalconDocumentation.html Wed Feb 18 10:55:56 2015
@@ -1,13 +1,13 @@
 <!DOCTYPE html>
 <!--
- | Generated by Apache Maven Doxia at 2015-01-11
+ | Generated by Apache Maven Doxia at 2015-02-18
  | Rendered using Apache Maven Fluido Skin 1.3.0
 -->
 <html xmlns="http://www.w3.org/1999/xhtml"; xml:lang="en" lang="en">
   <head>
     <meta charset="UTF-8" />
     <meta name="viewport" content="width=device-width, initial-scale=1.0" />
-    <meta name="Date-Revision-yyyymmdd" content="20150111" />
+    <meta name="Date-Revision-yyyymmdd" content="20150218" />
     <meta http-equiv="Content-Language" content="en" />
     <title>Falcon - Contents</title>
     <link rel="stylesheet" href="./css/apache-maven-fluido-1.3.0.min.css" />
@@ -245,7 +245,7 @@
         
                 
                     
-                  <li id="publishDate" class="pull-right">Last Published: 
2015-01-11</li> 
+                  <li id="publishDate" class="pull-right">Last Published: 
2015-02-18</li> 
             
                             </ul>
       </div>
@@ -269,10 +269,12 @@
 <li><a href="#Updating_process_and_feed_definition">Updating process and feed 
definition</a></li>
 <li><a href="#Handling_late_input_data">Handling late input data</a></li>
 <li><a href="#Idempotency">Idempotency</a></li>
-<li><a href="#Alerting_and_Monitoring">Alerting and Monitoring</a></li>
 <li><a href="#Falcon_EL_Expressions">Falcon EL Expressions</a></li>
 <li><a href="#Lineage">Lineage</a></li>
-<li><a href="#Security">Security</a></li></ul></div>
+<li><a href="#Security">Security</a></li>
+<li><a href="#Recipes">Recipes</a></li>
+<li><a href="#Monitoring">Monitoring</a></li>
+<li><a href="#Backwards_Compatibility">Backwards Compatibility 
Instructions</a></li></ul></div>
 <div class="section">
 <h3>Architecture<a name="Architecture"></a></h3></div>
 <div class="section">
@@ -298,10 +300,10 @@
 <p>There are two basic components of Falcon set up. Falcon Prism and Falcon 
Server. As the name suggests Falcon Prism splits the request it gets to the 
Falcon Servers. More details below:</p></div>
 <div class="section">
 <h4>Stand Alone Mode<a name="Stand_Alone_Mode"></a></h4>
-<p>Stand alone mode is useful when the hadoop jobs and relevant data 
processing involves only one hadoop cluster. In this mode there is single 
Falcon server that contacts with oozie to schedule jobs on Hadoop. All the 
process / feed request like submit, schedule, suspend, kill are sent to this 
server only. For running in this mode one should use the falcon which has been 
built for standalone mode, or build using standalone option if using source 
code.</p></div>
+<p>Stand alone mode is useful when the hadoop jobs and relevant data 
processing involves only one hadoop cluster. In this mode there is a single 
Falcon server that contacts Oozie to schedule jobs on Hadoop. All the 
process/feed requests like submit, schedule, suspend, kill etc. are sent to 
this server. For running falcon in this mode one should use the falcon which 
has been built using standalone option.</p></div>
 <div class="section">
 <h4>Distributed Mode<a name="Distributed_Mode"></a></h4>
-<p>Distributed mode is the mode which you might me using most of the time. 
This is for organisations which have multiple instances of hadoop clusters, and 
multiple workflow schedulers to handle them. Here we have 2 components: Prism 
and Server. Both Prism and server have there own setup (runtime and startup 
properties) and there config locations. In this mode Prism acts as a contact 
point for Falcon servers. Below are the requests that can be sent to prism and 
server in this mode:</p>
+<p>Distributed mode is for multiple (colos) instances of hadoop clusters, and 
multiple workflow schedulers to handle them. In this mode falcon has 2 
components: Prism and Server(s). Both Prism and servers have their own setup 
(runtime and startup properties) and their own config locations. In this mode 
Prism acts as a contact point for Falcon servers. While all commands are 
available through Prism, only read and instance api's are available through 
Server. Below are the requests that can be sent to each of these:</p>
 <p>Prism: submit, schedule, submitAndSchedule, Suspend, Resume, Kill, instance 
management  Server: schedule, suspend, resume, instance management</p>
 <p>As observed above submit and kill are kept exclusively as Prism operations 
to keep all the config stores in sync and to support feature of idempotency. 
Request may also be sent from prism but directed to a specific server using the 
option &quot;-colo&quot; from CLI or append the same in web request, if using 
API.</p>
 <p>When a cluster is submitted it is by default sent to all the servers 
configured in the prism. When is feed is SUBMIT / SCHEDULED request is only 
sent to the servers specified in the feed / process definitions. Servers are 
mentioned in the feed / process via CLUSTER tags in xml definition.</p>
@@ -389,13 +391,10 @@ catalog:$database-name:$table-name#parti
 <p>Delete operation on the entity removes any scheduled activity on the 
workflow engine, besides removing the entity from the falcon configuration 
store. Delete operation on an entity would only succeed if there are no 
dependent entities on the deleted entity.</p></div>
 <div class="section">
 <h4>Update<a name="Update"></a></h4>
-<p>Update operation allows an already submitted/scheduled entity to be 
updated. Cluster update is currently not allowed. Feed update can cause 
cascading update to all the processes already scheduled. Process update 
triggers update in falcon if entity is updated/the user specified workflow/lib 
is updated. The following set of actions are performed in Oozie to realize an 
update:</p>
+<p>Update operation allows an already submitted/scheduled entity to be 
updated. Cluster update is currently not allowed. Feed update can cause 
cascading update to all the processes already scheduled. Process update 
triggers update in falcon if entity is updated. The following set of actions 
are performed in scheduler to realize an update:</p>
 <ul>
-<li>Suspend the previously scheduled Oozie coordinator. This is to prevent any 
new action from being triggered.</li>
-<li>Update the coordinator to set the end time to &quot;now&quot;</li>
-<li>Resume the suspended coordinators</li>
-<li>Schedule as per the new process/feed definition with the start time as 
&quot;now&quot;</li></ul>
-<p>Update optionally takes effective time as a parameter which is used as the 
end time of previously scheduled coordinator. So, the updated configuration 
will be effective since the given timestamp.</p></div>
+<li>Update the old scheduled entity to set the end time to &quot;now&quot;</li>
+<li>Schedule as per the new process/feed definition with the start time as 
&quot;now&quot;</li></ul></div>
 <div class="section">
 <h3>Instance Management actions<a name="Instance_Management_actions"></a></h3>
 <p>Instance Manager gives user the option to control individual instances of 
the process based on their instance start time (start time of that instance). 
Start time needs to be given in standard TZ format. Example: 01 Jan 2012 01:00 
=&gt; 2012-01-01T01:00Z</p>
@@ -442,7 +441,7 @@ catalog:$database-name:$table-name#parti
 <p>With the integration of Hive, Falcon also provides retention for tables in 
Hive catalog.</p></div>
 <div class="section">
 <h4>Example:<a name="Example:"></a></h4>
-<p>If retention period is 10 hours, and the policy kicks in at time 't', the 
data retained by system is essentially the one falling in between [t-10h,t]. 
Any data in the boundaries [-&#xef;&#xbf;&#xbd;,t-10h) and 
(t,&#xef;&#xbf;&#xbd;] is removed from the system.</p>
+<p>If retention period is 10 hours, and the policy kicks in at time 't', the 
data retained by system is essentially the one in range [t-10h, t]. Any data 
before t-10h and after t is removed from the system.</p>
 <p>The 'action' attribute can attain values of DELETE/ARCHIVE. Based upon the 
tag value, the data eligible for removal is either deleted/archived.</p></div>
 <div class="section">
 <h4>NOTE: Falcon 0.1/0.2 releases support Delete operation only<a 
name="NOTE:_Falcon_0.10.2_releases_support_Delete_operation_only"></a></h4></div>
@@ -520,6 +519,26 @@ catalog:$database-name:$table-name#parti
 <ul>
 <li>The partition is not complete and hence not visible to users until all the 
data is committed on the secondary</li></ul>cluster, (no dirty reads)</div>
 <div class="section">
+<h4>Archival as Replication<a name="Archival_as_Replication"></a></h4>
+<p>Falcon allows users to archive data from on-premice to cloud, either Azure 
WASB or S3. It uses the underlying replication for archiving data from source 
to target. The archival URI is specified as the overridden location for the 
target cluster.</p>
+<p><b>Example:</b></p>
+<div class="source">
+<pre>
+    &lt;clusters&gt;
+        &lt;cluster name=&quot;on-premise-cluster&quot; 
type=&quot;source&quot;&gt;
+            &lt;validity start=&quot;2021-11-01T00:00Z&quot; 
end=&quot;2021-12-31T00:00Z&quot;/&gt;
+        &lt;/cluster&gt;
+        &lt;cluster name=&quot;cloud-cluster&quot; type=&quot;target&quot;&gt;
+            &lt;validity start=&quot;2011-11-01T00:00Z&quot; 
end=&quot;2011-12-31T00:00Z&quot;/&gt;
+            &lt;locations&gt;
+                &lt;location type=&quot;data&quot;
+                          
path=&quot;wasb://[email protected]/data/${YEAR}-${MONTH}-${DAY}-${HOUR}&quot;/&gt;
+            &lt;/locations&gt;
+        &lt;/cluster&gt;
+    &lt;/clusters&gt;
+
+</pre></div></div>
+<div class="section">
 <h4>Relation between feed's retention limit and feed's late arrival cut off 
period:<a 
name="Relation_between_feeds_retention_limit_and_feeds_late_arrival_cut_off_period:"></a></h4>
 <p>For reasons that are obvious, Falcon has an external validation that 
ensures that the user always specifies the feed retention limit to be more than 
the feed's allowed late arrival period. If this rule is violated by the user, 
the feed submission call itself throws back an error.</p></div>
 <div class="section">
@@ -656,58 +675,6 @@ validity start=&quot;2009-01-01T00:00Z&q
 <h3>Idempotency<a name="Idempotency"></a></h3>
 <p>All the operations in Falcon are Idempotent. That is if you make same 
request to the falcon server / prism again you will get a SUCCESSFUL return if 
it was SUCCESSFUL in the first attempt. For example, you submit a new process / 
feed and get SUCCESSFUL message return. Now if you run the same command / api 
request on same entity you will again get a SUCCESSFUL message. Same is true 
for other operations like schedule, kill, suspend and resume. Idempotency also 
by takes care of the condition when request is sent through prism and fails on 
one or more servers. For example prism is configured to send request to 3 
servers. First user sends a request to SUBMIT a process on all 3 of them, and 
receives a response SUCCESSFUL from all of them. Then due to some issue one of 
the servers goes down, and user send a request to schedule the submitted 
process. This time he will receive a response with PARTIAL status and a FAILURE 
message from the server that has gone down. If the users check he wi
 ll find the process would have been started and running on the 2 SUCCESSFUL 
servers. Now the issue with server is figured out and it is brought up. Sending 
the SCHEDULE request again through prism will result in a SUCCESSFUL response 
from prism as well as other three servers, but this time PROCESS will be 
SCHEDULED only on the server which had failed earlier and other two will keep 
running as before.</p></div>
 <div class="section">
-<h3>Alerting and Monitoring<a name="Alerting_and_Monitoring"></a></h3></div>
-<div class="section">
-<h4>Alerting<a name="Alerting"></a></h4>
-<p>Falcon provides monitoring of various events by capturing metrics of those 
events. The metric numbers can then be used to monitor performance and health 
of the Falcon system and the entire processing pipelines.</p>
-<p>Users can view the logs of these events in the metric.log file, by default 
this file is created under ${user.dir}/logs/ directory. Users may also extend 
the Falcon monitoring framework to send events to systems like 
Mondemand/lwes.</p>
-<p>The following events are captured by Falcon for logging the metrics:</p>
-<ol style="list-style-type: decimal">
-<li>New cluster definitions posted to Falcon (success &amp; failures)</li>
-<li>New feed definition posted to Falcon (success &amp; failures)</li>
-<li>New process definition posted to Falcon (success &amp; failures)</li>
-<li>Process update events (success &amp; failures)</li>
-<li>Feed update events (success &amp; failures)</li>
-<li>Cluster update events (success &amp; failures)</li>
-<li>Process suspend events (success &amp; failures)</li>
-<li>Feed suspend events (success &amp; failures)</li>
-<li>Process resume events (success &amp; failures)</li>
-<li>Feed resume events (success &amp; failures)</li>
-<li>Process remove events (success &amp; failures)</li>
-<li>Feed remove events (success &amp; failures)</li>
-<li>Cluster remove events (success &amp; failures)</li>
-<li>Process instance kill events (success &amp; failures)</li>
-<li>Process instance re-run events (success &amp; failures)</li>
-<li>Process instance generation events</li>
-<li>Process instance failure events</li>
-<li>Process instance auto-retry events</li>
-<li>Process instance retry exhaust events</li>
-<li>Feed instance deletion event</li>
-<li>Feed instance deletion failure event (no retries)</li>
-<li>Feed instance replication event</li>
-<li>Feed instance replication failure event</li>
-<li>Feed instance replication auto-retry event</li>
-<li>Feed instance replication retry exhaust event</li>
-<li>Feed instance late arrival event</li>
-<li>Feed instance post cut-off arrival event</li>
-<li>Process re-run due to late feed event</li>
-<li>Transaction rollback failed event</li></ol>
-<p>The metric logged for an event has the following properties:</p>
-<ol style="list-style-type: decimal">
-<li>Action - Name of the event.</li>
-<li>Dimensions - A list of name/value pairs of various attributes for a given 
action.</li>
-<li>Status- Status of an action FAILED/SUCCEEDED.</li>
-<li>Time-taken - Time taken in nanoseconds for a given action.</li></ol>
-<p>An example for an event logged for a submit of a new process definition:</p>
-<p>2012-05-04 12:23:34,026 {Action:submit, Dimensions:{entityType=process}, 
Status: SUCCEEDED, Time-taken:97087000 ns}</p>
-<p>Users may parse the metric.log or capture these events from custom 
monitoring frameworks and can plot various graphs  or send alerts according to 
their requirements.</p></div>
-<div class="section">
-<h4>Notifications<a name="Notifications"></a></h4>
-<p>Falcon creates a JMS topic for every process/feed that is scheduled in 
Falcon. The implementation class and the broker url of the JMS engine are read 
from the dependent cluster's definition. Users may register consumers on the 
required topic to check the availability or status of feed instances.</p>
-<p>For a given process that is scheduled, the name of the topic is same as the 
process name. Falcon sends a Map message for every feed produced by the 
instance of a process to the JMS topic. The JMS MapMessage sent to a topic has 
the following properties: entityName, feedNames, feedInstancePath, workflowId, 
runId, nominalTime, timeStamp, brokerUrl, brokerImplClass, entityType, 
operation, logFile, topicName, status, brokerTTL;</p>
-<p>For a given feed that is scheduled, the name of the topic is same as the 
feed name. Falcon sends a map message for every feed instance that is 
deleted/archived/replicated depending upon the retention policy set in the feed 
definition. The JMS MapMessage sent to a topic has the following properties: 
entityName, feedNames, feedInstancePath, workflowId, runId, nominalTime, 
timeStamp, brokerUrl, brokerImplClass, entityType, operation, logFile, 
topicName, status, brokerTTL;</p>
-<p>The JMS messages are automatically purged after a certain period (default 3 
days) by the Falcon JMS house-keeping service.TTL (Time-to-live) for JMS 
message can be configured in the Falcon's startup.properties file.</p></div>
-<div class="section">
 <h3>Falcon EL Expressions<a name="Falcon_EL_Expressions"></a></h3>
 <p>Falcon expression language can be used in process definition for giving the 
start and end instance for various feeds.</p>
 <p>Before going into how to use falcon EL expressions it is necessary to 
understand what does instance and instance start time refer to with respect to 
Falcon.</p>
@@ -805,15 +772,21 @@ validity start=&quot;2009-01-01T00:00Z&q
 <pre>
 config name: *.application.services
 config value: org.apache.falcon.metadata.MetadataMappingService
-&lt;verbatim&gt;
 
-Lineage is only captured for Process executions. A future release will capture 
lineage for
-lifecycle policies such as replication and retention.
-
---++ Security
-
-Security is detailed in [[Security][Security]].
-</pre></div></div>
+</pre></div>
+<p>Lineage is only captured for Process executions. A future release will 
capture lineage for lifecycle policies such as replication and 
retention.</p></div>
+<div class="section">
+<h3>Security<a name="Security"></a></h3>
+<p>Security is detailed in <a href="./Security.html">Security</a>.</p></div>
+<div class="section">
+<h3>Recipes<a name="Recipes"></a></h3>
+<p>Recipes is detailed in <a href="./Recipes.html">Recipes</a>.</p></div>
+<div class="section">
+<h3>Monitoring<a name="Monitoring"></a></h3>
+<p>Monitoring and Operationalizing Falcon is detailed in <a 
href="./Operability.html">Operability</a>.</p></div>
+<div class="section">
+<h3>Backwards Compatibility<a name="Backwards_Compatibility"></a></h3>
+<p>Backwards compatibility instructions are <a 
href="./Compatibility.html">detailed here.</a></p></div>
                   </div>
           </div>
 

Modified: falcon/site/HiveIntegration.html
URL: 
http://svn.apache.org/viewvc/falcon/site/HiveIntegration.html?rev=1660589&r1=1660588&r2=1660589&view=diff
==============================================================================
--- falcon/site/HiveIntegration.html (original)
+++ falcon/site/HiveIntegration.html Wed Feb 18 10:55:56 2015
@@ -1,13 +1,13 @@
 <!DOCTYPE html>
 <!--
- | Generated by Apache Maven Doxia at 2015-01-11
+ | Generated by Apache Maven Doxia at 2015-02-18
  | Rendered using Apache Maven Fluido Skin 1.3.0
 -->
 <html xmlns="http://www.w3.org/1999/xhtml"; xml:lang="en" lang="en">
   <head>
     <meta charset="UTF-8" />
     <meta name="viewport" content="width=device-width, initial-scale=1.0" />
-    <meta name="Date-Revision-yyyymmdd" content="20150111" />
+    <meta name="Date-Revision-yyyymmdd" content="20150218" />
     <meta http-equiv="Content-Language" content="en" />
     <title>Falcon - Hive Integration</title>
     <link rel="stylesheet" href="./css/apache-maven-fluido-1.3.0.min.css" />
@@ -245,7 +245,7 @@
         
                 
                     
-                  <li id="publishDate" class="pull-right">Last Published: 
2015-01-11</li> 
+                  <li id="publishDate" class="pull-right">Last Published: 
2015-02-18</li> 
             
                             </ul>
       </div>
@@ -288,8 +288,8 @@ catalog.service.impl=org.apache.falcon.c
 <p>Hence, Falcon for Hive support needs Oozie 4.x.</p></div>
 <div class="section">
 <h3>Oozie Shared Library setup<a name="Oozie_Shared_Library_setup"></a></h3>
-<p>Falcon post Hive integration depends heavily on the <a class="externalLink" 
href="http://oozie.apache.org/docs/4.0.0/WorkflowFunctionalSpec.html#a17_HDFS_Share_Libraries_for_Workflow_Applications_since_Oozie_2.3";>shared
 library feature of Oozie</a>. Since the sheer number of jars for HCatalog, Pig 
and Hive are in the many 10s in numbers, its quite daunting to redistribute the 
dependent jars from Falcon.</p>
-<p><a class="externalLink" 
href="http://oozie.apache.org/docs/4.0.0/DG_QuickStart.html#Oozie_Share_Lib_Installation";>This
 is a one time effort in Oozie setup and is quite straightforward.</a></p></div>
+<p>Falcon post Hive integration depends heavily on the <a class="externalLink" 
href="http://oozie.apache.org/docs/4.0.1/WorkflowFunctionalSpec.html#a17_HDFS_Share_Libraries_for_Workflow_Applications_since_Oozie_2.3";>shared
 library feature of Oozie</a>. Since the sheer number of jars for HCatalog, Pig 
and Hive are in the many 10s in numbers, its quite daunting to redistribute the 
dependent jars from Falcon.</p>
+<p><a class="externalLink" 
href="http://oozie.apache.org/docs/4.0.1/DG_QuickStart.html#Oozie_Share_Lib_Installation";>This
 is a one time effort in Oozie setup and is quite straightforward.</a></p></div>
 <div class="section">
 <h3>Approach<a name="Approach"></a></h3></div>
 <div class="section">
@@ -442,7 +442,7 @@ org.apache.hadoop.hive.ql.parse.ImportSe
         &lt;interface type=&quot;execute&quot; 
endpoint=&quot;localhost:10300&quot;
                    version=&quot;1.1.1&quot; /&gt;
         &lt;interface type=&quot;workflow&quot; 
endpoint=&quot;http://localhost:11010/oozie/&quot;
-                   version=&quot;3.3.0&quot; /&gt;
+                   version=&quot;4.0.1&quot; /&gt;
         &lt;interface type=&quot;registry&quot; 
endpoint=&quot;thrift://localhost:19083&quot;
                    version=&quot;0.11.0&quot; /&gt;
         &lt;interface type=&quot;messaging&quot; 
endpoint=&quot;tcp://localhost:61616?daemon=true&quot;
@@ -475,7 +475,7 @@ org.apache.hadoop.hive.ql.parse.ImportSe
         &lt;interface type=&quot;execute&quot; 
endpoint=&quot;localhost:20300&quot;
                    version=&quot;1.1.1&quot; /&gt;
         &lt;interface type=&quot;workflow&quot; 
endpoint=&quot;http://localhost:11020/oozie/&quot;
-                   version=&quot;3.3.0&quot; /&gt;
+                   version=&quot;4.0.1&quot; /&gt;
         &lt;interface type=&quot;registry&quot; 
endpoint=&quot;thrift://localhost:29083&quot;
                    version=&quot;0.11.0&quot; /&gt;
         &lt;interface type=&quot;messaging&quot; 
endpoint=&quot;tcp://localhost:61616?daemon=true&quot;

Modified: falcon/site/InstallationSteps.html
URL: 
http://svn.apache.org/viewvc/falcon/site/InstallationSteps.html?rev=1660589&r1=1660588&r2=1660589&view=diff
==============================================================================
--- falcon/site/InstallationSteps.html (original)
+++ falcon/site/InstallationSteps.html Wed Feb 18 10:55:56 2015
@@ -1,13 +1,13 @@
 <!DOCTYPE html>
 <!--
- | Generated by Apache Maven Doxia at 2015-01-11
+ | Generated by Apache Maven Doxia at 2015-02-18
  | Rendered using Apache Maven Fluido Skin 1.3.0
 -->
 <html xmlns="http://www.w3.org/1999/xhtml"; xml:lang="en" lang="en">
   <head>
     <meta charset="UTF-8" />
     <meta name="viewport" content="width=device-width, initial-scale=1.0" />
-    <meta name="Date-Revision-yyyymmdd" content="20150111" />
+    <meta name="Date-Revision-yyyymmdd" content="20150218" />
     <meta http-equiv="Content-Language" content="en" />
     <title>Falcon - Building & Installing Falcon</title>
     <link rel="stylesheet" href="./css/apache-maven-fluido-1.3.0.min.css" />
@@ -245,7 +245,7 @@
         
                 
                     
-                  <li id="publishDate" class="pull-right">Last Published: 
2015-01-11</li> 
+                  <li id="publishDate" class="pull-right">Last Published: 
2015-02-18</li> 
             
                             </ul>
       </div>
@@ -260,15 +260,22 @@
 <h4>Building Falcon<a name="Building_Falcon"></a></h4>
 <div class="source">
 <pre>
+You would need the following installed to build Falcon
+
+* JDK 1.7
+* Maven 3.x
+
 git clone https://git-wip-us.apache.org/repos/asf/falcon.git falcon
 
 cd falcon
 
-export MAVEN_OPTS=&quot;-Xmx1024m -XX:MaxPermSize=256m&quot; &amp;&amp; mvn 
clean install [For hadoop 1]
-export MAVEN_OPTS=&quot;-Xmx1024m -XX:MaxPermSize=256m&quot; &amp;&amp; mvn 
clean install -Phadoop-2 [For hadoop 2]
+export MAVEN_OPTS=&quot;-Xmx1024m -XX:MaxPermSize=256m -noverify&quot; 
&amp;&amp; mvn clean install
 
 [optionally -Dhadoop.version=&lt;&lt;hadoop.version&gt;&gt; can be appended to 
build for a specific version of hadoop]
-[optionally -Doozie.version=&lt;&lt;oozie version&gt;&gt; can be appended to 
build with a specific version of oozie. Oozie versions &gt;= 
3.oozie-3.2.0-incubating are supported]
+*Note:* Falcon drops support for Hadoop-1 and only supports Hadoop-2 from 
Falcon 0.6 onwards
+[optionally -Doozie.version=&lt;&lt;oozie version&gt;&gt; can be appended to 
build with a specific version of oozie.
+Oozie versions &gt;= 4 are supported]
+Falcon build with JDK 1.7 using -noverify option
 
 
 </pre></div>
@@ -277,12 +284,11 @@ export MAVEN_OPTS=&quot;-Xmx1024m -XX:Ma
 <div class="source">
 <pre>
 
-mvn clean assembly:assembly -DskipTests -DskipCheck=true [For hadoop 1]
-mvn clean assembly:assembly -DskipTests -DskipCheck=true -P hadoop-2 [For 
hadoop 2]
+mvn clean assembly:assembly -DskipTests -DskipCheck=true
 
 
 </pre></div>
-<p>Tar can be found in {project 
dir}/target/falcon-${project.version}-bin.tar.gz</p>
+<p>Tar can be found in {project 
dir}/target/apache-falcon-${project.version}-bin.tar.gz</p>
 <p>Tar is structured as follows</p>
 <div class="source">
 <pre>
@@ -318,12 +324,11 @@ mvn clean assembly:assembly -DskipTests
 <div class="source">
 <pre>
 
-mvn clean assembly:assembly -DskipTests -DskipCheck=true 
-Pdistributed,hadoop-1 [For hadoop 1]
-mvn clean assembly:assembly -DskipTests -DskipCheck=true 
-Pdistributed,hadoop-2 [For hadoop 2]
+mvn clean assembly:assembly -DskipTests -DskipCheck=true -Pdistributed,hadoop-2
 
 
 </pre></div>
-<p>Tar can be found in {project 
dir}/target/falcon-distributed-${project.version}-server.tar.gz</p>
+<p>Tar can be found in {project 
dir}/target/apache-falcon-distributed-${project.version}-server.tar.gz</p>
 <p>Tar is structured as follows</p>
 <div class="source">
 <pre>
@@ -412,15 +417,27 @@ cd falcon-distributed-${project.version}
 #export FALCON_EXPANDED_WEBAPP_DIR=
 
 </pre></div>
+<p><b>NOTE for Mac OS users</b></p>
+<div class="source">
+<pre>
+If you are using a Mac OS, you will need to configure the FALCON_SERVER_OPTS 
(explained above).
+
+In  {package dir}/conf/falcon-env.sh uncomment the following line
+#export FALCON_SERVER_OPTS=
+
+and change it to look as below
+export FALCON_SERVER_OPTS=&quot;-Djava.awt.headless=true 
-Djava.security.krb5.realm= -Djava.security.krb5.kdc=&quot;
+
+</pre></div>
 <p><b>Starting Falcon Server</b></p>
 <div class="source">
 <pre>
 bin/falcon-start [-port &lt;port&gt;]
 
 </pre></div>
-<p>By default,  * falcon server starts at port 15443 (https) by default . To 
change the port, use -port option</p>
+<p>By default, * If falcon.enableTLS is set to true explicitly or not set at 
all, falcon starts at port 15443 on <a class="externalLink" 
href="https://";>https://</a> by default. * If falcon.enableTLS is set to false 
explicitly, falcon starts at port 15000 on <a class="externalLink" 
href="http://.";>http://.</a> * To change the port, use -port option.</p>
 <ul>
-<li>falcon.enableTLS can be set to true or false explicitly to enable SSL, if 
not port that end with 443 will automatically put falcon on <a 
class="externalLink" href="https://";>https://</a></li></ul>* falcon server 
starts embedded active mq. To control this behaviour, set the following system 
properties using -D option in environment variable FALCON_OPTS:
+<li>If falcon.enableTLS is not set explicitly, port that ends with 443 will 
automatically put falcon on <a class="externalLink" 
href="https://.";>https://.</a> Any other port will put falcon on <a 
class="externalLink" href="http://.";>http://.</a></li></ul>* falcon server 
starts embedded active mq. To control this behaviour, set the following system 
properties using -D option in environment variable FALCON_OPTS:
 <ul>
 <li>falcon.embeddedmq=&lt;true/false&gt; - Should server start embedded active 
mq, default true</li>
 <li>falcon.embeddedmq.port=&lt;port&gt; - Port for embedded active mq, default 
61616</li>
@@ -450,6 +467,18 @@ bin/falcon help
 </pre></div>
 <p><b>Dashboard</b></p>
 <p>Once falcon / prism is started, you can view the status of falcon entities 
using the Web-based dashboard. The web UI works in both distributed and 
embedded mode. You can open your browser at the corresponding port to use the 
web UI.</p>
+<p>Falcon dashboard makes the REST api calls as user 
&quot;falcon-dashboard&quot;. If this user does not exist on your falcon and 
oozie servers, please create the user.</p>
+<div class="source">
+<pre>
+## create user.
+[root@falconhost ~] useradd -U -m falcon-dashboard -G users
+
+## verify user is created with membership in correct groups.
+[root@falconhost ~] groups falcon-dashboard
+falcon-dashboard : falcon-dashboard users
+[root@falconhost ~]
+
+</pre></div>
 <p><b>Stopping Falcon Server</b></p>
 <div class="source">
 <pre>
@@ -469,9 +498,10 @@ bin/prism-stop
 cd &lt;&lt;project home&gt;&gt;
 src/bin/package.sh &lt;&lt;hadoop-version&gt;&gt; &lt;&lt;oozie-version&gt;&gt;
 
-&gt;&gt; ex. src/bin/package.sh 1.1.2 3.1.3-incubating or src/bin/package.sh 
0.20.2-cdh3u5 4.0.0
-&gt;&gt; Falcon package is available in &lt;&lt;falcon 
home&gt;&gt;/target/falcon-&lt;&lt;version&gt;&gt;-bin.tar.gz
-&gt;&gt; Oozie package is available in &lt;&lt;falcon 
home&gt;&gt;/target/oozie-3.3.2-distro.tar.gz
+&gt;&gt; ex. src/bin/package.sh 1.1.2 4.0.1 or src/bin/package.sh 
0.20.2-cdh3u5 4.0.1
+&gt;&gt; ex. src/bin/package.sh 2.5.0 4.0.0
+&gt;&gt; Falcon package is available in &lt;&lt;falcon 
home&gt;&gt;/target/apache-falcon-&lt;&lt;version&gt;&gt;-bin.tar.gz
+&gt;&gt; Oozie package is available in &lt;&lt;falcon 
home&gt;&gt;/target/oozie-4.0.1-distro.tar.gz
 
 </pre></div></div>
 <div class="section">
@@ -481,7 +511,7 @@ src/bin/package.sh &lt;&lt;hadoop-versio
 bin/falcon-start
 
 </pre></div>
-<p>Make sure the hadoop and oozie endpoints are according to your setup in 
examples/entity/filesystem/standalone-cluster.xml</p>
+<p>Make sure the hadoop and oozie endpoints are according to your setup in 
examples/entity/filesystem/standalone-cluster.xml The cluster locations,staging 
and working dirs, MUST be created prior to submitting a cluster entity to 
Falcon. <b>staging</b> must have 777 permissions and the parent dirs must have 
execute permissions <b>working</b> must have 755 permissions and the parent 
dirs must have execute permissions</p>
 <div class="source">
 <pre>
 bin/falcon entity -submit -type cluster -file 
examples/entity/filesystem/standalone-cluster.xml

Added: falcon/site/MigrationInstructions.html
URL: 
http://svn.apache.org/viewvc/falcon/site/MigrationInstructions.html?rev=1660589&view=auto
==============================================================================
--- falcon/site/MigrationInstructions.html (added)
+++ falcon/site/MigrationInstructions.html Wed Feb 18 10:55:56 2015
@@ -0,0 +1,293 @@
+<!DOCTYPE html>
+<!--
+ | Generated by Apache Maven Doxia at 2015-02-18
+ | Rendered using Apache Maven Fluido Skin 1.3.0
+-->
+<html xmlns="http://www.w3.org/1999/xhtml"; xml:lang="en" lang="en">
+  <head>
+    <meta charset="UTF-8" />
+    <meta name="viewport" content="width=device-width, initial-scale=1.0" />
+    <meta name="Date-Revision-yyyymmdd" content="20150218" />
+    <meta http-equiv="Content-Language" content="en" />
+    <title>Falcon - Migration Instructions</title>
+    <link rel="stylesheet" href="./css/apache-maven-fluido-1.3.0.min.css" />
+    <link rel="stylesheet" href="./css/site.css" />
+    <link rel="stylesheet" href="./css/print.css" media="print" />
+
+      
+    <script type="text/javascript" 
src="./js/apache-maven-fluido-1.3.0.min.js"></script>
+
+                          
+        
+<script type="text/javascript">$( document ).ready( function() { $( 
'.carousel' ).carousel( { interval: 3500 } ) } );</script>
+          
+            </head>
+        <body class="topBarEnabled">
+          
+                        
+                    
+                
+
+    <div id="topbar" class="navbar navbar-fixed-top ">
+      <div class="navbar-inner">
+                                  <div class="container" style="width: 
68%;"><div class="nav-collapse">
+            
+                
+                                <ul class="nav">
+                          <li class="dropdown">
+        <a href="#" class="dropdown-toggle" data-toggle="dropdown">Falcon <b 
class="caret"></b></a>
+        <ul class="dropdown-menu">
+        
+                      <li>      <a href="index.html"  title="About">About</a>
+</li>
+                  
+                      <li>      <a 
href="https://cwiki.apache.org/confluence/display/FALCON/Index";  
title="Wiki">Wiki</a>
+</li>
+                  
+                      <li>      <a 
href="https://cwiki.apache.org/confluence/display/FALCON/News";  
title="News">News</a>
+</li>
+                  
+                      <li>      <a 
href="https://git-wip-us.apache.org/repos/asf/falcon.git";  title="Git">Git</a>
+</li>
+                  
+                      <li>      <a 
href="https://svn.apache.org/repos/asf/falcon";  title="Svn">Svn</a>
+</li>
+                  
+                      <li>      <a 
href="https://issues.apache.org/jira/browse/FALCON";  title="Jira">Jira</a>
+</li>
+                  
+                      <li>      <a 
href="https://cwiki.apache.org/confluence/display/FALCON/PoweredBy";  
title="Powered by">Powered by</a>
+</li>
+                  
+                      <li>      <a 
href="https://cwiki.apache.org/confluence/display/FALCON/Acknowledgements";  
title="Acknowledgements">Acknowledgements</a>
+</li>
+                  
+                      <li>      <a href="http://blogs.apache.org/falcon/";  
title="Blog">Blog</a>
+</li>
+                          </ul>
+      </li>
+                <li class="dropdown">
+        <a href="#" class="dropdown-toggle" data-toggle="dropdown">Project 
Information <b class="caret"></b></a>
+        <ul class="dropdown-menu">
+        
+                      <li>      <a href="project-info.html"  
title="Summary">Summary</a>
+</li>
+                  
+                      <li>      <a href="mail-lists.html"  title="Mailing 
Lists">Mailing Lists</a>
+</li>
+                  
+                      <li>      <a 
href="http://webchat.freenode.net?channels=apachefalcon&uio=d4";  
title="IRC">IRC</a>
+</li>
+                  
+                      <li>      <a href="team-list.html"  title="Team">Team</a>
+</li>
+                  
+                      <li>      <a href="issue-tracking.html"  title="Issue 
Tracking">Issue Tracking</a>
+</li>
+                  
+                      <li>      <a href="source-repository.html"  
title="Source Repository">Source Repository</a>
+</li>
+                  
+                      <li>      <a href="license.html"  
title="License">License</a>
+</li>
+                          </ul>
+      </li>
+                <li class="dropdown">
+        <a href="#" class="dropdown-toggle" data-toggle="dropdown">Releases <b 
class="caret"></b></a>
+        <ul class="dropdown-menu">
+        
+                      <li>      <a 
href="http://www.apache.org/dyn/closer.cgi/falcon/0.6-incubating";  
title="0.6-incubating">0.6-incubating</a>
+</li>
+                  
+                      <li>      <a 
href="http://archive.apache.org/dist/incubator/falcon/0.5-incubating";  
title="0.5-incubating">0.5-incubating</a>
+</li>
+                  
+                      <li>      <a 
href="http://archive.apache.org/dist/incubator/falcon/0.4-incubating";  
title="0.4-incubating">0.4-incubating</a>
+</li>
+                  
+                      <li>      <a 
href="http://archive.apache.org/dist/incubator/falcon/0.3-incubating";  
title="0.3-incubating">0.3-incubating</a>
+</li>
+                  
+                      <li>      <a 
href="https://cwiki.apache.org/confluence/display/FALCON/Roadmap";  
title="Coming soon">Coming soon</a>
+</li>
+                          </ul>
+      </li>
+                <li class="dropdown">
+        <a href="#" class="dropdown-toggle" 
data-toggle="dropdown">Documentation <b class="caret"></b></a>
+        <ul class="dropdown-menu">
+        
+                      <li>      <a href="0.6-incubating/index.html"  
title="current">current</a>
+</li>
+                  
+                      <li>      <a href="0.6-incubating/index.html"  
title="0.6-incubating">0.6-incubating</a>
+</li>
+                  
+                      <li>      <a href="0.5-incubating/index.html"  
title="0.5-incubating">0.5-incubating</a>
+</li>
+                  
+                      <li>      <a href="0.4-incubating/index.html"  
title="0.4-incubating">0.4-incubating</a>
+</li>
+                  
+                      <li>      <a href="0.3-incubating/index.html"  
title="0.3-incubating">0.3-incubating</a>
+</li>
+                          </ul>
+      </li>
+                <li class="dropdown">
+        <a href="#" class="dropdown-toggle" data-toggle="dropdown">Resources 
<b class="caret"></b></a>
+        <ul class="dropdown-menu">
+        
+                      <li>      <a href="slides/falcon-overview.html"  
title="Overview">Overview</a>
+</li>
+                  
+                      <li>      <a href="slides/falcon-user-guide.html"  
title="User Guide">User Guide</a>
+</li>
+                  
+                      <li>      <a href="GettingStarted.html"  title="Getting 
Started">Getting Started</a>
+</li>
+                  
+                      <li>      <a href="FalconDocumentation.html"  
title="Architecture">Architecture</a>
+</li>
+                  
+                      <li>      <a href="InstallationSteps.html"  
title="Installation">Installation</a>
+</li>
+                  
+                      <li>      <a href="OnBoarding.html"  title="On 
Boarding">On Boarding</a>
+</li>
+                  
+                      <li>      <a href="EntitySpecification.html"  
title="Entity Specification">Entity Specification</a>
+</li>
+                  
+                      <li>      <a href="FalconCLI.html"  title="Client 
(Falcon CLI)">Client (Falcon CLI)</a>
+</li>
+                  
+                      <li>      <a href="restapi/ResourceList.html"  
title="Rest API">Rest API</a>
+</li>
+                  
+                      <li>      <a href="HiveIntegration.html"  title="Hive 
Integration">Hive Integration</a>
+</li>
+                  
+                      <li>      <a href="Security.html"  
title="Security">Security</a>
+</li>
+                          </ul>
+      </li>
+                <li class="dropdown">
+        <a href="#" class="dropdown-toggle" data-toggle="dropdown">ASF <b 
class="caret"></b></a>
+        <ul class="dropdown-menu">
+        
+                      <li>      <a 
href="http://www.apache.org/foundation/how-it-works.html";  title="How Apache 
Works">How Apache Works</a>
+</li>
+                  
+                      <li>      <a href="http://www.apache.org/foundation/";  
title="Foundation">Foundation</a>
+</li>
+                  
+                      <li>      <a 
href="http://www.apache.org/foundation/sponsorship.html";  title="Sponsoring 
Apache">Sponsoring Apache</a>
+</li>
+                  
+                      <li>      <a 
href="http://www.apache.org/foundation/thanks.html";  title="Thanks">Thanks</a>
+</li>
+                          </ul>
+      </li>
+                  </ul>
+          
+          
+                            
+            
+            
+            
+    <iframe 
src="http://www.facebook.com/plugins/like.php?href=http://falcon.apache.org/falcon-website-general&send=false&layout=button_count&show-faces=false&action=like&colorscheme=dark";
+        scrolling="no" frameborder="0"
+        style="border:none; width:80px; height:20px; margin-top: 10px;"  
class="pull-right" ></iframe>
+                        
+    <script type="text/javascript" 
src="https://apis.google.com/js/plusone.js";></script>
+
+        <ul class="nav pull-right"><li style="margin-top: 10px;">
+    
+    <div class="g-plusone" 
data-href="http://falcon.apache.org/falcon-website-general"; data-size="medium"  
width="60px" align="right" ></div>
+
+        </li></ul>
+                              
+                   
+                      </div>
+          
+        </div>
+      </div>
+    </div>
+    
+        <div class="container">
+          <div id="banner">
+        <div class="pull-left">
+                                                  <a href="../index.html" 
id="bannerLeft">
+                                                                               
                 <img src="images/falcon-logo.png"  alt="Falcon" width="200px" 
height="45px"/>
+                </a>
+                      </div>
+        <div class="pull-right">                  <a 
href="http://www.apache.org"; id="bannerRight">
+                                                                               
                 <img src="images/apache-feather-tm.gif"  alt="Falcon" 
height="45px"/>
+                </a>
+      </div>
+        <div class="clear"><hr/></div>
+      </div>
+
+      <div id="breadcrumbs">
+        <ul class="breadcrumb">
+                
+                    
+                              <li class="">
+                    <a href="http://www.apache.org"; class="externalLink" 
title="Apache">
+        Apache</a>
+        </li>
+      <li class="divider ">/</li>
+            <li class="">
+                    <a href="index.html" title="Falcon">
+        Falcon</a>
+        </li>
+      <li class="divider ">/</li>
+        <li class="">Migration Instructions</li>
+        
+                
+                    
+                  <li id="publishDate" class="pull-right">Last Published: 
2015-02-18</li> 
+            
+                            </ul>
+      </div>
+
+      
+                        
+        <div id="bodyColumn" >
+                                  
+            <div class="section">
+<h2>Migration Instructions<a name="Migration_Instructions"></a></h2></div>
+<div class="section">
+<h3>Migrate from 0.5-incubating to 0.6-incubating<a 
name="Migrate_from_0.5-incubating_to_0.6-incubating"></a></h3>
+<p>This is a placeholder wiki for migration instructions from falcon 
0.5-incubating to 0.6-incubating.</p></div>
+<div class="section">
+<h4>Update Entities<a name="Update_Entities"></a></h4></div>
+<div class="section">
+<h4>Change cluster dir permissions<a 
name="Change_cluster_dir_permissions"></a></h4></div>
+<div class="section">
+<h4>Enable/Disable TLS<a name="EnableDisable_TLS"></a></h4></div>
+<div class="section">
+<h4>Authorization<a name="Authorization"></a></h4></div>
+                  </div>
+          </div>
+
+    <hr/>
+
+    <footer>
+            <div class="container">
+              <div class="row span12">Copyright &copy;                    
2013-2015
+                        <a href="http://www.apache.org";>Apache Software 
Foundation</a>.
+            All Rights Reserved.      
+                    
+      </div>
+
+                          
+                <p id="poweredBy" class="pull-right">
+                          <a href="http://maven.apache.org/"; title="Built by 
Maven" class="poweredBy">
+        <img class="builtBy" alt="Built by Maven" 
src="./images/logos/maven-feather.png" />
+      </a>
+              </p>
+        
+                </div>
+    </footer>
+  </body>
+</html>

Modified: falcon/site/OnBoarding.html
URL: 
http://svn.apache.org/viewvc/falcon/site/OnBoarding.html?rev=1660589&r1=1660588&r2=1660589&view=diff
==============================================================================
--- falcon/site/OnBoarding.html (original)
+++ falcon/site/OnBoarding.html Wed Feb 18 10:55:56 2015
@@ -1,13 +1,13 @@
 <!DOCTYPE html>
 <!--
- | Generated by Apache Maven Doxia at 2015-01-11
+ | Generated by Apache Maven Doxia at 2015-02-18
  | Rendered using Apache Maven Fluido Skin 1.3.0
 -->
 <html xmlns="http://www.w3.org/1999/xhtml"; xml:lang="en" lang="en">
   <head>
     <meta charset="UTF-8" />
     <meta name="viewport" content="width=device-width, initial-scale=1.0" />
-    <meta name="Date-Revision-yyyymmdd" content="20150111" />
+    <meta name="Date-Revision-yyyymmdd" content="20150218" />
     <meta http-equiv="Content-Language" content="en" />
     <title>Falcon - Contents</title>
     <link rel="stylesheet" href="./css/apache-maven-fluido-1.3.0.min.css" />
@@ -245,7 +245,7 @@
         
                 
                     
-                  <li id="publishDate" class="pull-right">Last Published: 
2015-01-11</li> 
+                  <li id="publishDate" class="pull-right">Last Published: 
2015-02-18</li> 
             
                             </ul>
       </div>
@@ -276,7 +276,7 @@
 <h4>Sample Pipeline<a name="Sample_Pipeline"></a></h4></div>
 <div class="section">
 <h5>Cluster   <a name="Cluster"></a></h5>
-<p>Cluster definition that contains end points for name node, job tracker, 
oozie and jms server:</p>
+<p>Cluster definition that contains end points for name node, job tracker, 
oozie and jms server: The cluster locations MUST be created prior to submitting 
a cluster entity to Falcon. <b>staging</b> must have 777 permissions and the 
parent dirs must have execute permissions <b>working</b> must have 755 
permissions and the parent dirs must have execute permissions</p>
 <div class="source">
 <pre>
 &lt;?xml version=&quot;1.0&quot;?&gt;
@@ -286,13 +286,13 @@
 &lt;cluster colo=&quot;ua2&quot; description=&quot;&quot; 
name=&quot;corp&quot; xmlns=&quot;uri:falcon:cluster:0.1&quot;
     xmlns:xsi=&quot;http://www.w3.org/2001/XMLSchema-instance&quot;&gt;    
     &lt;interfaces&gt;
-        &lt;interface type=&quot;readonly&quot; 
endpoint=&quot;hftp://name-node.com:50070&quot; 
version=&quot;0.20.2-cdh3u0&quot; /&gt;
+        &lt;interface type=&quot;readonly&quot; 
endpoint=&quot;hftp://name-node.com:50070&quot; version=&quot;2.5.0&quot; /&gt;
 
-        &lt;interface type=&quot;write&quot; 
endpoint=&quot;hdfs://name-node.com:54310&quot; 
version=&quot;0.20.2-cdh3u0&quot; /&gt;
+        &lt;interface type=&quot;write&quot; 
endpoint=&quot;hdfs://name-node.com:54310&quot; version=&quot;2.5.0&quot; /&gt;
 
-        &lt;interface type=&quot;execute&quot; 
endpoint=&quot;job-tracker:54311&quot; version=&quot;0.20.2-cdh3u0&quot; /&gt;
+        &lt;interface type=&quot;execute&quot; 
endpoint=&quot;job-tracker:54311&quot; version=&quot;2.5.0&quot; /&gt;
 
-        &lt;interface type=&quot;workflow&quot; 
endpoint=&quot;http://oozie.com:11000/oozie/&quot; version=&quot;3.1.4&quot; 
/&gt;
+        &lt;interface type=&quot;workflow&quot; 
endpoint=&quot;http://oozie.com:11000/oozie/&quot; version=&quot;4.0.1&quot; 
/&gt;
 
         &lt;interface type=&quot;messaging&quot; 
endpoint=&quot;tcp://jms-server.com:61616?daemon=true&quot; 
version=&quot;5.1.6&quot; /&gt;
     &lt;/interfaces&gt;

Added: falcon/site/Operability.html
URL: 
http://svn.apache.org/viewvc/falcon/site/Operability.html?rev=1660589&view=auto
==============================================================================
--- falcon/site/Operability.html (added)
+++ falcon/site/Operability.html Wed Feb 18 10:55:56 2015
@@ -0,0 +1,343 @@
+<!DOCTYPE html>
+<!--
+ | Generated by Apache Maven Doxia at 2015-02-18
+ | Rendered using Apache Maven Fluido Skin 1.3.0
+-->
+<html xmlns="http://www.w3.org/1999/xhtml"; xml:lang="en" lang="en">
+  <head>
+    <meta charset="UTF-8" />
+    <meta name="viewport" content="width=device-width, initial-scale=1.0" />
+    <meta name="Date-Revision-yyyymmdd" content="20150218" />
+    <meta http-equiv="Content-Language" content="en" />
+    <title>Falcon - Operationalizing Falcon</title>
+    <link rel="stylesheet" href="./css/apache-maven-fluido-1.3.0.min.css" />
+    <link rel="stylesheet" href="./css/site.css" />
+    <link rel="stylesheet" href="./css/print.css" media="print" />
+
+      
+    <script type="text/javascript" 
src="./js/apache-maven-fluido-1.3.0.min.js"></script>
+
+                          
+        
+<script type="text/javascript">$( document ).ready( function() { $( 
'.carousel' ).carousel( { interval: 3500 } ) } );</script>
+          
+            </head>
+        <body class="topBarEnabled">
+          
+                        
+                    
+                
+
+    <div id="topbar" class="navbar navbar-fixed-top ">
+      <div class="navbar-inner">
+                                  <div class="container" style="width: 
68%;"><div class="nav-collapse">
+            
+                
+                                <ul class="nav">
+                          <li class="dropdown">
+        <a href="#" class="dropdown-toggle" data-toggle="dropdown">Falcon <b 
class="caret"></b></a>
+        <ul class="dropdown-menu">
+        
+                      <li>      <a href="index.html"  title="About">About</a>
+</li>
+                  
+                      <li>      <a 
href="https://cwiki.apache.org/confluence/display/FALCON/Index";  
title="Wiki">Wiki</a>
+</li>
+                  
+                      <li>      <a 
href="https://cwiki.apache.org/confluence/display/FALCON/News";  
title="News">News</a>
+</li>
+                  
+                      <li>      <a 
href="https://git-wip-us.apache.org/repos/asf/falcon.git";  title="Git">Git</a>
+</li>
+                  
+                      <li>      <a 
href="https://svn.apache.org/repos/asf/falcon";  title="Svn">Svn</a>
+</li>
+                  
+                      <li>      <a 
href="https://issues.apache.org/jira/browse/FALCON";  title="Jira">Jira</a>
+</li>
+                  
+                      <li>      <a 
href="https://cwiki.apache.org/confluence/display/FALCON/PoweredBy";  
title="Powered by">Powered by</a>
+</li>
+                  
+                      <li>      <a 
href="https://cwiki.apache.org/confluence/display/FALCON/Acknowledgements";  
title="Acknowledgements">Acknowledgements</a>
+</li>
+                  
+                      <li>      <a href="http://blogs.apache.org/falcon/";  
title="Blog">Blog</a>
+</li>
+                          </ul>
+      </li>
+                <li class="dropdown">
+        <a href="#" class="dropdown-toggle" data-toggle="dropdown">Project 
Information <b class="caret"></b></a>
+        <ul class="dropdown-menu">
+        
+                      <li>      <a href="project-info.html"  
title="Summary">Summary</a>
+</li>
+                  
+                      <li>      <a href="mail-lists.html"  title="Mailing 
Lists">Mailing Lists</a>
+</li>
+                  
+                      <li>      <a 
href="http://webchat.freenode.net?channels=apachefalcon&uio=d4";  
title="IRC">IRC</a>
+</li>
+                  
+                      <li>      <a href="team-list.html"  title="Team">Team</a>
+</li>
+                  
+                      <li>      <a href="issue-tracking.html"  title="Issue 
Tracking">Issue Tracking</a>
+</li>
+                  
+                      <li>      <a href="source-repository.html"  
title="Source Repository">Source Repository</a>
+</li>
+                  
+                      <li>      <a href="license.html"  
title="License">License</a>
+</li>
+                          </ul>
+      </li>
+                <li class="dropdown">
+        <a href="#" class="dropdown-toggle" data-toggle="dropdown">Releases <b 
class="caret"></b></a>
+        <ul class="dropdown-menu">
+        
+                      <li>      <a 
href="http://www.apache.org/dyn/closer.cgi/falcon/0.6-incubating";  
title="0.6-incubating">0.6-incubating</a>
+</li>
+                  
+                      <li>      <a 
href="http://archive.apache.org/dist/incubator/falcon/0.5-incubating";  
title="0.5-incubating">0.5-incubating</a>
+</li>
+                  
+                      <li>      <a 
href="http://archive.apache.org/dist/incubator/falcon/0.4-incubating";  
title="0.4-incubating">0.4-incubating</a>
+</li>
+                  
+                      <li>      <a 
href="http://archive.apache.org/dist/incubator/falcon/0.3-incubating";  
title="0.3-incubating">0.3-incubating</a>
+</li>
+                  
+                      <li>      <a 
href="https://cwiki.apache.org/confluence/display/FALCON/Roadmap";  
title="Coming soon">Coming soon</a>
+</li>
+                          </ul>
+      </li>
+                <li class="dropdown">
+        <a href="#" class="dropdown-toggle" 
data-toggle="dropdown">Documentation <b class="caret"></b></a>
+        <ul class="dropdown-menu">
+        
+                      <li>      <a href="0.6-incubating/index.html"  
title="current">current</a>
+</li>
+                  
+                      <li>      <a href="0.6-incubating/index.html"  
title="0.6-incubating">0.6-incubating</a>
+</li>
+                  
+                      <li>      <a href="0.5-incubating/index.html"  
title="0.5-incubating">0.5-incubating</a>
+</li>
+                  
+                      <li>      <a href="0.4-incubating/index.html"  
title="0.4-incubating">0.4-incubating</a>
+</li>
+                  
+                      <li>      <a href="0.3-incubating/index.html"  
title="0.3-incubating">0.3-incubating</a>
+</li>
+                          </ul>
+      </li>
+                <li class="dropdown">
+        <a href="#" class="dropdown-toggle" data-toggle="dropdown">Resources 
<b class="caret"></b></a>
+        <ul class="dropdown-menu">
+        
+                      <li>      <a href="slides/falcon-overview.html"  
title="Overview">Overview</a>
+</li>
+                  
+                      <li>      <a href="slides/falcon-user-guide.html"  
title="User Guide">User Guide</a>
+</li>
+                  
+                      <li>      <a href="GettingStarted.html"  title="Getting 
Started">Getting Started</a>
+</li>
+                  
+                      <li>      <a href="FalconDocumentation.html"  
title="Architecture">Architecture</a>
+</li>
+                  
+                      <li>      <a href="InstallationSteps.html"  
title="Installation">Installation</a>
+</li>
+                  
+                      <li>      <a href="OnBoarding.html"  title="On 
Boarding">On Boarding</a>
+</li>
+                  
+                      <li>      <a href="EntitySpecification.html"  
title="Entity Specification">Entity Specification</a>
+</li>
+                  
+                      <li>      <a href="FalconCLI.html"  title="Client 
(Falcon CLI)">Client (Falcon CLI)</a>
+</li>
+                  
+                      <li>      <a href="restapi/ResourceList.html"  
title="Rest API">Rest API</a>
+</li>
+                  
+                      <li>      <a href="HiveIntegration.html"  title="Hive 
Integration">Hive Integration</a>
+</li>
+                  
+                      <li>      <a href="Security.html"  
title="Security">Security</a>
+</li>
+                          </ul>
+      </li>
+                <li class="dropdown">
+        <a href="#" class="dropdown-toggle" data-toggle="dropdown">ASF <b 
class="caret"></b></a>
+        <ul class="dropdown-menu">
+        
+                      <li>      <a 
href="http://www.apache.org/foundation/how-it-works.html";  title="How Apache 
Works">How Apache Works</a>
+</li>
+                  
+                      <li>      <a href="http://www.apache.org/foundation/";  
title="Foundation">Foundation</a>
+</li>
+                  
+                      <li>      <a 
href="http://www.apache.org/foundation/sponsorship.html";  title="Sponsoring 
Apache">Sponsoring Apache</a>
+</li>
+                  
+                      <li>      <a 
href="http://www.apache.org/foundation/thanks.html";  title="Thanks">Thanks</a>
+</li>
+                          </ul>
+      </li>
+                  </ul>
+          
+          
+                            
+            
+            
+            
+    <iframe 
src="http://www.facebook.com/plugins/like.php?href=http://falcon.apache.org/falcon-website-general&send=false&layout=button_count&show-faces=false&action=like&colorscheme=dark";
+        scrolling="no" frameborder="0"
+        style="border:none; width:80px; height:20px; margin-top: 10px;"  
class="pull-right" ></iframe>
+                        
+    <script type="text/javascript" 
src="https://apis.google.com/js/plusone.js";></script>
+
+        <ul class="nav pull-right"><li style="margin-top: 10px;">
+    
+    <div class="g-plusone" 
data-href="http://falcon.apache.org/falcon-website-general"; data-size="medium"  
width="60px" align="right" ></div>
+
+        </li></ul>
+                              
+                   
+                      </div>
+          
+        </div>
+      </div>
+    </div>
+    
+        <div class="container">
+          <div id="banner">
+        <div class="pull-left">
+                                                  <a href="../index.html" 
id="bannerLeft">
+                                                                               
                 <img src="images/falcon-logo.png"  alt="Falcon" width="200px" 
height="45px"/>
+                </a>
+                      </div>
+        <div class="pull-right">                  <a 
href="http://www.apache.org"; id="bannerRight">
+                                                                               
                 <img src="images/apache-feather-tm.gif"  alt="Falcon" 
height="45px"/>
+                </a>
+      </div>
+        <div class="clear"><hr/></div>
+      </div>
+
+      <div id="breadcrumbs">
+        <ul class="breadcrumb">
+                
+                    
+                              <li class="">
+                    <a href="http://www.apache.org"; class="externalLink" 
title="Apache">
+        Apache</a>
+        </li>
+      <li class="divider ">/</li>
+            <li class="">
+                    <a href="index.html" title="Falcon">
+        Falcon</a>
+        </li>
+      <li class="divider ">/</li>
+        <li class="">Operationalizing Falcon</li>
+        
+                
+                    
+                  <li id="publishDate" class="pull-right">Last Published: 
2015-02-18</li> 
+            
+                            </ul>
+      </div>
+
+      
+                        
+        <div id="bodyColumn" >
+                                  
+            <div class="section">
+<h2>Operationalizing Falcon<a name="Operationalizing_Falcon"></a></h2></div>
+<div class="section">
+<h3>Overview<a name="Overview"></a></h3>
+<p>Apache Falcon provides various tools to operationalize Falcon consisting of 
Alerts for unrecoverable errors, Audits of user actions, Metrics, and 
Notifications. They are detailed below.</p></div>
+<div class="section">
+<h3>Monitoring<a name="Monitoring"></a></h3>
+<p>Falcon provides monitoring of various events by capturing metrics of those 
events. The metric numbers can then be used to monitor performance and health 
of the Falcon system and the entire processing pipelines.</p>
+<p>Users can view the logs of these events in the metric.log file, by default 
this file is created under ${user.dir}/logs/ directory. Users may also extend 
the Falcon monitoring framework to send events to systems like Mondemand/lwes 
by implementingorg.apache.falcon.plugin.MonitoringPlugin interface.</p>
+<p>The following events are captured by Falcon for logging the metrics:</p>
+<ol style="list-style-type: decimal">
+<li>New cluster definitions posted to Falcon (success &amp; failures)</li>
+<li>New feed definition posted to Falcon (success &amp; failures)</li>
+<li>New process definition posted to Falcon (success &amp; failures)</li>
+<li>Process update events (success &amp; failures)</li>
+<li>Feed update events (success &amp; failures)</li>
+<li>Cluster update events (success &amp; failures)</li>
+<li>Process suspend events (success &amp; failures)</li>
+<li>Feed suspend events (success &amp; failures)</li>
+<li>Process resume events (success &amp; failures)</li>
+<li>Feed resume events (success &amp; failures)</li>
+<li>Process remove events (success &amp; failures)</li>
+<li>Feed remove events (success &amp; failures)</li>
+<li>Cluster remove events (success &amp; failures)</li>
+<li>Process instance kill events (success &amp; failures)</li>
+<li>Process instance re-run events (success &amp; failures)</li>
+<li>Process instance generation events</li>
+<li>Process instance failure events</li>
+<li>Process instance auto-retry events</li>
+<li>Process instance retry exhaust events</li>
+<li>Feed instance deletion event</li>
+<li>Feed instance deletion failure event (no retries)</li>
+<li>Feed instance replication event</li>
+<li>Feed instance replication failure event</li>
+<li>Feed instance replication auto-retry event</li>
+<li>Feed instance replication retry exhaust event</li>
+<li>Feed instance late arrival event</li>
+<li>Feed instance post cut-off arrival event</li>
+<li>Process re-run due to late feed event</li>
+<li>Transaction rollback failed event</li></ol>
+<p>The metric logged for an event has the following properties:</p>
+<ol style="list-style-type: decimal">
+<li>Action - Name of the event.</li>
+<li>Dimensions - A list of name/value pairs of various attributes for a given 
action.</li>
+<li>Status- Status of an action FAILED/SUCCEEDED.</li>
+<li>Time-taken - Time taken in nanoseconds for a given action.</li></ol>
+<p>An example for an event logged for a submit of a new process definition:</p>
+<p>2012-05-04 12:23:34,026 {Action:submit, Dimensions:{entityType=process}, 
Status: SUCCEEDED, Time-taken:97087000 ns}</p>
+<p>Users may parse the metric.log or capture these events from custom 
monitoring frameworks and can plot various graphs or send alerts according to 
their requirements.</p></div>
+<div class="section">
+<h3>Notifications<a name="Notifications"></a></h3>
+<p>Falcon creates a JMS topic for every process/feed that is scheduled in 
Falcon. The implementation class and the broker url of the JMS engine are read 
from the dependent cluster's definition. Users may register consumers on the 
required topic to check the availability or status of feed instances.</p>
+<p>For a given process that is scheduled, the name of the topic is same as the 
process name. Falcon sends a Map message for every feed produced by the 
instance of a process to the JMS topic. The JMS MapMessage sent to a topic has 
the following properties: entityName, feedNames, feedInstancePath, workflowId, 
runId, nominalTime, timeStamp, brokerUrl, brokerImplClass, entityType, 
operation, logFile, topicName, status, brokerTTL;</p>
+<p>For a given feed that is scheduled, the name of the topic is same as the 
feed name. Falcon sends a map message for every feed instance that is 
deleted/archived/replicated depending upon the retention policy set in the feed 
definition. The JMS MapMessage sent to a topic has the following properties: 
entityName, feedNames, feedInstancePath, workflowId, runId, nominalTime, 
timeStamp, brokerUrl, brokerImplClass, entityType, operation, logFile, 
topicName, status, brokerTTL;</p>
+<p>The JMS messages are automatically purged after a certain period (default 3 
days) by the Falcon JMS house-keeping service.TTL (Time-to-live) for JMS 
message can be configured in the Falcon's startup.properties file.</p></div>
+<div class="section">
+<h3>Alerts<a name="Alerts"></a></h3>
+<p>Falcon generates alerts for unrecoverable errors into a log file by 
default. Users can view these alerts in the alerts.log file, by default this 
file is created under ${user.dir}/logs/ directory.</p>
+<p>Users may also extend the Falcon Alerting plugin to send events to systems 
like Nagios, etc. by extending org.apache.falcon.plugin.AlertingPlugin 
interface.</p></div>
+<div class="section">
+<h3>Audits<a name="Audits"></a></h3>
+<p>Falcon audits all user activity and captures them into a log file by 
default. Users can view these audits in the audit.log file, by default this 
file is created under ${user.dir}/logs/ directory.</p>
+<p>Users may also extend the Falcon Audit plugin to send audits to systems 
like Apache Argus, etc. by extending org.apache.falcon.plugin.AuditingPlugin 
interface.</p></div>
+                  </div>
+          </div>
+
+    <hr/>
+
+    <footer>
+            <div class="container">
+              <div class="row span12">Copyright &copy;                    
2013-2015
+                        <a href="http://www.apache.org";>Apache Software 
Foundation</a>.
+            All Rights Reserved.      
+                    
+      </div>
+
+                          
+                <p id="poweredBy" class="pull-right">
+                          <a href="http://maven.apache.org/"; title="Built by 
Maven" class="poweredBy">
+        <img class="builtBy" alt="Built by Maven" 
src="./images/logos/maven-feather.png" />
+      </a>
+              </p>
+        
+                </div>
+    </footer>
+  </body>
+</html>

Modified: falcon/site/Security.html
URL: 
http://svn.apache.org/viewvc/falcon/site/Security.html?rev=1660589&r1=1660588&r2=1660589&view=diff
==============================================================================
--- falcon/site/Security.html (original)
+++ falcon/site/Security.html Wed Feb 18 10:55:56 2015
@@ -1,13 +1,13 @@
 <!DOCTYPE html>
 <!--
- | Generated by Apache Maven Doxia at 2015-01-11
+ | Generated by Apache Maven Doxia at 2015-02-18
  | Rendered using Apache Maven Fluido Skin 1.3.0
 -->
 <html xmlns="http://www.w3.org/1999/xhtml"; xml:lang="en" lang="en">
   <head>
     <meta charset="UTF-8" />
     <meta name="viewport" content="width=device-width, initial-scale=1.0" />
-    <meta name="Date-Revision-yyyymmdd" content="20150111" />
+    <meta name="Date-Revision-yyyymmdd" content="20150218" />
     <meta http-equiv="Content-Language" content="en" />
     <title>Falcon - Securing Falcon</title>
     <link rel="stylesheet" href="./css/apache-maven-fluido-1.3.0.min.css" />
@@ -245,7 +245,7 @@
         
                 
                     
-                  <li id="publishDate" class="pull-right">Last Published: 
2015-01-11</li> 
+                  <li id="publishDate" class="pull-right">Last Published: 
2015-02-18</li> 
             
                             </ul>
       </div>
@@ -289,19 +289,40 @@
 <div class="section">
 <h4>Super-User<a name="Super-User"></a></h4>
 <p>The super-user is the user with the same identity as falcon process itself. 
Loosely, if you started the falcon, then you are the super-user. The super-user 
can do anything in that permissions checks never fail for the super-user. There 
is no persistent notion of who was the super-user; when the falcon is started 
the process identity determines who is the super-user for now. The Falcon 
super-user does not have to be the super-user of the falcon host, nor is it 
necessary that all clusters have the same super-user. Also, an experimenter 
running Falcon on a personal workstation, conveniently becomes that 
installation's super-user without any configuration.</p>
-<p>Falcon also allows users to configure a super user group and allows users 
belonging to this group to be a super user.</p></div>
+<p>Falcon also allows users to configure a super user group and allows users 
belonging to this group to be a super user.</p>
+<p>ACL owner and group must be valid even if the authenticated user is a 
super-user.</p></div>
 <div class="section">
 <h4>Group Memberships<a name="Group_Memberships"></a></h4>
 <p>Once a user has been authenticated and a username has been determined, the 
list of groups is determined by a group mapping service, configured by the 
hadoop.security.group.mapping property in Hadoop. The default implementation, 
org.apache.hadoop.security.ShellBasedUnixGroupsMapping, will shell out to the 
Unix bash -c groups command to resolve a list of groups for a user.</p>
-<p>Note that Falcon stores the user and group of an Entity as strings; there 
is no conversion from user and group identity numbers as is conventional in 
Unix.</p></div>
+<p>Note that Falcon stores the user and group of an Entity as strings; there 
is no conversion from user and group identity numbers as is conventional in 
Unix.</p>
+<p>The only limitation is that a user cannot add a group in ACL that he does 
not belong to.</p></div>
 <div class="section">
 <h4>Authorization Provider<a name="Authorization_Provider"></a></h4>
 <p>Falcon provides a plugin-able provider interface for Authorization. It also 
ships with a default implementation that enforces the following authorization 
policy.</p></div>
 <div class="section">
 <h5>Entity and Instance Management Operations Policy<a 
name="Entity_and_Instance_Management_Operations_Policy"></a></h5>
-<p>* All Entity and Instance operations are authorized for users who created 
them, Owners and users with group memberships * Reference to entities with in a 
feed or process is allowed with out enforcing permissions Any Feed or Process 
can refer to a Cluster entity not owned by the Feed or Process owner Any 
Process can refer to a Feed entity not owned by the Process owner</p>
+<p></p>
+<ul>
+<li>All Entity and Instance operations are authorized for users who created 
them, Owners and users with group memberships</li>
+<li>Reference to entities with in a feed or process is allowed with out 
enforcing permissions</li></ul>
+<p>Any Feed or Process can refer to a Cluster entity not owned by the Feed or 
Process owner. Any Process can refer to a Feed entity not owned by the Process 
owner</p>
 <p>The authorization is enforced in the following way:</p>
-<p>if admin resource,      if authenticated user name matches the admin users 
configuration      Else if groups of the authenticated user matches the admin 
groups configuration      Else authorization exception is thrown Else if 
entities or instance resource      if the authenticated user matches the owner 
in ACL for the entity      Else if the groups of the authenticated user matches 
the group in ACL for the entity      Else authorization exception is thrown 
Else if lineage resource      All have read-only permissions, reason being 
folks should be able to examine the dependency      and allow reuse</p>
+<p></p>
+<ul>
+<li>if admin resource,
+<ul>
+<li>If authenticated user name matches the admin users configuration</li>
+<li>Else if groups of the authenticated user matches the admin groups 
configuration</li>
+<li>Else authorization exception is thrown</li></ul></li>
+<li>Else if entities or instance resource
+<ul>
+<li>If the authenticated user matches the owner in ACL for the entity</li>
+<li>Else if the groups of the authenticated user matches the group in ACL for 
the entity</li>
+<li>Else authorization exception is thrown</li></ul></li>
+<li>Else if lineage resource
+<ul>
+<li>All have read-only permissions, reason being folks should be able to 
examine the dependency and allow reuse</li></ul></li></ul>
+<p>To authenticate user for REST api calls, user should append 
&quot;user.name=&lt;username&gt;&quot; to the query.</p>
 <p><b>operations on Entity Resource</b></p>
 <p></p>
 <table border="0" class="table table-striped">
@@ -414,7 +435,7 @@
 <td>Admin User/Group</td></tr></table></div>
 <div class="section">
 <h5>Lineage Resource Policy<a name="Lineage_Resource_Policy"></a></h5>
-<p>Lineage is read-only and hence all users can look at lineage for their 
respective entities.</p></div>
+<p>Lineage is read-only and hence all users can look at lineage for their 
respective entities. <b>Note:</b> This gap will be fixed in a later 
release.</p></div>
 <div class="section">
 <h3>Authentication Configuration<a 
name="Authentication_Configuration"></a></h3>
 <p>Following is the Server Side Configuration Setup for 
Authentication.</p></div>
@@ -472,6 +493,9 @@
 # Comma separated list of black listed users
 *.falcon.http.authentication.blacklisted.users=
 
+# Increase Jetty request buffer size to accommodate the generated Kerberos 
token
+*.falcon.jetty.request.buffer.size=16192
+
 </pre></div></div>
 <div class="section">
 <h4>Pseudo/Simple Configuration<a name="PseudoSimple_Configuration"></a></h4>
@@ -572,22 +596,32 @@
 <td>Configuration Store</td>
 <td>${config.store.uri}</td>
 <td>falcon</td>
-<td>750</td></tr>
+<td>700</td></tr>
 <tr class="a">
-<td>Oozie coord/bundle XMLs</td>
-<td>${cluster.staging-location}/workflows/{entity}/{entity-name}</td>
+<td>Cluster Staging Location</td>
+<td>${cluster.staging-location}</td>
 <td>falcon</td>
-<td>644</td></tr>
+<td>777</td></tr>
 <tr class="b">
+<td>Cluster Working Location</td>
+<td>${cluster.working-location}</td>
+<td>falcon</td>
+<td>755</td></tr>
+<tr class="a">
 <td>Shared libs</td>
 <td>{cluster.working}/{lib,libext}</td>
 <td>falcon</td>
 <td>755</td></tr>
+<tr class="b">
+<td>Oozie coord/bundle XMLs</td>
+<td>${cluster.staging-location}/workflows/{entity}/{entity-name}</td>
+<td>$user</td>
+<td>cluster umask</td></tr>
 <tr class="a">
 <td>App logs</td>
 <td>${cluster.staging-location}/workflows/{entity}/{entity-name}/logs</td>
-<td>falcon</td>
-<td>777</td></tr></table></div>
+<td>$user</td>
+<td>cluster umask</td></tr></table><b>Note:</b> Please note that the cluster 
staging and working locations MUST be created prior to submitting a cluster 
entity to Falcon. Also, note that the the parent dirs must have execute 
permissions.</div>
 <div class="section">
 <h3>Backwards compatibility<a name="Backwards_compatibility"></a></h3></div>
 <div class="section">
@@ -606,13 +640,11 @@
 <p>The blacklist users used to have the following super users: hdfs, 
mapreduce, oozie, and falcon. The list is externalized from code into 
Startup.properties file and is empty now and needs to be configured 
specifically in the file.</p></div>
 <div class="section">
 <h4>Falcon Dashboard<a name="Falcon_Dashboard"></a></h4>
-<p>The dashboard assumes an anonymous user in Pseudo/Simple method and hence 
anonymous users must be enabled for it to work.</p>
-<div class="source">
-<pre>
-# Indicates if anonymous requests are allowed when using 'simple' 
authentication.
-*.falcon.http.authentication.simple.anonymous.allowed=true
-
-</pre></div>
+<p>To initialize the current user for dashboard, user should append query 
param &quot;user.name=&lt;username&gt;&quot; to the REST api call.</p>
+<p>If dashboard user wishes to change the current user, they should do the 
following.</p>
+<ul>
+<li>delete the hadoop.auth cookie from browser cache.</li>
+<li>append query param &quot;user.name=&lt;new_user&gt;&quot; to the next REST 
API call.</li></ul>
 <p>In Kerberos method, the browser must support HTTP Kerberos SPNEGO.</p></div>
 <div class="section">
 <h3>Known Limitations<a name="Known_Limitations"></a></h3>


Reply via email to