[6/9] flink git commit: [FLINK-7301] [docs] Rework state documentation

twalthr Tue, 08 Aug 2017 05:07:07 -0700

http://git-wip-us.apache.org/repos/asf/flink/blob/47070674/docs/ops/config.md
----------------------------------------------------------------------
diff --git a/docs/ops/config.md b/docs/ops/config.md
new file mode 100644
index 0000000..e84bd46
--- /dev/null
+++ b/docs/ops/config.md
@@ -0,0 +1,713 @@
+---
+title: "Configuration"
+nav-id: "config"
+nav-parent_id: ops
+nav-pos: 4
+---
+<!--
+Licensed to the Apache Software Foundation (ASF) under one
+or more contributor license agreements.  See the NOTICE file
+distributed with this work for additional information
+regarding copyright ownership.  The ASF licenses this file
+to you under the Apache License, Version 2.0 (the
+"License"); you may not use this file except in compliance
+with the License.  You may obtain a copy of the License at
+
+  http://www.apache.org/licenses/LICENSE-2.0
+
+Unless required by applicable law or agreed to in writing,
+software distributed under the License is distributed on an
+"AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY
+KIND, either express or implied.  See the License for the
+specific language governing permissions and limitations
+under the License.
+-->
+
+**For single-node setups Flink is ready to go out of the box and you don't 
need to change the default configuration to get started.**
+
+The out of the box configuration will use your default Java installation. You 
can manually set the environment variable `JAVA_HOME` or the configuration key 
`env.java.home` in `conf/flink-conf.yaml` if you want to manually override the 
Java runtime to use.
+
+This page lists the most common options that are typically needed to set up a 
well performing (distributed) installation. In addition a full list of all 
available configuration parameters is listed here.
+
+All configuration is done in `conf/flink-conf.yaml`, which is expected to be a 
flat collection of [YAML key value 
pairs](http://www.yaml.org/spec/1.2/spec.html) with format `key: value`.
+
+The system and run scripts parse the config at startup time. Changes to the 
configuration file require restarting the Flink JobManager and TaskManagers.
+
+The configuration files for the TaskManagers can be different, Flink does not 
assume uniform machines in the cluster.
+
+* This will be replaced by the TOC
+{:toc}
+
+## Common Options
+
+- `env.java.home`: The path to the Java installation to use (DEFAULT: system's 
default Java installation, if found). Needs to be specified if the startup 
scripts fail to automatically resolve the java home directory. Can be specified 
to point to a specific java installation or version. If this option is not 
specified, the startup scripts also evaluate the `$JAVA_HOME` environment 
variable.
+
+- `env.java.opts`: Set custom JVM options. This value is respected by Flink's 
start scripts, both JobManager and
+TaskManager, and Flink's YARN client. This can be used to set different 
garbage collectors or to include remote
+debuggers into the JVMs running Flink's services. Enclosing options in double 
quotes delays parameter substitution
+allowing access to variables from Flink's startup scripts. Use 
`env.java.opts.jobmanager` and `env.java.opts.taskmanager`
+for JobManager or TaskManager-specific options, respectively.
+
+- `env.java.opts.jobmanager`: JobManager-specific JVM options. These are used 
in addition to the regular `env.java.opts`.
+
+- `env.java.opts.taskmanager`: TaskManager-specific JVM options. These are 
used in addition to the regular `env.java.opts`.
+
+- `jobmanager.rpc.address`: The external address of the JobManager, which is 
the master/coordinator of the distributed system (DEFAULT: localhost). 
**Note:** The address (host name or IP) should be accessible by all nodes 
including the client.
+
+- `jobmanager.rpc.port`: The port number of the JobManager (DEFAULT: 6123).
+
+- `jobmanager.heap.mb`: JVM heap size (in megabytes) for the JobManager. You 
may have to increase the heap size for the JobManager if you are running very 
large applications (with many operators), or if you are keeping a long history 
of them.
+
+- `taskmanager.heap.mb`: JVM heap size (in megabytes) for the TaskManagers, 
which are the parallel workers of the system. In contrast to Hadoop, Flink runs 
operators (e.g., join, aggregate) and user-defined functions (e.g., Map, 
Reduce, CoGroup) inside the TaskManager (including sorting/hashing/caching), so 
this value should be as large as possible. If the cluster is exclusively 
running Flink, the total amount of available memory per machine minus some 
memory for the operating system (maybe 1-2 GB) is a good value. On YARN setups, 
this value is automatically configured to the size of the TaskManager's YARN 
container, minus a certain tolerance value.
+
+- `taskmanager.numberOfTaskSlots`: The number of parallel operator or user 
function instances that a single TaskManager can run (DEFAULT: 1). If this 
value is larger than 1, a single TaskManager takes multiple instances of a 
function or operator. That way, the TaskManager can utilize multiple CPU cores, 
but at the same time, the available memory is divided between the different 
operator or function instances. This value is typically proportional to the 
number of physical CPU cores that the TaskManager's machine has (e.g., equal to 
the number of cores, or half the number of cores). [More about task 
slots](config.html#configuring-taskmanager-processing-slots).
+
+- `parallelism.default`: The default parallelism to use for programs that have 
no parallelism specified. (DEFAULT: 1). For setups that have no concurrent jobs 
running, setting this value to NumTaskManagers * NumSlotsPerTaskManager will 
cause the system to use all available execution resources for the program's 
execution. **Note**: The default parallelism can be overwriten for an entire 
job by calling `setParallelism(int parallelism)` on the `ExecutionEnvironment` 
or by passing `-p <parallelism>` to the Flink Command-line frontend. It can be 
overwritten for single transformations by calling `setParallelism(int
+parallelism)` on an operator. See [Parallel 
Execution]({{site.baseurl}}/dev/parallel.html) for more information about 
parallelism.
+
+- `fs.default-scheme`: The default filesystem scheme to be used, with the 
necessary authority to contact, e.g. the host:port of the NameNode in the case 
of HDFS (if needed).
+By default, this is set to `file:///` which points to the local filesystem. 
This means that the local
+filesystem is going to be used to search for user-specified files **without** 
an explicit scheme
+definition. As another example, if this is set to `hdfs://localhost:9000/`, 
then a user-specified file path
+without explicit scheme definition, such as `/user/USERNAME/in.txt`, is going 
to be transformed into
+`hdfs://localhost:9000/user/USERNAME/in.txt`. This scheme is used **ONLY** if 
no other scheme is specified (explicitly) in the user-provided `URI`.
+
+- `fs.hdfs.hadoopconf`: The absolute path to the Hadoop File System's (HDFS) 
configuration **directory** (OPTIONAL VALUE). Specifying this value allows 
programs to reference HDFS files using short URIs (`hdfs:///path/to/files`, 
without including the address and port of the NameNode in the file URI). 
Without this option, HDFS files can be accessed, but require fully qualified 
URIs like `hdfs://address:port/path/to/files`. This option also causes file 
writers to pick up the HDFS's default values for block sizes and replication 
factors. Flink will look for the "core-site.xml" and "hdfs-site.xml" files in 
the specified directory.
+
+## Advanced Options
+
+### Compute
+
+- `taskmanager.compute.numa`: When enabled a TaskManager is started on each 
NUMA node for each worker listed in *conf/slaves* (DEFAULT: false). Note: only 
supported when deploying Flink as a standalone cluster.
+
+### Managed Memory
+
+By default, Flink allocates a fraction of `0.7` of the free memory (total 
memory configured via `taskmanager.heap.mb` minus memory used for network 
buffers) for its managed memory. Managed memory helps Flink to run the batch 
operators efficiently. It prevents `OutOfMemoryException`s because Flink knows 
how much memory it can use to execute operations. If Flink runs out of managed 
memory, it utilizes disk space. Using managed memory, some operations can be 
performed directly on the raw data without having to deserialize the data to 
convert it into Java objects. All in all, managed memory improves the 
robustness and speed of the system.
+
+The default fraction for managed memory can be adjusted using the 
`taskmanager.memory.fraction` parameter. An absolute value may be set using 
`taskmanager.memory.size` (overrides the fraction parameter). If desired, the 
managed memory may be allocated outside the JVM heap. This may improve 
performance in setups with large memory sizes.
+
+- `taskmanager.memory.size`: The amount of memory (in megabytes) that the task 
manager reserves on-heap or off-heap (depending on 
`taskmanager.memory.off-heap`) for sorting, hash tables, and caching of 
intermediate results. If unspecified (-1), the memory manager will take a fixed 
ratio with respect to the size of the task manager JVM as specified by 
`taskmanager.memory.fraction`. (DEFAULT: -1)
+
+- `taskmanager.memory.fraction`: The relative amount of memory (with respect 
to `taskmanager.heap.mb`, after subtracting the amount of memory used by 
network buffers) that the task manager reserves for sorting, hash tables, and 
caching of intermediate results. For example, a value of `0.8` means that a 
task manager reserves 80% of its memory (on-heap or off-heap depending on 
`taskmanager.memory.off-heap`) for internal data buffers, leaving 20% of free 
memory for the task manager's heap for objects created by user-defined 
functions. (DEFAULT: 0.7) This parameter is only evaluated, if 
`taskmanager.memory.size` is not set.
+
+- `taskmanager.memory.off-heap`: If set to `true`, the task manager allocates 
memory which is used for sorting, hash tables, and caching of intermediate 
results outside of the JVM heap. For setups with larger quantities of memory, 
this can improve the efficiency of the operations performed on the memory 
(DEFAULT: false).
+
+- `taskmanager.memory.segment-size`: The size of memory buffers used by the 
memory manager and the network stack in bytes (DEFAULT: 32768 (= 32 KiBytes)).
+
+- `taskmanager.memory.preallocate`: Can be either of `true` or `false`. 
Specifies whether task managers should allocate all managed memory when 
starting up. (DEFAULT: false). When `taskmanager.memory.off-heap` is set to 
`true`, then it is advised that this configuration is also set to `true`.  If 
this configuration is set to `false` cleaning up of the allocated offheap 
memory happens only when the configured JVM parameter MaxDirectMemorySize is 
reached by triggering a full GC. **Note:** For streaming setups, we highly 
recommend to set this value to `false` as the core state backends currently do 
not use the managed memory.
+
+### Memory and Performance Debugging
+
+These options are useful for debugging a Flink application for memory and 
garbage collection related issues, such as performance and out-of-memory 
process kills or exceptions.
+
+- `taskmanager.debug.memory.startLogThread`: Causes the TaskManagers to 
periodically log memory and Garbage collection statistics. The statistics 
include current heap-, off-heap, and other memory pool utilization, as well as 
the time spent on garbage collection, by heap memory pool.
+
+- `taskmanager.debug.memory.logIntervalMs`: The interval (in milliseconds) in 
which the TaskManagers log the memory and garbage collection statistics. Only 
has an effect, if `taskmanager.debug.memory.startLogThread` is set to true.
+
+### Kerberos-based Security
+
+Flink supports Kerberos authentication for the following services:
+
++ Hadoop Components, such as HDFS, YARN, or HBase *(version 2.6.1 and above; 
all other versions have critical bugs which might fail the Flink job 
unexpectedly)*.
++ Kafka Connectors *(version 0.9+ and above)*.
++ Zookeeper
+
+Configuring Flink for Kerberos security involves three aspects, explained 
separately in the following sub-sections.
+
+##### 1. Providing the cluster with a Kerberos credential (i.e. a keytab or a 
ticket via `kinit`)
+
+To provide the cluster with a Kerberos credential, Flink supports using a 
Kerberos keytab file or ticket caches managed by `kinit`.
+
+- `security.kerberos.login.use-ticket-cache`: Indicates whether to read from 
your Kerberos ticket cache (default: `true`).
+
+- `security.kerberos.login.keytab`: Absolute path to a Kerberos keytab file 
that contains the user credentials.
+
+- `security.kerberos.login.principal`: Kerberos principal name associated with 
the keytab.
+
+If both `security.kerberos.login.keytab` and 
`security.kerberos.login.principal` have values provided, keytabs will be used 
for authentication.
+It is preferable to use keytabs for long-running jobs, to avoid ticket 
expiration issues.   If you prefer to use the ticket cache,
+talk to your administrator about increasing the Hadoop delegation token 
lifetime.
+
+Note that authentication using ticket caches is only supported when deploying 
Flink as a standalone cluster or on YARN.
+
+##### 2. Making the Kerberos credential available to components and connectors 
as needed
+
+For Hadoop components, Flink will automatically detect if the configured 
Kerberos credentials should be used when connecting to HDFS, HBase, and other 
Hadoop components depending on whether Hadoop security is enabled (in 
`core-site.xml`).
+
+For any connector or component that uses a JAAS configuration file, make the 
Kerberos credentials available to them by configuring JAAS login contexts for 
each one respectively, using the following configuration:
+
+- `security.kerberos.login.contexts`: A comma-separated list of login contexts 
to provide the Kerberos credentials to (for example, `Client,KafkaClient` to 
use the credentials for ZooKeeper authentication and for Kafka authentication).
+
+This allows enabling Kerberos authentication for different connectors or 
components independently. For example, you can enable Hadoop security without 
necessitating the use of Kerberos for ZooKeeper, or vice versa.
+
+You may also provide a static JAAS configuration file using the mechanisms 
described in the [Java SE 
Documentation](http://docs.oracle.com/javase/7/docs/technotes/guides/security/jgss/tutorials/LoginConfigFile.html),
 whose entries will override those produced by the above configuration option.
+
+##### 3. Configuring the component and/or connector to use Kerberos 
authentication
+
+Finally, be sure to configure the connector within your Flink program or 
component as necessary to use Kerberos authentication.
+
+Below is a list of currently first-class supported connectors or components by 
Flink for Kerberos authentication:
+
+- Kafka: see 
[here]({{site.baseurl}}/dev/connectors/kafka.html#enabling-kerberos-authentication-for-versions-above-09-only)
 for details on configuring the Kafka connector to use Kerberos authentication.
+
+- Zookeeper (for HA): see 
[here]({{site.baseurl}}/ops/jobmanager_high_availability.html#configuring-for-zookeeper-security)
 for details on Zookeeper security configuration to work with the 
Kerberos-based security configurations mentioned here.
+
+For more information on how Flink security internally setups Kerberos 
authentication, please see [here]({{site.baseurl}}/ops/security-kerberos.html).
+
+### Other
+
+- `taskmanager.tmp.dirs`: The directory for temporary files, or a list of 
directories separated by the system's directory delimiter (for example ':' 
(colon) on Linux/Unix). If multiple directories are specified, then the 
temporary files will be distributed across the directories in a round-robin 
fashion. The I/O manager component will spawn one reading and one writing 
thread per directory. A directory may be listed multiple times to have the I/O 
manager use multiple threads for it (for example if it is physically stored on 
a very fast disc or RAID) (DEFAULT: The system's tmp dir).
+
+- `taskmanager.log.path`: The config parameter defining the taskmanager log 
file location
+
+- `jobmanager.web.address`: Address of the JobManager's web interface 
(DEFAULT: anyLocalAddress()).
+
+- `jobmanager.web.port`: Port of the JobManager's web interface (DEFAULT: 
8081).
+
+- `jobmanager.web.tmpdir`: This configuration parameter allows defining the 
Flink web directory to be used by the web interface. The web interface
+will copy its static files into the directory. Also uploaded job jars are 
stored in the directory if not overridden. By default, the temporary directory 
is used.
+
+- `jobmanager.web.upload.dir`: The config parameter defining the directory for 
uploading the job jars. If not specified a dynamic directory
+will be used under the directory specified by jobmanager.web.tmpdir.
+
+- `fs.overwrite-files`: Specifies whether file output writers should overwrite 
existing files by default. Set to *true* to overwrite by default, *false* 
otherwise. (DEFAULT: false)
+
+- `fs.output.always-create-directory`: File writers running with a parallelism 
larger than one create a directory for the output file path and put the 
different result files (one per parallel writer task) into that directory. If 
this option is set to *true*, writers with a parallelism of 1 will also create 
a directory and place a single result file into it. If the option is set to 
*false*, the writer will directly create the file directly at the output path, 
without creating a containing directory. (DEFAULT: false)
+
+- `taskmanager.network.memory.fraction`: Fraction of JVM memory to use for 
network buffers. This determines how many streaming data exchange channels a 
TaskManager can have at the same time and how well buffered the channels are. 
If a job is rejected or you get a warning that the system has not enough 
buffers available, increase this value or the min/max values below. (DEFAULT: 
0.1)
+
+- `taskmanager.network.memory.min`: Minimum memory size for network buffers in 
bytes (DEFAULT: 64 MB)
+
+- `taskmanager.network.memory.max`: Maximum memory size for network buffers in 
bytes (DEFAULT: 1 GB)
+
+- `state.backend`: The backend that will be used to store operator state 
checkpoints if checkpointing is enabled. Supported backends:
+   -  `jobmanager`: In-memory state, backup to JobManager's/ZooKeeper's 
memory. Should be used only for minimal state (Kafka offsets) or testing and 
local debugging.
+   -  `filesystem`: State is in-memory on the TaskManagers, and state 
snapshots are stored in a file system. Supported are all filesystems supported 
by Flink, for example HDFS, S3, ...
+
+- `state.backend.fs.checkpointdir`: Directory for storing checkpoints in a 
Flink supported filesystem. Note: State backend must be accessible from the 
JobManager, use `file://` only for local setups.
+
+- `state.backend.rocksdb.checkpointdir`:  The local directory for storing 
RocksDB files, or a list of directories separated by the systems directory 
delimiter (for example â:â (colon) on Linux/Unix). (DEFAULT value is 
`taskmanager.tmp.dirs`)
+
+- `state.checkpoints.dir`: The target directory for meta data of [externalized 
checkpoints]({{ site.baseurl 
}}/ops/state/checkpoints.html#externalized-checkpoints).
+
+- `state.checkpoints.num-retained`: The number of completed checkpoint 
instances to retain. Having more than one allows recovery fallback to an 
earlier checkpoints if the latest checkpoint is corrupt. (Default: 1)
+
+- `high-availability.zookeeper.storageDir`: Required for HA. Directory for 
storing JobManager metadata; this is persisted in the state backend and only a 
pointer to this state is stored in ZooKeeper. Exactly like the checkpoint 
directory it must be accessible from the JobManager and a local filesystem 
should only be used for local deployments. Previously this key was named 
`recovery.zookeeper.storageDir`.
+
+- `blob.storage.directory`: Directory for storing blobs (such as user JARs) on 
the TaskManagers.
+
+- `blob.server.port`: Port definition for the blob server (serving user JARs) 
on the TaskManagers. By default the port is set to 0, which means that the 
operating system is picking an ephemeral port. Flink also accepts a list of 
ports ("50100,50101"), ranges ("50100-50200") or a combination of both. It is 
recommended to set a range of ports to avoid collisions when multiple 
JobManagers are running on the same machine.
+
+- `blob.service.ssl.enabled`: Flag to enable ssl for the blob client/server 
communication. This is applicable only when the global ssl flag 
security.ssl.enabled is set to true (DEFAULT: true).
+
+- `restart-strategy`: Default [restart 
strategy]({{site.baseurl}}/dev/restart_strategies.html) to use in case no
+restart strategy has been specified for the job.
+The options are:
+    - fixed delay strategy: `fixed-delay`.
+    - failure rate strategy: `failure-rate`.
+    - no restarts: `none`
+
+    Default value is `none` unless checkpointing is enabled for the job in 
which case the default is `fixed-delay` with `Integer.MAX_VALUE` restart 
attempts and `10s` delay.
+
+- `restart-strategy.fixed-delay.attempts`: Number of restart attempts, used if 
the default restart strategy is set to "fixed-delay".
+Default value is 1, unless "fixed-delay" was activated by enabling 
checkpoints, in which case the default is `Integer.MAX_VALUE`.
+
+- `restart-strategy.fixed-delay.delay`: Delay between restart attempts, used 
if the default restart strategy is set to "fixed-delay".
+Default value is the `akka.ask.timeout`, unless "fixed-delay" was activated by 
enabling checkpoints, in which case
+the default is 10s.
+
+- `restart-strategy.failure-rate.max-failures-per-interval`: Maximum number of 
restarts in given time interval before failing a job in "failure-rate" strategy.
+Default value is 1.
+
+- `restart-strategy.failure-rate.failure-rate-interval`: Time interval for 
measuring failure rate in "failure-rate" strategy.
+Default value is `1 minute`.
+
+- `restart-strategy.failure-rate.delay`: Delay between restart attempts, used 
if the default restart strategy is set to "failure-rate".
+Default value is the `akka.ask.timeout`.
+
+## Full Reference
+
+### HDFS
+
+These parameters configure the default HDFS used by Flink. Setups that do not 
specify a HDFS configuration have to specify the full path to HDFS files 
(`hdfs://address:port/path/to/files`) Files will also be written with default 
HDFS parameters (block size, replication factor).
+
+- `fs.hdfs.hadoopconf`: The absolute path to the Hadoop configuration 
directory. The system will look for the "core-site.xml" and "hdfs-site.xml" 
files in that directory (DEFAULT: null).
+
+- `fs.hdfs.hdfsdefault`: The absolute path of Hadoop's own configuration file 
"hdfs-default.xml" (DEFAULT: null).
+
+- `fs.hdfs.hdfssite`: The absolute path of Hadoop's own configuration file 
"hdfs-site.xml" (DEFAULT: null).
+
+### JobManager &amp; TaskManager
+
+The following parameters configure Flink's JobManager and TaskManagers.
+
+- `jobmanager.rpc.address`: The external address of the JobManager, which is 
the master/coordinator of the distributed system (DEFAULT: **localhost**). 
**Note:** The address (host name or IP) should be accessible by all nodes 
including the client.
+
+- `jobmanager.rpc.port`: The port number of the JobManager (DEFAULT: **6123**).
+
+- `taskmanager.hostname`: The hostname of the network interface that the 
TaskManager binds to. By default, the TaskManager searches for network 
interfaces that can connect to the JobManager and other TaskManagers. This 
option can be used to define a hostname if that strategy fails for some reason. 
Because different TaskManagers need different values for this option, it 
usually is specified in an additional non-shared TaskManager-specific config 
file.
+
+- `taskmanager.rpc.port`: The task manager's IPC port (DEFAULT: **0**, which 
lets the OS choose a free port). Flink also accepts a list of ports 
("50100,50101"), ranges ("50100-50200") or a combination of both. It is 
recommended to set a range of ports to avoid collisions when multiple 
TaskManagers are running on the same machine.
+
+- `taskmanager.data.port`: The task manager's port used for data exchange 
operations (DEFAULT: **0**, which lets the OS choose a free port).
+
+- `taskmanager.data.ssl.enabled`: Enable SSL support for the taskmanager data 
transport. This is applicable only when the global ssl flag 
security.ssl.enabled is set to true (DEFAULT: **true**)
+
+- `jobmanager.heap.mb`: JVM heap size (in megabytes) for the JobManager 
(DEFAULT: **256**).
+
+- `taskmanager.heap.mb`: JVM heap size (in megabytes) for the TaskManagers, 
which are the parallel workers of the system. In contrast to Hadoop, Flink runs 
operators (e.g., join, aggregate) and user-defined functions (e.g., Map, 
Reduce, CoGroup) inside the TaskManager (including sorting/hashing/caching), so 
this value should be as large as possible (DEFAULT: **512**). On YARN setups, 
this value is automatically configured to the size of the TaskManager's YARN 
container, minus a certain tolerance value.
+
+- `taskmanager.numberOfTaskSlots`: The number of parallel operator or user 
function instances that a single TaskManager can run (DEFAULT: **1**). If this 
value is larger than 1, a single TaskManager takes multiple instances of a 
function or operator. That way, the TaskManager can utilize multiple CPU cores, 
but at the same time, the available memory is divided between the different 
operator or function instances. This value is typically proportional to the 
number of physical CPU cores that the TaskManager's machine has (e.g., equal to 
the number of cores, or half the number of cores).
+
+- `taskmanager.tmp.dirs`: The directory for temporary files, or a list of 
directories separated by the system's directory delimiter (for example ':' 
(colon) on Linux/Unix). If multiple directories are specified, then the 
temporary files will be distributed across the directories in a round robin 
fashion. The I/O manager component will spawn one reading and one writing 
thread per directory. A directory may be listed multiple times to have the I/O 
manager use multiple threads for it (for example if it is physically stored on 
a very fast disc or RAID) (DEFAULT: **The system's tmp dir**).
+
+- `taskmanager.network.memory.fraction`: Fraction of JVM memory to use for 
network buffers. This determines how many streaming data exchange channels a 
TaskManager can have at the same time and how well buffered the channels are. 
If a job is rejected or you get a warning that the system has not enough 
buffers available, increase this value or the min/max values below. Also note, 
that `taskmanager.network.memory.min` and `taskmanager.network.memory.max` may 
override this fraction. (DEFAULT: **0.1**)
+
+- `taskmanager.network.memory.min`: Minimum memory size for network buffers in 
bytes (DEFAULT: **64 MB**). Previously, this was determined from 
`taskmanager.network.numberOfBuffers` and `taskmanager.memory.segment-size`.
+
+- `taskmanager.network.memory.max`: Maximum memory size for network buffers in 
bytes (DEFAULT: **1 GB**). Previously, this was determined from 
`taskmanager.network.numberOfBuffers` and `taskmanager.memory.segment-size`.
+
+- `taskmanager.network.numberOfBuffers` (deprecated, replaced by the three 
parameters above): The number of buffers available to the network stack. This 
number determines how many streaming data exchange channels a TaskManager can 
have at the same time and how well buffered the channels are. If a job is 
rejected or you get a warning that the system has not enough buffers available, 
increase this value (DEFAULT: **2048**). If set, it will be mapped to 
`taskmanager.network.memory.min` and `taskmanager.network.memory.max` based on 
`taskmanager.memory.segment-size`.
+
+- `taskmanager.memory.size`: The amount of memory (in megabytes) that the task 
manager reserves on the JVM's heap space for sorting, hash tables, and caching 
of intermediate results. If unspecified (-1), the memory manager will take a 
fixed ratio of the heap memory available to the JVM, as specified by 
`taskmanager.memory.fraction`. (DEFAULT: **-1**)
+
+- `taskmanager.memory.fraction`: The relative amount of memory (with respect 
to `taskmanager.heap.mb`, after subtracting the amount of memory used by 
network buffers) that the task manager reserves for sorting, hash tables, and 
caching of intermediate results. For example, a value of `0.8` means that a 
task manager reserves 80% of its memory (on-heap or off-heap depending on 
`taskmanager.memory.off-heap`) for internal data buffers, leaving 20% of free 
memory for the task manager's heap for objects created by user-defined 
functions. (DEFAULT: 0.7) This parameter is only evaluated, if 
`taskmanager.memory.size` is not set.
+
+- `taskmanager.debug.memory.startLogThread`: Causes the TaskManagers to 
periodically log memory and Garbage collection statistics. The statistics 
include current heap-, off-heap, and other memory pool utilization, as well as 
the time spent on garbage collection, by heap memory pool.
+
+- `taskmanager.debug.memory.logIntervalMs`: The interval (in milliseconds) in 
which the TaskManagers log the memory and garbage collection statistics. Only 
has an effect, if `taskmanager.debug.memory.startLogThread` is set to true.
+
+- `taskmanager.maxRegistrationDuration`: Defines the maximum time it can take 
for the TaskManager registration. If the duration is exceeded without a 
successful registration, then the TaskManager terminates. The max registration 
duration requires a time unit specifier (ms/s/min/h/d) (e.g. "10 min"). 
(DEFAULT: **Inf**)
+
+- `taskmanager.initial-registration-pause`: The initial registration pause 
between two consecutive registration attempts. The pause is doubled for each 
new registration attempt until it reaches the maximum registration pause. The 
initial registration pause requires a time unit specifier (ms/s/min/h/d) (e.g. 
"5 s"). (DEFAULT: **500 ms**)
+
+- `taskmanager.max-registration-pause`: The maximum registration pause between 
two consecutive registration attempts. The max registration pause requires a 
time unit specifier (ms/s/min/h/d) (e.g. "5 s"). (DEFAULT: **30 s**)
+
+- `taskmanager.refused-registration-pause`: The pause after a registration has 
been refused by the job manager before retrying to connect. The refused 
registration pause requires a time unit specifier (ms/s/min/h/d) (e.g. "5 s"). 
(DEFAULT: **10 s**)
+
+- `taskmanager.jvm-exit-on-oom`: Indicates that the TaskManager should 
immediately terminate the JVM if the task thread throws an `OutOfMemoryError` 
(DEFAULT: **false**).
+
+- `blob.fetch.retries`: The number of retries for the TaskManager to download 
BLOBs (such as JAR files) from the JobManager (DEFAULT: **50**).
+
+- `blob.fetch.num-concurrent`: The number concurrent BLOB fetches (such as JAR 
file downloads) that the JobManager serves (DEFAULT: **50**).
+
+- `blob.fetch.backlog`: The maximum number of queued BLOB fetches (such as JAR 
file downloads) that the JobManager allows (DEFAULT: **1000**).
+
+- `task.cancellation-interval`: Time interval between two successive task 
cancellation attempts in milliseconds (DEFAULT: **30000**).
+
+- `taskmanager.exit-on-fatal-akka-error`: Whether the TaskManager shall be 
terminated in case of a fatal Akka error (quarantining event). (DEFAULT: 
**false**)
+
+### Distributed Coordination (via Akka)
+
+- `akka.ask.timeout`: Timeout used for all futures and blocking Akka calls. If 
Flink fails due to timeouts then you should try to increase this value. 
Timeouts can be caused by slow machines or a congested network. The timeout 
value requires a time-unit specifier (ms/s/min/h/d) (DEFAULT: **10 s**).
+
+- `akka.lookup.timeout`: Timeout used for the lookup of the JobManager. The 
timeout value has to contain a time-unit specifier (ms/s/min/h/d) (DEFAULT: 
**10 s**).
+
+- `akka.client.timeout`: Timeout used by Flink clients (e.g. `CliFrontend`, 
`ClusterClient`) when communicating with the Flink cluster. The timeout value 
has to contain a time-unit specifier (ms/s/min/h/d) (DEFAULT: **60 s**).
+
+- `akka.framesize`: Maximum size of messages which are sent between the 
JobManager and the TaskManagers. If Flink fails because messages exceed this 
limit, then you should increase it. The message size requires a size-unit 
specifier (DEFAULT: **10485760b**).
+
+- `akka.watch.heartbeat.interval`: Heartbeat interval for Akka's DeathWatch 
mechanism to detect dead TaskManagers. If TaskManagers are wrongly marked dead 
because of lost or delayed heartbeat messages, then you should increase this 
value. A thorough description of Akka's DeathWatch can be found 
[here](http://doc.akka.io/docs/akka/snapshot/scala/remoting.html#failure-detector)
 (DEFAULT: **10 s**).
+
+- `akka.watch.heartbeat.pause`: Acceptable heartbeat pause for Akka's 
DeathWatch mechanism. A low value does not allow a irregular heartbeat. A 
thorough description of Akka's DeathWatch can be found 
[here](http://doc.akka.io/docs/akka/snapshot/scala/remoting.html#failure-detector)
 (DEFAULT: **60 s**).
+
+- `akka.watch.threshold`: Threshold for the DeathWatch failure detector. A low 
value is prone to false positives whereas a high value increases the time to 
detect a dead TaskManager. A thorough description of Akka's DeathWatch can be 
found 
[here](http://doc.akka.io/docs/akka/snapshot/scala/remoting.html#failure-detector)
 (DEFAULT: **12**).
+
+- `akka.transport.heartbeat.interval`: Heartbeat interval for Akka's transport 
failure detector. Since Flink uses TCP, the detector is not necessary. 
Therefore, the detector is disabled by setting the interval to a very high 
value. In case you should need the transport failure detector, set the interval 
to some reasonable value. The interval value requires a time-unit specifier 
(ms/s/min/h/d) (DEFAULT: **1000 s**).
+
+- `akka.transport.heartbeat.pause`: Acceptable heartbeat pause for Akka's 
transport failure detector. Since Flink uses TCP, the detector is not 
necessary. Therefore, the detector is disabled by setting the pause to a very 
high value. In case you should need the transport failure detector, set the 
pause to some reasonable value. The pause value requires a time-unit specifier 
(ms/s/min/h/d) (DEFAULT: **6000 s**).
+
+- `akka.transport.threshold`: Threshold for the transport failure detector. 
Since Flink uses TCP, the detector is not necessary and, thus, the threshold is 
set to a high value (DEFAULT: **300**).
+
+- `akka.tcp.timeout`: Timeout for all outbound connections. If you should 
experience problems with connecting to a TaskManager due to a slow network, you 
should increase this value (DEFAULT: **20 s**).
+
+- `akka.throughput`: Number of messages that are processed in a batch before 
returning the thread to the pool. Low values denote a fair scheduling whereas 
high values can increase the performance at the cost of unfairness (DEFAULT: 
**15**).
+
+- `akka.log.lifecycle.events`: Turns on the Akka's remote logging of events. 
Set this value to 'true' in case of debugging (DEFAULT: **false**).
+
+- `akka.startup-timeout`: Timeout after which the startup of a remote 
component is considered being failed (DEFAULT: **akka.ask.timeout**).
+
+- `akka.ssl.enabled`: Turns on SSL for Akka's remote communication. This is 
applicable only when the global ssl flag security.ssl.enabled is set to true 
(DEFAULT: **true**).
+
+### SSL Settings
+
+- `security.ssl.enabled`: Turns on SSL for internal network communication. 
This can be optionally overridden by flags defined in different transport 
modules (DEFAULT: **false**).
+
+- `security.ssl.keystore`: The Java keystore file to be used by the flink 
endpoint for its SSL Key and Certificate.
+
+- `security.ssl.keystore-password`: The secret to decrypt the keystore file.
+
+- `security.ssl.key-password`: The secret to decrypt the server key in the 
keystore.
+
+- `security.ssl.truststore`: The truststore file containing the public CA 
certificates to be used by flink endpoints to verify the peer's certificate.
+
+- `security.ssl.truststore-password`: The secret to decrypt the truststore.
+
+- `security.ssl.protocol`: The SSL protocol version to be supported for the 
ssl transport (DEFAULT: **TLSv1.2**). Note that it doesn't support comma 
separated list.
+
+- `security.ssl.algorithms`: The comma separated list of standard SSL 
algorithms to be supported. Read more 
[here](http://docs.oracle.com/javase/8/docs/technotes/guides/security/StandardNames.html#ciphersuites)
 (DEFAULT: **TLS_RSA_WITH_AES_128_CBC_SHA**).
+
+- `security.ssl.verify-hostname`: Flag to enable peer's hostname verification 
during ssl handshake (DEFAULT: **true**).
+
+### Network communication (via Netty)
+
+These parameters allow for advanced tuning. The default values are sufficient 
when running concurrent high-throughput jobs on a large cluster.
+
+- `taskmanager.net.num-arenas`: The number of Netty arenas (DEFAULT: 
**taskmanager.numberOfTaskSlots**).
+
+- `taskmanager.net.server.numThreads`: The number of Netty server threads 
(DEFAULT: **taskmanager.numberOfTaskSlots**).
+
+- `taskmanager.net.client.numThreads`: The number of Netty client threads 
(DEFAULT: **taskmanager.numberOfTaskSlots**).
+
+- `taskmanager.net.server.backlog`: The netty server connection backlog.
+
+- `taskmanager.net.client.connectTimeoutSec`: The Netty client connection 
timeout (DEFAULT: **120 seconds**).
+
+- `taskmanager.net.sendReceiveBufferSize`: The Netty send and receive buffer 
size. This defaults to the system buffer size (`cat 
/proc/sys/net/ipv4/tcp_[rw]mem`) and is 4 MiB in modern Linux.
+
+- `taskmanager.net.transport`: The Netty transport type, either "nio" or 
"epoll" (DEFAULT: **nio**).
+
+### JobManager Web Frontend
+
+- `jobmanager.web.port`: Port of the JobManager's web interface that displays 
status of running jobs and execution time breakdowns of finished jobs (DEFAULT: 
8081). Setting this value to `-1` disables the web frontend.
+
+- `jobmanager.web.history`: The number of latest jobs that the JobManager's 
web front-end in its history (DEFAULT: 5).
+
+- `jobmanager.web.checkpoints.disable`: Disables checkpoint statistics 
(DEFAULT: `false`).
+
+- `jobmanager.web.checkpoints.history`: Number of checkpoint statistics to 
remember (DEFAULT: `10`).
+
+- `jobmanager.web.backpressure.cleanup-interval`: Time after which cached 
stats are cleaned up if not accessed (DEFAULT: `600000`, 10 mins).
+
+- `jobmanager.web.backpressure.refresh-interval`: Time after which available 
stats are deprecated and need to be refreshed (DEFAULT: `60000`, 1 min).
+
+- `jobmanager.web.backpressure.num-samples`: Number of stack trace samples to 
take to determine back pressure (DEFAULT: `100`).
+
+- `jobmanager.web.backpressure.delay-between-samples`: Delay between stack 
trace samples to determine back pressure (DEFAULT: `50`, 50 ms).
+
+- `jobmanager.web.ssl.enabled`: Enable https access to the web frontend. This 
is applicable only when the global ssl flag security.ssl.enabled is set to true 
(DEFAULT: `true`).
+
+- `jobmanager.web.access-control-allow-origin`: Enable custom access control 
parameter for allow origin header, default is `*`.
+
+### File Systems
+
+The parameters define the behavior of tasks that create result files.
+
+- `fs.default-scheme`: The default filesystem scheme to be used, with the 
necessary authority to contact, e.g. the host:port of the NameNode in the case 
of HDFS (if needed).
+By default, this is set to `file:///` which points to the local filesystem. 
This means that the local
+filesystem is going to be used to search for user-specified files **without** 
an explicit scheme
+definition. This scheme is used **ONLY** if no other scheme is specified 
(explicitly) in the user-provided `URI`.
+
+- `fs.overwrite-files`: Specifies whether file output writers should overwrite 
existing files by default. Set to *true* to overwrite by default, *false* 
otherwise. (DEFAULT: false)
+
+- `fs.output.always-create-directory`: File writers running with a parallelism 
larger than one create a directory for the output file path and put the 
different result files (one per parallel writer task) into that directory. If 
this option is set to *true*, writers with a parallelism of 1 will also create 
a directory and place a single result file into it. If the option is set to 
*false*, the writer will directly create the file directly at the output path, 
without creating a containing directory. (DEFAULT: false)
+
+### Compiler/Optimizer
+
+- `compiler.delimited-informat.max-line-samples`: The maximum number of line 
samples taken by the compiler for delimited inputs. The samples are used to 
estimate the number of records. This value can be overridden for a specific 
input with the input format's parameters (DEFAULT: 10).
+
+- `compiler.delimited-informat.min-line-samples`: The minimum number of line 
samples taken by the compiler for delimited inputs. The samples are used to 
estimate the number of records. This value can be overridden for a specific 
input with the input format's parameters (DEFAULT: 2).
+
+- `compiler.delimited-informat.max-sample-len`: The maximal length of a line 
sample that the compiler takes for delimited inputs. If the length of a single 
sample exceeds this value (possible because of misconfiguration of the parser), 
the sampling aborts. This value can be overridden for a specific input with the 
input format's parameters (DEFAULT: 2097152 (= 2 MiBytes)).
+
+### Runtime Algorithms
+
+- `taskmanager.runtime.hashjoin-bloom-filters`: Flag to activate/deactivate 
bloom filters in the hybrid hash join implementation. In cases where the hash 
join needs to spill to disk (datasets larger than the reserved fraction of 
memory), these bloom filters can greatly reduce the number of spilled records, 
at the cost some CPU cycles. (DEFAULT: false)
+
+- `taskmanager.runtime.max-fan`: The maximal fan-in for external merge joins 
and fan-out for spilling hash tables. Limits the number of file handles per 
operator, but may cause intermediate merging/partitioning, if set too small 
(DEFAULT: 128).
+
+- `taskmanager.runtime.sort-spilling-threshold`: A sort operation starts 
spilling when this fraction of its memory budget is full (DEFAULT: 0.8).
+
+### Resource Manager
+
+The configuration keys in this section are independent of the used resource 
management framework (YARN, Mesos, Standalone, ...)
+
+- `resourcemanager.rpc.port`: The config parameter defining the network port 
to connect to for communication with the resource manager. By default, the port
+of the JobManager, because the same ActorSystem is used. Its not possible to 
use this configuration key to define port ranges.
+
+
+### YARN
+
+- `containerized.heap-cutoff-ratio`: (Default 0.25) Percentage of heap space 
to remove from containers started by YARN. When a user requests a certain 
amount of memory for each TaskManager container (for example 4 GB), we can not 
pass this amount as the maximum heap space for the JVM (`-Xmx` argument) 
because the JVM is also allocating memory outside the heap. YARN is very strict 
with killing containers which are using more memory than requested. Therefore, 
we remove a 15% of the memory from the requested heap as a safety margin.
+
+- `containerized.heap-cutoff-min`: (Default 600 MB) Minimum amount of memory 
to cut off the requested heap size.
+
+- `yarn.maximum-failed-containers` (Default: number of requested containers). 
Maximum number of containers the system is going to reallocate in case of a 
failure.
+
+- `yarn.application-attempts` (Default: 1). Number of ApplicationMaster 
restarts. Note that that the entire Flink cluster will restart and the YARN 
Client will loose the connection. Also, the JobManager address will change and 
you'll need to set the JM host:port manually. It is recommended to leave this 
option at 1.
+
+- `yarn.heartbeat-delay` (Default: 5 seconds). Time between heartbeats with 
the ResourceManager.
+
+- `yarn.properties-file.location` (Default: temp directory). When a Flink job 
is submitted to YARN, the JobManager's host and the number of available 
processing slots is written into a properties file, so that the Flink client is 
able to pick those details up. This configuration parameter allows changing the 
default location of that file (for example for environments sharing a Flink 
installation between users)
+
+- `yarn.containers.vcores` The number of virtual cores (vcores) per YARN 
container. By default, the number of `vcores` is set to the number of slots per 
TaskManager, if set, or to 1, otherwise.
+
+- `containerized.master.env.`*ENV_VAR1=value* Configuration values prefixed 
with `containerized.master.env.` will be passed as environment variables to the 
ApplicationMaster/JobManager process. For example for passing `LD_LIBRARY_PATH` 
as an env variable to the ApplicationMaster, set:
+
+    `containerized.master.env.LD_LIBRARY_PATH: "/usr/lib/native"`
+
+- `containerized.taskmanager.env.` Similar to the configuration prefix about, 
this prefix allows setting custom environment variables for the TaskManager 
processes.
+
+- `yarn.container-start-command-template`: Flink uses the following template 
when starting on YARN:
+`%java% %jvmmem% %jvmopts% %logging% %class% %args% %redirects%`. This 
configuration parameter allows users
+to pass custom settings (such as JVM paths, arguments etc.). Note that in most 
cases, it is sufficient to
+use the `env.java.opts` setting, which is the `%jvmopts%` variable in the 
String.
+
+- `yarn.application-master.port` (Default: 0, which lets the OS choose an 
ephemeral port) With this configuration option, users can specify a port, a 
range of ports or a list of ports for the  Application Master (and JobManager) 
RPC port. By default we recommend using the default value (0) to let the 
operating system choose an appropriate port. In particular when multiple AMs 
are running on the  same physical host, fixed port assignments prevent the AM 
from starting.
+
+  For example when running Flink on YARN on an environment with a restrictive 
firewall, this option allows specifying a range of allowed ports.
+
+- `yarn.tags` A comma-separated list of tags to apply to the Flink YARN 
application.
+
+- `yarn.per-job-cluster.include-user-jar` (Default: ORDER) Control whether and 
how the user-jar is included in the system class path for per-job clusters. 
Setting this parameter to `DISABLED` causes the jar to be included in the user 
class path instead. Setting this parameter to one of `FIRST`, `LAST` or `ORDER` 
causes the jar to be included in the system class path, with the jar either 
being placed at the beginning of the class path (`FIRST`), at the end (`LAST`), 
or based on the lexicographic order (`ORDER`).
+
+### Mesos
+
+
+- `mesos.initial-tasks`: The initial workers to bring up when the master 
starts (**DEFAULT**: The number of workers specified at cluster startup).
+
+- `mesos.constraints.hard.hostattribute`: Constraints for task placement on 
mesos (**DEFAULT**: None).
+
+- `mesos.maximum-failed-tasks`: The maximum number of failed workers before 
the cluster fails (**DEFAULT**: Number of initial workers).
+May be set to -1 to disable this feature.
+
+- `mesos.master`: The Mesos master URL. The value should be in one of the 
following forms:
+  * `host:port`
+  * `zk://host1:port1,host2:port2,.../path`
+  * `zk://username:password@host1:port1,host2:port2,.../path`
+  * `file:///path/to/file`
+
+
+- `mesos.failover-timeout`: The failover timeout in seconds for the Mesos 
scheduler, after which running tasks are automatically shut down (**DEFAULT:** 
600).
+
+- `mesos.resourcemanager.artifactserver.port`:The config parameter defining 
the Mesos artifact server port to use. Setting the port to 0 will let the OS 
choose an available port.
+
+- `mesos.resourcemanager.framework.name`: Mesos framework name (**DEFAULT:** 
Flink)
+
+- `mesos.resourcemanager.framework.role`: Mesos framework role definition 
(**DEFAULT:** *)
+
+- `mesos.resourcemanager.framework.principal`: Mesos framework principal (**NO 
DEFAULT**)
+
+- `mesos.resourcemanager.framework.secret`: Mesos framework secret (**NO 
DEFAULT**)
+
+- `mesos.resourcemanager.framework.user`: Mesos framework user (**DEFAULT:**"")
+
+- `mesos.resourcemanager.artifactserver.ssl.enabled`: Enables SSL for the 
Flink artifact server (**DEFAULT**: true). Note that `security.ssl.enabled` 
also needs to be set to `true` encryption to enable encryption.
+
+- `mesos.resourcemanager.tasks.mem`: Memory to assign to the Mesos workers in 
MB (**DEFAULT**: 1024)
+
+- `mesos.resourcemanager.tasks.cpus`: CPUs to assign to the Mesos workers 
(**DEFAULT**: 0.0)
+
+- `mesos.resourcemanager.tasks.container.type`: Type of the containerization 
used: "mesos" or "docker" (DEFAULT: mesos);
+
+- `mesos.resourcemanager.tasks.container.image.name`: Image name to use for 
the container (**NO DEFAULT**)
+
+- `mesos.resourcemanager.tasks.container.volumes`: A comma seperated list of 
[host_path:]container_path[:RO|RW]. This allows for mounting additional volumes 
into your container. (**NO DEFAULT**)
+
+- `high-availability.zookeeper.path.mesos-workers`: The ZooKeeper root path 
for persisting the Mesos worker information.
+
+### High Availability (HA)
+
+- `high-availability`: Defines the high availability mode used for the cluster 
execution. Currently, Flink supports the following modes:
+  - `none` (default): No high availability. A single JobManager runs and no 
JobManager state is checkpointed.
+  - `zookeeper`: Supports the execution of multiple JobManagers and JobManager 
state checkpointing. Among the group of JobManagers, ZooKeeper elects one of 
them as the leader which is responsible for the cluster execution. In case of a 
JobManager failure, a standby JobManager will be elected as the new leader and 
is given the last checkpointed JobManager state. In order to use the 
'zookeeper' mode, it is mandatory to also define the 
`high-availability.zookeeper.quorum` configuration value.
+
+- `high-availability.cluster-id`: (Default `/default_ns` in standalone cluster 
mode, or the <yarn-application-id> under YARN) Defines the subdirectory under 
the root dir where the ZooKeeper HA mode will create znodes. This allows to 
isolate multiple applications on the same ZooKeeper. Previously this key was 
named `recovery.zookeeper.path.namespace` and 
`high-availability.zookeeper.path.namespace`.
+
+Previously this key was named `recovery.mode` and the default value was 
`standalone`.
+
+#### ZooKeeper-based HA Mode
+
+- `high-availability.zookeeper.quorum`: Defines the ZooKeeper quorum URL which 
is used to connect to the ZooKeeper cluster when the 'zookeeper' HA mode is 
selected. Previously this key was named `recovery.zookeeper.quorum`.
+
+- `high-availability.zookeeper.path.root`: (Default `/flink`) Defines the root 
dir under which the ZooKeeper HA mode will create namespace directories. 
Previously this ket was named `recovery.zookeeper.path.root`.
+
+- `high-availability.zookeeper.path.latch`: (Default `/leaderlatch`) Defines 
the znode of the leader latch which is used to elect the leader. Previously 
this key was named `recovery.zookeeper.path.latch`.
+
+- `high-availability.zookeeper.path.leader`: (Default `/leader`) Defines the 
znode of the leader which contains the URL to the leader and the current leader 
session ID. Previously this key was named `recovery.zookeeper.path.leader`.
+
+- `high-availability.storageDir`: Defines the directory in the state backend 
where the JobManager metadata will be stored (ZooKeeper only keeps pointers to 
it). Required for HA. Previously this key was named 
`recovery.zookeeper.storageDir` and `high-availability.zookeeper.storageDir`.
+
+- `high-availability.zookeeper.client.session-timeout`: (Default `60000`) 
Defines the session timeout for the ZooKeeper session in ms. Previously this 
key was named `recovery.zookeeper.client.session-timeout`
+
+- `high-availability.zookeeper.client.connection-timeout`: (Default `15000`) 
Defines the connection timeout for ZooKeeper in ms. Previously this key was 
named `recovery.zookeeper.client.connection-timeout`.
+
+- `high-availability.zookeeper.client.retry-wait`: (Default `5000`) Defines 
the pause between consecutive retries in ms. Previously this key was named 
`recovery.zookeeper.client.retry-wait`.
+
+- `high-availability.zookeeper.client.max-retry-attempts`: (Default `3`) 
Defines the number of connection retries before the client gives up. Previously 
this key was named `recovery.zookeeper.client.max-retry-attempts`.
+
+- `high-availability.job.delay`: (Default `akka.ask.timeout`) Defines the 
delay before persisted jobs are recovered in case of a master recovery 
situation. Previously this key was named `recovery.job.delay`.
+
+- `high-availability.zookeeper.client.acl`: (Default `open`) Defines the ACL 
(open|creator) to be configured on ZK node. The configuration value can be set 
to "creator" if the ZooKeeper server configuration has the "authProvider" 
property mapped to use SASLAuthenticationProvider and the cluster is configured 
to run in secure mode (Kerberos). The ACL options are based on 
https://zookeeper.apache.org/doc/r3.1.2/zookeeperProgrammers.html#sc_BuiltinACLSchemes
+
+#### ZooKeeper Security
+
+- `zookeeper.sasl.disable`: (Default: `true`) Defines if SASL based 
authentication needs to be enabled or disabled. The configuration value can be 
set to "true" if ZooKeeper cluster is running in secure mode (Kerberos).
+
+- `zookeeper.sasl.service-name`: (Default: `zookeeper`) If the ZooKeeper 
server is configured with a different service name (default:"zookeeper") then 
it can be supplied using this configuration. A mismatch in service name between 
client and server configuration will cause the authentication to fail.
+
+### Kerberos-based Security
+
+- `security.kerberos.login.use-ticket-cache`: Indicates whether to read from 
your Kerberos ticket cache (default: `true`).
+
+- `security.kerberos.login.keytab`: Absolute path to a Kerberos keytab file 
that contains the user credentials.
+
+- `security.kerberos.login.principal`: Kerberos principal name associated with 
the keytab.
+
+- `security.kerberos.login.contexts`: A comma-separated list of login contexts 
to provide the Kerberos credentials to (for example, `Client,KafkaClient` to 
use the credentials for ZooKeeper authentication and for Kafka authentication).
+
+### Environment
+
+- `env.log.dir`: (Defaults to the `log` directory under Flink's home) Defines 
the directory where the Flink logs are saved. It has to be an absolute path.
+
+- `env.log.max`: (Default: `5`) The maximum number of old log files to keep.
+
+- `env.ssh.opts`: Additional command line options passed to SSH clients when 
starting or stopping JobManager, TaskManager, and Zookeeper services 
(start-cluster.sh, stop-cluster.sh, start-zookeeper-quorum.sh, 
stop-zookeeper-quorum.sh).
+
+### Queryable State
+
+#### Server
+
+- `query.server.enable`: Enable queryable state (Default: `true`).
+
+- `query.server.port`: Port to bind queryable state server to (Default: `0`, 
binds to random port).
+
+- `query.server.network-threads`: Number of network (Netty's event loop) 
Threads for queryable state server (Default: `0`, picks number of slots).
+
+- `query.server.query-threads`: Number of query Threads for queryable state 
server (Default: `0`, picks number of slots).
+
+#### Client
+
+- `query.client.network-threads`: Number of network (Netty's event loop) 
Threads for queryable state client (Default: `0`, picks number of available 
cores as returned by `Runtime.getRuntime().availableProcessors()`).
+
+- `query.client.lookup.num-retries`: Number of retries on KvState lookup 
failure due to unavailable JobManager (Default: `3`).
+
+- `query.client.lookup.retry-delay`: Retry delay in milliseconds on KvState 
lookup failure due to unavailable JobManager (Default: `1000`).
+
+### Metrics
+
+- `metrics.reporters`: The list of named reporters, i.e. "foo,bar".
+
+- `metrics.reporter.<name>.<config>`: Generic setting `<config>` for the 
reporter named `<name>`.
+
+- `metrics.reporter.<name>.class`: The reporter class to use for the reporter 
named `<name>`.
+
+- `metrics.reporter.<name>.interval`: The reporter interval to use for the 
reporter named `<name>`.
+
+- `metrics.scope.jm`: (Default: &lt;host&gt;.jobmanager) Defines the scope 
format string that is applied to all metrics scoped to a JobManager.
+
+- `metrics.scope.jm.job`: (Default: &lt;host&gt;.jobmanager.&lt;job_name&gt;) 
Defines the scope format string that is applied to all metrics scoped to a job 
on a JobManager.
+
+- `metrics.scope.tm`: (Default: &lt;host&gt;.taskmanager.&lt;tm_id&gt;) 
Defines the scope format string that is applied to all metrics scoped to a 
TaskManager.
+
+- `metrics.scope.tm.job`: (Default: 
&lt;host&gt;.taskmanager.&lt;tm_id&gt;.&lt;job_name&gt;) Defines the scope 
format string that is applied to all metrics scoped to a job on a TaskManager.
+
+- `metrics.scope.task`: (Default: 
&lt;host&gt;.taskmanager.&lt;tm_id&gt;.&lt;job_name&gt;.&lt;task_name&gt;.&lt;subtask_index&gt;)
 Defines the scope format string that is applied to all metrics scoped to a 
task.
+
+- `metrics.scope.operator`: (Default: 
&lt;host&gt;.taskmanager.&lt;tm_id&gt;.&lt;job_name&gt;.&lt;operator_name&gt;.&lt;subtask_index&gt;)
 Defines the scope format string that is applied to all metrics scoped to an 
operator.
+
+- `metrics.latency.history-size`: (Default: 128) Defines the number of 
measured latencies to maintain at each operator
+
+### History Server
+
+You have to configure `jobmanager.archive.fs.dir` in order to archive 
terminated jobs and add it to the list of monitored directories via 
`historyserver.archive.fs.dir` if you want to display them via the 
HistoryServer's web frontend.
+
+- `jobmanager.archive.fs.dir`: Directory to upload information about 
terminated jobs to. You have to add this directory to the list of monitored 
directories of the history server via `historyserver.archive.fs.dir`.
+
+- `historyserver.archive.fs.dir`: Comma separated list of directories to fetch 
archived jobs from. The history server will monitor these directories for 
archived jobs. You can configure the JobManager to archive jobs to a directory 
via `jobmanager.archive.fs.dir`.
+
+- `historyserver.archive.fs.refresh-interval`: Interval in milliseconds for 
refreshing the archived job directories (DEFAULT: `10000`).
+
+- `historyserver.web.tmpdir`: This configuration parameter allows defining the 
Flink web directory to be used by the history server web interface. The web 
interface will copy its static files into the directory (DEFAULT: local system 
temporary directory).
+
+- `historyserver.web.address`: Address of the HistoryServer's web interface 
(DEFAULT: `anyLocalAddress()`).
+
+- `historyserver.web.port`: Port of the HistoryServers's web interface 
(DEFAULT: `8082`).
+
+- `historyserver.web.ssl.enabled`: Enable HTTPs access to the HistoryServer 
web frontend. This is applicable only when the global SSL flag 
security.ssl.enabled is set to true (DEFAULT: `false`).
+
+## Background
+
+
+### Configuring the Network Buffers
+
+If you ever see the Exception `java.io.IOException: Insufficient number of 
network buffers`, you
+need to adapt the amount of memory used for network buffers in order for your 
program to run on your
+task managers.
+
+Network buffers are a critical resource for the communication layers. They are 
used to buffer
+records before transmission over a network, and to buffer incoming data before 
dissecting it into
+records and handing them to the application. A sufficient number of network 
buffers is critical to
+achieve a good throughput.
+
+<div class="alert alert-info">
+Since Flink 1.3, you may follow the idiom "more is better" without any penalty 
on the latency (we
+prevent excessive buffering in each outgoing and incoming channel, i.e. 
*buffer bloat*, by limiting
+the actual number of buffers used by each channel).
+</div>
+
+In general, configure the task manager to have enough buffers that each 
logical network connection
+you expect to be open at the same time has a dedicated buffer. A logical 
network connection exists
+for each point-to-point exchange of data over the network, which typically 
happens at
+repartitioning or broadcasting steps (shuffle phase). In those, each parallel 
task inside the
+TaskManager has to be able to talk to all other parallel tasks.
+
+#### Setting Memory Fractions
+
+Previously, the number of network buffers was set manually which became a 
quite error-prone task
+(see below). Since Flink 1.3, it is possible to define a fraction of memory 
that is being used for
+network buffers with the following configuration parameters:
+
+- `taskmanager.network.memory.fraction`: Fraction of JVM memory to use for 
network buffers (DEFAULT: 0.1),
+- `taskmanager.network.memory.min`: Minimum memory size for network buffers in 
bytes (DEFAULT: 64 MB),
+- `taskmanager.network.memory.max`: Maximum memory size for network buffers in 
bytes (DEFAULT: 1 GB), and
+- `taskmanager.memory.segment-size`: Size of memory buffers used by the memory 
manager and the
+network stack in bytes (DEFAULT: 32768 (= 32 KiBytes)).
+
+#### Setting the Number of Network Buffers directly
+
+<div class="alert alert-warning">
+  <strong>Note:</strong> This way of configuring the amount of memory used for 
network buffers is deprecated. Please consider using the method above by 
defining a fraction of memory to use.
+</div>
+
+The required number of buffers on a task manager is
+*total-degree-of-parallelism* (number of targets) \* *intra-node-parallelism* 
(number of sources in one task manager) \* *n*
+with *n* being a constant that defines how many repartitioning-/broadcasting 
steps you expect to be
+active at the same time. Since the *intra-node-parallelism* is typically the 
number of cores, and
+more than 4 repartitioning or broadcasting channels are rarely active in 
parallel, it frequently
+boils down to
+
+```
+#slots-per-TM^2 * #TMs * 4
+```
+
+Where `#slots per TM` are the [number of slots per 
TaskManager](#configuring-taskmanager-processing-slots) and `#TMs` are the 
total number of task managers.
+
+To support, for example, a cluster of 20 8-slot machines, you should use 
roughly 5000 network
+buffers for optimal throughput.
+
+Each network buffer has by default a size of 32 KiBytes. In the example above, 
the system would thus
+allocate roughly 300 MiBytes for network buffers.
+
+The number and size of network buffers can be configured with the following 
parameters:
+
+- `taskmanager.network.numberOfBuffers`, and
+- `taskmanager.memory.segment-size`.
+
+### Configuring Temporary I/O Directories
+
+Although Flink aims to process as much data in main memory as possible, it is 
not uncommon that more data needs to be processed than memory is available. 
Flink's runtime is designed to write temporary data to disk to handle these 
situations.
+
+The `taskmanager.tmp.dirs` parameter specifies a list of directories into 
which Flink writes temporary files. The paths of the directories need to be 
separated by ':' (colon character). Flink will concurrently write (or read) one 
temporary file to (from) each configured directory. This way, temporary I/O can 
be evenly distributed over multiple independent I/O devices such as hard disks 
to improve performance. To leverage fast I/O devices (e.g., SSD, RAID, NAS), it 
is possible to specify a directory multiple times.
+
+If the `taskmanager.tmp.dirs` parameter is not explicitly specified, Flink 
writes temporary data to the temporary directory of the operating system, such 
as */tmp* in Linux systems.
+
+### Configuring TaskManager processing slots
+
+Flink executes a program in parallel by splitting it into subtasks and 
scheduling these subtasks to processing slots.
+
+Each Flink TaskManager provides processing slots in the cluster. The number of 
slots is typically proportional to the number of available CPU cores __of 
each__ TaskManager. As a general recommendation, the number of available CPU 
cores is a good default for `taskmanager.numberOfTaskSlots`.
+
+When starting a Flink application, users can supply the default number of 
slots to use for that job. The command line value therefore is called `-p` (for 
parallelism). In addition, it is possible to [set the number of slots in the 
programming APIs]({{site.baseurl}}/dev/parallel.html) for the whole application 
and for individual operators.
+
+<img src="{{ site.baseurl }}/fig/slots_parallelism.svg" class="img-responsive" 
/>


http://git-wip-us.apache.org/repos/asf/flink/blob/47070674/docs/ops/deployment/aws.md
----------------------------------------------------------------------
diff --git a/docs/ops/deployment/aws.md b/docs/ops/deployment/aws.md
new file mode 100644
index 0000000..57c7e58
--- /dev/null
+++ b/docs/ops/deployment/aws.md
@@ -0,0 +1,374 @@
+---
+title: "Amazon Web Services (AWS)"
+nav-title: AWS
+nav-parent_id: deployment
+nav-pos: 5
+---
+<!--
+Licensed to the Apache Software Foundation (ASF) under one
+or more contributor license agreements.  See the NOTICE file
+distributed with this work for additional information
+regarding copyright ownership.  The ASF licenses this file
+to you under the Apache License, Version 2.0 (the
+"License"); you may not use this file except in compliance
+with the License.  You may obtain a copy of the License at
+
+  http://www.apache.org/licenses/LICENSE-2.0
+
+Unless required by applicable law or agreed to in writing,
+software distributed under the License is distributed on an
+"AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY
+KIND, either express or implied.  See the License for the
+specific language governing permissions and limitations
+under the License.
+-->
+
+Amazon Web Services offers cloud computing services on which you can run Flink.
+
+* ToC
+{:toc}
+
+## EMR: Elastic MapReduce
+
+[Amazon Elastic MapReduce](https://aws.amazon.com/elasticmapreduce/) (Amazon 
EMR) is a web service that makes it easy to  quickly setup a Hadoop cluster. 
This is the **recommended way** to run Flink on AWS as it takes care of setting 
up everything.
+
+### Standard EMR Installation
+
+Flink is a supported application on Amazon EMR. [Amazon's 
documentation](http://docs.aws.amazon.com/emr/latest/ReleaseGuide/emr-flink.html)
+describes configuring Flink, creating and monitoring a cluster, and working 
with jobs.
+
+### Custom EMR Installation
+
+Amazon EMR services are regularly updated to new releases but a version of 
Flink which is not available
+can be manually installed in a stock EMR cluster.
+
+**Create EMR Cluster**
+
+The EMR documentation contains [examples showing how to start an EMR 
cluster](http://docs.aws.amazon.com/ElasticMapReduce/latest/ManagementGuide/emr-gs-launch-sample-cluster.html).
 You can follow that guide and install any EMR release. You don't need to 
install the *All Applications* part of the EMR release, but can stick to *Core 
Hadoop*.
+
+{% warn Note %}
+Access to S3 buckets requires
+[configuration of IAM 
roles](http://docs.aws.amazon.com/ElasticMapReduce/latest/ManagementGuide/emr-iam-roles.html)
+when creating an EMR cluster.
+
+**Install Flink on EMR Cluster**
+
+After creating your cluster, you can [connect to the master 
node](http://docs.aws.amazon.com/ElasticMapReduce/latest/ManagementGuide/emr-connect-master-node.html)
 and install Flink:
+
+1. Go the the [Downloads Page]({{ download_url}}) and **download a binary 
version of Flink matching the Hadoop version** of your EMR cluster, e.g. Hadoop 
2.7 for EMR releases 4.3.0, 4.4.0, or 4.5.0.
+2. Extract the Flink distribution and you are ready to deploy [Flink jobs via 
YARN](yarn_setup.html) after **setting the Hadoop config directory**:
+
+```bash
+HADOOP_CONF_DIR=/etc/hadoop/conf bin/flink run -m yarn-cluster 
examples/streaming/WordCount.jar
+```
+
+{% top %}
+
+## S3: Simple Storage Service
+
+[Amazon Simple Storage Service](http://aws.amazon.com/s3/) (Amazon S3) 
provides cloud object storage for a variety of use cases. You can use S3 with 
Flink for **reading** and **writing data** as well in conjunction with the 
[streaming **state backends**]({{ site.baseurl}}/ops/state/state_backends.html).
+
+You can use S3 objects like regular files by specifying paths in the following 
format:
+
+```
+s3://<your-bucket>/<endpoint>
+```
+
+The endpoint can either be a single file or a directory, for example:
+
+```java
+// Read from S3 bucket
+env.readTextFile("s3://<bucket>/<endpoint>");
+
+// Write to S3 bucket
+stream.writeAsText("s3://<bucket>/<endpoint>");
+
+// Use S3 as FsStatebackend
+env.setStateBackend(new FsStateBackend("s3://<your-bucket>/<endpoint>"));
+```
+
+Note that these examples are *not* exhaustive and you can use S3 in other 
places as well, including your [high availability 
setup](../jobmanager_high_availability.html) or the [RocksDBStateBackend]({{ 
site.baseurl }}/ops/state/state_backends.html#the-rocksdbstatebackend); 
everywhere that Flink expects a FileSystem URI.
+
+### Set S3 FileSystem
+
+{% panel **Note:** You don't have to configure this manually if you are 
running [Flink on EMR](#emr-elastic-mapreduce). %}
+
+S3 is treated by Flink as a regular FileSystem. Interaction with S3 happens 
via a Hadoop [S3 FileSystem client](https://wiki.apache.org/hadoop/AmazonS3).
+
+There are two popular S3 file system implementations available:
+
+1. `S3AFileSystem` (**recommended**): file system for reading and writing 
regular files using Amazon's SDK internally. No maximum file size and works 
with IAM roles.
+2. `NativeS3FileSystem`: file system for reading and writing regular files. 
Maximum object size is 5GB and does not work with IAM roles.
+
+#### `S3AFileSystem` (Recommended)
+
+This is the recommended S3 FileSystem implementation to use. It uses Amazon's 
SDK internally and works with IAM roles (see [Configure Access 
Credential](#configure-access-credentials)).
+
+You need to point Flink to a valid Hadoop configuration, which contains the 
following properties in `core-site.xml`:
+
+```xml
+<configuration>
+
+<property>
+  <name>fs.s3.impl</name>
+  <value>org.apache.hadoop.fs.s3a.S3AFileSystem</value>
+</property>
+
+<!-- Comma separated list of local directories used to buffer
+     large results prior to transmitting them to S3. -->
+<property>
+  <name>fs.s3.buffer.dir</name>
+  <value>/tmp</value>
+</property>
+
+</configuration>
+```
+
+This registers `S3AFileSystem` as the default FileSystem for URIs with the 
`s3://` scheme.
+
+#### `NativeS3FileSystem`
+
+This file system is limited to files up to 5GB in size and it does not work 
IAM roles (see [Configure Access Credential](#configure-access-credentials)), 
meaning that you have to manually configure your AWS credentials in the Hadoop 
config file.
+
+You need to point Flink to a valid Hadoop configuration, which contains the 
following property in `core-site.xml`:
+
+```xml
+<property>
+  <name>fs.s3.impl</name>
+  <value>org.apache.hadoop.fs.s3native.NativeS3FileSystem</value>
+</property>
+```
+
+This registers `NativeS3FileSystem` as the default FileSystem for URIs with 
the `s3://` scheme.
+
+#### Hadoop Configuration
+
+You can specify the [Hadoop configuration](../config.html#hdfs) in various 
ways, for example by configuring the path to the Hadoop configuration directory 
in `flink-conf.yaml`:
+
+```
+fs.hdfs.hadoopconf: /path/to/etc/hadoop
+```
+
+This registers `/path/to/etc/hadoop` as Hadoop's configuration directory with 
Flink. Flink will look for the `core-site.xml` and `hdfs-site.xml` files in the 
specified directory.  
+
+{% top %}
+
+### Configure Access Credentials
+
+{% panel **Note:** You don't have to configure this manually if you are 
running [Flink on EMR](#emr-elastic-mapreduce). %}
+
+After setting up the S3 FileSystem, you need to make sure that Flink is 
allowed to access your S3 buckets.
+
+#### Identity and Access Management (IAM) (Recommended)
+
+When using `S3AFileSystem` the recommended way of setting up credentials on 
AWS is via [Identity and Access Management 
(IAM)](http://docs.aws.amazon.com/IAM/latest/UserGuide/introduction.html). You 
can use IAM features to securely give Flink instances the credentials that they 
need in order to access S3 buckets. Details about how to do this are beyond the 
scope of this documentation. Please refer to the AWS user guide. What you are 
looking for are [IAM 
Roles](http://docs.aws.amazon.com/AWSEC2/latest/UserGuide/iam-roles-for-amazon-ec2.html).
+
+If you set this up correctly, you can manage access to S3 within AWS and don't 
need to distribute any access keys to Flink.
+
+Note that this only works with `S3AFileSystem` and not `NativeS3FileSystem`.
+
+{% top %}
+
+#### Access Keys with S3AFileSystem (Discouraged)
+
+Access to S3 can be granted via your **access and secret key pair**. Please 
note that this is discouraged since the [introduction of IAM 
roles](https://blogs.aws.amazon.com/security/post/Tx1XG3FX6VMU6O5/A-safer-way-to-distribute-AWS-credentials-to-EC2).
+
+For `S3AFileSystem` you need to configure both `fs.s3a.access.key` and 
`fs.s3a.secret.key`  in Hadoop's  `core-site.xml`:
+
+```xml
+<property>
+  <name>fs.s3a.access.key</name>
+  <value></value>
+</property>
+
+<property>
+  <name>fs.s3a.secret.key</name>
+  <value></value>
+</property>
+```
+
+{% top %}
+
+#### Access Keys with NativeS3FileSystem (Discouraged)
+
+Access to S3 can be granted via your **access and secret key pair**. But this 
is discouraged and you should use `S3AFileSystem` [with the required IAM 
roles](https://blogs.aws.amazon.com/security/post/Tx1XG3FX6VMU6O5/A-safer-way-to-distribute-AWS-credentials-to-EC2).
+
+For `NativeS3FileSystem` you need to configure both `fs.s3.awsAccessKeyId` and 
`fs.s3.awsSecretAccessKey`  in Hadoop's  `core-site.xml`:
+
+```xml
+<property>
+  <name>fs.s3.awsAccessKeyId</name>
+  <value></value>
+</property>
+
+<property>
+  <name>fs.s3.awsSecretAccessKey</name>
+  <value></value>
+</property>
+```
+
+{% top %}
+
+### Provide S3 FileSystem Dependency
+
+{% panel **Note:** You don't have to configure this manually if you are 
running [Flink on EMR](#emr-elastic-mapreduce). %}
+
+Hadoop's S3 FileSystem clients are packaged in the `hadoop-aws`. This JAR and 
all its dependencies need to be added to Flink's classpath, i.e. the class path 
of both Job and TaskManagers. Depending on which FileSystem implementation and 
which Flink and Hadoop version you use, you need to provide different 
dependencies (see below).
+
+There are multiple ways of adding JARs to Flink's class path, the easiest 
being simply to drop the JARs in Flink's `/lib` folder. You need to copy the 
`hadoop-aws` JAR with all its dependencies. You can also export the directory 
containing these JARs as part of the `HADOOP_CLASSPATH` environment variable on 
all machines.
+
+#### Flink for Hadoop 2.7
+
+Depending on which file system you use, please add the following dependencies. 
You can find these as part of the Hadoop binaries in 
`hadoop-2.7/share/hadoop/tools/lib`:
+
+- `S3AFileSystem`:
+  - `hadoop-aws-2.7.2.jar`
+  - `aws-java-sdk-1.7.4.jar`
+  - `httpcore-4.2.5.jar`
+  - `httpclient-4.2.5.jar`
+
+- `NativeS3FileSystem`:
+  - `hadoop-aws-2.7.2.jar`
+  - `guava-11.0.2.jar`
+
+Note that `hadoop-common` is available as part of Flink, but Guava is shaded 
by Flink.
+
+#### Flink for Hadoop 2.6
+
+Depending on which file system you use, please add the following dependencies. 
You can find these as part of the Hadoop binaries in 
`hadoop-2.6/share/hadoop/tools/lib`:
+
+- `S3AFileSystem`:
+  - `hadoop-aws-2.6.4.jar`
+  - `aws-java-sdk-1.7.4.jar`
+  - `httpcore-4.2.5.jar`
+  - `httpclient-4.2.5.jar`
+
+- `NativeS3FileSystem`:
+  - `hadoop-aws-2.6.4.jar`
+  - `guava-11.0.2.jar`
+
+Note that `hadoop-common` is available as part of Flink, but Guava is shaded 
by Flink.
+
+#### Flink for Hadoop 2.4 and earlier
+
+These Hadoop versions only have support for `NativeS3FileSystem`. This comes 
pre-packaged with Flink for Hadoop 2 as part of `hadoop-common`. You don't need 
to add anything to the classpath.
+
+{% top %}
+
+## Common Issues
+
+The following sections lists common issues when working with Flink on AWS.
+
+### Missing S3 FileSystem Configuration
+
+If your job submission fails with an Exception message noting that `No file 
system found with scheme s3` this means that no FileSystem has been configured 
for S3. Please check out the [FileSystem Configuration 
section](#set-s3-filesystem) for details on how to configure this properly.
+
+```
+org.apache.flink.client.program.ProgramInvocationException: The program 
execution failed:
+  Failed to submit job cd927567a81b62d7da4c18eaa91c3c39 (WordCount Example) 
[...]
+Caused by: org.apache.flink.runtime.JobException: Creating the input splits 
caused an error:
+  No file system found with scheme s3, referenced in file URI 
's3://<bucket>/<endpoint>'. [...]
+Caused by: java.io.IOException: No file system found with scheme s3,
+  referenced in file URI 's3://<bucket>/<endpoint>'.
+    at o.a.f.core.fs.FileSystem.get(FileSystem.java:296)
+    at o.a.f.core.fs.Path.getFileSystem(Path.java:311)
+    at 
o.a.f.api.common.io.FileInputFormat.createInputSplits(FileInputFormat.java:450)
+    at 
o.a.f.api.common.io.FileInputFormat.createInputSplits(FileInputFormat.java:57)
+    at 
o.a.f.runtime.executiongraph.ExecutionJobVertex.<init>(ExecutionJobVertex.java:156)
+```
+
+{% top %}
+
+### AWS Access Key ID and Secret Access Key Not Specified
+
+If you see your job failing with an Exception noting that the `AWS Access Key 
ID and Secret Access Key must be specified as the username or password`, your 
access credentials have not been set up properly. Please refer to the [access 
credential section](#configure-access-credentials) for details on how to 
configure this.
+
+```
+org.apache.flink.client.program.ProgramInvocationException: The program 
execution failed:
+  Failed to submit job cd927567a81b62d7da4c18eaa91c3c39 (WordCount Example) 
[...]
+Caused by: java.io.IOException: The given file URI (s3://<bucket>/<endpoint>) 
points to the
+  HDFS NameNode at <bucket>, but the File System could not be initialized with 
that address:
+  AWS Access Key ID and Secret Access Key must be specified as the username or 
password
+  (respectively) of a s3n URL, or by setting the fs.s3n.awsAccessKeyId
+  or fs.s3n.awsSecretAccessKey properties (respectively) [...]
+Caused by: java.lang.IllegalArgumentException: AWS Access Key ID and Secret 
Access Key must
+  be specified as the username or password (respectively) of a s3 URL, or by 
setting
+  the fs.s3n.awsAccessKeyId or fs.s3n.awsSecretAccessKey properties 
(respectively) [...]
+    at o.a.h.fs.s3.S3Credentials.initialize(S3Credentials.java:70)
+    at 
o.a.h.fs.s3native.Jets3tNativeFileSystemStore.initialize(Jets3tNativeFileSystemStore.java:80)
+    at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
+    at 
sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
+    at 
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
+    at java.lang.reflect.Method.invoke(Method.java:606)
+    at 
o.a.h.io.retry.RetryInvocationHandler.invokeMethod(RetryInvocationHandler.java:187)
+    at 
o.a.h.io.retry.RetryInvocationHandler.invoke(RetryInvocationHandler.java:102)
+    at o.a.h.fs.s3native.$Proxy6.initialize(Unknown Source)
+    at 
o.a.h.fs.s3native.NativeS3FileSystem.initialize(NativeS3FileSystem.java:330)
+    at 
o.a.f.runtime.fs.hdfs.HadoopFileSystem.initialize(HadoopFileSystem.java:321)
+```
+
+{% top %}
+
+### ClassNotFoundException: NativeS3FileSystem/S3AFileSystem Not Found
+
+If you see this Exception, the S3 FileSystem is not part of the class path of 
Flink. Please refer to [S3 FileSystem dependency 
section](#provide-s3-filesystem-dependency) for details on how to configure 
this properly.
+
+```
+Caused by: java.lang.RuntimeException: java.lang.RuntimeException: 
java.lang.ClassNotFoundException: Class 
org.apache.hadoop.fs.s3native.NativeS3FileSystem not found
+  at org.apache.hadoop.conf.Configuration.getClass(Configuration.java:2186)
+  at 
org.apache.flink.runtime.fs.hdfs.HadoopFileSystem.getHadoopWrapperClassNameForFileSystem(HadoopFileSystem.java:460)
+  at 
org.apache.flink.core.fs.FileSystem.getHadoopWrapperClassNameForFileSystem(FileSystem.java:352)
+  at org.apache.flink.core.fs.FileSystem.get(FileSystem.java:280)
+  at org.apache.flink.core.fs.Path.getFileSystem(Path.java:311)
+  at 
org.apache.flink.api.common.io.FileInputFormat.createInputSplits(FileInputFormat.java:450)
+  at 
org.apache.flink.api.common.io.FileInputFormat.createInputSplits(FileInputFormat.java:57)
+  at 
org.apache.flink.runtime.executiongraph.ExecutionJobVertex.<init>(ExecutionJobVertex.java:156)
+  ... 25 more
+Caused by: java.lang.RuntimeException: java.lang.ClassNotFoundException: Class 
org.apache.hadoop.fs.s3native.NativeS3FileSystem not found
+  at org.apache.hadoop.conf.Configuration.getClass(Configuration.java:2154)
+  at org.apache.hadoop.conf.Configuration.getClass(Configuration.java:2178)
+  ... 32 more
+Caused by: java.lang.ClassNotFoundException: Class 
org.apache.hadoop.fs.s3native.NativeS3FileSystem not found
+  at 
org.apache.hadoop.conf.Configuration.getClassByName(Configuration.java:2060)
+  at org.apache.hadoop.conf.Configuration.getClass(Configuration.java:2152)
+  ... 33 more
+```
+
+{% top %}
+
+### IOException: `400: Bad Request`
+
+If you you have configured everything properly, but get a `Bad Request` 
Exception **and** your S3 bucket is located in region `eu-central-1`, you might 
be running an S3 client, which does not support [Amazon's signature version 
4](http://docs.aws.amazon.com/AmazonS3/latest/API/sig-v4-authenticating-requests.html).
+
+Currently, this includes all Hadoop versions up to 2.7.2 running 
`NativeS3FileSystem`, which depend on `JetS3t 0.9.0` instead of a version [>= 
0.9.4](http://www.jets3t.org/RELEASE_NOTES.html).
+
+The only workaround is to change the bucket region.
+
+```
+[...]
+Caused by: java.io.IOException: s3://<bucket-in-eu-central-1>/<endpoint> : 400 
: Bad Request [...]
+Caused by: org.jets3t.service.impl.rest.HttpException [...]
+```
+
+{% top %}
+
+### NullPointerException at org.apache.hadoop.fs.LocalDirAllocator
+
+This Exception is usually caused by skipping the local buffer directory 
configuration `fs.s3.buffer.dir` for the `S3AFileSystem`. Please refer to the 
[S3AFileSystem configuration](#s3afilesystem-recommended) section to see how to 
configure the `S3AFileSystem` properly.
+
+```
+[...]
+Caused by: java.lang.NullPointerException at
+o.a.h.fs.LocalDirAllocator$AllocatorPerContext.confChanged(LocalDirAllocator.java:268)
 at
+o.a.h.fs.LocalDirAllocator$AllocatorPerContext.getLocalPathForWrite(LocalDirAllocator.java:344)
 at
+o.a.h.fs.LocalDirAllocator$AllocatorPerContext.createTmpFileForWrite(LocalDirAllocator.java:416)
 at
+o.a.h.fs.LocalDirAllocator.createTmpFileForWrite(LocalDirAllocator.java:198) at
+o.a.h.fs.s3a.S3AOutputStream.<init>(S3AOutputStream.java:87) at
+o.a.h.fs.s3a.S3AFileSystem.create(S3AFileSystem.java:410) at
+o.a.h.fs.FileSystem.create(FileSystem.java:907) at
+o.a.h.fs.FileSystem.create(FileSystem.java:888) at
+o.a.h.fs.FileSystem.create(FileSystem.java:785) at
+o.a.f.runtime.fs.hdfs.HadoopFileSystem.create(HadoopFileSystem.java:404) at
+o.a.f.runtime.fs.hdfs.HadoopFileSystem.create(HadoopFileSystem.java:48) at
+... 25 more
+```

http://git-wip-us.apache.org/repos/asf/flink/blob/47070674/docs/ops/deployment/cluster_setup.md
----------------------------------------------------------------------
diff --git a/docs/ops/deployment/cluster_setup.md 
b/docs/ops/deployment/cluster_setup.md
new file mode 100644
index 0000000..c8a5d0b
--- /dev/null
+++ b/docs/ops/deployment/cluster_setup.md
@@ -0,0 +1,151 @@
+---
+title: "Standalone Cluster"
+nav-parent_id: deployment
+nav-pos: 1
+---
+<!--
+Licensed to the Apache Software Foundation (ASF) under one
+or more contributor license agreements.  See the NOTICE file
+distributed with this work for additional information
+regarding copyright ownership.  The ASF licenses this file
+to you under the Apache License, Version 2.0 (the
+"License"); you may not use this file except in compliance
+with the License.  You may obtain a copy of the License at
+
+  http://www.apache.org/licenses/LICENSE-2.0
+
+Unless required by applicable law or agreed to in writing,
+software distributed under the License is distributed on an
+"AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY
+KIND, either express or implied.  See the License for the
+specific language governing permissions and limitations
+under the License.
+-->
+
+This page provides instructions on how to run Flink in a *fully distributed 
fashion* on a *static* (but possibly heterogeneous) cluster.
+
+* This will be replaced by the TOC
+{:toc}
+
+## Requirements
+
+### Software Requirements
+
+Flink runs on all *UNIX-like environments*, e.g. **Linux**, **Mac OS X**, and 
**Cygwin** (for Windows) and expects the cluster to consist of **one master 
node** and **one or more worker nodes**. Before you start to setup the system, 
make sure you have the following software installed **on each node**:
+
+- **Java 1.8.x** or higher,
+- **ssh** (sshd must be running to use the Flink scripts that manage
+  remote components)
+
+If your cluster does not fulfill these software requirements you will need to 
install/upgrade it.
+
+Having __passwordless SSH__ and
+__the same directory structure__ on all your cluster nodes will allow you to 
use our scripts to control
+everything.
+
+{% top %}
+
+### `JAVA_HOME` Configuration
+
+Flink requires the `JAVA_HOME` environment variable to be set on the master 
and all worker nodes and point to the directory of your Java installation.
+
+You can set this variable in `conf/flink-conf.yaml` via the `env.java.home` 
key.
+
+{% top %}
+
+## Flink Setup
+
+Go to the [downloads page]({{ site.download_url }}) and get the ready-to-run 
package. Make sure to pick the Flink package **matching your Hadoop version**. 
If you don't plan to use Hadoop, pick any version.
+
+After downloading the latest release, copy the archive to your master node and 
extract it:
+
+~~~bash
+tar xzf flink-*.tgz
+cd flink-*
+~~~
+
+### Configuring Flink
+
+After having extracted the system files, you need to configure Flink for the 
cluster by editing *conf/flink-conf.yaml*.
+
+Set the `jobmanager.rpc.address` key to point to your master node. You should 
also define the maximum amount of main memory the JVM is allowed to allocate on 
each node by setting the `jobmanager.heap.mb` and `taskmanager.heap.mb` keys.
+
+These values are given in MB. If some worker nodes have more main memory which 
you want to allocate to the Flink system you can overwrite the default value by 
setting the environment variable `FLINK_TM_HEAP` on those specific nodes.
+
+Finally, you must provide a list of all nodes in your cluster which shall be 
used as worker nodes. Therefore, similar to the HDFS configuration, edit the 
file *conf/slaves* and enter the IP/host name of each worker node. Each worker 
node will later run a TaskManager.
+
+The following example illustrates the setup with three nodes (with IP 
addresses from _10.0.0.1_
+to _10.0.0.3_ and hostnames _master_, _worker1_, _worker2_) and shows the 
contents of the
+configuration files (which need to be accessible at the same path on all 
machines):
+
+<div class="row">
+  <div class="col-md-6 text-center">
+    <img src="{{ site.baseurl }}/page/img/quickstart_cluster.png" 
style="width: 60%">
+  </div>
+<div class="col-md-6">
+  <div class="row">
+    <p class="lead text-center">
+      /path/to/<strong>flink/conf/<br>flink-conf.yaml</strong>
+    <pre>jobmanager.rpc.address: 10.0.0.1</pre>
+    </p>
+  </div>
+<div class="row" style="margin-top: 1em;">
+  <p class="lead text-center">
+    /path/to/<strong>flink/<br>conf/slaves</strong>
+  <pre>
+10.0.0.2
+10.0.0.3</pre>
+  </p>
+</div>
+</div>
+</div>
+
+The Flink directory must be available on every worker under the same path. You 
can use a shared NFS directory, or copy the entire Flink directory to every 
worker node.
+
+Please see the [configuration page](../config.html) for details and additional 
configuration options.
+
+In particular,
+
+ * the amount of available memory per JobManager (`jobmanager.heap.mb`),
+ * the amount of available memory per TaskManager (`taskmanager.heap.mb`),
+ * the number of available CPUs per machine (`taskmanager.numberOfTaskSlots`),
+ * the total number of CPUs in the cluster (`parallelism.default`) and
+ * the temporary directories (`taskmanager.tmp.dirs`)
+
+are very important configuration values.
+
+{% top %}
+
+### Starting Flink
+
+The following script starts a JobManager on the local node and connects via 
SSH to all worker nodes listed in the *slaves* file to start the TaskManager on 
each node. Now your Flink system is up and running. The JobManager running on 
the local node will now accept jobs at the configured RPC port.
+
+Assuming that you are on the master node and inside the Flink directory:
+
+~~~bash
+bin/start-cluster.sh
+~~~
+
+To stop Flink, there is also a `stop-cluster.sh` script.
+
+{% top %}
+
+### Adding JobManager/TaskManager Instances to a Cluster
+
+You can add both JobManager and TaskManager instances to your running cluster 
with the `bin/jobmanager.sh` and `bin/taskmanager.sh` scripts.
+
+#### Adding a JobManager
+
+~~~bash
+bin/jobmanager.sh ((start|start-foreground) cluster)|stop|stop-all
+~~~
+
+#### Adding a TaskManager
+
+~~~bash
+bin/taskmanager.sh start|start-foreground|stop|stop-all
+~~~
+
+Make sure to call these scripts on the hosts on which you want to start/stop 
the respective instance.
+
+{% top %}

[6/9] flink git commit: [FLINK-7301] [docs] Rework state documentation

Reply via email to