[GitHub] flink pull request #2465: [FLINK-4447] [docs] Include NettyConfig options on...

2016-11-01 Thread asfgit
Github user asfgit closed the pull request at:

https://github.com/apache/flink/pull/2465


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---


[GitHub] flink pull request #2465: [FLINK-4447] [docs] Include NettyConfig options on...

2016-09-19 Thread StephanEwen
Github user StephanEwen commented on a diff in the pull request:

https://github.com/apache/flink/pull/2465#discussion_r79476468
  
--- Diff: docs/setup/config.md ---
@@ -169,58 +169,111 @@ Default value is the `akka.ask.timeout`.
 These parameters configure the default HDFS used by Flink. Setups that do 
not specify a HDFS configuration have to specify the full path to HDFS files 
(`hdfs://address:port/path/to/files`) Files will also be written with default 
HDFS parameters (block size, replication factor).
 
 - `fs.hdfs.hadoopconf`: The absolute path to the Hadoop configuration 
directory. The system will look for the "core-site.xml" and "hdfs-site.xml" 
files in that directory (DEFAULT: null).
+
 - `fs.hdfs.hdfsdefault`: The absolute path of Hadoop's own configuration 
file "hdfs-default.xml" (DEFAULT: null).
+
 - `fs.hdfs.hdfssite`: The absolute path of Hadoop's own configuration file 
"hdfs-site.xml" (DEFAULT: null).
 
 ### JobManager & TaskManager
 
 The following parameters configure Flink's JobManager and TaskManagers.
 
 - `jobmanager.rpc.address`: The IP address of the JobManager, which is the 
master/coordinator of the distributed system (DEFAULT: **localhost**).
+
 - `jobmanager.rpc.port`: The port number of the JobManager (DEFAULT: 
**6123**).
+
 - `taskmanager.hostname`: The hostname of the network interface that the 
TaskManager binds to. By default, the TaskManager searches for network 
interfaces that can connect to the JobManager and other TaskManagers. This 
option can be used to define a hostname if that strategy fails for some reason. 
Because different TaskManagers need different values for this option, it 
usually is specified in an additional non-shared TaskManager-specific config 
file.
+
 - `taskmanager.rpc.port`: The task manager's IPC port (DEFAULT: **0**, 
which lets the OS choose a free port).
+
 - `taskmanager.data.port`: The task manager's port used for data exchange 
operations (DEFAULT: **0**, which lets the OS choose a free port).
+
 - `jobmanager.heap.mb`: JVM heap size (in megabytes) for the JobManager 
(DEFAULT: **256**).
+
 - `taskmanager.heap.mb`: JVM heap size (in megabytes) for the 
TaskManagers, which are the parallel workers of the system. In contrast to 
Hadoop, Flink runs operators (e.g., join, aggregate) and user-defined functions 
(e.g., Map, Reduce, CoGroup) inside the TaskManager (including 
sorting/hashing/caching), so this value should be as large as possible 
(DEFAULT: **512**). On YARN setups, this value is automatically configured to 
the size of the TaskManager's YARN container, minus a certain tolerance value.
+
 - `taskmanager.numberOfTaskSlots`: The number of parallel operator or user 
function instances that a single TaskManager can run (DEFAULT: **1**). If this 
value is larger than 1, a single TaskManager takes multiple instances of a 
function or operator. That way, the TaskManager can utilize multiple CPU cores, 
but at the same time, the available memory is divided between the different 
operator or function instances. This value is typically proportional to the 
number of physical CPU cores that the TaskManager's machine has (e.g., equal to 
the number of cores, or half the number of cores).
+
 - `taskmanager.tmp.dirs`: The directory for temporary files, or a list of 
directories separated by the systems directory delimiter (for example ':' 
(colon) on Linux/Unix). If multiple directories are specified, then the 
temporary files will be distributed across the directories in a round robin 
fashion. The I/O manager component will spawn one reading and one writing 
thread per directory. A directory may be listed multiple times to have the I/O 
manager use multiple threads for it (for example if it is physically stored on 
a very fast disc or RAID) (DEFAULT: **The system's tmp dir**).
+
 - `taskmanager.network.numberOfBuffers`: The number of buffers available 
to the network stack. This number determines how many streaming data exchange 
channels a TaskManager can have at the same time and how well buffered the 
channels are. If a job is rejected or you get a warning that the system has not 
enough buffers available, increase this value (DEFAULT: **2048**).
+
 - `taskmanager.memory.size`: The amount of memory (in megabytes) that the 
task manager reserves on the JVM's heap space for sorting, hash tables, and 
caching of intermediate results. If unspecified (-1), the memory manager will 
take a fixed ratio of the heap memory available to the JVM, as specified by 
`taskmanager.memory.fraction`. (DEFAULT: **-1**)
+
 - `taskmanager.memory.fraction`: The relative amount of memory that the 
task manager reserves for sorting, hash tables, and caching of intermediate 
results. For example, a value of 0.8 means that TaskManagers reserve 80% of the 
JVM's heap space for internal data buffers, leaving 20% o

[GitHub] flink pull request #2465: [FLINK-4447] [docs] Include NettyConfig options on...

2016-09-19 Thread StephanEwen
Github user StephanEwen commented on a diff in the pull request:

https://github.com/apache/flink/pull/2465#discussion_r79476351
  
--- Diff: docs/setup/config.md ---
@@ -169,58 +169,111 @@ Default value is the `akka.ask.timeout`.
 These parameters configure the default HDFS used by Flink. Setups that do 
not specify a HDFS configuration have to specify the full path to HDFS files 
(`hdfs://address:port/path/to/files`) Files will also be written with default 
HDFS parameters (block size, replication factor).
 
 - `fs.hdfs.hadoopconf`: The absolute path to the Hadoop configuration 
directory. The system will look for the "core-site.xml" and "hdfs-site.xml" 
files in that directory (DEFAULT: null).
+
 - `fs.hdfs.hdfsdefault`: The absolute path of Hadoop's own configuration 
file "hdfs-default.xml" (DEFAULT: null).
+
 - `fs.hdfs.hdfssite`: The absolute path of Hadoop's own configuration file 
"hdfs-site.xml" (DEFAULT: null).
 
 ### JobManager & TaskManager
 
 The following parameters configure Flink's JobManager and TaskManagers.
 
 - `jobmanager.rpc.address`: The IP address of the JobManager, which is the 
master/coordinator of the distributed system (DEFAULT: **localhost**).
+
 - `jobmanager.rpc.port`: The port number of the JobManager (DEFAULT: 
**6123**).
+
 - `taskmanager.hostname`: The hostname of the network interface that the 
TaskManager binds to. By default, the TaskManager searches for network 
interfaces that can connect to the JobManager and other TaskManagers. This 
option can be used to define a hostname if that strategy fails for some reason. 
Because different TaskManagers need different values for this option, it 
usually is specified in an additional non-shared TaskManager-specific config 
file.
+
 - `taskmanager.rpc.port`: The task manager's IPC port (DEFAULT: **0**, 
which lets the OS choose a free port).
+
 - `taskmanager.data.port`: The task manager's port used for data exchange 
operations (DEFAULT: **0**, which lets the OS choose a free port).
+
 - `jobmanager.heap.mb`: JVM heap size (in megabytes) for the JobManager 
(DEFAULT: **256**).
+
 - `taskmanager.heap.mb`: JVM heap size (in megabytes) for the 
TaskManagers, which are the parallel workers of the system. In contrast to 
Hadoop, Flink runs operators (e.g., join, aggregate) and user-defined functions 
(e.g., Map, Reduce, CoGroup) inside the TaskManager (including 
sorting/hashing/caching), so this value should be as large as possible 
(DEFAULT: **512**). On YARN setups, this value is automatically configured to 
the size of the TaskManager's YARN container, minus a certain tolerance value.
+
 - `taskmanager.numberOfTaskSlots`: The number of parallel operator or user 
function instances that a single TaskManager can run (DEFAULT: **1**). If this 
value is larger than 1, a single TaskManager takes multiple instances of a 
function or operator. That way, the TaskManager can utilize multiple CPU cores, 
but at the same time, the available memory is divided between the different 
operator or function instances. This value is typically proportional to the 
number of physical CPU cores that the TaskManager's machine has (e.g., equal to 
the number of cores, or half the number of cores).
+
 - `taskmanager.tmp.dirs`: The directory for temporary files, or a list of 
directories separated by the systems directory delimiter (for example ':' 
(colon) on Linux/Unix). If multiple directories are specified, then the 
temporary files will be distributed across the directories in a round robin 
fashion. The I/O manager component will spawn one reading and one writing 
thread per directory. A directory may be listed multiple times to have the I/O 
manager use multiple threads for it (for example if it is physically stored on 
a very fast disc or RAID) (DEFAULT: **The system's tmp dir**).
+
 - `taskmanager.network.numberOfBuffers`: The number of buffers available 
to the network stack. This number determines how many streaming data exchange 
channels a TaskManager can have at the same time and how well buffered the 
channels are. If a job is rejected or you get a warning that the system has not 
enough buffers available, increase this value (DEFAULT: **2048**).
+
 - `taskmanager.memory.size`: The amount of memory (in megabytes) that the 
task manager reserves on the JVM's heap space for sorting, hash tables, and 
caching of intermediate results. If unspecified (-1), the memory manager will 
take a fixed ratio of the heap memory available to the JVM, as specified by 
`taskmanager.memory.fraction`. (DEFAULT: **-1**)
+
 - `taskmanager.memory.fraction`: The relative amount of memory that the 
task manager reserves for sorting, hash tables, and caching of intermediate 
results. For example, a value of 0.8 means that TaskManagers reserve 80% of the 
JVM's heap space for internal data buffers, leaving 20% o

[GitHub] flink pull request #2465: [FLINK-4447] [docs] Include NettyConfig options on...

2016-09-19 Thread StephanEwen
Github user StephanEwen commented on a diff in the pull request:

https://github.com/apache/flink/pull/2465#discussion_r79474627
  
--- Diff: docs/setup/config.md ---
@@ -169,58 +169,111 @@ Default value is the `akka.ask.timeout`.
 These parameters configure the default HDFS used by Flink. Setups that do 
not specify a HDFS configuration have to specify the full path to HDFS files 
(`hdfs://address:port/path/to/files`) Files will also be written with default 
HDFS parameters (block size, replication factor).
 
 - `fs.hdfs.hadoopconf`: The absolute path to the Hadoop configuration 
directory. The system will look for the "core-site.xml" and "hdfs-site.xml" 
files in that directory (DEFAULT: null).
+
 - `fs.hdfs.hdfsdefault`: The absolute path of Hadoop's own configuration 
file "hdfs-default.xml" (DEFAULT: null).
+
 - `fs.hdfs.hdfssite`: The absolute path of Hadoop's own configuration file 
"hdfs-site.xml" (DEFAULT: null).
 
 ### JobManager & TaskManager
 
 The following parameters configure Flink's JobManager and TaskManagers.
 
 - `jobmanager.rpc.address`: The IP address of the JobManager, which is the 
master/coordinator of the distributed system (DEFAULT: **localhost**).
+
 - `jobmanager.rpc.port`: The port number of the JobManager (DEFAULT: 
**6123**).
+
 - `taskmanager.hostname`: The hostname of the network interface that the 
TaskManager binds to. By default, the TaskManager searches for network 
interfaces that can connect to the JobManager and other TaskManagers. This 
option can be used to define a hostname if that strategy fails for some reason. 
Because different TaskManagers need different values for this option, it 
usually is specified in an additional non-shared TaskManager-specific config 
file.
+
 - `taskmanager.rpc.port`: The task manager's IPC port (DEFAULT: **0**, 
which lets the OS choose a free port).
+
 - `taskmanager.data.port`: The task manager's port used for data exchange 
operations (DEFAULT: **0**, which lets the OS choose a free port).
+
 - `jobmanager.heap.mb`: JVM heap size (in megabytes) for the JobManager 
(DEFAULT: **256**).
+
 - `taskmanager.heap.mb`: JVM heap size (in megabytes) for the 
TaskManagers, which are the parallel workers of the system. In contrast to 
Hadoop, Flink runs operators (e.g., join, aggregate) and user-defined functions 
(e.g., Map, Reduce, CoGroup) inside the TaskManager (including 
sorting/hashing/caching), so this value should be as large as possible 
(DEFAULT: **512**). On YARN setups, this value is automatically configured to 
the size of the TaskManager's YARN container, minus a certain tolerance value.
+
 - `taskmanager.numberOfTaskSlots`: The number of parallel operator or user 
function instances that a single TaskManager can run (DEFAULT: **1**). If this 
value is larger than 1, a single TaskManager takes multiple instances of a 
function or operator. That way, the TaskManager can utilize multiple CPU cores, 
but at the same time, the available memory is divided between the different 
operator or function instances. This value is typically proportional to the 
number of physical CPU cores that the TaskManager's machine has (e.g., equal to 
the number of cores, or half the number of cores).
+
 - `taskmanager.tmp.dirs`: The directory for temporary files, or a list of 
directories separated by the systems directory delimiter (for example ':' 
(colon) on Linux/Unix). If multiple directories are specified, then the 
temporary files will be distributed across the directories in a round robin 
fashion. The I/O manager component will spawn one reading and one writing 
thread per directory. A directory may be listed multiple times to have the I/O 
manager use multiple threads for it (for example if it is physically stored on 
a very fast disc or RAID) (DEFAULT: **The system's tmp dir**).
+
 - `taskmanager.network.numberOfBuffers`: The number of buffers available 
to the network stack. This number determines how many streaming data exchange 
channels a TaskManager can have at the same time and how well buffered the 
channels are. If a job is rejected or you get a warning that the system has not 
enough buffers available, increase this value (DEFAULT: **2048**).
+
 - `taskmanager.memory.size`: The amount of memory (in megabytes) that the 
task manager reserves on the JVM's heap space for sorting, hash tables, and 
caching of intermediate results. If unspecified (-1), the memory manager will 
take a fixed ratio of the heap memory available to the JVM, as specified by 
`taskmanager.memory.fraction`. (DEFAULT: **-1**)
+
 - `taskmanager.memory.fraction`: The relative amount of memory that the 
task manager reserves for sorting, hash tables, and caching of intermediate 
results. For example, a value of 0.8 means that TaskManagers reserve 80% of the 
JVM's heap space for internal data buffers, leaving 20% o

[GitHub] flink pull request #2465: [FLINK-4447] [docs] Include NettyConfig options on...

2016-09-02 Thread greghogan
Github user greghogan commented on a diff in the pull request:

https://github.com/apache/flink/pull/2465#discussion_r77364407
  
--- Diff: docs/setup/config.md ---
@@ -169,58 +169,111 @@ Default value is the `akka.ask.timeout`.
 These parameters configure the default HDFS used by Flink. Setups that do 
not specify a HDFS configuration have to specify the full path to HDFS files 
(`hdfs://address:port/path/to/files`) Files will also be written with default 
HDFS parameters (block size, replication factor).
 
 - `fs.hdfs.hadoopconf`: The absolute path to the Hadoop configuration 
directory. The system will look for the "core-site.xml" and "hdfs-site.xml" 
files in that directory (DEFAULT: null).
+
 - `fs.hdfs.hdfsdefault`: The absolute path of Hadoop's own configuration 
file "hdfs-default.xml" (DEFAULT: null).
+
 - `fs.hdfs.hdfssite`: The absolute path of Hadoop's own configuration file 
"hdfs-site.xml" (DEFAULT: null).
 
 ### JobManager & TaskManager
 
 The following parameters configure Flink's JobManager and TaskManagers.
 
 - `jobmanager.rpc.address`: The IP address of the JobManager, which is the 
master/coordinator of the distributed system (DEFAULT: **localhost**).
+
 - `jobmanager.rpc.port`: The port number of the JobManager (DEFAULT: 
**6123**).
+
 - `taskmanager.hostname`: The hostname of the network interface that the 
TaskManager binds to. By default, the TaskManager searches for network 
interfaces that can connect to the JobManager and other TaskManagers. This 
option can be used to define a hostname if that strategy fails for some reason. 
Because different TaskManagers need different values for this option, it 
usually is specified in an additional non-shared TaskManager-specific config 
file.
+
 - `taskmanager.rpc.port`: The task manager's IPC port (DEFAULT: **0**, 
which lets the OS choose a free port).
+
 - `taskmanager.data.port`: The task manager's port used for data exchange 
operations (DEFAULT: **0**, which lets the OS choose a free port).
+
 - `jobmanager.heap.mb`: JVM heap size (in megabytes) for the JobManager 
(DEFAULT: **256**).
+
 - `taskmanager.heap.mb`: JVM heap size (in megabytes) for the 
TaskManagers, which are the parallel workers of the system. In contrast to 
Hadoop, Flink runs operators (e.g., join, aggregate) and user-defined functions 
(e.g., Map, Reduce, CoGroup) inside the TaskManager (including 
sorting/hashing/caching), so this value should be as large as possible 
(DEFAULT: **512**). On YARN setups, this value is automatically configured to 
the size of the TaskManager's YARN container, minus a certain tolerance value.
+
 - `taskmanager.numberOfTaskSlots`: The number of parallel operator or user 
function instances that a single TaskManager can run (DEFAULT: **1**). If this 
value is larger than 1, a single TaskManager takes multiple instances of a 
function or operator. That way, the TaskManager can utilize multiple CPU cores, 
but at the same time, the available memory is divided between the different 
operator or function instances. This value is typically proportional to the 
number of physical CPU cores that the TaskManager's machine has (e.g., equal to 
the number of cores, or half the number of cores).
+
 - `taskmanager.tmp.dirs`: The directory for temporary files, or a list of 
directories separated by the systems directory delimiter (for example ':' 
(colon) on Linux/Unix). If multiple directories are specified, then the 
temporary files will be distributed across the directories in a round robin 
fashion. The I/O manager component will spawn one reading and one writing 
thread per directory. A directory may be listed multiple times to have the I/O 
manager use multiple threads for it (for example if it is physically stored on 
a very fast disc or RAID) (DEFAULT: **The system's tmp dir**).
+
 - `taskmanager.network.numberOfBuffers`: The number of buffers available 
to the network stack. This number determines how many streaming data exchange 
channels a TaskManager can have at the same time and how well buffered the 
channels are. If a job is rejected or you get a warning that the system has not 
enough buffers available, increase this value (DEFAULT: **2048**).
+
 - `taskmanager.memory.size`: The amount of memory (in megabytes) that the 
task manager reserves on the JVM's heap space for sorting, hash tables, and 
caching of intermediate results. If unspecified (-1), the memory manager will 
take a fixed ratio of the heap memory available to the JVM, as specified by 
`taskmanager.memory.fraction`. (DEFAULT: **-1**)
+
 - `taskmanager.memory.fraction`: The relative amount of memory that the 
task manager reserves for sorting, hash tables, and caching of intermediate 
results. For example, a value of 0.8 means that TaskManagers reserve 80% of the 
JVM's heap space for internal data buffers, leaving 20% of 

[GitHub] flink pull request #2465: [FLINK-4447] [docs] Include NettyConfig options on...

2016-09-02 Thread greghogan
Github user greghogan commented on a diff in the pull request:

https://github.com/apache/flink/pull/2465#discussion_r77363847
  
--- Diff: docs/setup/config.md ---
@@ -169,58 +169,111 @@ Default value is the `akka.ask.timeout`.
 These parameters configure the default HDFS used by Flink. Setups that do 
not specify a HDFS configuration have to specify the full path to HDFS files 
(`hdfs://address:port/path/to/files`) Files will also be written with default 
HDFS parameters (block size, replication factor).
 
 - `fs.hdfs.hadoopconf`: The absolute path to the Hadoop configuration 
directory. The system will look for the "core-site.xml" and "hdfs-site.xml" 
files in that directory (DEFAULT: null).
+
 - `fs.hdfs.hdfsdefault`: The absolute path of Hadoop's own configuration 
file "hdfs-default.xml" (DEFAULT: null).
+
 - `fs.hdfs.hdfssite`: The absolute path of Hadoop's own configuration file 
"hdfs-site.xml" (DEFAULT: null).
 
 ### JobManager & TaskManager
 
 The following parameters configure Flink's JobManager and TaskManagers.
 
 - `jobmanager.rpc.address`: The IP address of the JobManager, which is the 
master/coordinator of the distributed system (DEFAULT: **localhost**).
+
 - `jobmanager.rpc.port`: The port number of the JobManager (DEFAULT: 
**6123**).
+
 - `taskmanager.hostname`: The hostname of the network interface that the 
TaskManager binds to. By default, the TaskManager searches for network 
interfaces that can connect to the JobManager and other TaskManagers. This 
option can be used to define a hostname if that strategy fails for some reason. 
Because different TaskManagers need different values for this option, it 
usually is specified in an additional non-shared TaskManager-specific config 
file.
+
 - `taskmanager.rpc.port`: The task manager's IPC port (DEFAULT: **0**, 
which lets the OS choose a free port).
+
 - `taskmanager.data.port`: The task manager's port used for data exchange 
operations (DEFAULT: **0**, which lets the OS choose a free port).
+
 - `jobmanager.heap.mb`: JVM heap size (in megabytes) for the JobManager 
(DEFAULT: **256**).
+
 - `taskmanager.heap.mb`: JVM heap size (in megabytes) for the 
TaskManagers, which are the parallel workers of the system. In contrast to 
Hadoop, Flink runs operators (e.g., join, aggregate) and user-defined functions 
(e.g., Map, Reduce, CoGroup) inside the TaskManager (including 
sorting/hashing/caching), so this value should be as large as possible 
(DEFAULT: **512**). On YARN setups, this value is automatically configured to 
the size of the TaskManager's YARN container, minus a certain tolerance value.
+
 - `taskmanager.numberOfTaskSlots`: The number of parallel operator or user 
function instances that a single TaskManager can run (DEFAULT: **1**). If this 
value is larger than 1, a single TaskManager takes multiple instances of a 
function or operator. That way, the TaskManager can utilize multiple CPU cores, 
but at the same time, the available memory is divided between the different 
operator or function instances. This value is typically proportional to the 
number of physical CPU cores that the TaskManager's machine has (e.g., equal to 
the number of cores, or half the number of cores).
+
 - `taskmanager.tmp.dirs`: The directory for temporary files, or a list of 
directories separated by the systems directory delimiter (for example ':' 
(colon) on Linux/Unix). If multiple directories are specified, then the 
temporary files will be distributed across the directories in a round robin 
fashion. The I/O manager component will spawn one reading and one writing 
thread per directory. A directory may be listed multiple times to have the I/O 
manager use multiple threads for it (for example if it is physically stored on 
a very fast disc or RAID) (DEFAULT: **The system's tmp dir**).
+
 - `taskmanager.network.numberOfBuffers`: The number of buffers available 
to the network stack. This number determines how many streaming data exchange 
channels a TaskManager can have at the same time and how well buffered the 
channels are. If a job is rejected or you get a warning that the system has not 
enough buffers available, increase this value (DEFAULT: **2048**).
+
 - `taskmanager.memory.size`: The amount of memory (in megabytes) that the 
task manager reserves on the JVM's heap space for sorting, hash tables, and 
caching of intermediate results. If unspecified (-1), the memory manager will 
take a fixed ratio of the heap memory available to the JVM, as specified by 
`taskmanager.memory.fraction`. (DEFAULT: **-1**)
+
 - `taskmanager.memory.fraction`: The relative amount of memory that the 
task manager reserves for sorting, hash tables, and caching of intermediate 
results. For example, a value of 0.8 means that TaskManagers reserve 80% of the 
JVM's heap space for internal data buffers, leaving 20% of 

[GitHub] flink pull request #2465: [FLINK-4447] [docs] Include NettyConfig options on...

2016-09-02 Thread greghogan
Github user greghogan commented on a diff in the pull request:

https://github.com/apache/flink/pull/2465#discussion_r77363711
  
--- Diff: docs/setup/config.md ---
@@ -169,58 +169,111 @@ Default value is the `akka.ask.timeout`.
 These parameters configure the default HDFS used by Flink. Setups that do 
not specify a HDFS configuration have to specify the full path to HDFS files 
(`hdfs://address:port/path/to/files`) Files will also be written with default 
HDFS parameters (block size, replication factor).
 
 - `fs.hdfs.hadoopconf`: The absolute path to the Hadoop configuration 
directory. The system will look for the "core-site.xml" and "hdfs-site.xml" 
files in that directory (DEFAULT: null).
+
 - `fs.hdfs.hdfsdefault`: The absolute path of Hadoop's own configuration 
file "hdfs-default.xml" (DEFAULT: null).
+
 - `fs.hdfs.hdfssite`: The absolute path of Hadoop's own configuration file 
"hdfs-site.xml" (DEFAULT: null).
 
 ### JobManager & TaskManager
 
 The following parameters configure Flink's JobManager and TaskManagers.
 
 - `jobmanager.rpc.address`: The IP address of the JobManager, which is the 
master/coordinator of the distributed system (DEFAULT: **localhost**).
+
 - `jobmanager.rpc.port`: The port number of the JobManager (DEFAULT: 
**6123**).
+
 - `taskmanager.hostname`: The hostname of the network interface that the 
TaskManager binds to. By default, the TaskManager searches for network 
interfaces that can connect to the JobManager and other TaskManagers. This 
option can be used to define a hostname if that strategy fails for some reason. 
Because different TaskManagers need different values for this option, it 
usually is specified in an additional non-shared TaskManager-specific config 
file.
+
 - `taskmanager.rpc.port`: The task manager's IPC port (DEFAULT: **0**, 
which lets the OS choose a free port).
+
 - `taskmanager.data.port`: The task manager's port used for data exchange 
operations (DEFAULT: **0**, which lets the OS choose a free port).
+
 - `jobmanager.heap.mb`: JVM heap size (in megabytes) for the JobManager 
(DEFAULT: **256**).
+
 - `taskmanager.heap.mb`: JVM heap size (in megabytes) for the 
TaskManagers, which are the parallel workers of the system. In contrast to 
Hadoop, Flink runs operators (e.g., join, aggregate) and user-defined functions 
(e.g., Map, Reduce, CoGroup) inside the TaskManager (including 
sorting/hashing/caching), so this value should be as large as possible 
(DEFAULT: **512**). On YARN setups, this value is automatically configured to 
the size of the TaskManager's YARN container, minus a certain tolerance value.
+
 - `taskmanager.numberOfTaskSlots`: The number of parallel operator or user 
function instances that a single TaskManager can run (DEFAULT: **1**). If this 
value is larger than 1, a single TaskManager takes multiple instances of a 
function or operator. That way, the TaskManager can utilize multiple CPU cores, 
but at the same time, the available memory is divided between the different 
operator or function instances. This value is typically proportional to the 
number of physical CPU cores that the TaskManager's machine has (e.g., equal to 
the number of cores, or half the number of cores).
+
 - `taskmanager.tmp.dirs`: The directory for temporary files, or a list of 
directories separated by the systems directory delimiter (for example ':' 
(colon) on Linux/Unix). If multiple directories are specified, then the 
temporary files will be distributed across the directories in a round robin 
fashion. The I/O manager component will spawn one reading and one writing 
thread per directory. A directory may be listed multiple times to have the I/O 
manager use multiple threads for it (for example if it is physically stored on 
a very fast disc or RAID) (DEFAULT: **The system's tmp dir**).
+
 - `taskmanager.network.numberOfBuffers`: The number of buffers available 
to the network stack. This number determines how many streaming data exchange 
channels a TaskManager can have at the same time and how well buffered the 
channels are. If a job is rejected or you get a warning that the system has not 
enough buffers available, increase this value (DEFAULT: **2048**).
+
 - `taskmanager.memory.size`: The amount of memory (in megabytes) that the 
task manager reserves on the JVM's heap space for sorting, hash tables, and 
caching of intermediate results. If unspecified (-1), the memory manager will 
take a fixed ratio of the heap memory available to the JVM, as specified by 
`taskmanager.memory.fraction`. (DEFAULT: **-1**)
+
 - `taskmanager.memory.fraction`: The relative amount of memory that the 
task manager reserves for sorting, hash tables, and caching of intermediate 
results. For example, a value of 0.8 means that TaskManagers reserve 80% of the 
JVM's heap space for internal data buffers, leaving 20% of 

[GitHub] flink pull request #2465: [FLINK-4447] [docs] Include NettyConfig options on...

2016-09-02 Thread greghogan
Github user greghogan commented on a diff in the pull request:

https://github.com/apache/flink/pull/2465#discussion_r77363499
  
--- Diff: docs/setup/config.md ---
@@ -169,58 +169,111 @@ Default value is the `akka.ask.timeout`.
 These parameters configure the default HDFS used by Flink. Setups that do 
not specify a HDFS configuration have to specify the full path to HDFS files 
(`hdfs://address:port/path/to/files`) Files will also be written with default 
HDFS parameters (block size, replication factor).
 
 - `fs.hdfs.hadoopconf`: The absolute path to the Hadoop configuration 
directory. The system will look for the "core-site.xml" and "hdfs-site.xml" 
files in that directory (DEFAULT: null).
+
 - `fs.hdfs.hdfsdefault`: The absolute path of Hadoop's own configuration 
file "hdfs-default.xml" (DEFAULT: null).
+
 - `fs.hdfs.hdfssite`: The absolute path of Hadoop's own configuration file 
"hdfs-site.xml" (DEFAULT: null).
 
 ### JobManager & TaskManager
 
 The following parameters configure Flink's JobManager and TaskManagers.
 
 - `jobmanager.rpc.address`: The IP address of the JobManager, which is the 
master/coordinator of the distributed system (DEFAULT: **localhost**).
+
 - `jobmanager.rpc.port`: The port number of the JobManager (DEFAULT: 
**6123**).
+
 - `taskmanager.hostname`: The hostname of the network interface that the 
TaskManager binds to. By default, the TaskManager searches for network 
interfaces that can connect to the JobManager and other TaskManagers. This 
option can be used to define a hostname if that strategy fails for some reason. 
Because different TaskManagers need different values for this option, it 
usually is specified in an additional non-shared TaskManager-specific config 
file.
+
 - `taskmanager.rpc.port`: The task manager's IPC port (DEFAULT: **0**, 
which lets the OS choose a free port).
+
 - `taskmanager.data.port`: The task manager's port used for data exchange 
operations (DEFAULT: **0**, which lets the OS choose a free port).
+
 - `jobmanager.heap.mb`: JVM heap size (in megabytes) for the JobManager 
(DEFAULT: **256**).
+
 - `taskmanager.heap.mb`: JVM heap size (in megabytes) for the 
TaskManagers, which are the parallel workers of the system. In contrast to 
Hadoop, Flink runs operators (e.g., join, aggregate) and user-defined functions 
(e.g., Map, Reduce, CoGroup) inside the TaskManager (including 
sorting/hashing/caching), so this value should be as large as possible 
(DEFAULT: **512**). On YARN setups, this value is automatically configured to 
the size of the TaskManager's YARN container, minus a certain tolerance value.
+
 - `taskmanager.numberOfTaskSlots`: The number of parallel operator or user 
function instances that a single TaskManager can run (DEFAULT: **1**). If this 
value is larger than 1, a single TaskManager takes multiple instances of a 
function or operator. That way, the TaskManager can utilize multiple CPU cores, 
but at the same time, the available memory is divided between the different 
operator or function instances. This value is typically proportional to the 
number of physical CPU cores that the TaskManager's machine has (e.g., equal to 
the number of cores, or half the number of cores).
+
 - `taskmanager.tmp.dirs`: The directory for temporary files, or a list of 
directories separated by the systems directory delimiter (for example ':' 
(colon) on Linux/Unix). If multiple directories are specified, then the 
temporary files will be distributed across the directories in a round robin 
fashion. The I/O manager component will spawn one reading and one writing 
thread per directory. A directory may be listed multiple times to have the I/O 
manager use multiple threads for it (for example if it is physically stored on 
a very fast disc or RAID) (DEFAULT: **The system's tmp dir**).
+
 - `taskmanager.network.numberOfBuffers`: The number of buffers available 
to the network stack. This number determines how many streaming data exchange 
channels a TaskManager can have at the same time and how well buffered the 
channels are. If a job is rejected or you get a warning that the system has not 
enough buffers available, increase this value (DEFAULT: **2048**).
+
 - `taskmanager.memory.size`: The amount of memory (in megabytes) that the 
task manager reserves on the JVM's heap space for sorting, hash tables, and 
caching of intermediate results. If unspecified (-1), the memory manager will 
take a fixed ratio of the heap memory available to the JVM, as specified by 
`taskmanager.memory.fraction`. (DEFAULT: **-1**)
+
 - `taskmanager.memory.fraction`: The relative amount of memory that the 
task manager reserves for sorting, hash tables, and caching of intermediate 
results. For example, a value of 0.8 means that TaskManagers reserve 80% of the 
JVM's heap space for internal data buffers, leaving 20% of 

[GitHub] flink pull request #2465: [FLINK-4447] [docs] Include NettyConfig options on...

2016-09-02 Thread greghogan
GitHub user greghogan opened a pull request:

https://github.com/apache/flink/pull/2465

[FLINK-4447] [docs] Include NettyConfig options on Configurations page



You can merge this pull request into a Git repository by running:

$ git pull https://github.com/greghogan/flink 
4447_include_nettyconfig_options_on_configurations_page

Alternatively you can review and apply these changes as the patch at:

https://github.com/apache/flink/pull/2465.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

This closes #2465


commit 9417a1046be1653a1d5c9109cb4d314375693da6
Author: Greg Hogan 
Date:   2016-09-02T15:10:55Z

[FLINK-4447] [docs] Include NettyConfig options on Configurations page




---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---