Added: aurora/site/publish/documentation/0.18.1/reference/configuration-tutorial/index.html URL: http://svn.apache.org/viewvc/aurora/site/publish/documentation/0.18.1/reference/configuration-tutorial/index.html?rev=1813982&view=auto ============================================================================== --- aurora/site/publish/documentation/0.18.1/reference/configuration-tutorial/index.html (added) +++ aurora/site/publish/documentation/0.18.1/reference/configuration-tutorial/index.html Wed Nov 1 18:39:52 2017 @@ -0,0 +1,674 @@ +<!DOCTYPE html> +<html lang="en"> + <head> + <meta charset="utf-8"> + <meta name="viewport" content="width=device-width, initial-scale=1"> + <title>Apache Aurora</title> + <link rel="stylesheet" href="https://maxcdn.bootstrapcdn.com/bootstrap/3.3.1/css/bootstrap.min.css"> + <link href="/assets/css/main.css" rel="stylesheet"> + <!-- Analytics --> + <script type="text/javascript"> + var _gaq = _gaq || []; + _gaq.push(['_setAccount', 'UA-45879646-1']); + _gaq.push(['_setDomainName', 'apache.org']); + _gaq.push(['_trackPageview']); + + (function() { + var ga = document.createElement('script'); ga.type = 'text/javascript'; ga.async = true; + ga.src = ('https:' == document.location.protocol ? 'https://ssl' : 'http://www') + '.google-analytics.com/ga.js'; + var s = document.getElementsByTagName('script')[0]; s.parentNode.insertBefore(ga, s); + })(); + </script> + </head> + <body> + <div class="container-fluid section-header"> + <div class="container"> + <div class="nav nav-bar"> + <a href="/"><img src="/assets/img/aurora_logo_dkbkg.svg" width="300" alt="Transparent Apache Aurora logo with dark background"/></a> + <ul class="nav navbar-nav navbar-right"> + <li><a href="/documentation/latest/">Documentation</a></li> + <li><a href="/community/">Community</a></li> + <li><a href="/downloads/">Downloads</a></li> + <li><a href="/blog/">Blog</a></li> + </ul> + </div> + </div> +</div> + + <div class="container-fluid"> + <div class="container content"> + <div class="col-md-12 documentation"> +<h5 class="page-header text-uppercase">Documentation +<select onChange="window.location.href='/documentation/' + this.value + '/reference/configuration-tutorial/'" + value="0.18.1"> + <option value="0.18.1" + selected="selected"> + 0.18.1 + (latest) + </option> + <option value="0.18.0" + > + 0.18.0 + </option> + <option value="0.17.0" + > + 0.17.0 + </option> + <option value="0.16.0" + > + 0.16.0 + </option> + <option value="0.15.0" + > + 0.15.0 + </option> + <option value="0.14.0" + > + 0.14.0 + </option> + <option value="0.13.0" + > + 0.13.0 + </option> + <option value="0.12.0" + > + 0.12.0 + </option> + <option value="0.11.0" + > + 0.11.0 + </option> + <option value="0.10.0" + > + 0.10.0 + </option> + <option value="0.9.0" + > + 0.9.0 + </option> + <option value="0.8.0" + > + 0.8.0 + </option> + <option value="0.7.0-incubating" + > + 0.7.0-incubating + </option> + <option value="0.6.0-incubating" + > + 0.6.0-incubating + </option> + <option value="0.5.0-incubating" + > + 0.5.0-incubating + </option> +</select> +</h5> +<h1 id="aurora-configuration-tutorial">Aurora Configuration Tutorial</h1> + +<p>How to write Aurora configuration files, including feature descriptions +and best practices. When writing a configuration file, make use of +<code>aurora job inspect</code>. It takes the same job key and configuration file +arguments as <code>aurora job create</code> or <code>aurora update start</code>. It first ensures the +configuration parses, then outputs it in human-readable form.</p> + +<p>You should read this after going through the general <a href="../../getting-started/tutorial/">Aurora Tutorial</a>.</p> + +<ul> +<li><a href="#the-basics">The Basics</a> + +<ul> +<li><a href="#use-bottom-to-top-object-ordering">Use Bottom-To-Top Object Ordering</a></li> +</ul></li> +<li><a href="#an-example-configuration-file">An Example Configuration File</a></li> +<li><a href="#defining-process-objects">Defining Process Objects</a></li> +<li><a href="#getting-your-code-into-the-sandbox">Getting Your Code Into The Sandbox</a></li> +<li><a href="#defining-task-objects">Defining Task Objects</a> + +<ul> +<li><a href="#sequentialtask-running-processes-in-parallel-or-sequentially">SequentialTask: Running Processes in Parallel or Sequentially</a></li> +<li><a href="#simpletask">SimpleTask</a></li> +<li><a href="#combining-tasks">Combining tasks</a></li> +</ul></li> +<li><a href="#defining-job-objects">Defining Job Objects</a></li> +<li><a href="#the-jobs-list">The jobs List</a></li> +<li><a href="#basic-examples">Basic Examples</a></li> +</ul> + +<h2 id="the-basics">The Basics</h2> + +<p>To run a job on Aurora, you must specify a configuration file that tells +Aurora what it needs to know to schedule the job, what Mesos needs to +run the tasks the job is made up of, and what Thermos needs to run the +processes that make up the tasks. This file must have +a<code>.aurora</code> suffix.</p> + +<p>A configuration file defines a collection of objects, along with parameter +values for their attributes. An Aurora configuration file contains the +following three types of objects:</p> + +<ul> +<li>Job</li> +<li>Task</li> +<li>Process</li> +</ul> + +<p>A configuration also specifies a list of <code>Job</code> objects assigned +to the variable <code>jobs</code>.</p> + +<ul> +<li>jobs (list of defined Jobs to run)</li> +</ul> + +<p>The <code>.aurora</code> file format is just Python. However, <code>Job</code>, <code>Task</code>, +<code>Process</code>, and other classes are defined by a type-checked dictionary +templating library called <em>Pystachio</em>, a powerful tool for +configuration specification and reuse. Pystachio objects are tailored +via {{}} surrounded templates.</p> + +<p>When writing your <code>.aurora</code> file, you may use any Pystachio datatypes, as +well as any objects shown in the <a href="../configuration/"><em>Aurora Configuration +Reference</em></a>, without <code>import</code> statements - the +Aurora config loader injects them automatically. Other than that, an <code>.aurora</code> +file works like any other Python script.</p> + +<p><a href="../configuration/"><em>Aurora Configuration Reference</em></a> +has a full reference of all Aurora/Thermos defined Pystachio objects.</p> + +<h3 id="use-bottom-to-top-object-ordering">Use Bottom-To-Top Object Ordering</h3> + +<p>A well-structured configuration starts with structural templates (if +any). Structural templates encapsulate in their attributes all the +differences between Jobs in the configuration that are not directly +manipulated at the <code>Job</code> level, but typically at the <code>Process</code> or <code>Task</code> +level. For example, if certain processes are invoked with slightly +different settings or input.</p> + +<p>After structural templates, define, in order, <code>Process</code>es, <code>Task</code>s, and +<code>Job</code>s.</p> + +<p>Structural template names should be <em>UpperCamelCased</em> and their +instantiations are typically <em>UPPER_SNAKE_CASED</em>. <code>Process</code>, <code>Task</code>, +and <code>Job</code> names are typically <em>lower_snake_cased</em>. Indentation is typically 2 +spaces.</p> + +<h2 id="an-example-configuration-file">An Example Configuration File</h2> + +<p>The following is a typical configuration file. Don’t worry if there are +parts you don’t understand yet, but you may want to refer back to this +as you read about its individual parts. Note that names surrounded by +curly braces {{}} are template variables, which the system replaces with +bound values for the variables.</p> +<pre class="highlight plaintext"><code># --- templates here --- +class Profile(Struct): + package_version = Default(String, 'live') + java_binary = Default(String, '/usr/lib/jvm/java-1.7.0-openjdk/bin/java') + extra_jvm_options = Default(String, '') + parent_environment = Default(String, 'prod') + parent_serverset = Default(String, + '/foocorp/service/bird/{{parent_environment}}/bird') + +# --- processes here --- +main = Process( + name = 'application', + cmdline = '{{profile.java_binary}} -server -Xmx1792m ' + '{{profile.extra_jvm_options}} ' + '-jar application.jar ' + '-upstreamService {{profile.parent_serverset}}' +) + +# --- tasks --- +base_task = SequentialTask( + name = 'application', + processes = [ + Process( + name = 'fetch', + cmdline = 'curl -O + https://packages.foocorp.com/{{profile.package_version}}/application.jar'), + ] +) + + # not always necessary but often useful to have separate task + # resource classes + staging_task = base_task(resources = + Resources(cpu = 1.0, + ram = 2048*MB, + disk = 1*GB)) +production_task = base_task(resources = + Resources(cpu = 4.0, + ram = 2560*MB, + disk = 10*GB)) + +# --- job template --- +job_template = Job( + name = 'application', + role = 'myteam', + contact = 'myteam-t...@foocorp.com', + instances = 20, + service = True, + task = production_task +) + +# -- profile instantiations (if any) --- +PRODUCTION = Profile() +STAGING = Profile( + extra_jvm_options = '-Xloggc:gc.log', + parent_environment = 'staging' +) + +# -- job instantiations -- +jobs = [ + job_template(cluster = 'cluster1', environment = 'prod') + .bind(profile = PRODUCTION), + + job_template(cluster = 'cluster2', environment = 'prod') + .bind(profile = PRODUCTION), + + job_template(cluster = 'cluster1', + environment = 'staging', + service = False, + task = staging_task, + instances = 2) + .bind(profile = STAGING), +] +</code></pre> + +<h2 id="defining-process-objects">Defining Process Objects</h2> + +<p>Processes are handled by the Thermos system. A process is a single +executable step run as a part of an Aurora task, which consists of a +bash-executable statement.</p> + +<p>The key (and required) <code>Process</code> attributes are:</p> + +<ul> +<li> <code>name</code>: Any string which is a valid Unix filename (no slashes, +NULLs, or leading periods). The <code>name</code> value must be unique relative +to other Processes in a <code>Task</code>.</li> +<li> <code>cmdline</code>: A command line run in a bash subshell, so you can use +bash scripts. Nothing is supplied for command-line arguments, +so <code>$*</code> is unspecified.</li> +</ul> + +<p>Many tiny processes make managing configurations more difficult. For +example, the following is a bad way to define processes.</p> +<pre class="highlight plaintext"><code>copy = Process( + name = 'copy', + cmdline = 'curl -O https://packages.foocorp.com/app.zip' +) +unpack = Process( + name = 'unpack', + cmdline = 'unzip app.zip' +) +remove = Process( + name = 'remove', + cmdline = 'rm -f app.zip' +) +run = Process( + name = 'app', + cmdline = 'java -jar app.jar' +) +run_task = Task( + processes = [copy, unpack, remove, run], + constraints = order(copy, unpack, remove, run) +) +</code></pre> + +<p>Since <code>cmdline</code> runs in a bash subshell, you can chain commands +with <code>&&</code> or <code>||</code>.</p> + +<p>When defining a <code>Task</code> that is just a list of Processes run in a +particular order, use <code>SequentialTask</code>, as described in the <a href="#Task"><em>Defining</em> +<code>Task</code> <em>Objects</em></a> section. The following simplifies and combines the +above multiple <code>Process</code> definitions into just two.</p> +<pre class="highlight plaintext"><code>stage = Process( + name = 'stage', + cmdline = 'curl -O https://packages.foocorp.com/app.zip && ' + 'unzip app.zip && rm -f app.zip') + +run = Process(name = 'app', cmdline = 'java -jar app.jar') + +run_task = SequentialTask(processes = [stage, run]) +</code></pre> + +<p><code>Process</code> also has optional attributes to customize its behaviour. Details can be found in the <a href="../configuration/#process-objects">Aurora Configuration Reference</a>.</p> + +<h2 id="getting-your-code-into-the-sandbox">Getting Your Code Into The Sandbox</h2> + +<p>When using Aurora, you need to get your executable code into its “sandbox”, specifically +the Task sandbox where the code executes for the Processes that make up that Task.</p> + +<p>Each Task has a sandbox created when the Task starts and garbage +collected when it finishes. All of a Task’s processes run in its +sandbox, so processes can share state by using a shared current +working directory.</p> + +<p>Typically, you save this code somewhere. You then need to define a Process +in your <code>.aurora</code> configuration file that fetches the code from that somewhere +to where the agent can see it. For a public cloud, that can be anywhere public on +the Internet, such as S3. For a private cloud internal storage, you need to put in +on an accessible HDFS cluster or similar storage.</p> + +<p>The template for this Process is:</p> +<pre class="highlight plaintext"><code><name> = Process( + name = '<name>' + cmdline = '<command to copy and extract code archive into current working directory>' +) +</code></pre> + +<p>Note: Be sure the extracted code archive has an executable.</p> + +<h2 id="getting-environment-variables-into-the-sandbox">Getting Environment Variables Into The Sandbox</h2> + +<p>Every time a process is forked the Thermos executor checks for the existence of the +<code>.thermos_profile</code> file, if the <code>.thermos_profile</code> file exists it will be sourced. +You can utilize this process to pass environment variables to the sandbox.</p> + +<p>An example for this Process is:</p> +<pre class="highlight plaintext"><code>setup_env = Process( + name = 'setup' + cmdline = '''cat <<EOF > .thermos_profile + export RESULT=hello + EOF''' +) + +read_env = Process( + name = 'read' + cmdline = 'echo $RESULT' +) +</code></pre> + +<h2 id="defining-task-objects">Defining Task Objects</h2> + +<p>Tasks are handled by Mesos. A task is a collection of processes that +runs in a shared sandbox. It’s the fundamental unit Aurora uses to +schedule the datacenter; essentially what Aurora does is find places +in the cluster to run tasks.</p> + +<p>The key (and required) parts of a Task are:</p> + +<ul> +<li><p><code>name</code>: A string giving the Task’s name. By default, if a Task is +not given a name, it inherits the first name in its Process list.</p></li> +<li><p><code>processes</code>: An unordered list of Process objects bound to the Task. +The value of the optional <code>constraints</code> attribute affects the +contents as a whole. Currently, the only constraint, <code>order</code>, determines if +the processes run in parallel or sequentially.</p></li> +<li><p><code>resources</code>: A <code>Resource</code> object defining the Task’s resource + footprint. A <code>Resource</code> object has three attributes: + - <code>cpu</code>: A Float, the fractional number of cores the Task + requires. + - <code>ram</code>: An Integer, RAM bytes the Task requires. + - <code>disk</code>: An integer, disk bytes the Task requires.</p></li> +</ul> + +<p>A basic Task definition looks like:</p> +<pre class="highlight plaintext"><code>Task( + name="hello_world", + processes=[Process(name = "hello_world", cmdline = "echo hello world")], + resources=Resources(cpu = 1.0, + ram = 1*GB, + disk = 1*GB)) +</code></pre> + +<p>A Task has optional attributes to customize its behaviour. Details can be found in the <a href="../configuration/#task-object">Aurora Configuration Reference</a></p> + +<h3 id="sequentialtask-running-processes-in-parallel-or-sequentially">SequentialTask: Running Processes in Parallel or Sequentially</h3> + +<p>By default, a Task with several Processes runs them in parallel. There +are two ways to run Processes sequentially:</p> + +<ul> +<li><p>Include an <code>order</code> constraint in the Task definition’s <code>constraints</code> +attribute whose arguments specify the processes’ run order:</p> +<pre class="highlight plaintext"><code>Task( ... processes=[process1, process2, process3], + constraints = order(process1, process2, process3), ...) +</code></pre></li> +<li><p>Use <code>SequentialTask</code> instead of <code>Task</code>; it automatically runs +processes in the order specified in the <code>processes</code> attribute. No +<code>constraint</code> parameter is needed:</p> +<pre class="highlight plaintext"><code>SequentialTask( ... processes=[process1, process2, process3] ...) +</code></pre></li> +</ul> + +<h3 id="simpletask">SimpleTask</h3> + +<p>For quickly creating simple tasks, use the <code>SimpleTask</code> helper. It +creates a basic task from a provided name and command line using a +default set of resources. For example, in a .<code>aurora</code> configuration +file:</p> +<pre class="highlight plaintext"><code>SimpleTask(name="hello_world", command="echo hello world") +</code></pre> + +<p>is equivalent to</p> +<pre class="highlight plaintext"><code>Task(name="hello_world", + processes=[Process(name = "hello_world", cmdline = "echo hello world")], + resources=Resources(cpu = 1.0, + ram = 1*GB, + disk = 1*GB)) +</code></pre> + +<p>The simplest idiomatic Job configuration thus becomes:</p> +<pre class="highlight plaintext"><code>import os +hello_world_job = Job( + task=SimpleTask(name="hello_world", command="echo hello world"), + role=os.getenv('USER'), + cluster="cluster1") +</code></pre> + +<p>When written to <code>hello_world.aurora</code>, you invoke it with a simple +<code>aurora job create cluster1/$USER/test/hello_world hello_world.aurora</code>.</p> + +<h3 id="combining-tasks">Combining tasks</h3> + +<p><code>Tasks.concat</code>(synonym,<code>concat_tasks</code>) and +<code>Tasks.combine</code>(synonym,<code>combine_tasks</code>) merge multiple Task definitions +into a single Task. It may be easier to define complex Jobs +as smaller constituent Tasks. But since a Job only includes a single +Task, the subtasks must be combined before using them in a Job. +Smaller Tasks can also be reused between Jobs, instead of having to +repeat their definition for multiple Jobs.</p> + +<p>With both methods, the merged Task takes the first Task’s name. The +difference between the two is the result Task’s process ordering.</p> + +<ul> +<li><p><code>Tasks.combine</code> runs its subtasks’ processes in no particular order. +The new Task’s resource consumption is the sum of all its subtasks’ +consumption.</p></li> +<li><p><code>Tasks.concat</code> runs its subtasks in the order supplied, with each +subtask’s processes run serially between tasks. It is analogous to +the <code>order</code> constraint helper, except at the Task level instead of +the Process level. The new Task’s resource consumption is the +maximum value specified by any subtask for each Resource attribute +(cpu, ram and disk).</p></li> +</ul> + +<p>For example, given the following:</p> +<pre class="highlight plaintext"><code>setup_task = Task( + ... + processes=[download_interpreter, update_zookeeper], + # It is important to note that {{Tasks.concat}} has + # no effect on the ordering of the processes within a task; + # hence the necessity of the {{order}} statement below + # (otherwise, the order in which {{download_interpreter}} + # and {{update_zookeeper}} run will be non-deterministic) + constraints=order(download_interpreter, update_zookeeper), + ... +) + +run_task = SequentialTask( + ... + processes=[download_application, start_application], + ... +) + +combined_task = Tasks.concat(setup_task, run_task) +</code></pre> + +<p>The <code>Tasks.concat</code> command merges the two Tasks into a single Task and +ensures all processes in <code>setup_task</code> run before the processes +in <code>run_task</code>. Conceptually, the task is reduced to:</p> +<pre class="highlight plaintext"><code>task = Task( + ... + processes=[download_interpreter, update_zookeeper, + download_application, start_application], + constraints=order(download_interpreter, update_zookeeper, + download_application, start_application), + ... +) +</code></pre> + +<p>In the case of <code>Tasks.combine</code>, the two schedules run in parallel:</p> +<pre class="highlight plaintext"><code>task = Task( + ... + processes=[download_interpreter, update_zookeeper, + download_application, start_application], + constraints=order(download_interpreter, update_zookeeper) + + order(download_application, start_application), + ... +) +</code></pre> + +<p>In the latter case, each of the two sequences may operate in parallel. +Of course, this may not be the intended behavior (for example, if +the <code>start_application</code> Process implicitly relies +upon <code>download_interpreter</code>). Make sure you understand the difference +between using one or the other.</p> + +<h2 id="defining-job-objects">Defining Job Objects</h2> + +<p>A job is a group of identical tasks that Aurora can run in a Mesos cluster.</p> + +<p>A <code>Job</code> object is defined by the values of several attributes, some +required and some optional. The required attributes are:</p> + +<ul> +<li><p><code>task</code>: Task object to bind to this job. Note that a Job can +only take a single Task.</p></li> +<li><p><code>role</code>: Job’s role account; in other words, the user account to run +the job as on a Mesos cluster machine. A common value is +<code>os.getenv('USER')</code>; using a Python command to get the user who +submits the job request. The other common value is the service +account that runs the job, e.g. <code>www-data</code>.</p></li> +<li><p><code>environment</code>: Job’s environment, typical values +are <code>devel</code>, <code>test</code>, or <code>prod</code>.</p></li> +<li><p><code>cluster</code>: Aurora cluster to schedule the job in, defined in +<code>/etc/aurora/clusters.json</code> or <code>~/.clusters.json</code>. You can specify +jobs where the only difference is the <code>cluster</code>, then at run time +only run the Job whose job key includes your desired cluster’s name.</p></li> +</ul> + +<p>You usually see a <code>name</code> parameter. By default, <code>name</code> inherits its +value from the Job’s associated Task object, but you can override this +default. For these four parameters, a Job definition might look like:</p> +<pre class="highlight plaintext"><code>foo_job = Job( name = 'foo', cluster = 'cluster1', + role = os.getenv('USER'), environment = 'prod', + task = foo_task) +</code></pre> + +<p>In addition to the required attributes, there are several optional +attributes. Details can be found in the <a href="../configuration/#job-objects">Aurora Configuration Reference</a>.</p> + +<h2 id="the-jobs-list">The jobs List</h2> + +<p>At the end of your <code>.aurora</code> file, you need to specify a list of the +file’s defined Jobs. For example, the following exports the jobs <code>job1</code>, +<code>job2</code>, and <code>job3</code>.</p> +<pre class="highlight plaintext"><code>jobs = [job1, job2, job3] +</code></pre> + +<p>This allows the aurora client to invoke commands on those jobs, such as +starting, updating, or killing them.</p> + +<h1 id="basic-examples">Basic Examples</h1> + +<p>These are provided to give a basic understanding of simple Aurora jobs.</p> + +<h3 id="hello_world-aurora">hello_world.aurora</h3> + +<p>Put the following in a file named <code>hello_world.aurora</code>, substituting your own values +for values such as <code>cluster</code>s.</p> +<pre class="highlight plaintext"><code>import os +hello_world_process = Process(name = 'hello_world', cmdline = 'echo hello world') + +hello_world_task = Task( + resources = Resources(cpu = 0.1, ram = 16 * MB, disk = 16 * MB), + processes = [hello_world_process]) + +hello_world_job = Job( + cluster = 'cluster1', + role = os.getenv('USER'), + task = hello_world_task) + +jobs = [hello_world_job] +</code></pre> + +<p>Then issue the following commands to create and kill the job, using your own values for the job key.</p> +<pre class="highlight plaintext"><code>aurora job create cluster1/$USER/test/hello_world hello_world.aurora + +aurora job kill cluster1/$USER/test/hello_world +</code></pre> + +<h3 id="environment-tailoring">Environment Tailoring</h3> + +<p>Put the following in a file named <code>hello_world_productionized.aurora</code>, substituting your own values +for values such as <code>cluster</code>s.</p> +<pre class="highlight plaintext"><code>include('hello_world.aurora') + +production_resources = Resources(cpu = 1.0, ram = 512 * MB, disk = 2 * GB) +staging_resources = Resources(cpu = 0.1, ram = 32 * MB, disk = 512 * MB) +hello_world_template = hello_world( + name = "hello_world-{{cluster}}" + task = hello_world(resources=production_resources)) + +jobs = [ + # production jobs + hello_world_template(cluster = 'cluster1', instances = 25), + hello_world_template(cluster = 'cluster2', instances = 15), + + # staging jobs + hello_world_template( + cluster = 'local', + instances = 1, + task = hello_world(resources=staging_resources)), +] +</code></pre> + +<p>Then issue the following commands to create and kill the job, using your own values for the job key</p> +<pre class="highlight plaintext"><code>aurora job create cluster1/$USER/test/hello_world-cluster1 hello_world_productionized.aurora + +aurora job kill cluster1/$USER/test/hello_world-cluster1 +</code></pre> + +</div> + + </div> + </div> + <div class="container-fluid section-footer buffer"> + <div class="container"> + <div class="row"> + <div class="col-md-2 col-md-offset-1"><h3>Quick Links</h3> + <ul> + <li><a href="/downloads/">Downloads</a></li> + <li><a href="/community/">Mailing Lists</a></li> + <li><a href="http://issues.apache.org/jira/browse/AURORA">Issue Tracking</a></li> + <li><a href="/documentation/latest/contributing/">How To Contribute</a></li> + </ul> + </div> + <div class="col-md-2"><h3>The ASF</h3> + <ul> + <li><a href="http://www.apache.org/licenses/">License</a></li> + <li><a href="http://www.apache.org/foundation/sponsorship.html">Sponsorship</a></li> + <li><a href="http://www.apache.org/foundation/thanks.html">Thanks</a></li> + <li><a href="http://www.apache.org/security/">Security</a></li> + </ul> + </div> + <div class="col-md-6"> + <p class="disclaimer">© 2014-2017 <a href="http://www.apache.org/">Apache Software Foundation</a>. Licensed under the <a href="http://www.apache.org/licenses/">Apache License v2.0</a>. The <a href="https://www.flickr.com/photos/trondk/12706051375/">Aurora Borealis IX photo</a> displayed on the homepage is available under a <a href="https://creativecommons.org/licenses/by-nc-nd/2.0/">Creative Commons BY-NC-ND 2.0 license</a>. Apache, Apache Aurora, and the Apache feather logo are trademarks of The Apache Software Foundation.</p> + </div> + </div> + </div> + + </body> +</html>
Added: aurora/site/publish/documentation/0.18.1/reference/configuration/index.html URL: http://svn.apache.org/viewvc/aurora/site/publish/documentation/0.18.1/reference/configuration/index.html?rev=1813982&view=auto ============================================================================== --- aurora/site/publish/documentation/0.18.1/reference/configuration/index.html (added) +++ aurora/site/publish/documentation/0.18.1/reference/configuration/index.html Wed Nov 1 18:39:52 2017 @@ -0,0 +1,1274 @@ +<!DOCTYPE html> +<html lang="en"> + <head> + <meta charset="utf-8"> + <meta name="viewport" content="width=device-width, initial-scale=1"> + <title>Apache Aurora</title> + <link rel="stylesheet" href="https://maxcdn.bootstrapcdn.com/bootstrap/3.3.1/css/bootstrap.min.css"> + <link href="/assets/css/main.css" rel="stylesheet"> + <!-- Analytics --> + <script type="text/javascript"> + var _gaq = _gaq || []; + _gaq.push(['_setAccount', 'UA-45879646-1']); + _gaq.push(['_setDomainName', 'apache.org']); + _gaq.push(['_trackPageview']); + + (function() { + var ga = document.createElement('script'); ga.type = 'text/javascript'; ga.async = true; + ga.src = ('https:' == document.location.protocol ? 'https://ssl' : 'http://www') + '.google-analytics.com/ga.js'; + var s = document.getElementsByTagName('script')[0]; s.parentNode.insertBefore(ga, s); + })(); + </script> + </head> + <body> + <div class="container-fluid section-header"> + <div class="container"> + <div class="nav nav-bar"> + <a href="/"><img src="/assets/img/aurora_logo_dkbkg.svg" width="300" alt="Transparent Apache Aurora logo with dark background"/></a> + <ul class="nav navbar-nav navbar-right"> + <li><a href="/documentation/latest/">Documentation</a></li> + <li><a href="/community/">Community</a></li> + <li><a href="/downloads/">Downloads</a></li> + <li><a href="/blog/">Blog</a></li> + </ul> + </div> + </div> +</div> + + <div class="container-fluid"> + <div class="container content"> + <div class="col-md-12 documentation"> +<h5 class="page-header text-uppercase">Documentation +<select onChange="window.location.href='/documentation/' + this.value + '/reference/configuration/'" + value="0.18.1"> + <option value="0.18.1" + selected="selected"> + 0.18.1 + (latest) + </option> + <option value="0.18.0" + > + 0.18.0 + </option> + <option value="0.17.0" + > + 0.17.0 + </option> + <option value="0.16.0" + > + 0.16.0 + </option> + <option value="0.15.0" + > + 0.15.0 + </option> + <option value="0.14.0" + > + 0.14.0 + </option> + <option value="0.13.0" + > + 0.13.0 + </option> + <option value="0.12.0" + > + 0.12.0 + </option> + <option value="0.11.0" + > + 0.11.0 + </option> + <option value="0.10.0" + > + 0.10.0 + </option> + <option value="0.9.0" + > + 0.9.0 + </option> + <option value="0.8.0" + > + 0.8.0 + </option> + <option value="0.7.0-incubating" + > + 0.7.0-incubating + </option> + <option value="0.6.0-incubating" + > + 0.6.0-incubating + </option> + <option value="0.5.0-incubating" + > + 0.5.0-incubating + </option> +</select> +</h5> +<h1 id="aurora-configuration-reference">Aurora Configuration Reference</h1> + +<p>Don’t know where to start? The Aurora configuration schema is very +powerful, and configurations can become quite complex for advanced use +cases.</p> + +<p>For examples of simple configurations to get something up and running +quickly, check out the <a href="../../getting-started/tutorial/">Tutorial</a>. When you feel comfortable with the basics, move +on to the <a href="../configuration-tutorial/">Configuration Tutorial</a> for more in-depth coverage of +configuration design.</p> + +<ul> +<li><a href="#process-schema">Process Schema</a> + +<ul> +<li><a href="#process-objects">Process Objects</a></li> +</ul></li> +<li><a href="#task-schema">Task Schema</a> + +<ul> +<li><a href="#task-object">Task Object</a></li> +<li><a href="#constraint-object">Constraint Object</a></li> +<li><a href="#resource-object">Resource Object</a></li> +</ul></li> +<li><a href="#job-schema">Job Schema</a> + +<ul> +<li><a href="#job-objects">Job Objects</a></li> +<li><a href="#updateconfig-objects">UpdateConfig Objects</a></li> +<li><a href="#healthcheckconfig-objects">HealthCheckConfig Objects</a></li> +<li><a href="#announcer-objects">Announcer Objects</a></li> +<li><a href="#container">Container Objects</a></li> +<li><a href="#lifecycleconfig-objects">LifecycleConfig Objects</a></li> +</ul></li> +<li><a href="#specifying-scheduling-constraints">Specifying Scheduling Constraints</a></li> +<li><a href="#template-namespaces">Template Namespaces</a> + +<ul> +<li><a href="#mesos-namespace">mesos Namespace</a></li> +<li><a href="#thermos-namespace">thermos Namespace</a></li> +</ul></li> +</ul> + +<h1 id="process-schema">Process Schema</h1> + +<p>Process objects consist of required <code>name</code> and <code>cmdline</code> attributes. You can customize Process +behavior with its optional attributes. Remember, Processes are handled by Thermos.</p> + +<h3 id="process-objects">Process Objects</h3> + +<table><thead> +<tr> +<th><strong>Attribute Name</strong></th> +<th style="text-align: center"><strong>Type</strong></th> +<th><strong>Description</strong></th> +</tr> +</thead><tbody> +<tr> +<td><strong>name</strong></td> +<td style="text-align: center">String</td> +<td>Process name (Required)</td> +</tr> +<tr> +<td><strong>cmdline</strong></td> +<td style="text-align: center">String</td> +<td>Command line (Required)</td> +</tr> +<tr> +<td><strong>max_failures</strong></td> +<td style="text-align: center">Integer</td> +<td>Maximum process failures (Default: 1)</td> +</tr> +<tr> +<td><strong>daemon</strong></td> +<td style="text-align: center">Boolean</td> +<td>When True, this is a daemon process. (Default: False)</td> +</tr> +<tr> +<td><strong>ephemeral</strong></td> +<td style="text-align: center">Boolean</td> +<td>When True, this is an ephemeral process. (Default: False)</td> +</tr> +<tr> +<td><strong>min_duration</strong></td> +<td style="text-align: center">Integer</td> +<td>Minimum duration between process restarts in seconds. (Default: 15)</td> +</tr> +<tr> +<td><strong>final</strong></td> +<td style="text-align: center">Boolean</td> +<td>When True, this process is a finalizing one that should run last. (Default: False)</td> +</tr> +<tr> +<td><strong>logger</strong></td> +<td style="text-align: center">Logger</td> +<td>Struct defining the log behavior for the process. (Default: Empty)</td> +</tr> +</tbody></table> + +<h4 id="name">name</h4> + +<p>The name is any valid UNIX filename string (specifically no +slashes, NULLs or leading periods). Within a Task object, each Process name +must be unique.</p> + +<h4 id="cmdline">cmdline</h4> + +<p>The command line run by the process. The command line is invoked in a bash +subshell, so can involve fully-blown bash scripts. However, nothing is +supplied for command-line arguments so <code>$*</code> is unspecified.</p> + +<h4 id="max_failures">max_failures</h4> + +<p>The maximum number of failures (non-zero exit statuses) this process can +have before being marked permanently failed and not retried. If a +process permanently fails, Thermos looks at the failure limit of the task +containing the process (usually 1) to determine if the task has +failed as well.</p> + +<p>Setting <code>max_failures</code> to 0 makes the process retry +indefinitely until it achieves a successful (zero) exit status. +It retries at most once every <code>min_duration</code> seconds to prevent +an effective denial of service attack on the coordinating Thermos scheduler.</p> + +<h4 id="daemon">daemon</h4> + +<p>By default, Thermos processes are non-daemon. If <code>daemon</code> is set to True, a +successful (zero) exit status does not prevent future process runs. +Instead, the process reinvokes after <code>min_duration</code> seconds. +However, the maximum failure limit still applies. A combination of +<code>daemon=True</code> and <code>max_failures=0</code> causes a process to retry +indefinitely regardless of exit status. This should be avoided +for very short-lived processes because of the accumulation of +checkpointed state for each process run. When running in Mesos +specifically, <code>max_failures</code> is capped at 100.</p> + +<h4 id="ephemeral">ephemeral</h4> + +<p>By default, Thermos processes are non-ephemeral. If <code>ephemeral</code> is set to +True, the process’ status is not used to determine if its containing task +has completed. For example, consider a task with a non-ephemeral +webserver process and an ephemeral logsaver process +that periodically checkpoints its log files to a centralized data store. +The task is considered finished once the webserver process has +completed, regardless of the logsaver’s current status.</p> + +<h4 id="min_duration">min_duration</h4> + +<p>Processes may succeed or fail multiple times during a single task’s +duration. Each of these is called a <em>process run</em>. <code>min_duration</code> is +the minimum number of seconds the scheduler waits before running the +same process.</p> + +<h4 id="final">final</h4> + +<p>Processes can be grouped into two classes: ordinary processes and +finalizing processes. By default, Thermos processes are ordinary. They +run as long as the task is considered healthy (i.e., no failure +limits have been reached.) But once all regular Thermos processes +finish or the task reaches a certain failure threshold, it +moves into a “finalization” stage and runs all finalizing +processes. These are typically processes necessary for cleaning up the +task, such as log checkpointers, or perhaps e-mail notifications that +the task completed.</p> + +<p>Finalizing processes may not depend upon ordinary processes or +vice-versa, however finalizing processes may depend upon other +finalizing processes and otherwise run as a typical process +schedule.</p> + +<h4 id="logger">logger</h4> + +<p>The default behavior of Thermos is to store stderr/stdout logs in files which grow unbounded. +In the event that you have large log volume, you may want to configure Thermos to automatically +rotate logs after they grow to a certain size, which can prevent your job from using more than its +allocated disk space.</p> + +<p>Logger objects specify a <code>destination</code> for Process logs which is, by default, <code>file</code> - a pair of +<code>stdout</code> and <code>stderr</code> files. Its also possible to specify <code>console</code> to get logs output to +the Process stdout and stderr streams, <code>none</code> to suppress any logs output or <code>both</code> to send logs to +files and console streams.</p> + +<p>The default Logger <code>mode</code> is <code>standard</code> which lets the stdout and stderr streams grow without bound.</p> + +<table><thead> +<tr> +<th><strong>Attribute Name</strong></th> +<th style="text-align: center"><strong>Type</strong></th> +<th><strong>Description</strong></th> +</tr> +</thead><tbody> +<tr> +<td><strong>destination</strong></td> +<td style="text-align: center">LoggerDestination</td> +<td>Destination of logs. (Default: <code>file</code>)</td> +</tr> +<tr> +<td><strong>mode</strong></td> +<td style="text-align: center">LoggerMode</td> +<td>Mode of the logger. (Default: <code>standard</code>)</td> +</tr> +<tr> +<td><strong>rotate</strong></td> +<td style="text-align: center">RotatePolicy</td> +<td>An optional rotation policy. (Default: <code>Empty</code>)</td> +</tr> +</tbody></table> + +<p>A RotatePolicy describes log rotation behavior for when <code>mode</code> is set to <code>rotate</code> and it is ignored +otherwise. If <code>rotate</code> is <code>Empty</code> or <code>RotatePolicy()</code> when the <code>mode</code> is set to <code>rotate</code> the +defaults below are used.</p> + +<table><thead> +<tr> +<th><strong>Attribute Name</strong></th> +<th style="text-align: center"><strong>Type</strong></th> +<th><strong>Description</strong></th> +</tr> +</thead><tbody> +<tr> +<td><strong>log_size</strong></td> +<td style="text-align: center">Integer</td> +<td>Maximum size (in bytes) of an individual log file. (Default: 100 MiB)</td> +</tr> +<tr> +<td><strong>backups</strong></td> +<td style="text-align: center">Integer</td> +<td>The maximum number of backups to retain. (Default: 5)</td> +</tr> +</tbody></table> + +<p>An example process configuration is as follows:</p> +<pre class="highlight plaintext"><code> process = Process( + name='process', + logger=Logger( + destination=LoggerDestination('both'), + mode=LoggerMode('rotate'), + rotate=RotatePolicy(log_size=5*MB, backups=5) + ) + ) +</code></pre> + +<h1 id="task-schema">Task Schema</h1> + +<p>Tasks fundamentally consist of a <code>name</code> and a list of Process objects stored as the +value of the <code>processes</code> attribute. Processes can be further constrained with +<code>constraints</code>. By default, <code>name</code>’s value inherits from the first Process in the +<code>processes</code> list, so for simple <code>Task</code> objects with one Process, <code>name</code> +can be omitted. In Mesos, <code>resources</code> is also required.</p> + +<h3 id="task-object">Task Object</h3> + +<table><thead> +<tr> +<th><strong>param</strong></th> +<th style="text-align: center"><strong>type</strong></th> +<th><strong>description</strong></th> +</tr> +</thead><tbody> +<tr> +<td><code>name</code></td> +<td style="text-align: center">String</td> +<td>Process name (Required) (Default: <code>processes0.name</code>)</td> +</tr> +<tr> +<td><code>processes</code></td> +<td style="text-align: center">List of <code>Process</code> objects</td> +<td>List of <code>Process</code> objects bound to this task. (Required)</td> +</tr> +<tr> +<td><code>constraints</code></td> +<td style="text-align: center">List of <code>Constraint</code> objects</td> +<td>List of <code>Constraint</code> objects constraining processes.</td> +</tr> +<tr> +<td><code>resources</code></td> +<td style="text-align: center"><code>Resource</code> object</td> +<td>Resource footprint. (Required)</td> +</tr> +<tr> +<td><code>max_failures</code></td> +<td style="text-align: center">Integer</td> +<td>Maximum process failures before being considered failed (Default: 1)</td> +</tr> +<tr> +<td><code>max_concurrency</code></td> +<td style="text-align: center">Integer</td> +<td>Maximum number of concurrent processes (Default: 0, unlimited concurrency.)</td> +</tr> +<tr> +<td><code>finalization_wait</code></td> +<td style="text-align: center">Integer</td> +<td>Amount of time allocated for finalizing processes, in seconds. (Default: 30)</td> +</tr> +</tbody></table> + +<h4 id="name">name</h4> + +<p><code>name</code> is a string denoting the name of this task. It defaults to the name of the first Process in +the list of Processes associated with the <code>processes</code> attribute.</p> + +<h4 id="processes">processes</h4> + +<p><code>processes</code> is an unordered list of <code>Process</code> objects. To constrain the order +in which they run, use <code>constraints</code>.</p> + +<h5 id="constraints">constraints</h5> + +<p>A list of <code>Constraint</code> objects. Currently it supports only one type, +the <code>order</code> constraint. <code>order</code> is a list of process names +that should run in the order given. For example,</p> +<pre class="highlight plaintext"><code> process = Process(cmdline = "echo hello {{name}}") + task = Task(name = "echoes", + processes = [process(name = "jim"), process(name = "bob")], + constraints = [Constraint(order = ["jim", "bob"])) +</code></pre> + +<p>Constraints can be supplied ad-hoc and in duplicate. Not all +Processes need be constrained, however Tasks with cycles are +rejected by the Thermos scheduler.</p> + +<p>Use the <code>order</code> function as shorthand to generate <code>Constraint</code> lists. +The following:</p> +<pre class="highlight plaintext"><code> order(process1, process2) +</code></pre> + +<p>is shorthand for</p> +<pre class="highlight plaintext"><code> [Constraint(order = [process1.name(), process2.name()])] +</code></pre> + +<p>The <code>order</code> function accepts Process name strings <code>('foo', 'bar')</code> or the processes +themselves, e.g. <code>foo=Process(name='foo', ...)</code>, <code>bar=Process(name='bar', ...)</code>, +<code>constraints=order(foo, bar)</code>.</p> + +<h4 id="resources">resources</h4> + +<p>Takes a <code>Resource</code> object, which specifies the amounts of CPU, memory, and disk space resources +to allocate to the Task.</p> + +<h4 id="max_failures">max_failures</h4> + +<p><code>max_failures</code> is the number of failed processes needed for the <code>Task</code> to be +marked as failed.</p> + +<p>For example, assume a Task has two Processes and a <code>max_failures</code> value of <code>2</code>:</p> +<pre class="highlight plaintext"><code> template = Process(max_failures=10) + task = Task( + name = "fail", + processes = [ + template(name = "failing", cmdline = "exit 1"), + template(name = "succeeding", cmdline = "exit 0") + ], + max_failures=2) +</code></pre> + +<p>The <code>failing</code> Process could fail 10 times before being marked as permanently +failed, and the <code>succeeding</code> Process could succeed on the first run. However, +the task would succeed despite only allowing for two failed processes. To be more +specific, there would be 10 failed process runs yet 1 failed process. Both processes +would have to fail for the Task to fail.</p> + +<h4 id="max_concurrency">max_concurrency</h4> + +<p>For Tasks with a number of expensive but otherwise independent +processes, you may want to limit the amount of concurrency +the Thermos scheduler provides rather than artificially constraining +it via <code>order</code> constraints. For example, a test framework may +generate a task with 100 test run processes, but wants to run it on +a machine with only 4 cores. You can limit the amount of parallelism to +4 by setting <code>max_concurrency=4</code> in your task configuration.</p> + +<p>For example, the following task spawns 180 Processes (“mappers”) +to compute individual elements of a 180 degree sine table, all dependent +upon one final Process (“reducer”) to tabulate the results:</p> +<pre class="highlight plaintext"><code>def make_mapper(id): + return Process( + name = "mapper%03d" % id, + cmdline = "echo 'scale=50;s(%d\*4\*a(1)/180)' | bc -l > + temp.sine_table.%03d" % (id, id)) + +def make_reducer(): + return Process(name = "reducer", cmdline = "cat temp.\* | nl \> sine\_table.txt + && rm -f temp.\*") + +processes = map(make_mapper, range(180)) + +task = Task( + name = "mapreduce", + processes = processes + [make\_reducer()], + constraints = [Constraint(order = [mapper.name(), 'reducer']) for mapper + in processes], + max_concurrency = 8) +</code></pre> + +<h4 id="finalization_wait">finalization_wait</h4> + +<p>Process execution is organizued into three active stages: <code>ACTIVE</code>, +<code>CLEANING</code>, and <code>FINALIZING</code>. The <code>ACTIVE</code> stage is when ordinary processes run. +This stage lasts as long as Processes are running and the Task is healthy. +The moment either all Processes have finished successfully or the Task has reached a +maximum Process failure limit, it goes into <code>CLEANING</code> stage and send +SIGTERMs to all currently running Processes and their process trees. +Once all Processes have terminated, the Task goes into <code>FINALIZING</code> stage +and invokes the schedule of all Processes with the “final” attribute set to True.</p> + +<p>This whole process from the end of <code>ACTIVE</code> stage to the end of <code>FINALIZING</code> +must happen within <code>finalization_wait</code> seconds. If it does not +finish during that time, all remaining Processes are sent SIGKILLs +(or if they depend upon uncompleted Processes, are +never invoked.)</p> + +<p>When running on Aurora, the <code>finalization_wait</code> is capped at 60 seconds.</p> + +<h3 id="constraint-object">Constraint Object</h3> + +<p>Current constraint objects only support a single ordering constraint, <code>order</code>, +which specifies its processes run sequentially in the order given. By +default, all processes run in parallel when bound to a <code>Task</code> without +ordering constraints.</p> + +<table><thead> +<tr> +<th>param</th> +<th style="text-align: center">type</th> +<th>description</th> +</tr> +</thead><tbody> +<tr> +<td>order</td> +<td style="text-align: center">List of String</td> +<td>List of processes by name (String) that should be run serially.</td> +</tr> +</tbody></table> + +<h3 id="resource-object">Resource Object</h3> + +<p>Specifies the amount of CPU, Ram, and disk resources the task needs. See the +<a href="../../features/resource-isolation/">Resource Isolation document</a> for suggested values and to understand how +resources are allocated.</p> + +<table><thead> +<tr> +<th>param</th> +<th style="text-align: center">type</th> +<th>description</th> +</tr> +</thead><tbody> +<tr> +<td><code>cpu</code></td> +<td style="text-align: center">Float</td> +<td>Fractional number of cores required by the task.</td> +</tr> +<tr> +<td><code>ram</code></td> +<td style="text-align: center">Integer</td> +<td>Bytes of RAM required by the task.</td> +</tr> +<tr> +<td><code>disk</code></td> +<td style="text-align: center">Integer</td> +<td>Bytes of disk required by the task.</td> +</tr> +<tr> +<td><code>gpu</code></td> +<td style="text-align: center">Integer</td> +<td>Number of GPU cores required by the task</td> +</tr> +</tbody></table> + +<h1 id="job-schema">Job Schema</h1> + +<h3 id="job-objects">Job Objects</h3> + +<p><em>Note: Specifying a <code>Container</code> object as the value of the <code>container</code> property is + deprecated in favor of setting its value directly to the appropriate <code>Docker</code> or <code>Mesos</code> + container type</em></p> + +<p><em>Note: Specifying preemption behavior of tasks through <code>production</code> flag is deprecated in favor of + electing appropriate task tier via <code>tier</code> attribute.</em></p> + +<table><thead> +<tr> +<th>name</th> +<th style="text-align: center">type</th> +<th>description</th> +</tr> +</thead><tbody> +<tr> +<td><code>task</code></td> +<td style="text-align: center">Task</td> +<td>The Task object to bind to this job. Required.</td> +</tr> +<tr> +<td><code>name</code></td> +<td style="text-align: center">String</td> +<td>Job name. (Default: inherited from the task attribute’s name)</td> +</tr> +<tr> +<td><code>role</code></td> +<td style="text-align: center">String</td> +<td>Job role account. Required.</td> +</tr> +<tr> +<td><code>cluster</code></td> +<td style="text-align: center">String</td> +<td>Cluster in which this job is scheduled. Required.</td> +</tr> +<tr> +<td><code>environment</code></td> +<td style="text-align: center">String</td> +<td>Job environment, default <code>devel</code>. Must be one of <code>prod</code>, <code>devel</code>, <code>test</code> or <code>staging<number></code>.</td> +</tr> +<tr> +<td><code>contact</code></td> +<td style="text-align: center">String</td> +<td>Best email address to reach the owner of the job. For production jobs, this is usually a team mailing list.</td> +</tr> +<tr> +<td><code>instances</code></td> +<td style="text-align: center">Integer</td> +<td>Number of instances (sometimes referred to as replicas or shards) of the task to create. (Default: 1)</td> +</tr> +<tr> +<td><code>cron_schedule</code></td> +<td style="text-align: center">String</td> +<td>Cron schedule in cron format. May only be used with non-service jobs. See <a href="../../features/cron-jobs/">Cron Jobs</a> for more information. Default: None (not a cron job.)</td> +</tr> +<tr> +<td><code>cron_collision_policy</code></td> +<td style="text-align: center">String</td> +<td>Policy to use when a cron job is triggered while a previous run is still active. KILL<em>EXISTING Kill the previous run, and schedule the new run CANCEL</em>NEW Let the previous run continue, and cancel the new run. (Default: KILL_EXISTING)</td> +</tr> +<tr> +<td><code>update_config</code></td> +<td style="text-align: center"><code>UpdateConfig</code> object</td> +<td>Parameters for controlling the rate and policy of rolling updates.</td> +</tr> +<tr> +<td><code>constraints</code></td> +<td style="text-align: center">dict</td> +<td>Scheduling constraints for the tasks. See the section on the <a href="#specifying-scheduling-constraints">constraint specification language</a></td> +</tr> +<tr> +<td><code>service</code></td> +<td style="text-align: center">Boolean</td> +<td>If True, restart tasks regardless of success or failure. (Default: False)</td> +</tr> +<tr> +<td><code>max_task_failures</code></td> +<td style="text-align: center">Integer</td> +<td>Maximum number of failures after which the task is considered to have failed (Default: 1) Set to -1 to allow for infinite failures</td> +</tr> +<tr> +<td><code>priority</code></td> +<td style="text-align: center">Integer</td> +<td>Preemption priority to give the task (Default 0). Tasks with higher priorities may preempt tasks at lower priorities.</td> +</tr> +<tr> +<td><code>production</code></td> +<td style="text-align: center">Boolean</td> +<td>(Deprecated) Whether or not this is a production task that may <a href="../../features/multitenancy/#preemption">preempt</a> other tasks (Default: False). Production job role must have the appropriate <a href="../../features/multitenancy/#preemption">quota</a>.</td> +</tr> +<tr> +<td><code>health_check_config</code></td> +<td style="text-align: center"><code>HealthCheckConfig</code> object</td> +<td>Parameters for controlling a task’s health checks. HTTP health check is only used if a health port was assigned with a command line wildcard.</td> +</tr> +<tr> +<td><code>container</code></td> +<td style="text-align: center">Choice of <code>Container</code>, <code>Docker</code> or <code>Mesos</code> object</td> +<td>An optional container to run all processes inside of.</td> +</tr> +<tr> +<td><code>lifecycle</code></td> +<td style="text-align: center"><code>LifecycleConfig</code> object</td> +<td>An optional task lifecycle configuration that dictates commands to be executed on startup/teardown. HTTP lifecycle is enabled by default if the “health” port is requested. See <a href="#lifecycleconfig-objects">LifecycleConfig Objects</a> for more information.</td> +</tr> +<tr> +<td><code>tier</code></td> +<td style="text-align: center">String</td> +<td>Task tier type. The default scheduler tier configuration allows for 3 tiers: <code>revocable</code>, <code>preemptible</code>, and <code>preferred</code>. If a tier is not elected, Aurora assigns the task to a tier based on its choice of <code>production</code> (that is <code>preferred</code> for production and <code>preemptible</code> for non-production jobs). See the section on <a href="../../features/multitenancy/#configuration-tiers">Configuration Tiers</a> for more information.</td> +</tr> +<tr> +<td><code>announce</code></td> +<td style="text-align: center"><code>Announcer</code> object</td> +<td>Optionally enable Zookeeper ServerSet announcements. See [Announcer Objects] for more information.</td> +</tr> +<tr> +<td><code>enable_hooks</code></td> +<td style="text-align: center">Boolean</td> +<td>Whether to enable <a href="../client-hooks/">Client Hooks</a> for this job. (Default: False)</td> +</tr> +</tbody></table> + +<h3 id="updateconfig-objects">UpdateConfig Objects</h3> + +<p>Parameters for controlling the rate and policy of rolling updates.</p> + +<table><thead> +<tr> +<th>object</th> +<th style="text-align: center">type</th> +<th>description</th> +</tr> +</thead><tbody> +<tr> +<td><code>batch_size</code></td> +<td style="text-align: center">Integer</td> +<td>Maximum number of shards to be updated in one iteration (Default: 1)</td> +</tr> +<tr> +<td><code>watch_secs</code></td> +<td style="text-align: center">Integer</td> +<td>Minimum number of seconds a shard must remain in <code>RUNNING</code> state before considered a success (Default: 45)</td> +</tr> +<tr> +<td><code>max_per_shard_failures</code></td> +<td style="text-align: center">Integer</td> +<td>Maximum number of restarts per shard during update. Increments total failure count when this limit is exceeded. (Default: 0)</td> +</tr> +<tr> +<td><code>max_total_failures</code></td> +<td style="text-align: center">Integer</td> +<td>Maximum number of shard failures to be tolerated in total during an update. Cannot be greater than or equal to the total number of tasks in a job. (Default: 0)</td> +</tr> +<tr> +<td><code>rollback_on_failure</code></td> +<td style="text-align: center">boolean</td> +<td>When False, prevents auto rollback of a failed update (Default: True)</td> +</tr> +<tr> +<td><code>wait_for_batch_completion</code></td> +<td style="text-align: center">boolean</td> +<td>When True, all threads from a given batch will be blocked from picking up new instances until the entire batch is updated. This essentially simulates the legacy sequential updater algorithm. (Default: False)</td> +</tr> +<tr> +<td><code>pulse_interval_secs</code></td> +<td style="text-align: center">Integer</td> +<td>Indicates a <a href="../../features/job-updates/#coordinated-job-updates">coordinated update</a>. If no pulses are received within the provided interval the update will be blocked. Beta-updater only. Will fail on submission when used with client updater. (Default: None)</td> +</tr> +</tbody></table> + +<h3 id="healthcheckconfig-objects">HealthCheckConfig Objects</h3> + +<p>Parameters for controlling a task’s health checks via HTTP or a shell command.</p> + +<table><thead> +<tr> +<th>param</th> +<th style="text-align: center">type</th> +<th>description</th> +</tr> +</thead><tbody> +<tr> +<td><code>health_checker</code></td> +<td style="text-align: center">HealthCheckerConfig</td> +<td>Configure what kind of health check to use.</td> +</tr> +<tr> +<td><code>initial_interval_secs</code></td> +<td style="text-align: center">Integer</td> +<td>Initial grace period (during which health-check failures are ignored) while performing health checks. (Default: 15)</td> +</tr> +<tr> +<td><code>interval_secs</code></td> +<td style="text-align: center">Integer</td> +<td>Interval on which to check the task’s health. (Default: 10)</td> +</tr> +<tr> +<td><code>max_consecutive_failures</code></td> +<td style="text-align: center">Integer</td> +<td>Maximum number of consecutive failures that will be tolerated before considering a task unhealthy (Default: 0)</td> +</tr> +<tr> +<td><code>min_consecutive_successes</code></td> +<td style="text-align: center">Integer</td> +<td>Minimum number of consecutive successful health checks required before considering a task healthy (Default: 1)</td> +</tr> +<tr> +<td><code>timeout_secs</code></td> +<td style="text-align: center">Integer</td> +<td>Health check timeout. (Default: 1)</td> +</tr> +</tbody></table> + +<h3 id="healthcheckerconfig-objects">HealthCheckerConfig Objects</h3> + +<table><thead> +<tr> +<th>param</th> +<th style="text-align: center">type</th> +<th>description</th> +</tr> +</thead><tbody> +<tr> +<td><code>http</code></td> +<td style="text-align: center">HttpHealthChecker</td> +<td>Configure health check to use HTTP. (Default)</td> +</tr> +<tr> +<td><code>shell</code></td> +<td style="text-align: center">ShellHealthChecker</td> +<td>Configure health check via a shell command.</td> +</tr> +</tbody></table> + +<h3 id="httphealthchecker-objects">HttpHealthChecker Objects</h3> + +<table><thead> +<tr> +<th>param</th> +<th style="text-align: center">type</th> +<th>description</th> +</tr> +</thead><tbody> +<tr> +<td><code>endpoint</code></td> +<td style="text-align: center">String</td> +<td>HTTP endpoint to check (Default: /health)</td> +</tr> +<tr> +<td><code>expected_response</code></td> +<td style="text-align: center">String</td> +<td>If not empty, fail the HTTP health check if the response differs. Case insensitive. (Default: ok)</td> +</tr> +<tr> +<td><code>expected_response_code</code></td> +<td style="text-align: center">Integer</td> +<td>If not zero, fail the HTTP health check if the response code differs. (Default: 0)</td> +</tr> +</tbody></table> + +<h3 id="shellhealthchecker-objects">ShellHealthChecker Objects</h3> + +<table><thead> +<tr> +<th>param</th> +<th style="text-align: center">type</th> +<th>description</th> +</tr> +</thead><tbody> +<tr> +<td><code>shell_command</code></td> +<td style="text-align: center">String</td> +<td>An alternative to HTTP health checking. Specifies a shell command that will be executed. Any non-zero exit status will be interpreted as a health check failure.</td> +</tr> +</tbody></table> + +<h3 id="announcer-objects">Announcer Objects</h3> + +<p>If the <code>announce</code> field in the Job configuration is set, each task will be +registered in the ServerSet <code>/aurora/role/environment/jobname</code> in the +zookeeper ensemble configured by the executor (which can be optionally overriden by specifying +<code>zk_path</code> parameter). If no Announcer object is specified, +no announcement will take place. For more information about ServerSets, see the <a href="../../features/service-discovery/">Service Discover</a> +documentation.</p> + +<p>By default, the hostname in the registered endpoints will be the <code>--hostname</code> parameter +that is passed to the mesos agent. To override the hostname value, the executor can be started +with <code>--announcer-hostname=<overriden_value></code>. If you decide to use <code>--announcer-hostname</code> and if +the overriden value needs to change for every executor, then the executor has to be started inside a wrapper, see <a href="../../operations/configuration/#thermos-executor-wrapper">Executor Wrapper</a>.</p> + +<p>For example, if you want the hostname in the endpoint to be an IP address instead of the hostname, +the <code>--hostname</code> parameter to the mesos agent can be set to the machine IP or the executor can +be started with <code>--announcer-hostname=<host_ip></code> while wrapping the executor inside a script.</p> + +<table><thead> +<tr> +<th>object</th> +<th style="text-align: center">type</th> +<th>description</th> +</tr> +</thead><tbody> +<tr> +<td><code>primary_port</code></td> +<td style="text-align: center">String</td> +<td>Which named port to register as the primary endpoint in the ServerSet (Default: <code>http</code>)</td> +</tr> +<tr> +<td><code>portmap</code></td> +<td style="text-align: center">dict</td> +<td>A mapping of additional endpoints to be announced in the ServerSet (Default: <code>{ 'aurora': '{{primary_port}}' }</code>)</td> +</tr> +<tr> +<td><code>zk_path</code></td> +<td style="text-align: center">String</td> +<td>Zookeeper serverset path override (executor must be started with the <code>--announcer-allow-custom-serverset-path</code> parameter)</td> +</tr> +</tbody></table> + +<h4 id="port-aliasing-with-the-announcer-portmap">Port aliasing with the Announcer <code>portmap</code></h4> + +<p>The primary endpoint registered in the ServerSet is the one allocated to the port +specified by the <code>primary_port</code> in the <code>Announcer</code> object, by default +the <code>http</code> port. This port can be referenced from anywhere within a configuration +as <code>{{thermos.ports[http]}}</code>.</p> + +<p>Without the port map, each named port would be allocated a unique port number. +The <code>portmap</code> allows two different named ports to be aliased together. The default +<code>portmap</code> aliases the <code>aurora</code> port (i.e. <code>{{thermos.ports[aurora]}}</code>) to +the <code>http</code> port. Even though the two ports can be referenced independently, +only one port is allocated by Mesos. Any port referenced in a <code>Process</code> object +but which is not in the portmap will be allocated dynamically by Mesos and announced as well.</p> + +<p>It is possible to use the portmap to alias names to static port numbers, e.g. +<code>{'http': 80, 'https': 443, 'aurora': 'http'}</code>. In this case, referencing +<code>{{thermos.ports[aurora]}}</code> would look up <code>{{thermos.ports[http]}}</code> then +find a static port 80. No port would be requested of or allocated by Mesos.</p> + +<p>Static ports should be used cautiously as Aurora does nothing to prevent two +tasks with the same static port allocations from being co-scheduled. +External constraints such as agent attributes should be used to enforce such +guarantees should they be needed.</p> + +<h3 id="container-objects">Container Objects</h3> + +<p>Describes the container the job’s processes will run inside. If not using Docker or the Mesos +unified-container, the container can be omitted from your job config.</p> + +<table><thead> +<tr> +<th>param</th> +<th style="text-align: center">type</th> +<th>description</th> +</tr> +</thead><tbody> +<tr> +<td><code>mesos</code></td> +<td style="text-align: center">Mesos</td> +<td>A native Mesos container to use.</td> +</tr> +<tr> +<td><code>docker</code></td> +<td style="text-align: center">Docker</td> +<td>A Docker container to use (via Docker engine)</td> +</tr> +</tbody></table> + +<h3 id="mesos-object">Mesos Object</h3> + +<table><thead> +<tr> +<th>param</th> +<th style="text-align: center">type</th> +<th>description</th> +</tr> +</thead><tbody> +<tr> +<td><code>image</code></td> +<td style="text-align: center">Choice(AppcImage, DockerImage)</td> +<td>An optional filesystem image to use within this container.</td> +</tr> +<tr> +<td><code>volumes</code></td> +<td style="text-align: center">List(Volume)</td> +<td>An optional list of volume mounts for this container.</td> +</tr> +</tbody></table> + +<h3 id="volume-object">Volume Object</h3> + +<table><thead> +<tr> +<th>param</th> +<th style="text-align: center">type</th> +<th>description</th> +</tr> +</thead><tbody> +<tr> +<td><code>container_path</code></td> +<td style="text-align: center">String</td> +<td>Path on the host to mount.</td> +</tr> +<tr> +<td><code>volume_path</code></td> +<td style="text-align: center">String</td> +<td>Mount point in the container.</td> +</tr> +<tr> +<td><code>mode</code></td> +<td style="text-align: center">Enum</td> +<td>Mode of the mount, can be ‘RW’ or 'RO’.</td> +</tr> +</tbody></table> + +<h3 id="appcimage">AppcImage</h3> + +<p>Describes an AppC filesystem image.</p> + +<table><thead> +<tr> +<th>param</th> +<th style="text-align: center">type</th> +<th>description</th> +</tr> +</thead><tbody> +<tr> +<td><code>name</code></td> +<td style="text-align: center">String</td> +<td>The name of the appc image.</td> +</tr> +<tr> +<td><code>image_id</code></td> +<td style="text-align: center">String</td> +<td>The <a href="https://github.com/appc/spec/blob/master/spec/aci.md#image-id">image id</a> of the appc image.</td> +</tr> +</tbody></table> + +<h3 id="dockerimage">DockerImage</h3> + +<p>Describes a Docker filesystem image.</p> + +<table><thead> +<tr> +<th>param</th> +<th style="text-align: center">type</th> +<th>description</th> +</tr> +</thead><tbody> +<tr> +<td><code>name</code></td> +<td style="text-align: center">String</td> +<td>The name of the docker image.</td> +</tr> +<tr> +<td><code>tag</code></td> +<td style="text-align: center">String</td> +<td>The tag that identifies the docker image.</td> +</tr> +</tbody></table> + +<h3 id="docker-object">Docker Object</h3> + +<p><em>Note: In order to correctly execute processes inside a job, the Docker container must have Python 2.7 installed.</em> +<em>Note: For private docker registry, mesos mandates the docker credential file to be named as <code>.dockercfg</code>, even though docker may create a credential file with a different name on various platforms. Also, the <code>.dockercfg</code> file needs to be copied into the sandbox using the <code>-thermos_executor_resources</code> flag, specified while starting Aurora.</em></p> + +<table><thead> +<tr> +<th>param</th> +<th style="text-align: center">type</th> +<th>description</th> +</tr> +</thead><tbody> +<tr> +<td><code>image</code></td> +<td style="text-align: center">String</td> +<td>The name of the docker image to execute. If the image does not exist locally it will be pulled with <code>docker pull</code>.</td> +</tr> +<tr> +<td><code>parameters</code></td> +<td style="text-align: center">List(Parameter)</td> +<td>Additional parameters to pass to the Docker engine.</td> +</tr> +</tbody></table> + +<h3 id="docker-parameter-object">Docker Parameter Object</h3> + +<p>Docker CLI parameters. This needs to be enabled by the scheduler <code>-allow_docker_parameters</code> option. +See <a href="https://docs.docker.com/reference/commandline/run/">Docker Command Line Reference</a> for valid parameters.</p> + +<table><thead> +<tr> +<th>param</th> +<th style="text-align: center">type</th> +<th>description</th> +</tr> +</thead><tbody> +<tr> +<td><code>name</code></td> +<td style="text-align: center">String</td> +<td>The name of the docker parameter. E.g. volume</td> +</tr> +<tr> +<td><code>value</code></td> +<td style="text-align: center">String</td> +<td>The value of the parameter. E.g. /usr/local/bin:/usr/bin:rw</td> +</tr> +</tbody></table> + +<h3 id="lifecycleconfig-objects">LifecycleConfig Objects</h3> + +<p><em>Note: The only lifecycle configuration supported is the HTTP lifecycle via the HttpLifecycleConfig.</em></p> + +<table><thead> +<tr> +<th>param</th> +<th style="text-align: center">type</th> +<th>description</th> +</tr> +</thead><tbody> +<tr> +<td><code>http</code></td> +<td style="text-align: center">HttpLifecycleConfig</td> +<td>Configure the lifecycle manager to send lifecycle commands to the task via HTTP.</td> +</tr> +</tbody></table> + +<h3 id="httplifecycleconfig-objects">HttpLifecycleConfig Objects</h3> + +<table><thead> +<tr> +<th>param</th> +<th style="text-align: center">type</th> +<th>description</th> +</tr> +</thead><tbody> +<tr> +<td><code>port</code></td> +<td style="text-align: center">String</td> +<td>The named port to send POST commands (Default: health)</td> +</tr> +<tr> +<td><code>graceful_shutdown_endpoint</code></td> +<td style="text-align: center">String</td> +<td>Endpoint to hit to indicate that a task should gracefully shutdown. (Default: /quitquitquit)</td> +</tr> +<tr> +<td><code>shutdown_endpoint</code></td> +<td style="text-align: center">String</td> +<td>Endpoint to hit to give a task its final warning before being killed. (Default: /abortabortabort)</td> +</tr> +</tbody></table> + +<h4 id="gracefulshutdownendpoint">graceful<em>shutdown</em>endpoint</h4> + +<p>If the Job is listening on the port as specified by the HttpLifecycleConfig +(default: <code>health</code>), a HTTP POST request will be sent over localhost to this +endpoint to request that the task gracefully shut itself down. This is a +courtesy call before the <code>shutdown_endpoint</code> is invoked a fixed amount of +time later.</p> + +<h4 id="shutdown_endpoint">shutdown_endpoint</h4> + +<p>If the Job is listening on the port as specified by the HttpLifecycleConfig +(default: <code>health</code>), a HTTP POST request will be sent over localhost to this +endpoint to request as a final warning before being shut down. If the task +does not shut down on its own after this, it will be forcefully killed</p> + +<h1 id="specifying-scheduling-constraints">Specifying Scheduling Constraints</h1> + +<p>In the <code>Job</code> object there is a map <code>constraints</code> from String to String +allowing the user to tailor the schedulability of tasks within the job.</p> + +<p>The constraint map’s key value is the attribute name in which we +constrain Tasks within our Job. The value is how we constrain them. +There are two types of constraints: <em>limit constraints</em> and <em>value +constraints</em>.</p> + +<table><thead> +<tr> +<th>constraint</th> +<th>description</th> +</tr> +</thead><tbody> +<tr> +<td>Limit</td> +<td>A string that specifies a limit for a constraint. Starts with <code>'limit:</code> followed by an Integer and closing single quote, such as <code>'limit:1'</code>.</td> +</tr> +<tr> +<td>Value</td> +<td>A string that specifies a value for a constraint. To include a list of values, separate the values using commas. To negate the values of a constraint, start with a <code>!</code> <code>.</code></td> +</tr> +</tbody></table> + +<p>Further details can be found in the <a href="../../features/constraints/">Scheduling Constraints</a> feature +description.</p> + +<h1 id="template-namespaces">Template Namespaces</h1> + +<p>Currently, a few Pystachio namespaces have special semantics. Using them +in your configuration allow you to tailor application behavior +through environment introspection or interact in special ways with the +Aurora client or Aurora-provided services.</p> + +<h3 id="mesos-namespace">mesos Namespace</h3> + +<p>The <code>mesos</code> namespace contains variables which relate to the <code>mesos</code> agent +which launched the task. The <code>instance</code> variable can be used +to distinguish between Task replicas.</p> + +<table><thead> +<tr> +<th>variable name</th> +<th style="text-align: center">type</th> +<th>description</th> +</tr> +</thead><tbody> +<tr> +<td><code>instance</code></td> +<td style="text-align: center">Integer</td> +<td>The instance number of the created task. A job with 5 replicas has instance numbers 0, 1, 2, 3, and 4.</td> +</tr> +<tr> +<td><code>hostname</code></td> +<td style="text-align: center">String</td> +<td>The instance hostname that the task was launched on.</td> +</tr> +</tbody></table> + +<p>Please note, there is no uniqueness guarantee for <code>instance</code> in the presence of +network partitions. If that is required, it should be baked in at the application +level using a distributed coordination service such as Zookeeper.</p> + +<h3 id="thermos-namespace">thermos Namespace</h3> + +<p>The <code>thermos</code> namespace contains variables that work directly on the +Thermos platform in addition to Aurora. This namespace is fully +compatible with Tasks invoked via the <code>thermos</code> CLI.</p> + +<table><thead> +<tr> +<th style="text-align: center">variable</th> +<th>type</th> +<th>description</th> +</tr> +</thead><tbody> +<tr> +<td style="text-align: center"><code>ports</code></td> +<td>map of string to Integer</td> +<td>A map of names to port numbers</td> +</tr> +<tr> +<td style="text-align: center"><code>task_id</code></td> +<td>string</td> +<td>The task ID assigned to this task.</td> +</tr> +</tbody></table> + +<p>The <code>thermos.ports</code> namespace is automatically populated by Aurora when +invoking tasks on Mesos. When running the <code>thermos</code> command directly, +these ports must be explicitly mapped with the <code>-P</code> option.</p> + +<p>For example, if ’{{<code>thermos.ports[http]</code>}}’ is specified in a <code>Process</code> +configuration, it is automatically extracted and auto-populated by +Aurora, but must be specified with, for example, <code>thermos -P http:12345</code> +to map <code>http</code> to port 12345 when running via the CLI.</p> + +</div> + + </div> + </div> + <div class="container-fluid section-footer buffer"> + <div class="container"> + <div class="row"> + <div class="col-md-2 col-md-offset-1"><h3>Quick Links</h3> + <ul> + <li><a href="/downloads/">Downloads</a></li> + <li><a href="/community/">Mailing Lists</a></li> + <li><a href="http://issues.apache.org/jira/browse/AURORA">Issue Tracking</a></li> + <li><a href="/documentation/latest/contributing/">How To Contribute</a></li> + </ul> + </div> + <div class="col-md-2"><h3>The ASF</h3> + <ul> + <li><a href="http://www.apache.org/licenses/">License</a></li> + <li><a href="http://www.apache.org/foundation/sponsorship.html">Sponsorship</a></li> + <li><a href="http://www.apache.org/foundation/thanks.html">Thanks</a></li> + <li><a href="http://www.apache.org/security/">Security</a></li> + </ul> + </div> + <div class="col-md-6"> + <p class="disclaimer">© 2014-2017 <a href="http://www.apache.org/">Apache Software Foundation</a>. Licensed under the <a href="http://www.apache.org/licenses/">Apache License v2.0</a>. The <a href="https://www.flickr.com/photos/trondk/12706051375/">Aurora Borealis IX photo</a> displayed on the homepage is available under a <a href="https://creativecommons.org/licenses/by-nc-nd/2.0/">Creative Commons BY-NC-ND 2.0 license</a>. Apache, Apache Aurora, and the Apache feather logo are trademarks of The Apache Software Foundation.</p> + </div> + </div> + </div> + + </body> +</html> Added: aurora/site/publish/documentation/0.18.1/reference/observer-configuration/index.html URL: http://svn.apache.org/viewvc/aurora/site/publish/documentation/0.18.1/reference/observer-configuration/index.html?rev=1813982&view=auto ============================================================================== --- aurora/site/publish/documentation/0.18.1/reference/observer-configuration/index.html (added) +++ aurora/site/publish/documentation/0.18.1/reference/observer-configuration/index.html Wed Nov 1 18:39:52 2017 @@ -0,0 +1,225 @@ +<!DOCTYPE html> +<html lang="en"> + <head> + <meta charset="utf-8"> + <meta name="viewport" content="width=device-width, initial-scale=1"> + <title>Apache Aurora</title> + <link rel="stylesheet" href="https://maxcdn.bootstrapcdn.com/bootstrap/3.3.1/css/bootstrap.min.css"> + <link href="/assets/css/main.css" rel="stylesheet"> + <!-- Analytics --> + <script type="text/javascript"> + var _gaq = _gaq || []; + _gaq.push(['_setAccount', 'UA-45879646-1']); + _gaq.push(['_setDomainName', 'apache.org']); + _gaq.push(['_trackPageview']); + + (function() { + var ga = document.createElement('script'); ga.type = 'text/javascript'; ga.async = true; + ga.src = ('https:' == document.location.protocol ? 'https://ssl' : 'http://www') + '.google-analytics.com/ga.js'; + var s = document.getElementsByTagName('script')[0]; s.parentNode.insertBefore(ga, s); + })(); + </script> + </head> + <body> + <div class="container-fluid section-header"> + <div class="container"> + <div class="nav nav-bar"> + <a href="/"><img src="/assets/img/aurora_logo_dkbkg.svg" width="300" alt="Transparent Apache Aurora logo with dark background"/></a> + <ul class="nav navbar-nav navbar-right"> + <li><a href="/documentation/latest/">Documentation</a></li> + <li><a href="/community/">Community</a></li> + <li><a href="/downloads/">Downloads</a></li> + <li><a href="/blog/">Blog</a></li> + </ul> + </div> + </div> +</div> + + <div class="container-fluid"> + <div class="container content"> + <div class="col-md-12 documentation"> +<h5 class="page-header text-uppercase">Documentation +<select onChange="window.location.href='/documentation/' + this.value + '/reference/observer-configuration/'" + value="0.18.1"> + <option value="0.18.1" + selected="selected"> + 0.18.1 + (latest) + </option> + <option value="0.18.0" + > + 0.18.0 + </option> + <option value="0.17.0" + > + 0.17.0 + </option> + <option value="0.16.0" + > + 0.16.0 + </option> + <option value="0.15.0" + > + 0.15.0 + </option> + <option value="0.14.0" + > + 0.14.0 + </option> + <option value="0.13.0" + > + 0.13.0 + </option> + <option value="0.12.0" + > + 0.12.0 + </option> + <option value="0.11.0" + > + 0.11.0 + </option> + <option value="0.10.0" + > + 0.10.0 + </option> + <option value="0.9.0" + > + 0.9.0 + </option> + <option value="0.8.0" + > + 0.8.0 + </option> + <option value="0.7.0-incubating" + > + 0.7.0-incubating + </option> + <option value="0.6.0-incubating" + > + 0.6.0-incubating + </option> + <option value="0.5.0-incubating" + > + 0.5.0-incubating + </option> +</select> +</h5> +<h1 id="observer-configuration-reference">Observer Configuration Reference</h1> + +<p>The Aurora/Thermos observer can take a variety of configuration options through command-line arguments. +A list of the available options can be seen by running <code>thermos_observer --long-help</code>.</p> + +<p>Please refer to the <a href="../../operations/configuration/">Operator Configuration Guide</a> for details on how +to properly set the most important options.</p> +<pre class="highlight plaintext"><code>$ thermos_observer.pex --long-help +Options: + -h, --help, --short-help + show this help message and exit. + --long-help show options from all registered modules, not just the + __main__ module. + --mesos-root=MESOS_ROOT + The mesos root directory to search for Thermos + executor sandboxes [default: /var/lib/mesos] + --ip=IP The IP address the observer will bind to. [default: + 0.0.0.0] + --port=PORT The port on which the observer should listen. + [default: 1338] + --polling_interval_secs=POLLING_INTERVAL_SECS + The number of seconds between observer refresh + attempts. [default: 5] + --task_process_collection_interval_secs=TASK_PROCESS_COLLECTION_INTERVAL_SECS + The number of seconds between per task process + resource collections. [default: 20] + --task_disk_collection_interval_secs=TASK_DISK_COLLECTION_INTERVAL_SECS + The number of seconds between per task disk resource + collections. [default: 60] + + From module twitter.common.app: + --app_daemonize Daemonize this application. [default: False] + --app_profile_output=FILENAME + Dump the profiling output to a binary profiling + format. [default: None] + --app_daemon_stderr=TWITTER_COMMON_APP_DAEMON_STDERR + Direct this app's stderr to this file if daemonized. + [default: /dev/null] + --app_debug Print extra debugging information during application + initialization. [default: False] + --app_rc_filename Print the filename for the rc file and quit. [default: + False] + --app_daemon_stdout=TWITTER_COMMON_APP_DAEMON_STDOUT + Direct this app's stdout to this file if daemonized. + [default: /dev/null] + --app_profiling Run profiler on the code while it runs. Note this can + cause slowdowns. [default: False] + --app_ignore_rc_file + Ignore default arguments from the rc file. [default: + False] + --app_pidfile=TWITTER_COMMON_APP_PIDFILE + The pidfile to use if --app_daemonize is specified. + [default: None] + + From module twitter.common.log.options: + --log_to_stdout=[scheme:]LEVEL + OBSOLETE - legacy flag, use --log_to_stderr instead. + [default: ERROR] + --log_to_stderr=[scheme:]LEVEL + The level at which logging to stderr [default: ERROR]. + Takes either LEVEL or scheme:LEVEL, where LEVEL is one + of ['INFO', 'NONE', 'WARN', 'ERROR', 'DEBUG', 'FATAL'] + and scheme is one of ['google', 'plain']. + --log_to_disk=[scheme:]LEVEL + The level at which logging to disk [default: INFO]. + Takes either LEVEL or scheme:LEVEL, where LEVEL is one + of ['INFO', 'NONE', 'WARN', 'ERROR', 'DEBUG', 'FATAL'] + and scheme is one of ['google', 'plain']. + --log_dir=DIR The directory into which log files will be generated + [default: /var/tmp]. + --log_simple Write a single log file rather than one log file per + log level [default: False]. + --log_to_scribe=[scheme:]LEVEL + The level at which logging to scribe [default: NONE]. + Takes either LEVEL or scheme:LEVEL, where LEVEL is one + of ['INFO', 'NONE', 'WARN', 'ERROR', 'DEBUG', 'FATAL'] + and scheme is one of ['google', 'plain']. + --scribe_category=CATEGORY + The category used when logging to the scribe daemon. + [default: python_default]. + --scribe_buffer Buffer messages when scribe is unavailable rather than + dropping them. [default: False]. + --scribe_host=HOST The host running the scribe daemon. [default: + localhost]. + --scribe_port=PORT The port used to connect to the scribe daemon. + [default: 1463]. +</code></pre> + +</div> + + </div> + </div> + <div class="container-fluid section-footer buffer"> + <div class="container"> + <div class="row"> + <div class="col-md-2 col-md-offset-1"><h3>Quick Links</h3> + <ul> + <li><a href="/downloads/">Downloads</a></li> + <li><a href="/community/">Mailing Lists</a></li> + <li><a href="http://issues.apache.org/jira/browse/AURORA">Issue Tracking</a></li> + <li><a href="/documentation/latest/contributing/">How To Contribute</a></li> + </ul> + </div> + <div class="col-md-2"><h3>The ASF</h3> + <ul> + <li><a href="http://www.apache.org/licenses/">License</a></li> + <li><a href="http://www.apache.org/foundation/sponsorship.html">Sponsorship</a></li> + <li><a href="http://www.apache.org/foundation/thanks.html">Thanks</a></li> + <li><a href="http://www.apache.org/security/">Security</a></li> + </ul> + </div> + <div class="col-md-6"> + <p class="disclaimer">© 2014-2017 <a href="http://www.apache.org/">Apache Software Foundation</a>. Licensed under the <a href="http://www.apache.org/licenses/">Apache License v2.0</a>. The <a href="https://www.flickr.com/photos/trondk/12706051375/">Aurora Borealis IX photo</a> displayed on the homepage is available under a <a href="https://creativecommons.org/licenses/by-nc-nd/2.0/">Creative Commons BY-NC-ND 2.0 license</a>. Apache, Apache Aurora, and the Apache feather logo are trademarks of The Apache Software Foundation.</p> + </div> + </div> + </div> + + </body> +</html>