Author: jihoonson
Date: Mon Nov 9 15:32:06 2015
New Revision: 1713460
URL: http://svn.apache.org/viewvc?rev=1713460&view=rev
Log:
update devel configuration
Modified:
tajo/site/docs/devel/_sources/configuration/tajo-site-xml.txt
tajo/site/docs/devel/_sources/getting_started.txt
tajo/site/docs/devel/_sources/sql_language/queries.txt
tajo/site/docs/devel/_sources/table_management/text.txt
tajo/site/docs/devel/_sources/time_zone.txt
tajo/site/docs/devel/configuration/tajo-site-xml.html
tajo/site/docs/devel/getting_started.html
tajo/site/docs/devel/index.html
tajo/site/docs/devel/searchindex.js
tajo/site/docs/devel/sql_language/queries.html
tajo/site/docs/devel/table_management/text.html
tajo/site/docs/devel/time_zone.html
Modified: tajo/site/docs/devel/_sources/configuration/tajo-site-xml.txt
URL:
http://svn.apache.org/viewvc/tajo/site/docs/devel/_sources/configuration/tajo-site-xml.txt?rev=1713460&r1=1713459&r2=1713460&view=diff
==============================================================================
--- tajo/site/docs/devel/_sources/configuration/tajo-site-xml.txt (original)
+++ tajo/site/docs/devel/_sources/configuration/tajo-site-xml.txt Mon Nov 9
15:32:06 2015
@@ -2,23 +2,455 @@
The tajo-site.xml File
**********************
-To the ``core-site.xml`` file on every host in your cluster, you must add the
following information:
+You can add more configurations in the ``tajo-site.xml`` file. Note that you
should replicate this file to the whole hosts in your cluster once you edited.
+If you are looking for the configurations for the master and the worker,
please refer to :doc:`tajo_master_configuration` and
:doc:`worker_configuration`.
+Also, catalog configurations are found here :doc:`catalog_configuration`.
+
+=========================
+Join Query Settings
+=========================
+
+""""""""""""""""""""""""""""""""""""""
+`tajo.dist-query.join.auto-broadcast`
+""""""""""""""""""""""""""""""""""""""
+
+A flag to enable or disable the use of broadcast join.
+
+ * Property value type: Boolean
+ * Default value: true
+ * Example
+
+.. code-block:: xml
+
+ <property>
+ <name>tajo.dist-query.join.auto-broadcast</name>
+ <value>true</value>
+ </property>
+
+"""""""""""""""""""""""""""""""""""""""""""""""""""""""""""
+`tajo.dist-query.broadcast.non-cross-join.threshold-kb`
+"""""""""""""""""""""""""""""""""""""""""""""""""""""""""""
+
+A threshold for non-cross joins. When a non-cross join query is executed with
the broadcast join, the whole size of broadcasted tables won't exceed this
threshold.
+
+ * Property value type: Integer
+ * Unit: KB
+ * Default value: 5120
+ * Example
+
+.. code-block:: xml
+
+ <property>
+ <name>tajo.dist-query.broadcast.non-cross-join.threshold-kb</name>
+ <value>5120</value>
+ </property>
+
+"""""""""""""""""""""""""""""""""""""""""""""""""""""""""""
+`tajo.dist-query.broadcast.cross-join.threshold-kb`
+"""""""""""""""""""""""""""""""""""""""""""""""""""""""""""
+
+A threshold for cross joins. When a cross join query is executed, the whole
size of broadcasted tables won't exceed this threshold.
+
+ * Property value type: Integer
+ * Unit: KB
+ * Default value: 1024
+ * Example
+
+.. code-block:: xml
+
+ <property>
+ <name>tajo.dist-query.broadcast.cross-join.threshold-kb</name>
+ <value>1024</value>
+ </property>
+
+.. warning::
+ In Tajo, the broadcast join is only the way to perform cross joins. Since
the cross join is a very expensive operation, this value need to be tuned
carefully.
+
+""""""""""""""""""""""""""""""""""""""
+`tajo.dist-query.join.task-volume-mb`
+""""""""""""""""""""""""""""""""""""""
+
+The repartition join is executed in two stages. When a join query is executed
with the repartition join, this value indicates the amount of input data
processed by each task at the second stage.
+As a result, it determines the degree of the parallel processing of the join
query.
+
+ * Property value type: Integer
+ * Unit: MB
+ * Default value: 64
+ * Example
+
+.. code-block:: xml
+
+ <property>
+ <name>tajo.dist-query.join.task-volume-mb</name>
+ <value>64</value>
+ </property>
+
+"""""""""""""""""""""""""""""""""""""""""""
+`tajo.dist-query.join.partition-volume-mb`
+"""""""""""""""""""""""""""""""""""""""""""
+
+The repartition join is executed in two stages. When a join query is executed
with the repartition join,
+this value indicates the output size of each task at the first stage, which
determines the number of partitions to be shuffled between two stages.
+
+ * Property value type: Integer
+ * Unit: MB
+ * Default value: 128
+ * Example
+
+.. code-block:: xml
+
+ <property>
+ <name>tajo.dist-query.join.partition-volume-mb</name>
+ <value>128</value>
+ </property>
+
+""""""""""""""""""""""""""""""""""""""""""""""""""""""""
+`tajo.executor.join.common.in-memory-hash-threshold-mb`
+""""""""""""""""""""""""""""""""""""""""""""""""""""""""
+
+This value provides the criterion to decide the algorithm to perform a join in
a task.
+If the input data is smaller than this value, join is performed with the
in-memory hash join.
+Otherwise, the sort-merge join is used.
+
+ * Property value type: Integer
+ * Unit: MB
+ * Default value: 64
+ * Example
+
+.. code-block:: xml
+
+ <property>
+ <name>tajo.executor.join.common.in-memory-hash-threshold-mb</name>
+ <value>64</value>
+ </property>
+
+.. warning::
+ This value is the size of the input stored on file systems. So, when the
input data is loaded into JVM heap,
+ its actual size is usually much larger than the configured value, which
means that too large threshold can cause unexpected OutOfMemory errors.
+ This value should be tuned carefully.
+
+""""""""""""""""""""""""""""""""""""""""""""""""""""""""
+`tajo.executor.join.inner.in-memory-hash-threshold-mb`
+""""""""""""""""""""""""""""""""""""""""""""""""""""""""
+
+This value provides the criterion to decide the algorithm to perform an inner
join in a task.
+If the input data is smaller than this value, the inner join is performed with
the in-memory hash join.
+Otherwise, the sort-merge join is used.
+
+ * Property value type: Integer
+ * Unit: MB
+ * Default value: 64
+ * Example
+
+.. code-block:: xml
+
+ <property>
+ <name>tajo.executor.join.inner.in-memory-hash-threshold-mb</name>
+ <value>64</value>
+ </property>
+
+.. warning::
+ This value is the size of the input stored on file systems. So, when the
input data is loaded into JVM heap,
+ its actual size is usually much larger than the configured value, which
means that too large threshold can cause unexpected OutOfMemory errors.
+ This value should be tuned carefully.
+
+""""""""""""""""""""""""""""""""""""""""""""""""""""""""
+`tajo.executor.join.outer.in-memory-hash-threshold-mb`
+""""""""""""""""""""""""""""""""""""""""""""""""""""""""
+
+This value provides the criterion to decide the algorithm to perform an outer
join in a task.
+If the input data is smaller than this value, the outer join is performed with
the in-memory hash join.
+Otherwise, the sort-merge join is used.
+
+ * Property value type: Integer
+ * Unit: MB
+ * Default value: 64
+ * Example
+
+.. code-block:: xml
+
+ <property>
+ <name>tajo.executor.join.outer.in-memory-hash-threshold-mb</name>
+ <value>64</value>
+ </property>
+
+.. warning::
+ This value is the size of the input stored on file systems. So, when the
input data is loaded into JVM heap,
+ its actual size is usually much larger than the configured value, which
means that too large threshold can cause unexpected OutOfMemory errors.
+ This value should be tuned carefully.
+
+"""""""""""""""""""""""""""""""""""""
+`tajo.executor.join.hash-table.size`
+"""""""""""""""""""""""""""""""""""""
+
+The initial size of hash table for in-memory hash join.
+
+ * Property value type: Integer
+ * Default value: 100000
+ * Example
+
+.. code-block:: xml
+
+ <property>
+ <name>tajo.executor.join.hash-table.size</name>
+ <value>100000</value>
+ </property>
======================
-System Config
+Sort Query Settings
======================
+""""""""""""""""""""""""""""""""""""""
+`tajo.dist-query.sort.task-volume-mb`
+""""""""""""""""""""""""""""""""""""""
+
+The sort operation is executed in two stages. When a sort query is executed,
this value indicates the amount of input data processed by each task at the
second stage.
+As a result, it determines the degree of the parallel processing of the sort
query.
+
+ * Property value type: Integer
+ * Unit: MB
+ * Default value: 64
+ * Example
+
+.. code-block:: xml
+
+ <property>
+ <name>tajo.dist-query.sort.task-volume-mb</name>
+ <value>64</value>
+ </property>
+
+""""""""""""""""""""""""""""""""""""""""
+`tajo.executor.external-sort.buffer-mb`
+""""""""""""""""""""""""""""""""""""""""
+
+A threshold to choose the sort algorithm. If the input data is larger than
this threshold, the external sort algorithm is used.
+
+ * Property value type: Integer
+ * Unit: MB
+ * Default value: 200
+ * Example
+
+.. code-block:: xml
+
+ <property>
+ <name>tajo.executor.external-sort.buffer-mb</name>
+ <value>200</value>
+ </property>
+
+""""""""""""""""""""""""""""""""""""""
+`tajo.executor.sort.list.size`
+""""""""""""""""""""""""""""""""""""""
+The initial size of list for in-memory sort.
+
+ * Property value type: Integer
+ * Default value: 100000
+ * Example
+
+.. code-block:: xml
+
+ <property>
+ <name>tajo.executor.sort.list.size</name>
+ <value>100000</value>
+ </property>
+
+=========================
+Group by Query Settings
+=========================
+
+""""""""""""""""""""""""""""""""""""""""""""
+`tajo.dist-query.groupby.multi-level-aggr`
+""""""""""""""""""""""""""""""""""""""""""""
+
+A flag to enable the multi-level algorithm for distinct aggregation. If this
value is set, 3-phase aggregation algorithm is used.
+Otherwise, 2-phase aggregation algorithm is used.
+
+ * Property value type: Boolean
+ * Default value: true
+ * Example
+
+.. code-block:: xml
+
+ <property>
+ <name>tajo.dist-query.groupby.multi-level-aggr</name>
+ <value>true</value>
+ </property>
+
+""""""""""""""""""""""""""""""""""""""""""""""
+`tajo.dist-query.groupby.partition-volume-mb`
+""""""""""""""""""""""""""""""""""""""""""""""
+
+The aggregation is executed in two stages. When an aggregation query is
executed,
+this value indicates the output size of each task at the first stage, which
determines the number of partitions to be shuffled between two stages.
+
+ * Property value type: Integer
+ * Unit: MB
+ * Default value: 256
+ * Example
+
+.. code-block:: xml
+
+ <property>
+ <name>tajo.dist-query.groupby.partition-volume-mb</name>
+ <value>256</value>
+ </property>
+
+""""""""""""""""""""""""""""""""""""""""""""""
+`tajo.dist-query.groupby.task-volume-mb`
+""""""""""""""""""""""""""""""""""""""""""""""
+
+The aggregation operation is executed in two stages. When an aggregation query
is executed, this value indicates the amount of input data processed by each
task at the second stage.
+As a result, it determines the degree of the parallel processing of the
aggregation query.
+
+ * Property value type: Integer
+ * Unit: MB
+ * Default value: 64
+ * Example
+
+.. code-block:: xml
+
+ <property>
+ <name>tajo.dist-query.groupby.task-volume-mb</name>
+ <value>64</value>
+ </property>
+
+""""""""""""""""""""""""""""""""""""""""""""""""""""""""
+`tajo.executor.groupby.in-memory-hash-threshold-mb`
+""""""""""""""""""""""""""""""""""""""""""""""""""""""""
+
+This value provides the criterion to decide the algorithm to perform an
aggregation in a task.
+If the input data is smaller than this value, the aggregation is performed
with the in-memory hash aggregation.
+Otherwise, the sort-based aggregation is used.
+
+ * Property value type: Integer
+ * Unit: MB
+ * Default value: 64
+ * Example
+
+.. code-block:: xml
+
+ <property>
+ <name>tajo.executor.groupby.in-memory-hash-threshold-mb</name>
+ <value>64</value>
+ </property>
+
+.. warning::
+ This value is the size of the input stored on file systems. So, when the
input data is loaded into JVM heap,
+ its actual size is usually much larger than the configured value, which
means that too large threshold can cause unexpected OutOfMemory errors.
+ This value should be tuned carefully.
+
+""""""""""""""""""""""""""""""""""""""""""
+`tajo.executor.aggregate.hash-table.size`
+""""""""""""""""""""""""""""""""""""""""""
+
+The initial size of hash table for in-memory aggregation.
+
+ * Property value type: Integer
+ * Default value: 10000
+ * Example
+
+.. code-block:: xml
+
+ <property>
+ <name>tajo.executor.aggregate.hash-table.size</name>
+ <value>10000</value>
+ </property>
======================
Date/Time Settings
======================
-+--------------------------+----------------+--------------------------------------------------------+
-| Property Name | Property Value | Descriptions
|
-+==========================+================+========================================================+
-| tajo.timezone | Time zone id | Refer to :doc:`/time_zone`
|
-+--------------------------+----------------+--------------------------------------------------------+
-| tajo.datetime.date-order | Date order | Determine date order. It should
be one of YMD, DMY, MDY|
-+--------------------------+----------------+--------------------------------------------------------+
+"""""""""""""""""""
+`tajo.timezone`
+"""""""""""""""""""
+
+Refer to :doc:`/time_zone`.
+
+ * Property value type: Time zone id
+ * Default value: Default time zone of JVM
+ * Example
+
+.. code-block:: xml
+
+ <property>
+ <name>tajo.timezone</name>
+ <value>GMT+9</value>
+ </property>
+
+"""""""""""""""""""""""""""
+`tajo.datetime.date-order`
+"""""""""""""""""""""""""""
+
+Date order specification.
+
+ * Property value type: One of YMD, DMY, MDY.
+ * Default value: YMD
+ * Example
+
+.. code-block:: xml
+
+ <property>
+ <name>tajo.datetime.date-order</name>
+ <value>YMD</value>
+ </property>
+
+======================
+Table partitions
+======================
+
+""""""""""""""""""""""""""""""""""""""""""""""""""""
+`tajo.partition.overwrite.even-if-no-result`
+""""""""""""""""""""""""""""""""""""""""""""""""""""
+
+If this value is true, a partitioned table is overwritten even if a subquery
leads to no result. Otherwise, the table data will be kept if there is no
result.
+
+ * Property value type: Boolean
+ * Default value: false
+ * Example
+
+.. code-block:: xml
+
+ <property>
+ <name>tajo.partition.overwrite.even-if-no-result</name>
+ <value>false</value>
+ </property>
+
+""""""""""""""""""""""""""""""""""""""""""""""""""""
+`tajo.dist-query.table-partition.task-volume-mb`
+""""""""""""""""""""""""""""""""""""""""""""""""""""
+
+In Tajo, storing a partition table is executed in two stages.
+This value indicates the output size of a task of the former stage, which
determines the number of partitions to be shuffled between two stages.
+
+ * Property value type: Integer
+ * Unit: MB
+ * Default value: 256
+ * Example
+
+.. code-block:: xml
+
+ <property>
+ <name>tajo.dist-query.table-partition.task-volume-mb</name>
+ <value>256</value>
+ </property>
+
+======================
+Arithmetic Settings
+======================
+
+""""""""""""""""""""""""""""""""""""""""""""""""""""
+`tajo.behavior.arithmetic-abort`
+""""""""""""""""""""""""""""""""""""""""""""""""""""
+
+A flag to indicate how to handle the errors caused by invalid arithmetic
operations. If true, a running query will be terminated with an overflow or a
divide-by-zero.
+
+ * Property value type: Boolean
+ * Default value: false
+ * Example
+
+.. code-block:: xml
+ <property>
+ <name>tajo.behavior.arithmetic-abort</name>
+ <value>false</value>
+ </property>
\ No newline at end of file
Modified: tajo/site/docs/devel/_sources/getting_started.txt
URL:
http://svn.apache.org/viewvc/tajo/site/docs/devel/_sources/getting_started.txt?rev=1713460&r1=1713459&r2=1713460&view=diff
==============================================================================
--- tajo/site/docs/devel/_sources/getting_started.txt (original)
+++ tajo/site/docs/devel/_sources/getting_started.txt Mon Nov 9 15:32:06 2015
@@ -9,7 +9,7 @@ Prerequisites
======================
* Hadoop 2.3.0 or higher (up to 2.6.0)
- * Java 1.7 or higher
+ * Java 1.8 or higher
* Protocol buffer 2.5.0
===================================
Modified: tajo/site/docs/devel/_sources/sql_language/queries.txt
URL:
http://svn.apache.org/viewvc/tajo/site/docs/devel/_sources/sql_language/queries.txt?rev=1713460&r1=1713459&r2=1713460&view=diff
==============================================================================
--- tajo/site/docs/devel/_sources/sql_language/queries.txt (original)
+++ tajo/site/docs/devel/_sources/sql_language/queries.txt Mon Nov 9 15:32:06
2015
@@ -15,7 +15,7 @@ Overview
[WHERE <condition>]
[GROUP BY <expression> [, ...]]
[HAVING <condition>]
- [ORDER BY <expression> [ASC|DESC] [NULL FIRST|NULL LAST] [, ...]]
+ [ORDER BY <expression> [ASC|DESC] [NULLS (FIRST|LAST)] [, ...]]
@@ -189,7 +189,7 @@ Orderby and Limit Clauses
.. code-block:: sql
- FROM ... ORDER BY <sort_expr> [(ASC|DESC)] [NULL (FIRST|LAST) [,...]
+ FROM ... ORDER BY <sort_expr> [(ASC|DESC)] [NULLS (FIRST|LAST) [,...]
``sort_expr`` can be a column reference, aliased column reference, or a
complex expression.
``ASC`` indicates an ascending order of ``sort_expr`` values. ``DESC``
indicates a descending order of ``sort_expr`` values.
Modified: tajo/site/docs/devel/_sources/table_management/text.txt
URL:
http://svn.apache.org/viewvc/tajo/site/docs/devel/_sources/table_management/text.txt?rev=1713460&r1=1713459&r2=1713460&view=diff
==============================================================================
--- tajo/site/docs/devel/_sources/table_management/text.txt (original)
+++ tajo/site/docs/devel/_sources/table_management/text.txt Mon Nov 9 15:32:06
2015
@@ -42,6 +42,7 @@ The ``WITH`` clause in the CREATE TABLE
* ``text.serde``: custom (De)serializer class.
``org.apache.tajo.storage.text.CSVLineSerDe`` is the default (De)serializer
class.
* ``timezone``: the time zone that the table uses for writting. When table
rows are read or written, ```timestamp``` and ```time``` column values are
adjusted by this timezone if it is set. Time zone can be an abbreviation form
like 'PST' or 'DST'. Also, it accepts an offset-based form like 'UTC+9' or a
location-based form like 'Asia/Seoul'.
* ``text.error-tolerance.max-num``: the maximum number of permissible parsing
errors. This value should be an integer value. By default,
``text.error-tolerance.max-num`` is ``0``. According to the value, parsing
errors will be handled in different ways.
+* ``quote_char``: quote character. If this property is specified, the quote
characters in field values will be ignored. If the quote is incomplete, the
quote character will be remain in the value. It may cause invalid parsing,
probably leading to NULL value for some data types like INT and FLOAT.
* If ``text.error-tolerance.max-num < 0``, all parsing errors are ignored.
* If ``text.error-tolerance.max-num == 0``, any parsing error is not
allowed. If any error occurs, the query will be failed. (default)
Modified: tajo/site/docs/devel/_sources/time_zone.txt
URL:
http://svn.apache.org/viewvc/tajo/site/docs/devel/_sources/time_zone.txt?rev=1713460&r1=1713459&r2=1713460&view=diff
==============================================================================
--- tajo/site/docs/devel/_sources/time_zone.txt (original)
+++ tajo/site/docs/devel/_sources/time_zone.txt Mon Nov 9 15:32:06 2015
@@ -19,7 +19,7 @@ You can set the system time zone in *con
.. code-block:: xml
<name>tajo.timezone</name>
- <property>GMT+9</property>
+ <value>GMT+9</value>
==================
Modified: tajo/site/docs/devel/configuration/tajo-site-xml.html
URL:
http://svn.apache.org/viewvc/tajo/site/docs/devel/configuration/tajo-site-xml.html?rev=1713460&r1=1713459&r2=1713460&view=diff
==============================================================================
--- tajo/site/docs/devel/configuration/tajo-site-xml.html (original)
+++ tajo/site/docs/devel/configuration/tajo-site-xml.html Mon Nov 9 15:32:06
2015
@@ -209,35 +209,456 @@
<div class="section" id="the-tajo-site-xml-file">
<h1>The tajo-site.xml File<a class="headerlink" href="#the-tajo-site-xml-file"
title="Permalink to this headline">¶</a></h1>
-<p>To the <code class="docutils literal"><span
class="pre">core-site.xml</span></code> file on every host in your cluster, you
must add the following information:</p>
-<div class="section" id="system-config">
-<h2>System Config<a class="headerlink" href="#system-config" title="Permalink
to this headline">¶</a></h2>
+<p>You can add more configurations in the <code class="docutils literal"><span
class="pre">tajo-site.xml</span></code> file. Note that you should replicate
this file to the whole hosts in your cluster once you edited.
+If you are looking for the configurations for the master and the worker,
please refer to <a class="reference internal"
href="tajo_master_configuration.html"><em>Tajo Master Configuration</em></a>
and <a class="reference internal" href="worker_configuration.html"><em>Worker
Configuration</em></a>.
+Also, catalog configurations are found here <a class="reference internal"
href="catalog_configuration.html"><em>Catalog Configuration</em></a>.</p>
+<div class="section" id="join-query-settings">
+<h2>Join Query Settings<a class="headerlink" href="#join-query-settings"
title="Permalink to this headline">¶</a></h2>
+<div class="section" id="tajo-dist-query-join-auto-broadcast">
+<h3><cite>tajo.dist-query.join.auto-broadcast</cite><a class="headerlink"
href="#tajo-dist-query-join-auto-broadcast" title="Permalink to this
headline">¶</a></h3>
+<p>A flag to enable or disable the use of broadcast join.</p>
+<blockquote>
+<div><ul class="simple">
+<li>Property value type: Boolean</li>
+<li>Default value: true</li>
+<li>Example</li>
+</ul>
+</div></blockquote>
+<div class="highlight-xml"><div class="highlight"><pre><span
class="nt"><property></span>
+ <span class="nt"><name></span>tajo.dist-query.join.auto-broadcast<span
class="nt"></name></span>
+ <span class="nt"><value></span>true<span
class="nt"></value></span>
+<span class="nt"></property></span>
+</pre></div>
+</div>
+</div>
+<div class="section"
id="tajo-dist-query-broadcast-non-cross-join-threshold-kb">
+<h3><cite>tajo.dist-query.broadcast.non-cross-join.threshold-kb</cite><a
class="headerlink"
href="#tajo-dist-query-broadcast-non-cross-join-threshold-kb" title="Permalink
to this headline">¶</a></h3>
+<p>A threshold for non-cross joins. When a non-cross join query is executed
with the broadcast join, the whole size of broadcasted tables won’t
exceed this threshold.</p>
+<blockquote>
+<div><ul class="simple">
+<li>Property value type: Integer</li>
+<li>Unit: KB</li>
+<li>Default value: 5120</li>
+<li>Example</li>
+</ul>
+</div></blockquote>
+<div class="highlight-xml"><div class="highlight"><pre><span
class="nt"><property></span>
+ <span
class="nt"><name></span>tajo.dist-query.broadcast.non-cross-join.threshold-kb<span
class="nt"></name></span>
+ <span class="nt"><value></span>5120<span
class="nt"></value></span>
+<span class="nt"></property></span>
+</pre></div>
+</div>
+</div>
+<div class="section" id="tajo-dist-query-broadcast-cross-join-threshold-kb">
+<h3><cite>tajo.dist-query.broadcast.cross-join.threshold-kb</cite><a
class="headerlink" href="#tajo-dist-query-broadcast-cross-join-threshold-kb"
title="Permalink to this headline">¶</a></h3>
+<p>A threshold for cross joins. When a cross join query is executed, the whole
size of broadcasted tables won’t exceed this threshold.</p>
+<blockquote>
+<div><ul class="simple">
+<li>Property value type: Integer</li>
+<li>Unit: KB</li>
+<li>Default value: 1024</li>
+<li>Example</li>
+</ul>
+</div></blockquote>
+<div class="highlight-xml"><div class="highlight"><pre><span
class="nt"><property></span>
+ <span
class="nt"><name></span>tajo.dist-query.broadcast.cross-join.threshold-kb<span
class="nt"></name></span>
+ <span class="nt"><value></span>1024<span
class="nt"></value></span>
+<span class="nt"></property></span>
+</pre></div>
+</div>
+<div class="admonition warning">
+<p class="first admonition-title">Warning</p>
+<p class="last">In Tajo, the broadcast join is only the way to perform cross
joins. Since the cross join is a very expensive operation, this value need to
be tuned carefully.</p>
+</div>
+</div>
+<div class="section" id="tajo-dist-query-join-task-volume-mb">
+<h3><cite>tajo.dist-query.join.task-volume-mb</cite><a class="headerlink"
href="#tajo-dist-query-join-task-volume-mb" title="Permalink to this
headline">¶</a></h3>
+<p>The repartition join is executed in two stages. When a join query is
executed with the repartition join, this value indicates the amount of input
data processed by each task at the second stage.
+As a result, it determines the degree of the parallel processing of the join
query.</p>
+<blockquote>
+<div><ul class="simple">
+<li>Property value type: Integer</li>
+<li>Unit: MB</li>
+<li>Default value: 64</li>
+<li>Example</li>
+</ul>
+</div></blockquote>
+<div class="highlight-xml"><div class="highlight"><pre><span
class="nt"><property></span>
+ <span class="nt"><name></span>tajo.dist-query.join.task-volume-mb<span
class="nt"></name></span>
+ <span class="nt"><value></span>64<span class="nt"></value></span>
+<span class="nt"></property></span>
+</pre></div>
+</div>
+</div>
+<div class="section" id="tajo-dist-query-join-partition-volume-mb">
+<h3><cite>tajo.dist-query.join.partition-volume-mb</cite><a class="headerlink"
href="#tajo-dist-query-join-partition-volume-mb" title="Permalink to this
headline">¶</a></h3>
+<p>The repartition join is executed in two stages. When a join query is
executed with the repartition join,
+this value indicates the output size of each task at the first stage, which
determines the number of partitions to be shuffled between two stages.</p>
+<blockquote>
+<div><ul class="simple">
+<li>Property value type: Integer</li>
+<li>Unit: MB</li>
+<li>Default value: 128</li>
+<li>Example</li>
+</ul>
+</div></blockquote>
+<div class="highlight-xml"><div class="highlight"><pre><span
class="nt"><property></span>
+ <span
class="nt"><name></span>tajo.dist-query.join.partition-volume-mb<span
class="nt"></name></span>
+ <span class="nt"><value></span>128<span
class="nt"></value></span>
+<span class="nt"></property></span>
+</pre></div>
+</div>
+</div>
+<div class="section"
id="tajo-executor-join-common-in-memory-hash-threshold-mb">
+<h3><cite>tajo.executor.join.common.in-memory-hash-threshold-mb</cite><a
class="headerlink"
href="#tajo-executor-join-common-in-memory-hash-threshold-mb" title="Permalink
to this headline">¶</a></h3>
+<p>This value provides the criterion to decide the algorithm to perform a join
in a task.
+If the input data is smaller than this value, join is performed with the
in-memory hash join.
+Otherwise, the sort-merge join is used.</p>
+<blockquote>
+<div><ul class="simple">
+<li>Property value type: Integer</li>
+<li>Unit: MB</li>
+<li>Default value: 64</li>
+<li>Example</li>
+</ul>
+</div></blockquote>
+<div class="highlight-xml"><div class="highlight"><pre><span
class="nt"><property></span>
+ <span
class="nt"><name></span>tajo.executor.join.common.in-memory-hash-threshold-mb<span
class="nt"></name></span>
+ <span class="nt"><value></span>64<span class="nt"></value></span>
+<span class="nt"></property></span>
+</pre></div>
+</div>
+<div class="admonition warning">
+<p class="first admonition-title">Warning</p>
+<p class="last">This value is the size of the input stored on file systems.
So, when the input data is loaded into JVM heap,
+its actual size is usually much larger than the configured value, which means
that too large threshold can cause unexpected OutOfMemory errors.
+This value should be tuned carefully.</p>
+</div>
+</div>
+<div class="section" id="tajo-executor-join-inner-in-memory-hash-threshold-mb">
+<h3><cite>tajo.executor.join.inner.in-memory-hash-threshold-mb</cite><a
class="headerlink" href="#tajo-executor-join-inner-in-memory-hash-threshold-mb"
title="Permalink to this headline">¶</a></h3>
+<p>This value provides the criterion to decide the algorithm to perform an
inner join in a task.
+If the input data is smaller than this value, the inner join is performed with
the in-memory hash join.
+Otherwise, the sort-merge join is used.</p>
+<blockquote>
+<div><ul class="simple">
+<li>Property value type: Integer</li>
+<li>Unit: MB</li>
+<li>Default value: 64</li>
+<li>Example</li>
+</ul>
+</div></blockquote>
+<div class="highlight-xml"><div class="highlight"><pre><span
class="nt"><property></span>
+ <span
class="nt"><name></span>tajo.executor.join.inner.in-memory-hash-threshold-mb<span
class="nt"></name></span>
+ <span class="nt"><value></span>64<span class="nt"></value></span>
+<span class="nt"></property></span>
+</pre></div>
+</div>
+<div class="admonition warning">
+<p class="first admonition-title">Warning</p>
+<p class="last">This value is the size of the input stored on file systems.
So, when the input data is loaded into JVM heap,
+its actual size is usually much larger than the configured value, which means
that too large threshold can cause unexpected OutOfMemory errors.
+This value should be tuned carefully.</p>
+</div>
+</div>
+<div class="section" id="tajo-executor-join-outer-in-memory-hash-threshold-mb">
+<h3><cite>tajo.executor.join.outer.in-memory-hash-threshold-mb</cite><a
class="headerlink" href="#tajo-executor-join-outer-in-memory-hash-threshold-mb"
title="Permalink to this headline">¶</a></h3>
+<p>This value provides the criterion to decide the algorithm to perform an
outer join in a task.
+If the input data is smaller than this value, the outer join is performed with
the in-memory hash join.
+Otherwise, the sort-merge join is used.</p>
+<blockquote>
+<div><ul class="simple">
+<li>Property value type: Integer</li>
+<li>Unit: MB</li>
+<li>Default value: 64</li>
+<li>Example</li>
+</ul>
+</div></blockquote>
+<div class="highlight-xml"><div class="highlight"><pre><span
class="nt"><property></span>
+ <span
class="nt"><name></span>tajo.executor.join.outer.in-memory-hash-threshold-mb<span
class="nt"></name></span>
+ <span class="nt"><value></span>64<span class="nt"></value></span>
+<span class="nt"></property></span>
+</pre></div>
+</div>
+<div class="admonition warning">
+<p class="first admonition-title">Warning</p>
+<p class="last">This value is the size of the input stored on file systems.
So, when the input data is loaded into JVM heap,
+its actual size is usually much larger than the configured value, which means
that too large threshold can cause unexpected OutOfMemory errors.
+This value should be tuned carefully.</p>
+</div>
+</div>
+<div class="section" id="tajo-executor-join-hash-table-size">
+<h3><cite>tajo.executor.join.hash-table.size</cite><a class="headerlink"
href="#tajo-executor-join-hash-table-size" title="Permalink to this
headline">¶</a></h3>
+<p>The initial size of hash table for in-memory hash join.</p>
+<blockquote>
+<div><ul class="simple">
+<li>Property value type: Integer</li>
+<li>Default value: 100000</li>
+<li>Example</li>
+</ul>
+</div></blockquote>
+<div class="highlight-xml"><div class="highlight"><pre><span
class="nt"><property></span>
+ <span class="nt"><name></span>tajo.executor.join.hash-table.size<span
class="nt"></name></span>
+ <span class="nt"><value></span>100000<span
class="nt"></value></span>
+<span class="nt"></property></span>
+</pre></div>
+</div>
+</div>
+</div>
+<div class="section" id="sort-query-settings">
+<h2>Sort Query Settings<a class="headerlink" href="#sort-query-settings"
title="Permalink to this headline">¶</a></h2>
+<div class="section" id="tajo-dist-query-sort-task-volume-mb">
+<h3><cite>tajo.dist-query.sort.task-volume-mb</cite><a class="headerlink"
href="#tajo-dist-query-sort-task-volume-mb" title="Permalink to this
headline">¶</a></h3>
+<p>The sort operation is executed in two stages. When a sort query is
executed, this value indicates the amount of input data processed by each task
at the second stage.
+As a result, it determines the degree of the parallel processing of the sort
query.</p>
+<blockquote>
+<div><ul class="simple">
+<li>Property value type: Integer</li>
+<li>Unit: MB</li>
+<li>Default value: 64</li>
+<li>Example</li>
+</ul>
+</div></blockquote>
+<div class="highlight-xml"><div class="highlight"><pre><span
class="nt"><property></span>
+ <span class="nt"><name></span>tajo.dist-query.sort.task-volume-mb<span
class="nt"></name></span>
+ <span class="nt"><value></span>64<span class="nt"></value></span>
+<span class="nt"></property></span>
+</pre></div>
+</div>
+</div>
+<div class="section" id="tajo-executor-external-sort-buffer-mb">
+<h3><cite>tajo.executor.external-sort.buffer-mb</cite><a class="headerlink"
href="#tajo-executor-external-sort-buffer-mb" title="Permalink to this
headline">¶</a></h3>
+<p>A threshold to choose the sort algorithm. If the input data is larger than
this threshold, the external sort algorithm is used.</p>
+<blockquote>
+<div><ul class="simple">
+<li>Property value type: Integer</li>
+<li>Unit: MB</li>
+<li>Default value: 200</li>
+<li>Example</li>
+</ul>
+</div></blockquote>
+<div class="highlight-xml"><div class="highlight"><pre><span
class="nt"><property></span>
+ <span
class="nt"><name></span>tajo.executor.external-sort.buffer-mb<span
class="nt"></name></span>
+ <span class="nt"><value></span>200<span
class="nt"></value></span>
+<span class="nt"></property></span>
+</pre></div>
+</div>
+</div>
+<div class="section" id="tajo-executor-sort-list-size">
+<h3><cite>tajo.executor.sort.list.size</cite><a class="headerlink"
href="#tajo-executor-sort-list-size" title="Permalink to this
headline">¶</a></h3>
+<p>The initial size of list for in-memory sort.</p>
+<blockquote>
+<div><ul class="simple">
+<li>Property value type: Integer</li>
+<li>Default value: 100000</li>
+<li>Example</li>
+</ul>
+</div></blockquote>
+<div class="highlight-xml"><div class="highlight"><pre><span
class="nt"><property></span>
+ <span class="nt"><name></span>tajo.executor.sort.list.size<span
class="nt"></name></span>
+ <span class="nt"><value></span>100000<span
class="nt"></value></span>
+<span class="nt"></property></span>
+</pre></div>
+</div>
+</div>
+</div>
+<div class="section" id="group-by-query-settings">
+<h2>Group by Query Settings<a class="headerlink"
href="#group-by-query-settings" title="Permalink to this headline">¶</a></h2>
+<div class="section" id="tajo-dist-query-groupby-multi-level-aggr">
+<h3><cite>tajo.dist-query.groupby.multi-level-aggr</cite><a class="headerlink"
href="#tajo-dist-query-groupby-multi-level-aggr" title="Permalink to this
headline">¶</a></h3>
+<p>A flag to enable the multi-level algorithm for distinct aggregation. If
this value is set, 3-phase aggregation algorithm is used.
+Otherwise, 2-phase aggregation algorithm is used.</p>
+<blockquote>
+<div><ul class="simple">
+<li>Property value type: Boolean</li>
+<li>Default value: true</li>
+<li>Example</li>
+</ul>
+</div></blockquote>
+<div class="highlight-xml"><div class="highlight"><pre><span
class="nt"><property></span>
+ <span
class="nt"><name></span>tajo.dist-query.groupby.multi-level-aggr<span
class="nt"></name></span>
+ <span class="nt"><value></span>true<span
class="nt"></value></span>
+<span class="nt"></property></span>
+</pre></div>
+</div>
+</div>
+<div class="section" id="tajo-dist-query-groupby-partition-volume-mb">
+<h3><cite>tajo.dist-query.groupby.partition-volume-mb</cite><a
class="headerlink" href="#tajo-dist-query-groupby-partition-volume-mb"
title="Permalink to this headline">¶</a></h3>
+<p>The aggregation is executed in two stages. When an aggregation query is
executed,
+this value indicates the output size of each task at the first stage, which
determines the number of partitions to be shuffled between two stages.</p>
+<blockquote>
+<div><ul class="simple">
+<li>Property value type: Integer</li>
+<li>Unit: MB</li>
+<li>Default value: 256</li>
+<li>Example</li>
+</ul>
+</div></blockquote>
+<div class="highlight-xml"><div class="highlight"><pre><span
class="nt"><property></span>
+ <span
class="nt"><name></span>tajo.dist-query.groupby.partition-volume-mb<span
class="nt"></name></span>
+ <span class="nt"><value></span>256<span
class="nt"></value></span>
+<span class="nt"></property></span>
+</pre></div>
+</div>
+</div>
+<div class="section" id="tajo-dist-query-groupby-task-volume-mb">
+<h3><cite>tajo.dist-query.groupby.task-volume-mb</cite><a class="headerlink"
href="#tajo-dist-query-groupby-task-volume-mb" title="Permalink to this
headline">¶</a></h3>
+<p>The aggregation operation is executed in two stages. When an aggregation
query is executed, this value indicates the amount of input data processed by
each task at the second stage.
+As a result, it determines the degree of the parallel processing of the
aggregation query.</p>
+<blockquote>
+<div><ul class="simple">
+<li>Property value type: Integer</li>
+<li>Unit: MB</li>
+<li>Default value: 64</li>
+<li>Example</li>
+</ul>
+</div></blockquote>
+<div class="highlight-xml"><div class="highlight"><pre><span
class="nt"><property></span>
+ <span
class="nt"><name></span>tajo.dist-query.groupby.task-volume-mb<span
class="nt"></name></span>
+ <span class="nt"><value></span>64<span class="nt"></value></span>
+<span class="nt"></property></span>
+</pre></div>
+</div>
+</div>
+<div class="section" id="tajo-executor-groupby-in-memory-hash-threshold-mb">
+<h3><cite>tajo.executor.groupby.in-memory-hash-threshold-mb</cite><a
class="headerlink" href="#tajo-executor-groupby-in-memory-hash-threshold-mb"
title="Permalink to this headline">¶</a></h3>
+<p>This value provides the criterion to decide the algorithm to perform an
aggregation in a task.
+If the input data is smaller than this value, the aggregation is performed
with the in-memory hash aggregation.
+Otherwise, the sort-based aggregation is used.</p>
+<blockquote>
+<div><ul class="simple">
+<li>Property value type: Integer</li>
+<li>Unit: MB</li>
+<li>Default value: 64</li>
+<li>Example</li>
+</ul>
+</div></blockquote>
+<div class="highlight-xml"><div class="highlight"><pre><span
class="nt"><property></span>
+ <span
class="nt"><name></span>tajo.executor.groupby.in-memory-hash-threshold-mb<span
class="nt"></name></span>
+ <span class="nt"><value></span>64<span class="nt"></value></span>
+<span class="nt"></property></span>
+</pre></div>
+</div>
+<div class="admonition warning">
+<p class="first admonition-title">Warning</p>
+<p class="last">This value is the size of the input stored on file systems.
So, when the input data is loaded into JVM heap,
+its actual size is usually much larger than the configured value, which means
that too large threshold can cause unexpected OutOfMemory errors.
+This value should be tuned carefully.</p>
+</div>
+</div>
+<div class="section" id="tajo-executor-aggregate-hash-table-size">
+<h3><cite>tajo.executor.aggregate.hash-table.size</cite><a class="headerlink"
href="#tajo-executor-aggregate-hash-table-size" title="Permalink to this
headline">¶</a></h3>
+<p>The initial size of hash table for in-memory aggregation.</p>
+<blockquote>
+<div><ul class="simple">
+<li>Property value type: Integer</li>
+<li>Default value: 10000</li>
+<li>Example</li>
+</ul>
+</div></blockquote>
+<div class="highlight-xml"><div class="highlight"><pre><span
class="nt"><property></span>
+ <span
class="nt"><name></span>tajo.executor.aggregate.hash-table.size<span
class="nt"></name></span>
+ <span class="nt"><value></span>10000<span
class="nt"></value></span>
+<span class="nt"></property></span>
+</pre></div>
+</div>
+</div>
</div>
<div class="section" id="date-time-settings">
<h2>Date/Time Settings<a class="headerlink" href="#date-time-settings"
title="Permalink to this headline">¶</a></h2>
-<table border="1" class="docutils">
-<colgroup>
-<col width="27%" />
-<col width="16%" />
-<col width="57%" />
-</colgroup>
-<thead valign="bottom">
-<tr class="row-odd"><th class="head">Property Name</th>
-<th class="head">Property Value</th>
-<th class="head">Descriptions</th>
-</tr>
-</thead>
-<tbody valign="top">
-<tr class="row-even"><td>tajo.timezone</td>
-<td>Time zone id</td>
-<td>Refer to <a class="reference internal" href="../time_zone.html"><em>Time
Zone</em></a></td>
-</tr>
-<tr class="row-odd"><td>tajo.datetime.date-order</td>
-<td>Date order</td>
-<td>Determine date order. It should be one of YMD, DMY, MDY</td>
-</tr>
-</tbody>
-</table>
+<div class="section" id="tajo-timezone">
+<h3><cite>tajo.timezone</cite><a class="headerlink" href="#tajo-timezone"
title="Permalink to this headline">¶</a></h3>
+<p>Refer to <a class="reference internal" href="../time_zone.html"><em>Time
Zone</em></a>.</p>
+<blockquote>
+<div><ul class="simple">
+<li>Property value type: Time zone id</li>
+<li>Default value: Default time zone of JVM</li>
+<li>Example</li>
+</ul>
+</div></blockquote>
+<div class="highlight-xml"><div class="highlight"><pre><span
class="nt"><property></span>
+ <span class="nt"><name></span>tajo.timezone<span
class="nt"></name></span>
+ <span class="nt"><value></span>GMT+9<span
class="nt"></value></span>
+<span class="nt"></property></span>
+</pre></div>
+</div>
+</div>
+<div class="section" id="tajo-datetime-date-order">
+<h3><cite>tajo.datetime.date-order</cite><a class="headerlink"
href="#tajo-datetime-date-order" title="Permalink to this headline">¶</a></h3>
+<p>Date order specification.</p>
+<blockquote>
+<div><ul class="simple">
+<li>Property value type: One of YMD, DMY, MDY.</li>
+<li>Default value: YMD</li>
+<li>Example</li>
+</ul>
+</div></blockquote>
+<div class="highlight-xml"><div class="highlight"><pre><span
class="nt"><property></span>
+ <span class="nt"><name></span>tajo.datetime.date-order<span
class="nt"></name></span>
+ <span class="nt"><value></span>YMD<span
class="nt"></value></span>
+<span class="nt"></property></span>
+</pre></div>
+</div>
+</div>
+</div>
+<div class="section" id="table-partitions">
+<h2>Table partitions<a class="headerlink" href="#table-partitions"
title="Permalink to this headline">¶</a></h2>
+<div class="section" id="tajo-partition-overwrite-even-if-no-result">
+<h3><cite>tajo.partition.overwrite.even-if-no-result</cite><a
class="headerlink" href="#tajo-partition-overwrite-even-if-no-result"
title="Permalink to this headline">¶</a></h3>
+<p>If this value is true, a partitioned table is overwritten even if a
subquery leads to no result. Otherwise, the table data will be kept if there is
no result.</p>
+<blockquote>
+<div><ul class="simple">
+<li>Property value type: Boolean</li>
+<li>Default value: false</li>
+<li>Example</li>
+</ul>
+</div></blockquote>
+<div class="highlight-xml"><div class="highlight"><pre><span
class="nt"><property></span>
+ <span
class="nt"><name></span>tajo.partition.overwrite.even-if-no-result<span
class="nt"></name></span>
+ <span class="nt"><value></span>false<span
class="nt"></value></span>
+<span class="nt"></property></span>
+</pre></div>
+</div>
+</div>
+<div class="section" id="tajo-dist-query-table-partition-task-volume-mb">
+<h3><cite>tajo.dist-query.table-partition.task-volume-mb</cite><a
class="headerlink" href="#tajo-dist-query-table-partition-task-volume-mb"
title="Permalink to this headline">¶</a></h3>
+<p>In Tajo, storing a partition table is executed in two stages.
+This value indicates the output size of a task of the former stage, which
determines the number of partitions to be shuffled between two stages.</p>
+<blockquote>
+<div><ul class="simple">
+<li>Property value type: Integer</li>
+<li>Unit: MB</li>
+<li>Default value: 256</li>
+<li>Example</li>
+</ul>
+</div></blockquote>
+<div class="highlight-xml"><div class="highlight"><pre><span
class="nt"><property></span>
+ <span
class="nt"><name></span>tajo.dist-query.table-partition.task-volume-mb<span
class="nt"></name></span>
+ <span class="nt"><value></span>256<span
class="nt"></value></span>
+<span class="nt"></property></span>
+</pre></div>
+</div>
+</div>
+</div>
+<div class="section" id="arithmetic-settings">
+<h2>Arithmetic Settings<a class="headerlink" href="#arithmetic-settings"
title="Permalink to this headline">¶</a></h2>
+<div class="section" id="tajo-behavior-arithmetic-abort">
+<h3><cite>tajo.behavior.arithmetic-abort</cite><a class="headerlink"
href="#tajo-behavior-arithmetic-abort" title="Permalink to this
headline">¶</a></h3>
+<p>A flag to indicate how to handle the errors caused by invalid arithmetic
operations. If true, a running query will be terminated with an overflow or a
divide-by-zero.</p>
+<blockquote>
+<div><ul class="simple">
+<li>Property value type: Boolean</li>
+<li>Default value: false</li>
+<li>Example</li>
+</ul>
+</div></blockquote>
+<div class="highlight-xml"><div class="highlight"><pre><span
class="nt"><property></span>
+ <span class="nt"><name></span>tajo.behavior.arithmetic-abort<span
class="nt"></name></span>
+ <span class="nt"><value></span>false<span
class="nt"></value></span>
+<span class="nt"></property></span>
+</pre></div>
+</div>
+</div>
</div>
</div>
Modified: tajo/site/docs/devel/getting_started.html
URL:
http://svn.apache.org/viewvc/tajo/site/docs/devel/getting_started.html?rev=1713460&r1=1713459&r2=1713460&view=diff
==============================================================================
--- tajo/site/docs/devel/getting_started.html (original)
+++ tajo/site/docs/devel/getting_started.html Mon Nov 9 15:32:06 2015
@@ -212,7 +212,7 @@
<blockquote>
<div><ul class="simple">
<li>Hadoop 2.3.0 or higher (up to 2.6.0)</li>
-<li>Java 1.7 or higher</li>
+<li>Java 1.8 or higher</li>
<li>Protocol buffer 2.5.0</li>
</ul>
</div></blockquote>
Modified: tajo/site/docs/devel/index.html
URL:
http://svn.apache.org/viewvc/tajo/site/docs/devel/index.html?rev=1713460&r1=1713459&r2=1713460&view=diff
==============================================================================
--- tajo/site/docs/devel/index.html (original)
+++ tajo/site/docs/devel/index.html Mon Nov 9 15:32:06 2015
@@ -268,8 +268,12 @@ As a result, some contents can be mismat
</ul>
</li>
<li class="toctree-l2"><a class="reference internal"
href="configuration/tajo-site-xml.html">The tajo-site.xml File</a><ul>
-<li class="toctree-l3"><a class="reference internal"
href="configuration/tajo-site-xml.html#system-config">System Config</a></li>
+<li class="toctree-l3"><a class="reference internal"
href="configuration/tajo-site-xml.html#join-query-settings">Join Query
Settings</a></li>
+<li class="toctree-l3"><a class="reference internal"
href="configuration/tajo-site-xml.html#sort-query-settings">Sort Query
Settings</a></li>
+<li class="toctree-l3"><a class="reference internal"
href="configuration/tajo-site-xml.html#group-by-query-settings">Group by Query
Settings</a></li>
<li class="toctree-l3"><a class="reference internal"
href="configuration/tajo-site-xml.html#date-time-settings">Date/Time
Settings</a></li>
+<li class="toctree-l3"><a class="reference internal"
href="configuration/tajo-site-xml.html#table-partitions">Table
partitions</a></li>
+<li class="toctree-l3"><a class="reference internal"
href="configuration/tajo-site-xml.html#arithmetic-settings">Arithmetic
Settings</a></li>
</ul>
</li>
<li class="toctree-l2"><a class="reference internal"
href="configuration/catalog-site-xml.html">The catalog-site.xml File</a></li>