[drill-site] branch asf-site updated: edits

bridgetb Wed, 05 Sep 2018 18:51:00 -0700

This is an automated email from the ASF dual-hosted git repository.

bridgetb pushed a commit to branch asf-site
in repository https://gitbox.apache.org/repos/asf/drill-site.git



The following commit(s) were added to refs/heads/asf-site by this push:
     new cc74e50  edits
cc74e50 is described below

commit cc74e5055233476bfb4445b4256b74285ab2095d
Author: Bridget Bevens <[email protected]>
AuthorDate: Wed Sep 5 18:50:36 2018 -0700

    edits
---
 .../index.html                                     | 22 ++++++++++------------
 feed.xml                                           |  4 ++--
 2 files changed, 12 insertions(+), 14 deletions(-)

diff --git 
a/docs/sort-based-and-hash-based-memory-constrained-operators/index.html 
b/docs/sort-based-and-hash-based-memory-constrained-operators/index.html
index 59cb98b..19c5885 100644
--- a/docs/sort-based-and-hash-based-memory-constrained-operators/index.html
+++ b/docs/sort-based-and-hash-based-memory-constrained-operators/index.html
@@ -1285,11 +1285,9 @@
 </ul>
 
 <p>Drill only uses the External Sort operator to sort data. Drill uses the 
Hash-Aggregate operator to aggregate data. Alternatively, Drill can sort the 
data and then use the (lightweight) Streaming-Aggregate operator to aggregate 
data.
-Drill uses the Hash-Join operator to join data. Alternatively, Drill can use 
the Nested-Loop-Join or sort the data and then use the (lightweight) 
Merge-Join. Drill typically uses Hash operators for joining and aggregation, as 
they perform better than the Sort operator (Hash - O(N) vs. Sort - O(N * 
log(N))). However, if the Hash operators are disabled, or the data is already 
sorted, Drill uses the alternative methods previously described.</p>
+Drill uses the Hash-Join operator to join data. Alternatively, Drill can use 
the Nested-Loop-Join or sort the data and then use the (lightweight) 
Merge-Join. Drill typically uses Hash operators for joining and aggregation, as 
they perform better than the Sort operator (Hash - O(N) vs. Sort - O(N * 
log(N))). However, if you disable the Hash operators, or the data is already 
sorted, Drill uses the alternative methods previously described.</p>
 
-<p>The memory configuration in Drill is specified as the memory limit 
per-query, per-node. The allocated memory is equally divided among all 
instances of the spillable operators (per query on each node). The number of 
instances is the number of spillable operators in the query plan multiplied by 
the maximal degree of parallelism. The maximal degree of parallelism is the 
number of minor fragments required to perform the work for each instance of a 
spillable operator. When an instance of a [...]
-
-<p>To see the difference in memory consumption between the operators, run a 
query and then view the query profile in the Drill Web UI. Optionally, you can 
disable the Hash operators, which forces Drill to use the Merge-Join and 
Streaming-Aggregate operators.   </p>
+<p>The memory configuration in Drill is specified as the memory limit 
per-query, per-node. The allocated memory is equally divided among all 
instances of the spillable operators (per query on each node). The number of 
instances is the number of spillable operators in the query plan multiplied by 
the maximal degree of parallelism. The maximal degree of parallelism is the 
number of minor fragments required to perform the work for each instance of a 
spillable operator. When an instance of a [...]
 
 <h2 id="spill-to-disk">Spill to Disk</h2>
 
@@ -1305,7 +1303,7 @@ Drill uses the Hash-Join operator to join data. 
Alternatively, Drill can use the
 
 <p>Spillable operators write data to a temporary work area on disk when they 
cannot process all of the data in memory. The default location of the temporary 
work area is <code>/tmp/drill/spill</code> on the local file system. </p>
 
-<p>The <code>/tmp/drill/spill</code> directory should suffice for small 
workloads or examples, however it is highly recommended that you redirect the 
default spill location to a location with enough disk space to support spilling 
for large workloads.</p>
+<p>The <code>/tmp/drill/spill</code> directory should suffice for small 
workloads or examples; however, you should redirect the default spill location 
to a location with enough disk space to support spilling for large 
workloads.</p>
 
 <p><strong>Note:</strong> Spilled data may require more space than the table 
referenced in the query that is spilling the data. For example, when the 
underlying table is compressed (Parquet), or when the operator received data 
joined from multiple tables.</p>
 
@@ -1313,15 +1311,15 @@ Drill uses the Hash-Join operator to join data. 
Alternatively, Drill can use the
 
 <p><strong>Configuring Spill to Disk</strong>  </p>
 
-<p>The <code>drill-override.conf</code> file, located in the 
<code>/conf</code> directory, contains options that set the spill locations for 
the Hash and Sort operators. An administrator can change the file system and 
directories into which the operators spill data. Refer to the 
<code>drill-override-example.conf</code> file included in the 
<code>/conf</code> directory for examples. </p>
+<p>The <code>drill-override.conf</code> file, located in the 
<code>/conf</code> directory, contains options that set the spill locations for 
the spillable operators. An administrator can change the file system and 
directories into which the operators spill data. Refer to the 
<code>drill-override-example.conf</code> file included in the 
<code>/conf</code> directory for examples. </p>
 
 <p>The following list describes the spill to disk configuration options:  </p>
 
 <ul>
-<li><strong>drill.exec.spill.fs</strong><br>
-Introduced in Drill 1.11. The default file system on the local machine into 
which the Sort, Hash Aggregate, and Hash Join operators spill data. You can 
configure this option so that data spills into a distributed file system, such 
as hdfs. For example, &quot;hdfs:///&quot;. The default setting is 
&quot;file:///&quot;.</li>
-<li><strong>drill.exec.spill.directories</strong><br>
-Introduced in Drill 1.11. The list of directories into which the Sort, Hash 
Aggregate, and Hash Join operators spill data. The list must be an array with 
directories separated by a comma, for example [&quot;/fs1/drill/spill&quot; , 
&quot;/fs2/drill/spill&quot; , &quot;/fs3/drill/spill&quot;]. The default 
setting is [&quot;/tmp/drill/spill&quot;].<br></li>
+<li><p><strong>drill.exec.spill.fs</strong><br>
+Introduced in Drill 1.11. The default file system on the local machine into 
which the spillable operators spill data. You can configure this option so that 
data spills into a distributed file system, such as hdfs. For example, 
&quot;hdfs:///&quot;. The default setting is &quot;file:///&quot;.  </p></li>
+<li><p><strong>drill.exec.spill.directories</strong><br>
+Introduced in Drill 1.11. The list of directories into which the spillable 
operators spill data. The list must be an array with directories separated by a 
comma, for example [&quot;/fs1/drill/spill&quot; , &quot;/fs2/drill/spill&quot; 
, &quot;/fs3/drill/spill&quot;]. The default setting is 
[&quot;/tmp/drill/spill&quot;].  </p></li>
 </ul>
 
 <p><strong>Note:</strong> The following options were available prior to Drill 
1.11, but have since been deprecated and replaced with the options described 
above:  </p>
@@ -1334,7 +1332,7 @@ Introduced in Drill 1.11. The list of directories into 
which the Sort, Hash Aggr
 
 <h2 id="memory-allocation">Memory Allocation</h2>
 
-<p>Drill evenly splits the available memory among all instances of the 
spillable operators. When a query is parallelized, the number of operators is 
multiplied, which reduces the amount of memory given to each instance of the 
operators during a query.  </p>
+<p>Drill evenly splits the available memory among all instances of the 
spillable operators. When a query is parallelized, the number of operators is 
multiplied, which reduces the amount of memory given to each instance of the 
operators during a query. To see the difference in memory consumption between 
the operators, you can run a query and then view the query profile in the Drill 
Web UI. Optionally, you can disable the Hash operators, which forces Drill to 
use the Merge-Join and Streami [...]
 
 <p><strong>Memory Allocation Configuration Options</strong>  </p>
 
@@ -1344,7 +1342,7 @@ Introduced in Drill 1.11. The list of directories into 
which the Sort, Hash Aggr
 <li><p><strong>planner.memory.max_query_memory_per_node</strong><br>
 The <code>planner.memory.max_query_memory_per_node</code> option is the 
minimum amount of memory available to Drill per query on a node. The default of 
2 GB typically allows between two and three concurrent queries to run when the 
JVM is configured to use 8 GB of direct memory (default). When the memory 
requirement for Drill increases, the default of 2 GB is constraining. You must 
increase the amount of memory for queries to complete, unless the setting for 
the <code>planner.memory.perce [...]
 <li><p><strong>planner.memory.percent_per_query</strong><br>
-Alternatively, the <code>planner.memory.percent_per_query</code> option sets 
the memory as a percentage of the total direct memory. The default is 5%. This 
value is only used when throttling is disabled. Setting the value to 0 disables 
the option. You can increase or decrease the value, however you should set the 
percentage well below the JVM direct memory to account for the cases where 
Drill does not manage memory, such as for the less memory intensive operators. 
</p>
+Alternatively, the <code>planner.memory.percent_per_query</code> option sets 
the memory as a percentage of the total direct memory. The default is 5%. This 
value is only used when throttling is disabled. Setting the value to 0 disables 
the option. You can increase or decrease the value; however, you should set the 
percentage well below the JVM direct memory to account for the cases where 
Drill does not manage memory, such as for the less memory intensive operators. 
</p>
 
 <ul>
 <li><p>The percentage is calculated using the following formula:    </p>
diff --git a/feed.xml b/feed.xml
index 3a87690..0e23b7d 100644
--- a/feed.xml
+++ b/feed.xml
@@ -6,8 +6,8 @@
 </description>
     <link>/</link>
     <atom:link href="/feed.xml" rel="self" type="application/rss+xml"/>
-    <pubDate>Wed, 05 Sep 2018 18:33:55 -0700</pubDate>
-    <lastBuildDate>Wed, 05 Sep 2018 18:33:55 -0700</lastBuildDate>
+    <pubDate>Wed, 05 Sep 2018 18:48:40 -0700</pubDate>
+    <lastBuildDate>Wed, 05 Sep 2018 18:48:40 -0700</lastBuildDate>
     <generator>Jekyll v2.5.2</generator>
     
       <item>

[drill-site] branch asf-site updated: edits

Reply via email to