This is an automated email from the ASF dual-hosted git repository.
blue pushed a commit to branch asf-site
in repository https://gitbox.apache.org/repos/asf/iceberg.git
The following commit(s) were added to refs/heads/asf-site by this push:
new b3ffb1a Deployed 4c98a0f64 with MkDocs version: 1.2.1
b3ffb1a is described below
commit b3ffb1a5307f43da1aa8c9f359e93fabc117d8c9
Author: Ryan Blue <[email protected]>
AuthorDate: Sun Nov 21 08:53:08 2021 -0800
Deployed 4c98a0f64 with MkDocs version: 1.2.1
---
index.html | 6 +--
maintenance/index.html | 36 ++++++++++-------
search/search_index.json | 2 +-
sitemap.xml | 80 +++++++++++++++++++-------------------
sitemap.xml.gz | Bin 505 -> 505 bytes
theme_customization/nav-item.html | 4 +-
6 files changed, 67 insertions(+), 61 deletions(-)
diff --git a/index.html b/index.html
index d6dfafd..5d4bb65 100644
--- a/index.html
+++ b/index.html
@@ -321,7 +321,7 @@ if (is_top_frame) { $('body').addClass('wm-top-page'); }
</li>
<li
class="wm-toc-li wm-toc-lev1 "
-><a href="https://trino.io/docs/current/connector/iceberg.html" class="block
wm-toc-text">
+><a href="https://trino.io/docs/current/connector/iceberg.html"
target="_blank" rel="noopener noreferrer" class="block wm-toc-text">
<span>
<img src="img/trino-logo.png" class="navigation-icon fa-fw"/> Trino
</span>
@@ -329,7 +329,7 @@ if (is_top_frame) { $('body').addClass('wm-top-page'); }
</li>
<li
class="wm-toc-li wm-toc-lev1 "
-><a href="https://prestodb.io/docs/current/connector/iceberg.html"
class="block wm-toc-text">
+><a href="https://prestodb.io/docs/current/connector/iceberg.html"
target="_blank" rel="noopener noreferrer" class="block wm-toc-text">
<span>
<img src="img/prestodb-logo.png" class="navigation-icon fa-fw"/> PrestoDB
</span>
@@ -579,5 +579,5 @@ if (is_top_frame) { $('body').addClass('wm-top-page'); }
</html>
<!--
MkDocs version : 1.2.1
-Build Date UTC : 2021-11-18 00:01:42.363031+00:00
+Build Date UTC : 2021-11-21 16:53:08.709778+00:00
-->
\ No newline at end of file
diff --git a/maintenance/index.html b/maintenance/index.html
index fea87e7..79a7e06 100644
--- a/maintenance/index.html
+++ b/maintenance/index.html
@@ -89,8 +89,10 @@ table.expireSnapshots()
</code></pre>
<p>See the <a
href="../javadoc/0.12.1/org/apache/iceberg/ExpireSnapshots.html"><code>ExpireSnapshots</code>
Javadoc</a> to see more configuration options.</p>
<p>There is also a Spark action that can run table expiration in parallel for
large tables:</p>
-<pre><code class="language-java">Actions.forTable(table)
- .expireSnapshots()
+<pre><code class="language-java">Table table = ...
+SparkActions
+ .get()
+ .expireSnapshots(table)
.expireOlderThan(tsToExpire)
.execute();
</code></pre>
@@ -123,15 +125,16 @@ Regularly expiring snapshots deletes unused data
files.</p>
</tbody>
</table>
<p>See <a href="/configuration/#write-properties">table write properties</a>
for more details.</p>
-<h3 id="remove-orphan-files">Remove orphan files<a class="headerlink"
href="#remove-orphan-files" title="Permanent link">¶</a></h3>
+<h3 id="delete-orphan-files">Delete orphan files<a class="headerlink"
href="#delete-orphan-files" title="Permanent link">¶</a></h3>
<p>In Spark and other distributed processing engines, task or job failures can
leave files that are not referenced by table metadata, and in some cases normal
snapshot expiration may not be able to determine a file is no longer needed and
delete it.</p>
-<p>To clean up these “orphan” files under a table location, use
the <code>removeOrphanFiles</code> action.</p>
+<p>To clean up these “orphan” files under a table location, use
the <code>deleteOrphanFiles</code> action.</p>
<pre><code class="language-java">Table table = ...
-Actions.forTable(table)
- .removeOrphanFiles()
+SparkActions
+ .get()
+ .deleteOrphanFiles(table)
.execute();
</code></pre>
-<p>See the <a
href="../javadoc/0.12.1/org/apache/iceberg/actions/RemoveOrphanFilesAction.html">RemoveOrphanFilesAction
Javadoc</a> to see more configuration options.</p>
+<p>See the <a
href="../javadoc/0.12.1/org/apache/iceberg/actions/DeleteOrphanFiles.html">DeleteOrphanFiles
Javadoc</a> to see more configuration options.</p>
<p>This action may take a long time to finish if you have lots of files in
data and metadata directories. It is recommended to execute this periodically,
but you may not need to execute this often.</p>
<div class="admonition note">
<p class="admonition-title">Note</p>
@@ -152,25 +155,28 @@ FileSystem API to avoid unintentional deletion. </p>
<p>Iceberg tracks each data file in a table. More data files leads to more
metadata stored in manifest files, and small data files causes an unnecessary
amount of metadata and less efficient queries from file open costs.</p>
<p>Iceberg can compact data files in parallel using Spark with the
<code>rewriteDataFiles</code> action. This will combine small files into larger
files to reduce metadata overhead and runtime file open cost.</p>
<pre><code class="language-java">Table table = ...
-Actions.forTable(table).rewriteDataFiles()
+SparkActions
+ .get()
+ .rewriteDataFiles(table)
.filter(Expressions.equal("date", "2020-08-18"))
- .targetSizeInBytes(500 * 1024 * 1024) // 500 MB
+ .option("target-file-size-bytes", Long.toString(500 * 1024 *
1024)) // 500 MB
.execute();
</code></pre>
-<p>The <code>files</code> metadata table is useful for inspecting data file
sizes and determining when to compact partitons.</p>
-<p>See the <a
href="../javadoc/0.12.1/org/apache/iceberg/actions/RewriteDataFilesAction.html"><code>RewriteDataFilesAction</code>
Javadoc</a> to see more configuration options.</p>
+<p>The <code>files</code> metadata table is useful for inspecting data file
sizes and determining when to compact partitions.</p>
+<p>See the <a
href="../javadoc/0.12.1/org/apache/iceberg/actions/RewriteDataFiles.html"><code>RewriteDataFiles</code>
Javadoc</a> to see more configuration options.</p>
<h3 id="rewrite-manifests">Rewrite manifests<a class="headerlink"
href="#rewrite-manifests" title="Permanent link">¶</a></h3>
<p>Iceberg uses metadata in its manifest list and manifest files speed up
query planning and to prune unnecessary data files. The metadata tree functions
as an index over a table’s data.</p>
<p>Manifests in the metadata tree are automatically compacted in the order
they are added, which makes queries faster when the write pattern aligns with
read filters. For example, writing hourly-partitioned data as it arrives is
aligned with time range query filters.</p>
<p>When a table’s write pattern doesn’t align with the query
pattern, metadata can be rewritten to re-group data files into manifests using
<code>rewriteManifests</code> or the <code>rewriteManifests</code> action (for
parallel rewrites using Spark).</p>
<p>This example rewrites small manifests and groups data files by the first
partition field.</p>
<pre><code class="language-java">Table table = ...
-table.rewriteManifests()
+SparkActions
+ .get()
+ .rewriteManifests(table)
.rewriteIf(file -> file.length() < 10 * 1024 * 1024) // 10 MB
- .clusterBy(file -> file.partition().get(0, Integer.class))
- .commit();
+ .execute();
</code></pre>
-<p>See the <a
href="../javadoc/0.12.1/org/apache/iceberg/actions/RewriteManifestsAction.html"><code>RewriteManifestsAction</code>
Javadoc</a> to see more configuration options.</p>
+<p>See the <a
href="../javadoc/0.12.1/org/apache/iceberg/actions/RewriteManifests.html"><code>RewriteManifests</code>
Javadoc</a> to see more configuration options.</p>
<br>
</div>
diff --git a/search/search_index.json b/search/search_index.json
index 4cd0f08..10f17ed 100644
--- a/search/search_index.json
+++ b/search/search_index.json
@@ -1 +1 @@
-{"config":{"indexing":"full","lang":["en"],"min_search_length":3,"prebuild_index":false,"separator":"[\\s\\-]+"},"docs":[{"location":"","text":"Apache
Iceberg is an open table format for huge analytic datasets. Iceberg adds
tables to compute engines including Spark, Trino, PrestoDB, Flink and Hive
using a high-performance table format that works just like a SQL table. User
experience \u00b6 Iceberg avoids unpleasant surprises. Schema evolution works
and won\u2019t inadvertently un-delete [...]
\ No newline at end of file
+{"config":{"indexing":"full","lang":["en"],"min_search_length":3,"prebuild_index":false,"separator":"[\\s\\-]+"},"docs":[{"location":"","text":"Apache
Iceberg is an open table format for huge analytic datasets. Iceberg adds
tables to compute engines including Spark, Trino, PrestoDB, Flink and Hive
using a high-performance table format that works just like a SQL table. User
experience \u00b6 Iceberg avoids unpleasant surprises. Schema evolution works
and won\u2019t inadvertently un-delete [...]
\ No newline at end of file
diff --git a/sitemap.xml b/sitemap.xml
index 85aa3b5..015a147 100644
--- a/sitemap.xml
+++ b/sitemap.xml
@@ -2,202 +2,202 @@
<urlset xmlns="http://www.sitemaps.org/schemas/sitemap/0.9">
<url>
<loc>https://iceberg.apache.org/</loc>
- <lastmod>2021-11-18</lastmod>
+ <lastmod>2021-11-21</lastmod>
<changefreq>daily</changefreq>
</url>
<url>
<loc>https://iceberg.apache.org/api/</loc>
- <lastmod>2021-11-18</lastmod>
+ <lastmod>2021-11-21</lastmod>
<changefreq>daily</changefreq>
</url>
<url>
<loc>https://iceberg.apache.org/aws/</loc>
- <lastmod>2021-11-18</lastmod>
+ <lastmod>2021-11-21</lastmod>
<changefreq>daily</changefreq>
</url>
<url>
<loc>https://iceberg.apache.org/benchmarks/</loc>
- <lastmod>2021-11-18</lastmod>
+ <lastmod>2021-11-21</lastmod>
<changefreq>daily</changefreq>
</url>
<url>
<loc>https://iceberg.apache.org/blogs/</loc>
- <lastmod>2021-11-18</lastmod>
+ <lastmod>2021-11-21</lastmod>
<changefreq>daily</changefreq>
</url>
<url>
<loc>https://iceberg.apache.org/community/</loc>
- <lastmod>2021-11-18</lastmod>
+ <lastmod>2021-11-21</lastmod>
<changefreq>daily</changefreq>
</url>
<url>
<loc>https://iceberg.apache.org/configuration/</loc>
- <lastmod>2021-11-18</lastmod>
+ <lastmod>2021-11-21</lastmod>
<changefreq>daily</changefreq>
</url>
<url>
<loc>https://iceberg.apache.org/custom-catalog/</loc>
- <lastmod>2021-11-18</lastmod>
+ <lastmod>2021-11-21</lastmod>
<changefreq>daily</changefreq>
</url>
<url>
<loc>https://iceberg.apache.org/evolution/</loc>
- <lastmod>2021-11-18</lastmod>
+ <lastmod>2021-11-21</lastmod>
<changefreq>daily</changefreq>
</url>
<url>
<loc>https://iceberg.apache.org/flink-connector/</loc>
- <lastmod>2021-11-18</lastmod>
+ <lastmod>2021-11-21</lastmod>
<changefreq>daily</changefreq>
</url>
<url>
<loc>https://iceberg.apache.org/flink/</loc>
- <lastmod>2021-11-18</lastmod>
+ <lastmod>2021-11-21</lastmod>
<changefreq>daily</changefreq>
</url>
<url>
<loc>https://iceberg.apache.org/getting-started/</loc>
- <lastmod>2021-11-18</lastmod>
+ <lastmod>2021-11-21</lastmod>
<changefreq>daily</changefreq>
</url>
<url>
<loc>https://iceberg.apache.org/hive/</loc>
- <lastmod>2021-11-18</lastmod>
+ <lastmod>2021-11-21</lastmod>
<changefreq>daily</changefreq>
</url>
<url>
<loc>https://iceberg.apache.org/how-to-release/</loc>
- <lastmod>2021-11-18</lastmod>
+ <lastmod>2021-11-21</lastmod>
<changefreq>daily</changefreq>
</url>
<url>
<loc>https://iceberg.apache.org/java-api-quickstart/</loc>
- <lastmod>2021-11-18</lastmod>
+ <lastmod>2021-11-21</lastmod>
<changefreq>daily</changefreq>
</url>
<url>
<loc>https://iceberg.apache.org/jdbc/</loc>
- <lastmod>2021-11-18</lastmod>
+ <lastmod>2021-11-21</lastmod>
<changefreq>daily</changefreq>
</url>
<url>
<loc>https://iceberg.apache.org/maintenance/</loc>
- <lastmod>2021-11-18</lastmod>
+ <lastmod>2021-11-21</lastmod>
<changefreq>daily</changefreq>
</url>
<url>
<loc>https://iceberg.apache.org/nessie/</loc>
- <lastmod>2021-11-18</lastmod>
+ <lastmod>2021-11-21</lastmod>
<changefreq>daily</changefreq>
</url>
<url>
<loc>https://iceberg.apache.org/partitioning/</loc>
- <lastmod>2021-11-18</lastmod>
+ <lastmod>2021-11-21</lastmod>
<changefreq>daily</changefreq>
</url>
<url>
<loc>https://iceberg.apache.org/performance/</loc>
- <lastmod>2021-11-18</lastmod>
+ <lastmod>2021-11-21</lastmod>
<changefreq>daily</changefreq>
</url>
<url>
<loc>https://iceberg.apache.org/python-api-intro/</loc>
- <lastmod>2021-11-18</lastmod>
+ <lastmod>2021-11-21</lastmod>
<changefreq>daily</changefreq>
</url>
<url>
<loc>https://iceberg.apache.org/python-feature-support/</loc>
- <lastmod>2021-11-18</lastmod>
+ <lastmod>2021-11-21</lastmod>
<changefreq>daily</changefreq>
</url>
<url>
<loc>https://iceberg.apache.org/python-quickstart/</loc>
- <lastmod>2021-11-18</lastmod>
+ <lastmod>2021-11-21</lastmod>
<changefreq>daily</changefreq>
</url>
<url>
<loc>https://iceberg.apache.org/releases/</loc>
- <lastmod>2021-11-18</lastmod>
+ <lastmod>2021-11-21</lastmod>
<changefreq>daily</changefreq>
</url>
<url>
<loc>https://iceberg.apache.org/reliability/</loc>
- <lastmod>2021-11-18</lastmod>
+ <lastmod>2021-11-21</lastmod>
<changefreq>daily</changefreq>
</url>
<url>
<loc>https://iceberg.apache.org/roadmap/</loc>
- <lastmod>2021-11-18</lastmod>
+ <lastmod>2021-11-21</lastmod>
<changefreq>daily</changefreq>
</url>
<url>
<loc>https://iceberg.apache.org/schemas/</loc>
- <lastmod>2021-11-18</lastmod>
+ <lastmod>2021-11-21</lastmod>
<changefreq>daily</changefreq>
</url>
<url>
<loc>https://iceberg.apache.org/security/</loc>
- <lastmod>2021-11-18</lastmod>
+ <lastmod>2021-11-21</lastmod>
<changefreq>daily</changefreq>
</url>
<url>
<loc>https://iceberg.apache.org/snapshots/</loc>
- <lastmod>2021-11-18</lastmod>
+ <lastmod>2021-11-21</lastmod>
<changefreq>daily</changefreq>
</url>
<url>
<loc>https://iceberg.apache.org/spark-configuration/</loc>
- <lastmod>2021-11-18</lastmod>
+ <lastmod>2021-11-21</lastmod>
<changefreq>daily</changefreq>
</url>
<url>
<loc>https://iceberg.apache.org/spark-ddl/</loc>
- <lastmod>2021-11-18</lastmod>
+ <lastmod>2021-11-21</lastmod>
<changefreq>daily</changefreq>
</url>
<url>
<loc>https://iceberg.apache.org/spark-procedures/</loc>
- <lastmod>2021-11-18</lastmod>
+ <lastmod>2021-11-21</lastmod>
<changefreq>daily</changefreq>
</url>
<url>
<loc>https://iceberg.apache.org/spark-queries/</loc>
- <lastmod>2021-11-18</lastmod>
+ <lastmod>2021-11-21</lastmod>
<changefreq>daily</changefreq>
</url>
<url>
<loc>https://iceberg.apache.org/spark-structured-streaming/</loc>
- <lastmod>2021-11-18</lastmod>
+ <lastmod>2021-11-21</lastmod>
<changefreq>daily</changefreq>
</url>
<url>
<loc>https://iceberg.apache.org/spark-writes/</loc>
- <lastmod>2021-11-18</lastmod>
+ <lastmod>2021-11-21</lastmod>
<changefreq>daily</changefreq>
</url>
<url>
<loc>https://iceberg.apache.org/spec/</loc>
- <lastmod>2021-11-18</lastmod>
+ <lastmod>2021-11-21</lastmod>
<changefreq>daily</changefreq>
</url>
<url>
<loc>https://iceberg.apache.org/terms/</loc>
- <lastmod>2021-11-18</lastmod>
+ <lastmod>2021-11-21</lastmod>
<changefreq>daily</changefreq>
</url>
<url>
<loc>https://iceberg.apache.org/trademarks/</loc>
- <lastmod>2021-11-18</lastmod>
+ <lastmod>2021-11-21</lastmod>
<changefreq>daily</changefreq>
</url>
<url>
<loc>https://iceberg.apache.org/trino-prestodb/</loc>
- <lastmod>2021-11-18</lastmod>
+ <lastmod>2021-11-21</lastmod>
<changefreq>daily</changefreq>
</url>
<url>
<loc>https://iceberg.apache.org/why-iceberg/</loc>
- <lastmod>2021-11-18</lastmod>
+ <lastmod>2021-11-21</lastmod>
<changefreq>daily</changefreq>
</url>
</urlset>
\ No newline at end of file
diff --git a/sitemap.xml.gz b/sitemap.xml.gz
index e0adefd..8c0c1c3 100644
Binary files a/sitemap.xml.gz and b/sitemap.xml.gz differ
diff --git a/theme_customization/nav-item.html
b/theme_customization/nav-item.html
index d1eb4db..884b1ec 100644
--- a/theme_customization/nav-item.html
+++ b/theme_customization/nav-item.html
@@ -10,13 +10,13 @@
</span>
</a>
{%- elif nav_item.title == "Trino" -%}
- <a href="{{ nav_item.url }}" class="block wm-toc-text">
+ <a href="{{ nav_item.url }}" target="_blank" rel="noopener noreferrer"
class="block wm-toc-text">
<span>
<img src="img/trino-logo.png" class="navigation-icon fa-fw"/> {{
nav_item.title }}
</span>
</a>
{%- elif nav_item.title == "PrestoDB" -%}
- <a href="{{ nav_item.url }}" class="block wm-toc-text">
+ <a href="{{ nav_item.url }}" target="_blank" rel="noopener noreferrer"
class="block wm-toc-text">
<span>
<img src="img/prestodb-logo.png" class="navigation-icon fa-fw"/> {{
nav_item.title }}
</span>