(iceberg) branch main updated: Docs: Enhance create_changelog_view usage (#9889)

yufei Wed, 13 Mar 2024 09:54:34 -0700

This is an automated email from the ASF dual-hosted git repository.

yufei pushed a commit to branch main
in repository https://gitbox.apache.org/repos/asf/iceberg.git



The following commit(s) were added to refs/heads/main by this push:
     new 8b58277540 Docs: Enhance create_changelog_view usage (#9889)
8b58277540 is described below

commit 8b58277540fb9c5bc1936dcbc1a13793a72a6fee
Author: Manu Zhang <[email protected]>
AuthorDate: Thu Mar 14 00:54:20 2024 +0800

    Docs: Enhance create_changelog_view usage (#9889)
---
 docs/docs/spark-procedures.md | 9 ++++++---
 1 file changed, 6 insertions(+), 3 deletions(-)

diff --git a/docs/docs/spark-procedures.md b/docs/docs/spark-procedures.md
index ee8a8466c7..7dc0d1a2aa 100644
--- a/docs/docs/spark-procedures.md
+++ b/docs/docs/spark-procedures.md
@@ -756,8 +756,8 @@ Creates a view that contains the changes from a given table.
 | `table`              | ✔️         | string              | Name of the source 
table for the changelog                                                         
                                                                                
                  |
 | `changelog_view`     |           | string              | Name of the view to 
create                                                                          
                                                                                
                 |
 | `options`            |           | map<string, string> | A map of Spark read 
options to use                                                                  
                                                                                
                 |
-| `net_changes`        |           | boolean             | Whether to output 
net changes (see below for more information). Defaults to false.                
                                                                                
                   |
-| `compute_updates`    |           | boolean             | Whether to compute 
pre/post update images (see below for more information). Defaults to false.     
                                                                                
                  | 
+| `net_changes`        |           | boolean             | Whether to output 
net changes (see below for more information). Defaults to false. It must be 
false when `compute_updates` is true.                                           
                                                                       |
+| `compute_updates`    |           | boolean             | Whether to compute 
pre/post update images (see below for more information). Defaults to true if 
`identifer_columns` are provided; otherwise, defaults to false.                 
                                                                                
      | 
 | `identifier_columns` |           | array<string>       | The list of 
identifier columns to compute updates. If the argument `compute_updates` is set 
to true and `identifier_columns` are not provided, the table’s current 
identifier fields will be used.   |
 
 Here is a list of commonly used Spark read options:
@@ -823,7 +823,10 @@ second snapshot deleted 1 record.
 |2     | Bob      |INSERT      |0      |5390529835796506035|
 |1     | Alice  |DELETE        |1      |8764748981452218370|
 
-Create a changelog view that computes net changes. It removes intermediate 
changes and only outputs the net changes. 
+#### Net Changes
+
+The procedure can remove intermediate changes across multiple snapshots, and 
only outputs the net changes. Here is an example to create a changelog view 
that computes net changes. 
+
 ```sql
 CALL spark_catalog.system.create_changelog_view(
   table => 'db.tbl',

(iceberg) branch main updated: Docs: Enhance create_changelog_view usage (#9889)

Reply via email to