This is an automated email from the ASF dual-hosted git repository. bhavanisudha pushed a commit to branch asf-site in repository https://gitbox.apache.org/repos/asf/hudi.git
The following commit(s) were added to refs/heads/asf-site by this push: new de9dad49323 [DOCS] Minor doc fixes (#9603) de9dad49323 is described below commit de9dad49323dc0ddd08e5670fa5acf42f0b24fcd Author: Bhavani Sudha Saktheeswaran <2179254+bhasu...@users.noreply.github.com> AuthorDate: Tue Sep 5 10:07:48 2023 -0700 [DOCS] Minor doc fixes (#9603) --- ...a-Delta-Lake-Apache-Iceberg-and-Apache-Hudi.mdx | 2 +- website/community/syncs.md | 15 ++++++--------- website/docs/faq.md | 4 ++-- website/docs/quick-start-guide.md | 3 ++- .../assets/images/upcoming-community-calls.png | Bin 92461 -> 118827 bytes .../version-0.13.0/quick-start-guide.md | 1 + .../version-0.13.1/quick-start-guide.md | 1 + 7 files changed, 13 insertions(+), 13 deletions(-) diff --git a/website/blog/2023-08-09-Lakehouse-Trifecta-Delta-Lake-Apache-Iceberg-and-Apache-Hudi.mdx b/website/blog/2023-08-09-Lakehouse-Trifecta-Delta-Lake-Apache-Iceberg-and-Apache-Hudi.mdx index 2cd79f8d0bf..6c756dec91e 100644 --- a/website/blog/2023-08-09-Lakehouse-Trifecta-Delta-Lake-Apache-Iceberg-and-Apache-Hudi.mdx +++ b/website/blog/2023-08-09-Lakehouse-Trifecta-Delta-Lake-Apache-Iceberg-and-Apache-Hudi.mdx @@ -1,5 +1,5 @@ --- -title: "Hoodie Timeline: Foundational pillar for ACID transactions" +title: "Lakehouse Trifecta — Delta Lake, Apache Iceberg & Apache Hudi" authors: - name: Sandip Roy category: blog diff --git a/website/community/syncs.md b/website/community/syncs.md index c4b4db98de0..df6a0ee269c 100644 --- a/website/community/syncs.md +++ b/website/community/syncs.md @@ -12,23 +12,19 @@ Meetings will be recorded and made available, on a best-effort basis. See more informations about Chinese community sync call by changing the website language into Chinese. -### Schedules - -Refer to the [Apache Hudi events calendar](https://calendar.google.com/calendar/embed?src=rgpb1ta2mgp5au38fr2834poa8%40group.calendar.google.com&ctz=America%2FLos_Angeles). - ## Monthly Community Call **[ZOOM LINK TO JOIN](https://zoom.us/j/96743829685?pwd=S3pxRVVKTDljWVFVWW1IU1UvY0JEUT09)** -Every month on the Last Wed, 09:00 AM Pacific Time (US and Canada)([translate to other time zones](https://www.worldtimebuddy.com/?qm=1&lid=5368361,2643743,1264527,1796236&h=2643743&date=2022-5-25&sln=17-18&hf=1)) +Every month on the **Fourth Wednesday**, at 09:00 AM Pacific Time (US and Canada)([translate to other time zones](https://www.worldtimebuddy.com/?qm=1&lid=5368361,2643743,1264527,1796236&h=2643743&date=2022-5-25&sln=17-18&hf=1)) -**Recordings of the community calls** +### Recordings of the community calls Uploaded to [Apache Hudi youtube channel](https://www.youtube.com/channel/UCs7AhE0BWaEPZSChrBR-Muw) after every call. [LINK TO SLIDE DECKS](https://drive.google.com/drive/folders/1hsq-kerUsHDlJ3WDeysMQGnVTmttzHgB?usp=sharing) -**Typical agenda** +### Typical agenda * \[15 mins\] Progress updates & Plans (PMC member) * \[30 mins\] User presentations, Upcoming feature show cases, Demos @@ -36,6 +32,7 @@ Uploaded to [Apache Hudi youtube channel](https://www.youtube.com/channel/UCs7Ah If you would like to present in one of the community calls, please fill out a [form here](https://forms.gle/aMkb93ViHhzRRXqV9) and someone from PMC will engage with you. -Here are some upcoming calls for convenience. - +### Schedules +Refer to the [Apache Hudi events calendar](https://calendar.google.com/calendar/embed?src=rgpb1ta2mgp5au38fr2834poa8%40group.calendar.google.com&ctz=America%2FLos_Angeles). +Here are some upcoming calls for convenience. ![Upcoming calls](/assets/images/upcoming-community-calls.png) diff --git a/website/docs/faq.md b/website/docs/faq.md index f27dcd52a77..1c77aa4c390 100644 --- a/website/docs/faq.md +++ b/website/docs/faq.md @@ -210,7 +210,7 @@ No. Hudi removes all the copies of a record key when deletes are issued. Here is When issuing an `upsert` operation on a table and the batch of records provided contains multiple entries for a given key, then all of them are reduced into a single final value by repeatedly calling payload class's [preCombine()](https://github.com/apache/hudi/blob/d3edac4612bde2fa9deca9536801dbc48961fb95/hudi-common/src/main/java/org/apache/hudi/common/model/HoodieRecordPayload.java#L40) method . By default, we pick the record with the greatest value (determined by calling .compareTo() [...] -For an insert or bulk\_insert operation, no such pre-combining is performed. Thus, if your input contains duplicates, the table would also contain duplicates. If you don't want duplicate records either issue an upsert or consider specifying option to de-duplicate input in either [datasource](https://hudi.apache.org/docs/configurations#hoodiedatasourcewriteinsertdropduplicates) or [deltastreamer](https://github.com/apache/hudi/blob/d3edac4612bde2fa9deca9536801dbc48961fb95/hudi-utilities/s [...] +For an insert or bulk\_insert operation, no such pre-combining is performed. Thus, if your input contains duplicates, the table would also contain duplicates. If you don't want duplicate records either issue an **upsert** or consider specifying option to de-duplicate input in either datasource using [`hoodie.datasource.write.insert.drop.duplicates`](https://hudi.apache.org/docs/configurations#hoodiedatasourcewriteinsertdropduplicates) & [`hoodie.combine.before.insert`](https://hudi.apach [...] ### How can I pass hudi configurations to my spark writer job? @@ -319,7 +319,7 @@ Hudi employs [optimistic concurrency control](https://hudi.apache.org/docs/concu ### Can single-writer inserts have duplicates? -By default, Hudi turns off key based de-duplication for INSERT/BULK\_INSERT operations and thus the table could contain duplicates. If users believe, they have duplicates in inserts, they can either issue UPSERT or consider specifying the option to de-duplicate input in either [datasource](https://hudi.apache.org/docs/configurations#hoodiedatasourcewriteinsertdropduplicates) or [deltastreamer](https://github.com/apache/hudi/blob/d3edac4612bde2fa9deca9536801dbc48961fb95/hudi-utilities/src [...] +By default, Hudi turns off key based de-duplication for INSERT/BULK\_INSERT operations and thus the table could contain duplicates. If users believe, they have duplicates in inserts, they can either issue UPSERT or consider specifying the option to de-duplicate input in either datasource using [`hoodie.datasource.write.insert.drop.duplicates`](https://hudi.apache.org/docs/configurations#hoodiedatasourcewriteinsertdropduplicates) & [`hoodie.combine.before.insert`](https://hudi.apache.org/ [...] ### Can concurrent inserts cause duplicates? diff --git a/website/docs/quick-start-guide.md b/website/docs/quick-start-guide.md index 17564d2508e..421f3497170 100644 --- a/website/docs/quick-start-guide.md +++ b/website/docs/quick-start-guide.md @@ -20,7 +20,8 @@ Hudi works with Spark-2.4.3+ & Spark 3.x versions. You can follow instructions [ | Hudi | Supported Spark 3 version | |:----------------|:------------------------------------------------| -| 0.12.0 - 0.13.x | 3.3.x (default build), 3.2.x, 3.1.x | +| 0.13.x | 3.3.x (default build), 3.2.x, 3.1.x | +| 0.12.x | 3.3.x (default build), 3.2.x, 3.1.x | | 0.11.x | 3.2.x (default build, Spark bundle only), 3.1.x | | 0.10.x | 3.1.x (default build), 3.0.x | | 0.7.0 - 0.9.0 | 3.0.x | diff --git a/website/static/assets/images/upcoming-community-calls.png b/website/static/assets/images/upcoming-community-calls.png index f1cf55f2905..f04e684380c 100644 Binary files a/website/static/assets/images/upcoming-community-calls.png and b/website/static/assets/images/upcoming-community-calls.png differ diff --git a/website/versioned_docs/version-0.13.0/quick-start-guide.md b/website/versioned_docs/version-0.13.0/quick-start-guide.md index 453090dcdc0..7a71ae3c765 100644 --- a/website/versioned_docs/version-0.13.0/quick-start-guide.md +++ b/website/versioned_docs/version-0.13.0/quick-start-guide.md @@ -20,6 +20,7 @@ Hudi works with Spark-2.4.3+ & Spark 3.x versions. You can follow instructions [ | Hudi | Supported Spark 3 version | |:----------------|:------------------------------------------------| +| 0.13.x | 3.3.x (default build), 3.2.x, 3.1.x | | 0.12.x | 3.3.x (default build), 3.2.x, 3.1.x | | 0.11.x | 3.2.x (default build, Spark bundle only), 3.1.x | | 0.10.x | 3.1.x (default build), 3.0.x | diff --git a/website/versioned_docs/version-0.13.1/quick-start-guide.md b/website/versioned_docs/version-0.13.1/quick-start-guide.md index 045707abf77..9c477c58641 100644 --- a/website/versioned_docs/version-0.13.1/quick-start-guide.md +++ b/website/versioned_docs/version-0.13.1/quick-start-guide.md @@ -20,6 +20,7 @@ Hudi works with Spark-2.4.3+ & Spark 3.x versions. You can follow instructions [ | Hudi | Supported Spark 3 version | |:----------------|:------------------------------------------------| +| 0.13.x | 3.3.x (default build), 3.2.x, 3.1.x | | 0.12.x | 3.3.x (default build), 3.2.x, 3.1.x | | 0.11.x | 3.2.x (default build, Spark bundle only), 3.1.x | | 0.10.x | 3.1.x (default build), 3.0.x |