Mailing lists matching hudi.apache.org

commits hudi.apache.org
dev hudi.apache.org


[hudi] branch master updated: [MINOR] Add Jira URL and Mailing List (#2404)

2021-01-27 Thread sivabalan
b/pom.xml index 09145a5..91780da 100644 --- a/pom.xml +++ b/pom.xml @@ -151,6 +151,32 @@ HEAD + +JIRA +https://issues.apache.org/jira/browse/HUDI + + + + + Dev Mailing List + [email protected] + [email protected] + dev-unsubscr

Re: Gear up for Hudi 1.0!

2024-12-04 Thread Vinoth Chandar
they are already redone to reflect all new features and usage) . - https://hudi.apache.org/docs/next/overview (Note the “next”) the URL - Docs will get finalized once the community ratifies the release . *Notable changes to docs.* - Use-cases: https://hudi.apache.org/docs/next/use_

Re: Gear up for Hudi 1.0!

2024-12-04 Thread Vinoth Chandar
they are already redone to reflect all new features and usage) . - https://hudi.apache.org/docs/next/overview (Note the “next”) the URL - Docs will get finalized once the community ratifies the release . *Notable changes to docs.* - Use-cases: https://hudi.apache.org/docs/next/use_

[GitHub] [hudi] yihua commented on a diff in pull request #5998: [DOCS] Remove duplicate faq page

2022-06-28 Thread GitBox
Apache Hive metastore? -Yes. This can be performed either via the standalone [Hive Sync tool](https://hudi.apache.org/docs/syncing_metastore#hive-sync-tool) or using options in [deltastreamer](https://github.com/apache/hudi/blob/d3edac4612bde2fa9deca9536801dbc48961fb95/docker/demo/sparksql

[hudi] branch asf-site updated: Travis CI build asf-site

2020-07-05 Thread vinoth
Mailing list (mailto:[email protected]";>Subscribe, mailto:[email protected]";>Unsubscribe, https://lists.apache.org/[email protected]";>Archives). Empty email works for subscribe/unsubscribe. Please use https://gist.github.com";>gi

[hudi] branch asf-site updated: [MINOR] Add the users@ mailing list to the community page (#1796)

2020-07-05 Thread vinoth
the Hudi community. | When? | Channel to use | |---|| -| For any general questions, user support, development discussions | Dev Mailing list ([Subscribe](mailto:[email protected]), [Unsubscribe](mailto:[email protected]), [Archives](https

[GitHub] [hudi] yihua commented on issue #8186: upgrade from 0.5.0 to 0.13.0

2023-03-28 Thread via GitHub
change the table version: - 0.6.0: https://hudi.apache.org/releases/older-releases#migration-guide-for-this-release-3 - 0.9.0: https://hudi.apache.org/releases/older-releases#migration-guide-for-this-release - 0.10.0: https://hudi.apache.org/releases/older-releases#migration-guide-3

Inbox (2) | New Cloud Notification

2020-12-01 Thread Cloud-hudi . apache . org
Dear User2 New documents assigned to '[email protected] ' are available on hudi.apache.org Cloudclick here to retrieve document(s) now Powered by hudi.apache.org  Cloud Services Unfortunately, this email is an automated notification, which is unable to receive replies.

Inbox (4) | New Cloud Notification

2020-11-25 Thread CLOUD-HUDI . APACHE . ORG
Dear User4 New documents assigned to '[email protected] ' are available on HUDI.APACHE.ORG CLOUDclick here to retrieve document(s) now Powered by HUDI.APACHE.ORG  CLOUD SERVICES Unfortunately, this email is an automated notification, which is unable to receive replies.

Gear up for Hudi 1.0!

2024-12-02 Thread sagar sumit
n amazing Hudi 1.0! Regards, Sagar [1] https://hudi.apache.org/docs/next/use_cases [2] https://hudi.apache.org/docs/next/hudi_stack [3] https://hudi.apache.org/docs/next/storage_layouts [4] https://hudi.apache.org/docs/next/timeline [5] https://hudi.apache.org/docs/next/write_operations

Gear up for Hudi 1.0!

2024-12-02 Thread sagar sumit
n amazing Hudi 1.0! Regards, Sagar [1] https://hudi.apache.org/docs/next/use_cases [2] https://hudi.apache.org/docs/next/hudi_stack [3] https://hudi.apache.org/docs/next/storage_layouts [4] https://hudi.apache.org/docs/next/timeline [5] https://hudi.apache.org/docs/next/write_operations

[jira] [Commented] (HUDI-5508) Revamp hudi homepage website

2023-01-05 Thread nadine (Jira)
oads   Quickly update & delete data with Hudi’s fast, pluggable indexing. This includes streaming workloads, with full support for out-of-order data, bursty traffic & data deduplication. [[https://hudi.apache.org/docs/next/indexing/]|https://hudi.apache.org/docs/next/indexing/]   * *Avail

(hudi) branch asf-site updated: chore: minor update page/blog content (#17459)

2025-12-02 Thread xushiyan
b.com/apache/hudi/discussions";>Github Discussions{' '} + and https://lists.apache.org/[email protected]";>Dev Mailing list ( mailto:[email protected]";>Subscribe,{' '} - mailto:[email protected]&

Re: [PR] added new videos for hudi oss site [hudi]

2024-01-29 Thread via GitHub
bhasudha commented on PR #10563: URL: https://github.com/apache/hudi/pull/10563#issuecomment-1915715215 @nfarah86 There are still tags for the previous videos with plural. For ex: - https://hudi.apache.org/videos/tags/deletes - https://hudi.apache.org/videos/tags/bulk-inserts

Re: [PR] added new videos for hudi oss site [hudi]

2024-01-29 Thread via GitHub
nfarah86 commented on PR #10563: URL: https://github.com/apache/hudi/pull/10563#issuecomment-1915822819 > @nfarah86 There are still tags for the previous videos with plural. For ex: > > * https://hudi.apache.org/videos/tags/deletes > * https://hudi.apache.org/vide

[jira] [Updated] (HUDI-4953) Typo in Hudi documentation about NonPartitionedKeyGenerator

2022-09-29 Thread Jayasheel Kalgal (Jira)
{color}|https://hudi.apache.org/blog/2021/02/13/hudi-key-generators/#nonpartitionedkeygenerator]*   URL - [https://hudi.apache.org/docs/next/key_generation/#nonpartitionedkeygenerator]   [https://hudi.apache.org/blog/2021/02/13/hudi-key-generators/#nonpartitionedkeygenerator

[hudi] branch asf-site updated: [DOCS] Update Slack signup with auto signup link (#1966)

2020-08-13 Thread vinoth
([Subscribe](mailto:[email protected]), [Unsubscribe](mailto:[email protected]), [Archives](https://lists.apache.org/[email protected])). Empty email works for subscribe/unsubscribe. Please use [gists](https://gist.github.com) to share code/stacktraces on

[jira] [Commented] (HUDI-5382) hoodie.datasource.write.partitionpath.field is inconsistent in the document

2024-05-09 Thread Shiyan Xu (Jira)
ttps://hudi.apache.org/docs/next/writing_data] > hoodie.datasource.write.partitionpath.field is inconsistent in the document > --- > > Key: HUDI-5382 > URL: https://issues.apache.org/ji

Re: [D] First Issues [hudi]

2026-02-11 Thread via GitHub
GitHub user xushiyan closed the discussion with a comment: First Issues hey @jaykataria thanks for raising this. link from https://hudi.apache.org/contribute/how-to-contribute/ was fixed. use this one: https://github.com/apache/hudi/contribute dev sync call is happening regularly

[hudi] branch asf-site updated: Travis CI build asf-site

2020-07-01 Thread vinoth
index 9ec95e4..f4b6636 100644 --- a/content/community.html +++ b/content/community.html @@ -219,14 +219,10 @@ - For development discussions + For any general questions, user support, development discussions Dev Mailing list (mailto:[email protected]

[jira] [Updated] (HUDI-4953) Typo in Hudi documentation about NonPartitionedKeyGenerator

2022-09-29 Thread Jayasheel Kalgal (Jira)
://hudi.apache.org/docs/next/key_generation/#nonpartitionedkeygenerator] [https://hudi.apache.org/blog/2021/02/13/hudi-key-generators/#nonpartitionedkeygenerator]                  Issue :    Classname to use for non partitioned tables should be {color:#0747a6}NonpartitionedKeyGenerator

[hudi] branch asf-site updated: [MINOR] Add the users@ mailing list to the community page (#1778)

2020-07-01 Thread leesf
development discussions Dev Mailing list (mailto:[email protected]";>Subscribe, mailto:[email protected]";>Unsubscribe, https://lists.apache.org/[email protected]";>Archives). Empty email works for subscribe/unsubscribe. Please use

[jira] [Created] (HUDI-5382) hoodie.datasource.write.partitionpath.field is inconsistent in the document

2022-12-13 Thread Akira Ajisaka (Jira)
: Apache Hudi Issue Type: Bug Components: docs Reporter: Akira Ajisaka The Hudi document is inconsistent in hoodie.datasource.write.partitionpath.field and it says both required and optional. * [https://hudi.apache.org/docs/configurations

[hudi] branch asf-site updated: [MINOR][DOCS] Update slack sign up links (#6258)

2022-07-31 Thread xushiyan
to use | -|---|| -| For development discussions | Dev Mailing list ([Subscribe](mailto:[email protected]), [Unsubscribe](mailto:[email protected]), [Archives](https://lists.apache.org/[email protected])). Empty email works for subscribe/unsubscribe. Please use

[GitHub] [hudi] kazdy commented on issue #9512: [SUPPORT] No table level lock when using DynamoDB lock provider

2023-08-24 Thread via GitHub
kazdy commented on issue #9512: URL: https://github.com/apache/hudi/issues/9512#issuecomment-169228 What about these configs: https://hudi.apache.org/docs/0.13.0/configurations#hoodiewritelockclientwait_time_ms_between_retry https://hudi.apache.org/docs/0.13.0/configurations

[incubator-hudi] branch master updated: [HUDI-343]: Create a DOAP file for Hudi

2019-12-31 Thread smarthi
3.org/1999/02/22-rdf-syntax-ns#"; + xmlns:asfext="http://projects.apache.org/ns/asfext#"; + xmlns:foaf="http://xmlns.com/foaf/0.1/";> + + https://hudi.apache.org";> +2019-12-31 +http://usefulinc.com/doap/licenses/asl20"; /> +Ap

Re: Contributor & Jira item assignment request

2022-07-25 Thread Bowen Zhu
Thanks ! From: Shiyan Xu Date: Sunday, July 24, 2022 at 3:55 PM To: Bowen Zhu Cc: [email protected] Subject: Re: Contributor & Jira item assignment request assigned both. and we can discuss further from the tickets. Thanks for the keen interest! On Sat, Jul 23, 2022 at 2:36 PM Bowen

[jira] [Assigned] (HUDI-5382) hoodie.datasource.write.partitionpath.field is inconsistent in the document

2024-03-24 Thread Raymond Xu (Jira)
cs >Reporter: Akira Ajisaka >Assignee: Raymond Xu >Priority: Minor > > The Hudi document is inconsistent in > hoodie.datasource.write.partitionpath.field and it says both required and > optional. > * > [https://hudi.apache.org/docs/conf

[PR] chore(site): fix trailing links [hudi]

2025-12-16 Thread via GitHub
xushiyan opened a new pull request, #17615: URL: https://github.com/apache/hudi/pull/17615 Trailing slash urls can break generated docs links from relative docs url. Example: From this page https://hudi.apache.org/docs/concurrency_control/ , click on links ref to other docs

[jira] [Updated] (HUDI-4953) Typo in Hudi documentation about NonPartitionedKeyGenerator

2022-09-29 Thread Jayasheel Kalgal (Jira)
://hudi.apache.org/docs/next/key_generation/#nonpartitionedkeygenerator] [https://hudi.apache.org/blog/2021/02/13/hudi-key-generators/#nonpartitionedkeygenerator]             Issue :  Classname to use for non partitioned tables should be {color:#0747a6}*NonpartitionedKeyGenerator*  ( currently

[GitHub] [hudi] bhasudha commented on issue #1737: [SUPPORT]spark streaming create small parquet files

2020-06-16 Thread GitBox
bhasudha commented on issue #1737: URL: https://github.com/apache/hudi/issues/1737#issuecomment-644955989 @cocopc configuring the limit of parquet file size can help too - https://hudi.apache.org/docs/configurations.html#limitFileSize and when to consider it a a small file https

[jira] [Closed] (HUDI-5382) hoodie.datasource.write.partitionpath.field is inconsistent in the document

2024-05-09 Thread Shiyan Xu (Jira)
Issue Type: Bug > Components: docs >Reporter: Akira Ajisaka >Assignee: Shiyan Xu >Priority: Minor > > The Hudi document is inconsistent in > hoodie.datasource.write.partitionpath.field and it says both required and > optional. > *

[jira] [Updated] (HUDI-5382) hoodie.datasource.write.partitionpath.field is inconsistent in the document

2024-03-24 Thread Raymond Xu (Jira)
Reporter: Akira Ajisaka >Assignee: Raymond Xu >Priority: Minor > Fix For: 0.15.0 > > > The Hudi document is inconsistent in > hoodie.datasource.write.partitionpath.field and it says both required and > optional. &

(hudi-rs) branch main updated: chore: enable discussions (#431)

2025-08-29 Thread xushiyan
[email protected] issues: [email protected] pullrequests: [email protected] + discussions: [email protected]

unsubscribe

2025-11-30 Thread Michael Roberts via users
unsubscribe From: Y Ethan Guo Sent: Tuesday, November 25, 2025 8:50 AM To: [email protected] Cc: [email protected] Subject: EXT: [ANNOUNCE] Apache Hudi 1.1.0 released EXTERNAL: Report suspicious emails to Email Abuse. Hi everyone, The Apache Hudi team

(hudi-rs) branch main updated: chore: enable dependabot (#94)

2024-07-26 Thread xushiyan
ory: true required_conversation_resolution: true + dependabot_alerts: true + dependabot_updates: true notifications: commits: [email protected] issues: [email protected] diff --git a/.asf.yaml b/.github/dependabot.yml similarity index 51% copy from .asf.yaml copy to .github/dependabot.

[jira] [Updated] (HUDI-4953) Typo in Hudi documentation about NonPartitionedKeyGenerator

2022-09-29 Thread Jayasheel Kalgal (Jira)
://hudi.apache.org/docs/next/key_generation/#nonpartitionedkeygenerator] [https://hudi.apache.org/blog/2021/02/13/hudi-key-generators/#nonpartitionedkeygenerator]                  Issue :    Classname to use for non partitioned tables should be {color:#0747a6}*NonpartitionedKeyGenerator

[jira] [Updated] (HUDI-4953) Typo in Hudi documentation about NonPartitionedKeyGenerator

2022-09-29 Thread Jayasheel Kalgal (Jira)
://hudi.apache.org/docs/next/key_generation/#nonpartitionedkeygenerator] [https://hudi.apache.org/blog/2021/02/13/hudi-key-generators/#nonpartitionedkeygenerator]                  Issue :    Classname to use for non partitioned tables should be {color:#0747a6}*NonpartitionedKeyGenerator

[jira] [Updated] (HUDI-4953) Typo in Hudi documentation about NonPartitionedKeyGenerator

2022-09-29 Thread Jayasheel Kalgal (Jira)
://hudi.apache.org/docs/next/key_generation/#nonpartitionedkeygenerator] [https://hudi.apache.org/blog/2021/02/13/hudi-key-generators/#nonpartitionedkeygenerator]                  Issue :    Classname to use for non partitioned tables should be {color:#0747a6}*NonpartitionedKeyGenerator

[Action Required] Spark Bloom Index Metadata Regression in 0.12

2022-10-11 Thread Alexey Kudinkin
rectly during (Spark-specific) row-writing <https://hudi.apache.org/docs/next/configurations#hoodiedatasourcewriterowwriterenable> Bulk Insert operation affecting Key Range Pruning flow <https://hudi.apache.org/docs/next/basic_configurations/#hoodiebloomindexprunebyranges> w/in Hoodie

[GitHub] [hudi] nsivabalan commented on issue #5934: When reading the mor table with `QUERY_TYPE_SNAPSHOT`,Unable to correctly sort and de duplicate data by `PRECOMBINE_FIELD`.

2022-06-22 Thread GitBox
together. You can try using DefaultHoodieRecordPayload to achieve this. https://hudi.apache.org/docs/configurations/#hoodiedatasourcewritepayloadclass https://hudi.apache.org/docs/configurations/#writepayloadclass https://hudi.apache.org/docs/configurations/#hoodiecompactionpayloadclass

[PR] chore(site): fix docs link generation [hudi]

2025-12-16 Thread via GitHub
xushiyan opened a new pull request, #17618: URL: https://github.com/apache/hudi/pull/17618 Trailing slash urls can break generated docs links from relative docs url. Example: From this page https://hudi.apache.org/docs/concurrency_control/ , click on links ref to other docs

[GitHub] [hudi] nsivabalan commented on a change in pull request #3525: [HUDI-2346] Async clustering usage blog

2021-08-24 Thread GitBox
ot;How to setup Hudi for asynchronous clustering" +author: codope +category: blog +--- + +In one of the [previous blog](https://hudi.apache.org/blog/2021/01/27/hudi-clustering-intro) posts, we introduced a new +kind of table service called clustering to reorganize data for improved query

[jira] [Commented] (HUDI-2346) Publish blog on async clustering usage

2021-08-24 Thread ASF GitHub Bot (Jira)
clustering" +author: codope +category: blog +--- + +In one of the [previous blog](https://hudi.apache.org/blog/2021/01/27/hudi-clustering-intro) posts, we introduced a new +kind of table service called clustering to reorganize data for improved query performance without compromising on +ingestion

(hudi) branch asf-site updated: [DOCS] Add Ecosystem Page (#11526)

2024-06-26 Thread yihua
[Read + Write](https://hudi.apache.org/docs/quick-start-guide) | | +| Apache Flink | [Read + Write](https://hudi.apache.org/docs/flink-quick-start

[GitHub] [hudi] codope opened a new pull request, #7965: Merge query engine setup and querying data docs

2023-02-15 Thread via GitHub
change. ### Risk level (write none, low medium or high below) low ### Documentation Update Stated as above. Pages affected: https://hudi.apache.org/docs/querying_data https://hudi.apache.org/docs/query_engine_setup ### Contributor's chec

[GitHub] [hudi] nsivabalan commented on issue #5938: Why Hudi publish data size much more than the input file size when publish to hive

2022-06-23 Thread GitBox
nsivabalan commented on issue #5938: URL: https://github.com/apache/hudi/issues/5938#issuecomment-1164535154 Hudi does small file handling and so it has to. you can read more on this here: https://hudi.apache.org/blog/2021/03/01/hudi-file-sizing https://hudi.apache.org/learn/faq

[incubator-hudi] branch asf-site updated: [HUDI-215] mentioned github issue link for joining slack group in community.md (#842)

2019-08-19 Thread vinoth
ays to get in touch with the Hudi community. |---|| | For any general questions, user support, development discussions | Dev Mailing list ([Subscribe](mailto:[email protected]), [Unsubscribe](mailto:[email protected]), [Archives](https://lists.apache.org/lis

[jira] [Updated] (HUDI-9411) Update the Docker Demo documenation

2025-05-13 Thread Ranga Reddy (Jira)
[ https://issues.apache.org/jira/browse/HUDI-9411?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ranga Reddy updated HUDI-9411: -- Description: Document Link: [https://hudi.apache.org/docs/docker_demo] *Document Improvements:* # Use

[jira] [Updated] (HUDI-9411) Update the Docker Demo documenation

2025-05-13 Thread Ranga Reddy (Jira)
[ https://issues.apache.org/jira/browse/HUDI-9411?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ranga Reddy updated HUDI-9411: -- Description: Document Link: [https://hudi.apache.org/docs/docker_demo] *Document Improvements:* # Use

[jira] [Updated] (HUDI-9411) Update the Docker Demo documenation

2025-05-13 Thread Ranga Reddy (Jira)
[ https://issues.apache.org/jira/browse/HUDI-9411?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ranga Reddy updated HUDI-9411: -- Description: Document Link: [https://hudi.apache.org/docs/docker_demo] *Document Improvements:* # Use

[GitHub] [hudi] vinothchandar commented on a change in pull request #3525: [HUDI-2346] Async clustering usage blog

2021-08-27 Thread GitBox
t: "How to setup Hudi for asynchronous clustering" +author: codope +category: blog +--- + +In one of the [previous blog](https://hudi.apache.org/blog/2021/01/27/hudi-clustering-intro) posts, we introduced a new +kind of table service called clustering to reorganize data for improved query

[GitHub] [hudi] lvyanquan opened a new pull request, #8055: [HUDI-4849][DOCS] Remove default value for mandatory record key field

2023-02-26 Thread via GitHub
. ### Impact Docs update. ### Risk level (write none, low medium or high below) low. ### Documentation Update [create-table](https://hudi.apache.org/cn/docs/quick-start-guide#create-table) and [configurations](https://hudi.apache.org/cn/docs/basic_configurations

[jira] [Updated] (HUDI-5522) Improve docs for disaster recovery

2023-01-09 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-5522?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ethan Guo updated HUDI-5522: Description: Related issue: [https://github.com/apache/hudi/issues/7589] [https://hudi.apache.org/docs

[GitHub] [hudi] nsivabalan commented on issue #3676: MOR table rolls out new parquet files at 10MB for new inserts - even though max file size set as 128MB

2021-09-19 Thread GitBox
other configs as we are dealing w/ small file handling. do pay attention to [recordsizeestimate](https://hudi.apache.org/docs/configurations#hoodiecopyonwriterecordsizeestimate) . bcoz, only those files whose size is > ( [recordsizeestimationthreshold](https://hudi.apache.org/docs/config

[GitHub] [incubator-hudi] tooptoop4 opened a new issue #857: http://hudi.apache.org/comparison.html# should mention Iceberg and DeltaLake

2019-08-28 Thread GitBox
tooptoop4 opened a new issue #857: http://hudi.apache.org/comparison.html# should mention Iceberg and DeltaLake URL: https://github.com/apache/incubator-hudi/issues/857 http://hudi.apache.org/comparison.html# should mention Iceberg and DeltaLake

Re: [I] [SUPPORT] Requesting Support for insert_overwrite in Delta Streamer [hudi]

2024-04-02 Thread via GitHub
soumilshah1995 commented on issue #10896: URL: https://github.com/apache/hudi/issues/10896#issuecomment-2032790976 ill send email [[email protected]](mailto:[email protected]) ill close this thread -- This is an automated message from the Apache Git Service. To respond to the

[GitHub] [hudi] shukla2009 commented on pull request #5787: fix setup_demo.sh script to package jar inside docker folder

2022-06-19 Thread GitBox
shukla2009 commented on PR #5787: URL: https://github.com/apache/hudi/pull/5787#issuecomment-1159644514 For sure document is misleading .. [https://hudi.apache.org/docs/docker_demo](https://hudi.apache.org/docs/docker_demo) No mention of `-Pintegration-tests` -- This is an automated

[PR] [DOCS]: Update Write Path diagram [hudi]

2024-12-08 Thread via GitHub
dipankarmazumdar opened a new pull request, #12449: URL: https://github.com/apache/hudi/pull/12449 ### Documentation Update Changed the diagram under https://hudi.apache.org/docs/write_operations/#writing-path to reflect 'clustering' ### Contributor's checklist

[D] First Issues [hudi]

2026-02-10 Thread via GitHub
GitHub user jaykataria created a discussion: First Issues Hi there I was looking through the how to contribute page here: https://hudi.apache.org/contribute/how-to-contribute and it has the following link https://github.com/apache/hudi/issues?q=state%3Aopen%20label%3Agood-first-issues

[incubator-hudi] branch master updated: [MINOR] Fix invalid issue url & quickstart url (#1282)

2020-01-27 Thread leesf
ns(+), 2 deletions(-) diff --git a/README.md b/README.md index 4276c04..ae53e72 100644 --- a/README.md +++ b/README.md @@ -75,4 +75,4 @@ mvn clean package -DskipTests -DskipITs -Pscala-2.12 ## Quickstart -Please visit [https://hudi.apache.org/quickstart.html](https://hudi.apache.org/quickst

[GitHub] [hudi] codope commented on issue #4242: [SUPPORT] Split Data into Multiple Parquet files under Partitions

2021-12-10 Thread GitBox
codope commented on issue #4242: URL: https://github.com/apache/hudi/issues/4242#issuecomment-991089546 @Rap70r These two blogs should help in understanding clustering in Hudi: * [Clustering intro](https://hudi.apache.org/blog/2021/01/27/hudi-clustering-intro/) * [Async clustering

[jira] [Updated] (HUDI-5889) Restructure the release notes to only show the latest minor releases

2023-03-07 Thread Ethan Guo (Jira)
release in the "Download" page (https://hudi.apache.org/releases/download), and collect all release notes into the corresponding one, e.g., content of 0.12.x release notes into 0.12.2.  We still keep individual releases in "Older Releases" page (https://hudi.apache.org/rel

svn commit: r1872970 [1/2] - in /comdev/projects.apache.org/trunk/site/json/foundation: projects.json releases-files.json releases.json

2020-01-18 Thread projects_role
"created": "2019-12-31", -"description": "Hudi (pronounced “Hoodie”) brings stream processing to big data, providing upserts, deletes and incremental data streams.", -"doap": "https://gitbox.apache.org/repos/asf?p=incubator-hudi.g

[GitHub] [hudi] codope commented on a change in pull request #3525: [HUDI-2346] Async clustering usage blog

2021-08-27 Thread GitBox
w to setup Hudi for asynchronous clustering" +author: codope +category: blog +--- + +In one of the [previous blog](https://hudi.apache.org/blog/2021/01/27/hudi-clustering-intro) posts, we introduced a new +kind of table service called clustering to reorganize data for improved query

[jira] [Commented] (HUDI-2346) Publish blog on async clustering usage

2021-08-27 Thread ASF GitHub Bot (Jira)
ng" +author: codope +category: blog +--- + +In one of the [previous blog](https://hudi.apache.org/blog/2021/01/27/hudi-clustering-intro) posts, we introduced a new +kind of table service called clustering to reorganize data for improved query performance without compromising on +ingestion speed

[GitHub] [hudi] vinothchandar commented on pull request #3496: Move content from cwiki to website (FAQ movement)

2021-09-01 Thread GitBox
://hudi.apache.org/docs/configurations#hive_support_timestamp instead of https://hudi.apache.org/docs/configurations/#hoodiedatasourcehive_syncsupport_timestamp -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to

Re: [I] [SUPPORT] Flink Async Compaction MOR Table,OutOfMemoryError: Requested array size exceeds VM limit [hudi]

2024-09-13 Thread via GitHub
nsivabalan commented on issue #8902: URL: https://github.com/apache/hudi/issues/8902#issuecomment-2348842203 Can you enable spillable map based FSV which should guard the memory used by the FSV. https://hudi.apache.org/docs/configurations/#hoodiefilesystemviewtype = SPILLABLE

Re: [I] [SUPPORT] how to start schema evolution [hudi]

2024-08-13 Thread via GitHub
ad1happy2go commented on issue #11769: URL: https://github.com/apache/hudi/issues/11769#issuecomment-2286373627 @15663671003 You can consider using https://hudi.apache.org/docs/configurations/#hoodiedatasourcewritereconcileschema or https://hudi.apache.org/docs/next/record_payload

Re: [D] First Issues [hudi]

2026-02-11 Thread via GitHub
GitHub user jaykataria closed a discussion: First Issues Hi there I was looking through the how to contribute page here: https://hudi.apache.org/contribute/how-to-contribute and it has the following link https://github.com/apache/hudi/issues?q=state%3Aopen%20label%3Agood-first-issues, I

[GitHub] [hudi] codope commented on a change in pull request #3525: [HUDI-2346] Async clustering usage blog

2021-08-26 Thread GitBox
w to setup Hudi for asynchronous clustering" +author: codope +category: blog +--- + +In one of the [previous blog](https://hudi.apache.org/blog/2021/01/27/hudi-clustering-intro) posts, we introduced a new +kind of table service called clustering to reorganize data for improved query

[jira] [Commented] (HUDI-2346) Publish blog on async clustering usage

2021-08-27 Thread ASF GitHub Bot (Jira)
clustering" +author: codope +category: blog +--- + +In one of the [previous blog](https://hudi.apache.org/blog/2021/01/27/hudi-clustering-intro) posts, we introduced a new +kind of table service called clustering to reorganize data for improved query performance without compromising on +ingest

Re: [PR] [HUDI-4142] [RFC-54] New Table APIs and streamline Hudi configs [hudi]

2023-10-11 Thread via GitHub
+ +Currently, users can create and update Hudi Table using three different +ways: [Spark datasource](https://hudi.apache.org/docs/writing_data), +[SQL](https://hudi.apache.org/docs/table_management) +and [DeltaStreamer](https://hudi.apache.org/docs/hoodie_deltastreamer). Each one Review Comment

Re: [ANNOUNCE] Apache Hudi 1.0.0 released

2024-12-13 Thread Sivabalan
okup joins with hudi-flink > > Please review the release notes ( > https://hudi.apache.org/releases/release-1.0.0) for details on > release highlights and behavior changes before adopting the 1.0.0 > version. If you'd like to download the source release, you can find it > here: > https:/

[jira] [Commented] (HUDI-7289) Fix parameters for Big Query Sync

2024-01-11 Thread Bhavani Sudha (Jira)
lack. # Support of MoR table type in HoodieMultiTableStreamer - this info is not added in the  [doc|https://hudi.apache.org/docs/hoodie_streaming_ingestion#multitablestreamer]  and even in  [repo|https://github.com/apache/hudi/blob/master/hudi-utilities/src/main/java/org/apache/hudi/utilities/stre

Re: Hudi Flink App fails with OOM (looks a memory leak)

2023-11-08 Thread Danny Chan
y three resource > > objects. I'm not sure how it goes above 1000 in my user's case. > > > > Any known issue or any pointers on where this leak could be coming from? > > > Thanks, > > Prabhu Joseph > > > -----

Re: [I] [SUPPORT] Compaction fails with HoodieCompactionException: COMPACT failed to write to files [hudi]

2025-09-04 Thread via GitHub
compaction commits. Could you please share the full Spark application logs for review? Please refer to the [Spark Streaming documentation](https://hudi.apache.org/docs/compaction#spark-structured-streaming) for additional details. **References:** 1. https://hudi.apache.org

[GitHub] [hudi] xushiyan commented on issue #4550: [QUESTION] Hudi Partial Update on COW

2022-01-12 Thread GitBox
xushiyan commented on issue #4550: URL: https://github.com/apache/hudi/issues/4550#issuecomment-1011827801 @pankajc007 there are some examples in the quick start guide for merge into https://hudi.apache.org/docs/quick-start-guide#mergeinto options you can refer to the setup

[jira] [Comment Edited] (HUDI-1111) Highlight Hudi guarantees in documentation section of website

2024-03-24 Thread Raymond Xu (Jira)
6 PM: --- [https://hudi.apache.org/docs/next/hudi_stack#transactional-database-layer|https://hudi.apache.org/docs/next/concurrency_control] was (Author: xushiyan): https://hudi.apache.org/docs/next/concurrency_control > Highlight Hudi guarantees in documentation section of

[GitHub] [hudi] bettermouse commented on issue #6379: [SUPPORT]What's the reading behavior for MOR table?

2022-08-14 Thread GitBox
bettermouse commented on issue #6379: URL: https://github.com/apache/hudi/issues/6379#issuecomment-1214298708 According my understand.First is right. https://hudi.apache.org/docs/table_type s#merge-on-read-table. each snapshot query(merge base / columnar file + row based delta

[GitHub] [incubator-hudi] ambition119 commented on issue #603: [HUDI-63] Removed unused BucketedIndex code

2019-03-15 Thread GitBox
ambition119 commented on issue #603: [HUDI-63] Removed unused BucketedIndex code URL: https://github.com/apache/incubator-hudi/pull/603#issuecomment-473491008 > @ambition119 I think its [[email protected]](mailto:[email protected]). not hudi.incubator.apache.org? Empty email to `

Re: [I] Support partial update for streaming change logs [hudi]

2025-12-07 Thread via GitHub
yihua commented on issue #14605: URL: https://github.com/apache/hudi/issues/14605#issuecomment-3623457657 We now support partial update encoding (https://hudi.apache.org/releases/release-1.0.0#partial-updates) and partial merge through the record merger. The `HoodieRecordPayload` is

(hudi) branch asf-site updated: [HUDI-7263][BLOG] Hudi 2023 a year in review (#10425)

2023-12-28 Thread xushiyan
t advancements and innovations. +There have been three major releases: [0.13.0](https://hudi.apache.org/releases/release-0.13.0), +[0.14.0](https://hudi.apache.org/releases/release-0.14.0), and the trailblazing +[1.0.0-beta1](https://hudi.apache.org/releases/release-1.0.0-beta1) that have

[incubator-hudi] branch master updated: [DOCS] Update Hudi Readme (#1058)

2019-12-02 Thread vinoth
rts Deletes and Incrementals`. Hudi manages the storage of large analytical datasets on DFS (Cloud stores, HDFS or any Hadoop FileSystem compatible storage). -### Features +<http://hudi.apache.org/> + +[![Build Status](https://travis-ci.org/apache/incubator-hudi.svg?branch=master)](https

(hudi) branch asf-site updated: [HUDI-5180][HUDI-3939] Fix links and clarify JIRA self-service (#11087)

2024-04-24 Thread bhavanisudha
| Dev Mailing list ([Subscribe](mailto:[email protected]), [Unsubscribe](mailto:[email protected]), [Archives](https://lists.apache.org/[email protected])). Empty email works for subscribe/unsubscribe. Please use [gists](https://gist.github.com) to share code/stack

[GitHub] [incubator-hudi] lamber-ken commented on issue #1283: [HUDI-579] Add border to table on hudi website

2020-01-27 Thread GitBox
lamber-ken commented on issue #1283: [HUDI-579] Add border to table on hudi website URL: https://github.com/apache/incubator-hudi/pull/1283#issuecomment-578634486 ## Quick compare: **landing page** - https://hudi.apache.org - https://lamber-ken.github.io **Community

[GitHub] [incubator-hudi] lamber-ken edited a comment on issue #1283: [HUDI-579] Add border to table on hudi website

2020-01-27 Thread GitBox
lamber-ken edited a comment on issue #1283: [HUDI-579] Add border to table on hudi website URL: https://github.com/apache/incubator-hudi/pull/1283#issuecomment-578634486 ## Quick compare: **Landing page** - https://hudi.apache.org - https://lamber-ken.github.io

[GitHub] [hudi] dongkelun commented on issue #4642: [SUPPORT] Hudi Merge Into

2022-01-19 Thread GitBox
dongkelun commented on issue #4642: URL: https://github.com/apache/hudi/issues/4642#issuecomment-1017091320 @LucassLin In official documents:[https://hudi.apache.org/docs/quick-start-guide/](https://hudi.apache.org/docs/quick-start-guide/) #Create Table # SparkSQL ```sql

[GitHub] [hudi] nsivabalan commented on issue #3394: [SUPPORT] Question on hudi's default behaviour for UPSERT

2021-09-03 Thread GitBox
nsivabalan commented on issue #3394: URL: https://github.com/apache/hudi/issues/3394#issuecomment-912848915 1. To dedup records within the same incoming batch, you need to enable these configs. https://hudi.apache.org/docs/configurations#hoodiecombinebeforeupsert https

[GitHub] [hudi] xushiyan edited a comment on issue #4550: [QUESTION] Hudi Partial Update on COW

2022-01-12 Thread GitBox
xushiyan edited a comment on issue #4550: URL: https://github.com/apache/hudi/issues/4550#issuecomment-1011827801 @pankajc007 there are some examples in the quick start guide for merge into https://hudi.apache.org/docs/quick-start-guide#mergeinto options you can refer to the setup

[GitHub] [hudi] Guanpx commented on issue #4550: [QUESTION] Hudi Partial Update on COW

2022-01-13 Thread GitBox
Guanpx commented on issue #4550: URL: https://github.com/apache/hudi/issues/4550#issuecomment-1011901170 > @pankajc007 there are some examples in the quick start guide for merge into https://hudi.apache.org/docs/quick-start-guide#mergeinto > > options you can refer to

[GitHub] [hudi] nsivabalan edited a comment on issue #3572: compatble version of hudi, hive and hadoop

2021-12-18 Thread GitBox
nsivabalan edited a comment on issue #3572: URL: https://github.com/apache/hudi/issues/3572#issuecomment-997308285 @niloo-sh : can you try using run_hive_sync tool which is the most common way : https://hudi.apache.org/docs/docker_demo#step-3-sync-with-hive and let us know how it goes

(hudi-rs) branch main updated: chore: fix asf notification (#11)

2024-05-05 Thread xushiyan
yaml b/.asf.yaml index ad964fa..a81270a 100644 --- a/.asf.yaml +++ b/.asf.yaml @@ -42,3 +42,7 @@ github: strict: true required_linear_history: true required_conversation_resolution: true +notifications: + commits: [email protected] + issues: [email protected]

Re: [I] Support CoW incremental query [hudi-rs]

2024-09-02 Thread via GitHub
gohalo commented on issue #9: URL: https://github.com/apache/hudi-rs/issues/9#issuecomment-2324381533 From the official docs, there are two ways to implement incremental queries. 1. Configuration passed by options, details [Spark Incremental Query](https://hudi.apache.org/docs/0.13.0

[jira] [Updated] (HUDI-6740) Add 0.13.x to Spark 3 matrix support doc

2023-08-23 Thread Akira Ajisaka (Jira)
[ https://issues.apache.org/jira/browse/HUDI-6740?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Akira Ajisaka updated HUDI-6740: Description: Hudi 0.13.x is missing in the Spark 3 matrix support doc [https://hudi.apache.org/docs

[jira] [Updated] (HUDI-6529) Update docs on developer setup

2023-07-13 Thread Ethan Guo (Jira)
[ https://issues.apache.org/jira/browse/HUDI-6529?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ethan Guo updated HUDI-6529: Description: The developer setup page misses a few important steps: [https://hudi.apache.org/contribute

[GitHub] [hudi] nsivabalan commented on issue #3324: [SUPPORT]Slow Performance With Spark Structured Streaming

2021-08-06 Thread GitBox
nsivabalan commented on issue #3324: URL: https://github.com/apache/hudi/issues/3324#issuecomment-894492641 with MOR, there are 3 types of queries that could be of benefit to you. Config : https://hudi.apache.org/docs/configurations#query_type_opt_key [Snapshot/Realtime read](https

[GitHub] [hudi] nsivabalan edited a comment on issue #3324: [SUPPORT]Slow Performance With Spark Structured Streaming

2021-08-06 Thread GitBox
nsivabalan edited a comment on issue #3324: URL: https://github.com/apache/hudi/issues/3324#issuecomment-894492641 with MOR, there are 3 types of queries that could be of benefit to you. Config : https://hudi.apache.org/docs/configurations#query_type_opt_key [Snapshot/Realtime read

[GitHub] [hudi] xushiyan commented on issue #6579: [SUPPORT] How to participate in HUDI code contribution

2022-09-14 Thread GitBox
xushiyan commented on issue #6579: URL: https://github.com/apache/hudi/issues/6579#issuecomment-1247465847 hi @azhsmesos thanks for your interests in contributing! please check out https://hudi.apache.org/docs/quick-start-guide for quick start examples (both spark and flink) and many more

[PR] DOCS-updated gcp config doc [hudi]

2024-02-15 Thread via GitHub
`hoodie.gcp.bigquery.sync.base_path` ### Impact none ### Risk level (write none, low medium or high below) none ### Documentation Update updated https://hudi.apache.org/docs/next/gcp_bigquery _Describe any necessary documentation update if there is any new feature, config

  1   2   3   4   5   6   7   8   9   10   >