Discussion for timestamp support

2021-03-31 Thread lrz
Hi, I want to discuss about the support for timestamp dataType. As we know, now Hudi save timestamp type as long, then this will lead to some problem when the table include timestamp datatype: 1) At bootstrap operation, if the origin parquet file was written by a spark application, then spark

Re: [DISCUSS] Incremental computation pipeline for HUDI

2021-03-31 Thread vino yang
>> Oops, the image crushes, for "change flags", i mean: insert, update(before and after) and delete. Yes, the image I attached is also about these flags. [image: image (3).png] +1 for the idea. Best, Vino Danny Chan 于2021年4月1日周四 上午10:03写道: > Oops, the image crushes, for "change flags", i

Re: [DISCUSS] Incremental computation pipeline for HUDI

2021-03-31 Thread Danny Chan
Oops, the image crushes, for "change flags", i mean: insert, update(before and after) and delete. The Flink engine can propagate the change flags internally between its operators, if HUDI can send the change flags to Flink, the incremental calculation of CDC would be very natural (almost

Re: [VOTE] Release 0.8.0, release candidate #1

2021-03-31 Thread Mehrotra, Udit
+1 - Release Validation Script [OK] - Compile with Spark 2/Spark 3 [OK] - Ran QuickStart with Spark 2/Spark 3 on EMR [OK] Thanks, Udit On 3/31/21, 5:16 PM, "Vinoth Chandar" wrote: CAUTION: This email originated from outside of the organization. Do not click links or open attachments

Re: [VOTE] Release 0.8.0, release candidate #1

2021-03-31 Thread Vinoth Chandar
+1 binding - Tested Optimistic Concurrency Control in EMR/Zookeeper setup - Ran some perf tests with bulk_insert/row writer, upserts - Cleaning, Archival all seem to be happening as expected - Hit one intermittent issue with the metadata table, but unable to repro again. So not blocking the

Re: [VOTE] Release 0.8.0, release candidate #1

2021-03-31 Thread vbal...@apache.org
+1 binding Compilation Succeeded.Release Validation Succeeded ```balaji-varadarajan--C02CV6A6MD6R:scripts balaji.varadarajan$ ./release/validate_staged_release.sh --release=0.8.0 --rc_num=1  /tmp/validation_scratch_dir_001 ~/code/oss/upstream_hudi/scriptsDownloading from svn co

Re: [VOTE] Release 0.8.0, release candidate #1

2021-03-31 Thread nishith agarwal
+1 binding 1. Compilation [OK] 2. Quick start (Spark 2.x, 3.x) [OK] 3. Signature [OK] Thanks, Nishith On Wed, Mar 31, 2021 at 8:35 AM vino yang wrote: > +1 binding > > - ran `mvn clean package -DskipTests` [OK] > - quick start (Spark 2.x, 3.x) [OK] > - checked signature [OK] > > Best, > Vino

Re: [VOTE] Release 0.8.0, release candidate #1

2021-03-31 Thread vino yang
+1 binding - ran `mvn clean package -DskipTests` [OK] - quick start (Spark 2.x, 3.x) [OK] - checked signature [OK] Best, Vino Sivabalan 于2021年3月31日周三 下午12:32写道: > +1 binding > > - Compilation Ok > - Quick start utils w/ spark3 Ok > - checksum Ok > - release validation script Ok > - Ran hudi

Re: [DISCUSS] Incremental computation pipeline for HUDI

2021-03-31 Thread vino yang
Hi Danny, Thanks for kicking off this discussion thread. Yes, incremental query( or says "incremental processing") has always been an important feature of the Hudi framework. If we can make this feature better, it will be even more exciting. In the data warehouse, in some complex calculations,

[DISCUSS] Incremental computation pipeline for HUDI

2021-03-31 Thread Danny Chan
Hi dear HUDI community ~ Here i want to fire a discuss about using HUDI as the unified storage/format for data warehouse/lake incremental computation. Usually people divide data warehouse production into several levels, such as the ODS(operation data store), DWD(data warehouse details), DWS(data

Re: Request to join Project Committer Group

2021-03-31 Thread Danny Chan
cc @vinoth Best, Danny Chan harshit mittal 于2021年3月31日周三 下午3:18写道: > Hi, > I'd like to be added to the project committer group. Could somebody help me > with this request?(jiraId: hmittal83, cwiki userId: hmittal83). > -- > Best, > Harshit >

Request to join Project Committer Group

2021-03-31 Thread harshit mittal
Hi, I'd like to be added to the project committer group. Could somebody help me with this request?(jiraId: hmittal83, cwiki userId: hmittal83). -- Best, Harshit