zdl11111 commented on issue #10017:
URL: https://github.com/apache/hudi/issues/10017#issuecomment-1803344542

   > @zdl11111 Can you also share sample code/data what you are trying if 
possible.
   
   Sure, actually, this is a random problem, we can also query the correct data 
from the table, I am just wondering whether it is reasonable. Here are the 
steps:
   1. create a hudi table:
   
   create table table1 (
   c1 char(6), c2 varchar(10), c3 boolean,
   c4 smallint, c5 integer , c6 bigint,
   c7 float, c8 double,
   c9 timestamp , c10 date, c11 decimal(10,4), updatecol int,
   primary key (c5, c10) not enforced) partitioned by (c10, updatecol)
   with ('connector' = 'hudi',
           'table.type' = 'MERGE_ON_READ',
           'path' = 'hdfs:/flink/table1'
           'index.type' = 'BUCKET',
           'hoodie.bucket.index.hash.field' = 'c5,c10',
           'hoodie.bucket.index.num.buckets' = '4');
   
   2. create a source table using 'datagen' connector
   
   create table t_integer_gen (
   c1 char(6), c2 varchar(10), c3 boolean,
   c4 smallint, c5 integer, c6 bigint,
   c7 float, c8 double,
   c10 timestamp, c11 date, c12 decimal(10,4))
       with (
           'connector' = 'datagen',
           'number-of-rows' = '100000',
           'rows-per-second' = '2000',
           'fields.c1.length' = '6',
           'fields.c2.length' = '10'
           );
   3. insert into table1 select * from t_integer_gen;


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to