Re: [Hudi Improvement]: Introduce secondary source-ordering-field for breaking ties while writing

2019-08-30 Thread Vinoth Chandar
Assigned to you. and also added you to the role for future tickets,, On Thu, Aug 29, 2019 at 11:57 PM Pratyaksh Sharma wrote: > Hi Vinoth, > > The jira is HUDI-207 . > > On Thu, Aug 29, 2019 at 10:17 PM Vinoth Chandar wrote: > > > HI, > > > >

Re: [Hudi Improvement]: Introduce secondary source-ordering-field for breaking ties while writing

2019-08-30 Thread Pratyaksh Sharma
Hi Vinoth, The jira is HUDI-207 . On Thu, Aug 29, 2019 at 10:17 PM Vinoth Chandar wrote: > HI, > > whats your JIRA id? if you could share that, will add you the contributors > role. > > On Thu, Aug 29, 2019 at 12:02 AM Pratyaksh Sharma > wrote:

Re: [Hudi Improvement]: Introduce secondary source-ordering-field for breaking ties while writing

2019-08-29 Thread Pratyaksh Sharma
Sure Balaji, Please give me permissions so I can assign this jira to me and start working on it. On Wed, Aug 28, 2019 at 7:23 PM vbal...@apache.org wrote: > Sure Pratyaksh, Whatever field works for your use-case is good enough. > You do have

Re: [Hudi Improvement]: Introduce secondary source-ordering-field for breaking ties while writing

2019-08-28 Thread vbal...@apache.org
Sure Pratyaksh, Whatever field works for your use-case is good enough. You do have the flexibility to generate a derived field or use one of the source fieldsĀ  Balaji.VOn Wednesday, August 28, 2019, 06:48:44 AM PDT, Pratyaksh Sharma wrote: Hi Balaji, Sure I can do that. However

Re: [Hudi Improvement]: Introduce secondary source-ordering-field for breaking ties while writing

2019-08-28 Thread Pratyaksh Sharma
Hi Balaji, Sure I can do that. However after a considerable amount of time, the bin-log position will get exhausted. To handle this, we can have secondary ordering field as the ingestion_timestamp (the time when I am pushing the event to Kafka to be consumed by DeltaStreamer) which will work