Re: [DISCUSS] [VOTE] JDBC incremental load with DeltaStreamer

2019-09-16 Thread Vinoth Chandar
It should work like any other source and none of the others are aware if whether deltaStreamer is running in continuous mode or not. Simplistically, it just needs a config to denote an incremental field - say `_last_modified_at` and we use that as a checkpoint to tail that table by including a

Re: [DISCUSS] [VOTE] JDBC incremental load with DeltaStreamer

2019-09-16 Thread Taher Koitawala
Will this be the same implementation as session.read.jdbc("") and then call this code continuously like how we are running HUDI in continuous mode? On Mon, Sep 16, 2019 at 9:09 PM Vinoth Chandar wrote: > Thanks, Taher! Any takers for driving this? This is something I would be > very interested

Re: [DISCUSS] [VOTE] JDBC incremental load with DeltaStreamer

2019-09-16 Thread Vinoth Chandar
Thanks, Taher! Any takers for driving this? This is something I would be very interested in getting involved with. Dont have the bandwidth atm :/ On Sun, Sep 15, 2019 at 11:15 PM Taher Koitawala wrote: > Thank you all for your support. JIRA filed at >

Re: [DISCUSS] [VOTE] JDBC incremental load with DeltaStreamer

2019-09-16 Thread Taher Koitawala
Thank you all for your support. JIRA filed at https://issues.apache.org/jira/browse/HUDI-251 Regards, Taher Koitawala On Mon, Sep 16, 2019 at 11:34 AM Taher Koitawala wrote: > Since everyone is fully onboard. I am creating a JIRA to track this. > > On Sun, Sep 15, 2019 at 9:47 AM

Re: [DISCUSS] [VOTE] JDBC incremental load with DeltaStreamer

2019-09-16 Thread Taher Koitawala
Since everyone is fully onboard. I am creating a JIRA to track this. On Sun, Sep 15, 2019 at 9:47 AM vbal...@apache.org wrote: > > +1. Agree with everyone's point. Go for it Taher !! > Balaji.VOn Saturday, September 14, 2019, 07:44:04 PM PDT, Bhavani > Sudha Saktheeswaran wrote: > > +1 I

Re: [DISCUSS] [VOTE] JDBC incremental load with DeltaStreamer

2019-09-14 Thread vbal...@apache.org
+1. Agree with everyone's point. Go for it Taher !! Balaji.VOn Saturday, September 14, 2019, 07:44:04 PM PDT, Bhavani Sudha Saktheeswaran wrote: +1 I  think adding new sources to DeltaStreamer is really valuable. Thanks, Sudha On Sat, Sep 14, 2019 at 7:52 AM vino yang wrote: > Hi

Re: [DISCUSS] [VOTE] JDBC incremental load with DeltaStreamer

2019-09-14 Thread Bhavani Sudha Saktheeswaran
+1 I think adding new sources to DeltaStreamer is really valuable. Thanks, Sudha On Sat, Sep 14, 2019 at 7:52 AM vino yang wrote: > Hi Taher, > > IMO, it's a good supplement to Hudi. > > So +1 from my side. > > Vinoth Chandar 于2019年9月14日周六 下午10:23写道: > > > Hi Taher, > > > > I am fully

Re: [DISCUSS] [VOTE] JDBC incremental load with DeltaStreamer

2019-09-14 Thread Vinoth Chandar
Hi Taher, I am fully onboard on this. This is such a frequently asked question and having it all doable with a simple DeltaStreamer command would be really powerful. +1 - Vinoth On 2019/09/14 05:51:05, Taher Koitawala wrote: > Hi All, > Currently, we are trying to pull data

[DISCUSS] [VOTE] JDBC incremental load with DeltaStreamer

2019-09-13 Thread Taher Koitawala
Hi All, Currently, we are trying to pull data incrementally from our RDBMS sources, however the way we are doing this is with HUDI is to create a spark table on top of the JDBC source using [1] which writes raw data to an HDFS dir. We then use DeltaStreamer dfs-source to write that to a