Unfortunately, dont have timestamps in those tables:( Only key on which I can check existence of specific record.
But even with the timestamp how would you make the update.? When I say update I mean to overwrite existing record. For example you have following in table A key| field1 | field2 1 a b and in Table B key| field1 | field2 1 c d so after update I want to have in Table B 1 | a | b Dont want to insert new row in this case, just overwrite the existing one. Thanks On Thu 30 May 2019 at 05:10, Aakash Basu <aakash.spark....@gmail.com> wrote: > Don't you have a date/timestamp to handle updates? So, you're talking > about CDC? If you've Datestamp you can check if that/those key(s) exists, > if exists then check if timestamp matches, if that matches, then ignore, if > that doesn't then update. > > On Thu 30 May, 2019, 7:11 AM Genieliu, <feixiang...@163.com> wrote: > >> Isn't step1 and step2 producing the copy of Table A? >> >> >> >> -- >> Sent from: http://apache-spark-user-list.1001560.n3.nabble.com/ >> >> --------------------------------------------------------------------- >> To unsubscribe e-mail: user-unsubscr...@spark.apache.org >> >> -- Tomasz Krol patric...@gmail.com