RE: How to compare data in two tables?

2015-04-27 Thread Mich Talebzadeh
pient to ensure that this email is virus free, therefore neither Peridale Ltd, its subsidiaries nor their employees accept any responsibility. From: Alexander Pivovarov [mailto:apivova...@gmail.com] Sent: 27 April 2015 22:05 To: user@hive.apache.org Subject: RE: How to compare data in two tables?

RE: How to compare data in two tables?

2015-04-27 Thread Alexander Pivovarov
Golden source is Oracle DB. Ihave two cases: 1. Tables are overwritten completly every day. 2. Tables are incrementally loaded. PK is auto incremented number in Oracle. What you think if I concat all cells of a row to a string. Get int hashcode from the string. And then sum hashcodes to get a

RE: How to compare data in two tables?

2015-04-27 Thread Mich Talebzadeh
Hi Alex, Am I correct that the source of data resides in a relational table and that table has all the data already (the golden source) sent to both instances of Hive? Is the data in Hive added incrementally daily with “operation timestamp” for each record? Also do you have a unique identif