Here is an example of what Amr mentioned from one of my Hive scripts, returns the set of pages not in "daily_pagecounts_table"
select dt.page_id, dt.dates, dt.pageviews, dt.total_pageviews FROM daily_timelines dt LEFT OUTER JOIN daily_pagecounts_table dp ON (dt.page_id = dp.page_id) where dp.page_id is NULL On Mon, Jun 29, 2009 at 7:14 PM, Amr Awadallah <[email protected]> wrote: > > do an outer join on user and filter on name.user is null > > -- amr > > > Rakesh Setty wrote: > > Hi, > > > > I am new to Hive. I would like to know what is the easiest way > to get the difference between two sets. For example, how can I convert the > following SQL query to Hive? > > > > select user from page_views where user not in (select name from users); > > > > Thanks, > > Rakesh > > -- Peter N. Skomoroch 617.285.8348 http://www.datawrangling.com http://delicious.com/pskomoroch http://twitter.com/peteskomoroch
