*I read somewhere that hive does NOT support OR condition....that's what I was wondering as to how others handle the situation.* * *
On Fri, Aug 20, 2010 at 8:30 AM, Jacob R Rideout <apa...@jacobrideout.net>wrote: > On Fri, Aug 20, 2010 at 8:37 AM, Ronak Bhatt <ronakb...@gmail.com> wrote: > > Hi - I've a following SQL query. What is the way to convert it into HIVE > > runnable format - > > Select a.id, count(b.url) > > from a, b > > where a.id = b.id > > and (b.url like '%google.com%' or b.url like '%h.google.com%' or b.url > like > > '%bing%') > > and a.exec_date = date(now()) ; ----- get today's data > > group by a.id > > > > thanks, ronak > > > > I think something like: > > SELECT a.id, COUNT(distinct b.url) > FROM a > JOIN b on a.id = b.id > WHERE > (b.url LIKE '%google.com%' OR b.url LIKE '%h.google.com%' OR > b.url LIKE '%bing%') > AND a.exec_date = from_unixtime(unix_timestamp()) > GROUP BY a.id; > > should work, but I didn't test it. You will probably need to format > the data string to match your data - hive doesn't really have a date > type as such - but instead has date manipulation functions for > strings. > > Take a look at > http://wiki.apache.org/hadoop/Hive/LanguageManual/UDF#Date_Functions > > > Jacob Rideout >