Hello, I have finally wrote a program to upload my data to amazon s3, start a cluster on amazon emr, and recover my partitions, and can issue simple queries on hive.
now I would like to: select count(*),itemSid from items group by itemSid <- gives me how many times an item as viewed and another query to extract unique views that i dont know how to yet. how do I store the outputs of these queries, such as: itemSid, pageViews, uniquePageViews 1 10 8 commonsense tells me, store the results of query A, then query B, and then combine them in a table? is that correct, and if so, how can i accomplish this? Best Regards, C.B.
