Hello,

I have finally wrote a program to upload my data to amazon s3, start a
cluster on amazon emr, and recover my partitions, and can issue simple
queries on hive.

now I would like to:

select count(*),itemSid from items group by itemSid <- gives me how
many times an item as viewed

and another query to extract unique views that i dont know how to yet.

how do I store the outputs of these queries,

such as:

itemSid, pageViews, uniquePageViews
1            10             8

commonsense tells me, store the results of query A, then query B, and
then combine them in a table?

is that correct, and if so, how can i accomplish this?

Best Regards,
C.B.

Reply via email to