Why not use Spark SQL?

Mohammed
Author: Big Data Analytics with 
Spark<http://www.amazon.com/Big-Data-Analytics-Spark-Practitioners/dp/1484209656/>

From: Vikash Kumar [mailto:vikashsp...@gmail.com]
Sent: Wednesday, March 2, 2016 8:29 PM
To: user@spark.apache.org
Subject: convert SQL multiple Join in Spark


I have to write or convert below SQL query into spark/scala. Anybody can 
suggest how to implement this in Spark?

SELECT a.PERSON_ID as RETAINED_PERSON_ID,

                                a.PERSON_ID,

                                a.PERSONTYPE,

                                'y' as HOLDOUT,

                                d.LOCATION,

                                b.HHID,

                                a.AGE_OUTPUT as AGE,

                                a.FIRST_NAME,

                                d.STARTDATE,

                                d.ENDDATE,

                                'Not In Campaign' as HH_TYPE

                                FROM PERSON_MASTER_VIEW a

                                    INNER JOIN PERSON_ADDRESS_HH_KEYS b

                                        on a.PERSON_ID = b.PERSON_ID

                                    LEFT JOIN #Holdouts c

                                        on a.PERSON_ID = c.RETAINED_PERSON_ID

                                    INNER JOIN #Holdouts d

                                        on b.HHID = d.HHID

                                WHERE c.RETAINED_PERSON_ID IS NULL and 
a.PERSONTYPE IS NOT NULL

                                GROUP BY a.PERSON_ID, a.PERSONTYPE, b.HHID, 
a.AGE_OUTPUT, a.FIRST_NAME, d.LOCATION, d.STARTDATE, d.ENDDATE

Reply via email to