Hi, Does anyone has good architecture document/design principle for building warehouse application using Spark.
Is it better way of having Hive Context created with HQL and perform transformation or Directly loading files in dataframe and perform data transformation. We need to implement SCD 2 Type in Spark, Is there any better document/reference for building Type 2 warehouse object Thanks in advace /Mahender