And the usual hint when migrating - do not migrate only but also optimize the 
ETL process design - this brings the most benefit s

> On 5. Apr 2018, at 08:18, Jörn Franke <jornfra...@gmail.com> wrote:
> 
> Ok this is not much detail, but you are probably best off if you migrate them 
> to SparkSQL.
> 
> Depends also on the Hive version and Spark version. If you have a recent one 
> with TEZ+llap I would not expect so much difference. It can be also less 
> performant -Spark SQL got only recently some features suchst cost based 
> optimizer.
> 
>> On 5. Apr 2018, at 08:02, Pralabh Kumar <pralabhku...@gmail.com> wrote:
>> 
>> Hi 
>> 
>> I have lot of ETL jobs (complex ones) , since they are SLA critical , I am 
>> planning them to migrate to spark.
>> 
>>> On Thu, Apr 5, 2018 at 10:46 AM, Jörn Franke <jornfra...@gmail.com> wrote:
>>> You need to provide more context on what you do currently in Hive and what 
>>> do you expect from the migration.
>>> 
>>>> On 5. Apr 2018, at 05:43, Pralabh Kumar <pralabhku...@gmail.com> wrote:
>>>> 
>>>> Hi Spark group
>>>> 
>>>> What's the best way to Migrate Hive to Spark
>>>> 
>>>> 1) Use HiveContext of Spark
>>>> 2) Use Hive on Spark 
>>>> (https://cwiki.apache.org/confluence/display/Hive/Hive+on+Spark%3A+Getting+Started)
>>>> 3) Migrate Hive to Calcite to Spark SQL
>>>> 
>>>> 
>>>> Regards
>>>> 
>> 

Reply via email to