Re: Hive SKEWED feature supported in Spark SQL ?

Michael Armbrust Thu, 19 Feb 2015 12:01:11 -0800

>
> 1) is SKEWED BY honored ? If so, has anyone run into directories not being
> created ?
>


It is not.

2) if it is not honored, does it matter ? Hive introduced this feature to
> better handle joins where tables had a skewed distribution on keys joined
> on so that the single mapper handling one of the keys didn't hold up the
> whole process. Could that happen in Spark / Spark SQL?
>

It could matter for very skewed data, though I have not heard many
complaints.  We could consider adding it in the future if people are having
problems with skewed data.

Re: Hive SKEWED feature supported in Spark SQL ?

Reply via email to