> > 1) is SKEWED BY honored ? If so, has anyone run into directories not being > created ? >
It is not. 2) if it is not honored, does it matter ? Hive introduced this feature to > better handle joins where tables had a skewed distribution on keys joined > on so that the single mapper handling one of the keys didn't hold up the > whole process. Could that happen in Spark / Spark SQL? > It could matter for very skewed data, though I have not heard many complaints. We could consider adding it in the future if people are having problems with skewed data.