Spark is currently on a fairly dated version of Kryo 2.x; it's trailing on the 
fixes in Hive and, as the APIs are incompatible, resulted in that mutant 
spark-project/hive JAR needed for the Hive 1.2.1 support

But: updating it hasn't been an option, because Spark needs to be in sync with 
Twitter's Chill library.

There's now an offer from Twitter to help coordinate a kryo update across 
Chill, Scalding and other things they use

https://github.com/twitter/chill/pull/230

Given kryo is "The guava jar of serialization", I doubt anyone is jumping up 
and down wanting this, but it is something to consider. Once hive moves to it, 
all the hive spark integration is probably going to break again; getting in 
sync with hive (see SPARK-10793) would reduce the traumaticness of hive updates

Reply via email to