[GitHub] spark issue #20568: [SPARK-23381][CORE] Murmur3 hash generates a different v...

mrkm4ntr Mon, 12 Feb 2018 17:25:37 -0800

Github user mrkm4ntr commented on the issue:

    https://github.com/apache/spark/pull/20568
  
    @hvanhovell The main motivation is making the online prediction of trained 
parameters using FeatureHasher in MLLib.  If the generated hash value is 
different from the implementations in another language, indices of coefficients 
do not match and can not predict correctly.
    But I agree backward compatibility is more important. Since FeatureHasher 
will be added from Spark 2.3.0, how about adding a new method of this content 
to Murmur 3 and using it only from FeatureHasher?



---

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[GitHub] spark issue #20568: [SPARK-23381][CORE] Murmur3 hash generates a different v...

Reply via email to