Github user mrkm4ntr commented on the issue:
https://github.com/apache/spark/pull/20568
@hvanhovell The main motivation is making the online prediction of trained
parameters using FeatureHasher in MLLib. If the generated hash value is
different from the implementations in another language, indices of coefficients
do not match and can not predict correctly.
But I agree backward compatibility is more important. Since FeatureHasher
will be added from Spark 2.3.0, how about adding a new method of this content
to Murmur 3 and using it only from FeatureHasher?
---
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]