[jira] [Commented] (SPARK-23381) Murmur3 hash generates a different value from other implementations
[ https://issues.apache.org/jira/browse/SPARK-23381?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16359690#comment-16359690 ] Shintaro Murakami commented on SPARK-23381: --- FeatureHasher in MLLib uses Murmur3 in hashing indices. If I made an online prediction in another environment like C++ predict server, the indices do not match and can not predict correctly. > Murmur3 hash generates a different value from other implementations > --- > > Key: SPARK-23381 > URL: https://issues.apache.org/jira/browse/SPARK-23381 > Project: Spark > Issue Type: Bug > Components: Spark Core >Affects Versions: 2.2.1 >Reporter: Shintaro Murakami >Priority: Major > > Murmur3 hash generates a different value from the original and other > implementations (like Scala standard library and Guava or so) when the length > of a bytes array is not multiple of 4. -- This message was sent by Atlassian JIRA (v7.6.3#76005) - To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org
[jira] [Updated] (SPARK-23381) Murmur3 hash generates a different value from other implementations
[ https://issues.apache.org/jira/browse/SPARK-23381?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shintaro Murakami updated SPARK-23381: -- Summary: Murmur3 hash generates a different value from other implementations (was: Murmur3 hash generates a different value ) > Murmur3 hash generates a different value from other implementations > --- > > Key: SPARK-23381 > URL: https://issues.apache.org/jira/browse/SPARK-23381 > Project: Spark > Issue Type: Bug > Components: Spark Core >Affects Versions: 2.2.1 >Reporter: Shintaro Murakami >Priority: Major > > Murmur3 hash generates a different value from the original and other > implementations (like Scala standard library and Guava or so) when the length > of a bytes array is not multiple of 4. -- This message was sent by Atlassian JIRA (v7.6.3#76005) - To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org
[jira] [Created] (SPARK-23381) Murmur3 hash generates a different value
Shintaro Murakami created SPARK-23381: - Summary: Murmur3 hash generates a different value Key: SPARK-23381 URL: https://issues.apache.org/jira/browse/SPARK-23381 Project: Spark Issue Type: Bug Components: Spark Core Affects Versions: 2.2.1 Reporter: Shintaro Murakami Murmur3 hash generates a different value from the original and other implementations (like Scala standard library and Guava or so) when the length of a bytes array is not multiple of 4. -- This message was sent by Atlassian JIRA (v7.6.3#76005) - To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org
[jira] [Created] (SPARK-22795) Raise error when line search in FirstOrderMinimizer failed
Shintaro Murakami created SPARK-22795: - Summary: Raise error when line search in FirstOrderMinimizer failed Key: SPARK-22795 URL: https://issues.apache.org/jira/browse/SPARK-22795 Project: Spark Issue Type: Improvement Components: ML Affects Versions: 2.2.1 Reporter: Shintaro Murakami Priority: Minor When line search in FirstOrderMinimizer (LBFGS or OWLQN so on) failed, the fit method of estimator not failed, and a meaning-less transformer is returned. The error should be discovered early. -- This message was sent by Atlassian JIRA (v6.4.14#64029) - To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org