[jira] [Commented] (SPARK-23381) Murmur3 hash generates a different value from other implementations

2018-02-10 Thread Shintaro Murakami (JIRA)

[ 
https://issues.apache.org/jira/browse/SPARK-23381?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16359690#comment-16359690
 ] 

Shintaro Murakami commented on SPARK-23381:
---

FeatureHasher in MLLib uses Murmur3 in hashing indices. If I made an online 
prediction in another environment like C++ predict server,  the indices do not 
match and can not predict correctly.

> Murmur3 hash generates a different value from other implementations
> ---
>
> Key: SPARK-23381
> URL: https://issues.apache.org/jira/browse/SPARK-23381
> Project: Spark
>  Issue Type: Bug
>  Components: Spark Core
>Affects Versions: 2.2.1
>Reporter: Shintaro Murakami
>Priority: Major
>
> Murmur3 hash generates a different value from the original and other 
> implementations (like Scala standard library and Guava or so) when the length 
> of a bytes array is not multiple of 4.  



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Updated] (SPARK-23381) Murmur3 hash generates a different value from other implementations

2018-02-10 Thread Shintaro Murakami (JIRA)

 [ 
https://issues.apache.org/jira/browse/SPARK-23381?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Shintaro Murakami updated SPARK-23381:
--
Summary: Murmur3 hash generates a different value from other 
implementations  (was: Murmur3 hash generates a different value )

> Murmur3 hash generates a different value from other implementations
> ---
>
> Key: SPARK-23381
> URL: https://issues.apache.org/jira/browse/SPARK-23381
> Project: Spark
>  Issue Type: Bug
>  Components: Spark Core
>Affects Versions: 2.2.1
>Reporter: Shintaro Murakami
>Priority: Major
>
> Murmur3 hash generates a different value from the original and other 
> implementations (like Scala standard library and Guava or so) when the length 
> of a bytes array is not multiple of 4.  



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Created] (SPARK-23381) Murmur3 hash generates a different value

2018-02-10 Thread Shintaro Murakami (JIRA)
Shintaro Murakami created SPARK-23381:
-

 Summary: Murmur3 hash generates a different value 
 Key: SPARK-23381
 URL: https://issues.apache.org/jira/browse/SPARK-23381
 Project: Spark
  Issue Type: Bug
  Components: Spark Core
Affects Versions: 2.2.1
Reporter: Shintaro Murakami


Murmur3 hash generates a different value from the original and other 
implementations (like Scala standard library and Guava or so) when the length 
of a bytes array is not multiple of 4.  



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Created] (SPARK-22795) Raise error when line search in FirstOrderMinimizer failed

2017-12-15 Thread Shintaro Murakami (JIRA)
Shintaro Murakami created SPARK-22795:
-

 Summary: Raise error when line search in FirstOrderMinimizer failed
 Key: SPARK-22795
 URL: https://issues.apache.org/jira/browse/SPARK-22795
 Project: Spark
  Issue Type: Improvement
  Components: ML
Affects Versions: 2.2.1
Reporter: Shintaro Murakami
Priority: Minor


When line search in FirstOrderMinimizer (LBFGS or OWLQN so on) failed, the fit 
method of estimator not failed, and a meaning-less transformer is returned. 
The error should be discovered early.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org