[ 
https://issues.apache.org/jira/browse/MAHOUT-1888?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15567471#comment-15567471
 ] 

ASF GitHub Bot commented on MAHOUT-1888:
----------------------------------------

Github user dlyubimov commented on a diff in the pull request:

    https://github.com/apache/mahout/pull/260#discussion_r82932511
  
    --- Diff: 
spark/src/test/scala/org/apache/mahout/sparkbindings/test/DistributedSparkSuite.scala
 ---
    @@ -45,6 +45,7 @@ trait DistributedSparkSuite extends 
DistributedMahoutSuite with LoggerConfigurat
               .set("spark.akka.frameSize", "30")
               .set("spark.default.parallelism", "10")
               .set("spark.executor.memory", "2G")
    +          .set("spark.kryo.registrationRequired", "true")
    --- End diff --
    
    This is not needed, this is why it is failing. we can enable that to see 
what else is left, but we don't have to patch every class out there that is 
used in tests. I think the classes that still have that have something to do 
with IndexedDataSet which is (in my view) not part of algebra engine, so we can 
ignore the rest.


> Performance Bug with Mahout Vector Serialization
> ------------------------------------------------
>
>                 Key: MAHOUT-1888
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-1888
>             Project: Mahout
>          Issue Type: Bug
>          Components: spark
>    Affects Versions: 0.12.2
>            Reporter: Suneel Marthi
>            Assignee: Suneel Marthi
>             Fix For: 0.13.0
>
>
> Identified a performance bug with Mahout Vector serialization in 
> DistributedSparkSuite.
> Add the following
> {Code}
> .set("spark.kryo.registrationRequired", "true")
> {Code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to