hi, 

i want to run a multiclass classification with 390 classes on120k label
points(tf-idf vectors). but i get the following exception. If i reduce the
number of classes to ~20 everythings work fine. How can i fix this?

 i use the LogisticRegressionWithLBFGS class for my classification on a 8
Node Cluster with 


total-executor-cores = 30

executor-memory = 20g

My Exception:

15/07/02 15:55:00 INFO DAGScheduler: Job 11 finished: count at
LBFGS.scala:170, took 0,521823 s
15/07/02 15:55:02 INFO MemoryStore: ensureFreeSpace(-1069858488) called with
curMem=308280107, maxMem=11113699737
15/07/02 15:55:02 INFO MemoryStore: Block broadcast_22 stored as values in
memory (estimated size -1069858488.0 B, free 11.1 GB)
Exception in thread "main" java.lang.reflect.InvocationTargetException
        at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
        at
sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
        at
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
        at java.lang.reflect.Method.invoke(Method.java:606)
        at
org.apache.spark.deploy.worker.DriverWrapper$.main(DriverWrapper.scala:58)
        at 
org.apache.spark.deploy.worker.DriverWrapper.main(DriverWrapper.scala)
Caused by: java.lang.IllegalArgumentException: requirement failed:
sizeInBytes was negative: -1069858488
        at scala.Predef$.require(Predef.scala:233)
        at org.apache.spark.storage.BlockInfo.markReady(BlockInfo.scala:55)
        at org.apache.spark.storage.BlockManager.doPut(BlockManager.scala:812)
        at
org.apache.spark.storage.BlockManager.putIterator(BlockManager.scala:635)
        at 
org.apache.spark.storage.BlockManager.putSingle(BlockManager.scala:993)
        at
org.apache.spark.broadcast.TorrentBroadcast.writeBlocks(TorrentBroadcast.scala:99)
        at
org.apache.spark.broadcast.TorrentBroadcast.<init>(TorrentBroadcast.scala:85)
        at
org.apache.spark.broadcast.TorrentBroadcastFactory.newBroadcast(TorrentBroadcastFactory.scala:34)
        at
org.apache.spark.broadcast.BroadcastManager.newBroadcast(BroadcastManager.scala:62)
        at org.apache.spark.SparkContext.broadcast(SparkContext.scala:1289)
        at
org.apache.spark.mllib.optimization.LBFGS$CostFun.calculate(LBFGS.scala:215)
        at
org.apache.spark.mllib.optimization.LBFGS$CostFun.calculate(LBFGS.scala:204)
        at
breeze.optimize.CachedDiffFunction.calculate(CachedDiffFunction.scala:23)
        at
breeze.optimize.FirstOrderMinimizer.calculateObjective(FirstOrderMinimizer.scala:108)
        at
breeze.optimize.FirstOrderMinimizer.initialState(FirstOrderMinimizer.scala:101)
        at
breeze.optimize.FirstOrderMinimizer.iterations(FirstOrderMinimizer.scala:146)
        at org.apache.spark.mllib.optimization.LBFGS$.runLBFGS(LBFGS.scala:178)
        at org.apache.spark.mllib.optimization.LBFGS.optimize(LBFGS.scala:117)
        at
org.apache.spark.mllib.regression.GeneralizedLinearAlgorithm.run(GeneralizedLinearAlgorithm.scala:282)
        at
org.apache.spark.mllib.regression.GeneralizedLinearAlgorithm.run(GeneralizedLinearAlgorithm.scala:205)
        at
com.test.spark.SVMSimpleAppEC2$.createNaiveBayesModel(SVMSimpleAppEC2.scala:150)
        at com.test.spark.SVMSimpleAppEC2$.main(SVMSimpleAppEC2.scala:48)
        at com.test.spark.SVMSimpleAppEC2.main(SVMSimpleAppEC2.scala)
        ... 6 more
15/07/02 15:55:02 INFO SparkContext: Invoking stop() from shutdown hook



--
View this message in context: 
http://apache-spark-user-list.1001560.n3.nabble.com/Spark-1-4-MLLib-Bug-Multiclass-Classification-requirement-failed-sizeInBytes-was-negative-tp23610.html
Sent from the Apache Spark User List mailing list archive at Nabble.com.

---------------------------------------------------------------------
To unsubscribe, e-mail: user-unsubscr...@spark.apache.org
For additional commands, e-mail: user-h...@spark.apache.org

Reply via email to