Re: ML Linear and Logistic Regression - Poor Performance

2018-05-02 Thread Irving Duran
May want to think about reducing the number of iterations. Right now you have it set at 500. Thank You, Irving Duran On Fri, Apr 27, 2018 at 7:15 PM Thodoris Zois wrote: > I am in CentOS 7 and I use Spark 2.3.0. Below I have posted my code. > Logistic regression took 85

Re: ML Linear and Logistic Regression - Poor Performance

2018-04-27 Thread Thodoris Zois
I am in CentOS 7 and I use Spark 2.3.0. Below I have posted my code. Logistic regression took 85 minutes and linear regression 127 seconds… My dataset as I said is 128 MB and contains: 1000 features and ~100 classes. #SparkSession ss = SparkSession.builder.getOrCreate() start = time.time()

Re: ML Linear and Logistic Regression - Poor Performance

2018-04-27 Thread Irving Duran
Are you reformatting the data correctly for logistic (meaning 0 & 1's) before modeling? What are OS and spark version you using? Thank You, Irving Duran On Fri, Apr 27, 2018 at 2:34 PM Thodoris Zois wrote: > Hello, > > I am running an experiment to test logistic and

ML Linear and Logistic Regression - Poor Performance

2018-04-27 Thread Thodoris Zois
Hello, I am running an experiment to test logistic and linear regression on spark using MLlib. My dataset is only 128MB and something weird happens. Linear regression takes about 127 seconds either with 1 or 500 iterations. On the other hand, logistic regression most of the times does not