Re: [MLLib] Logistic Regression and standadization

2018-04-20 Thread Valeriy Avanesov
Hi all. Filipp, do you use l1/l2/elstic-net penalization? I believe in this case standardization matters. Best, Valeriy. On 04/17/2018 11:40 AM, Weichen Xu wrote: Not a bug. When disabling standadization, mllib LR will still do standadization for features, but it will scale the

Re: [MLLib] Logistic Regression and standadization

2018-04-20 Thread Weichen Xu
Right. If regularization item isn't zero, then enable/disable standardization will get different result. But, if comparing results between R-glmnet and mllib, if we set the same parameters for regularization/standardization/... , then we should get the same result. If not, then maybe there's a

[SQL] Number of buckets in metrics of FileSourceScanExec?

2018-04-20 Thread Jacek Laskowski
Hi, With bucketing support enabled by default in 2.3, I think that the number of buckets should be included in the metrics of FileSourceScanExec. WDYT? Shall I report an enhancement in JIRA? Pozdrawiam, Jacek Laskowski https://about.me/JacekLaskowski Mastering Spark SQL