[Scikit-learn-general] failure in SGD classifier for large range in x

Jacob VanderPlas Tue, 22 May 2012 13:13:41 -0700

Hi,
I've been playing with the SGD classifier (using the current master 
branch), and found that when there is a large range in x, the classifier 
fails.  For example, when running the example script found here:
  http://scikit-learn.org/0.10/auto_examples/linear_model/plot_sgd_ols.html
If I replace the line:
   xmin, xmax = -5, 5
with:
   xmin, xmax = 0, 100
then the regression fails (full traceback copied below).  If the range 
is set to (0, 50), the call does not produce an error, but leads to a 
line which clearly does not fit the data (intercept on order -2E30).  
Any ideas about what is going on here?
   Jake


 
"/usr/local/lib/python2.6/dist-packages/scikit_learn-0.12_git-py2.6-linux-i686.egg/sklearn/linear_model/stochastic_gradient.py",
 
line 737, in fit
    coef_init, intercept_init)
  File 
"/usr/local/lib/python2.6/dist-packages/scikit_learn-0.12_git-py2.6-linux-i686.egg/sklearn/linear_model/stochastic_gradient.py",
 
line 672, in _partial_fit
    self._fit_regressor(X, y, sample_weight, n_iter)
  File 
"/usr/local/lib/python2.6/dist-packages/scikit_learn-0.12_git-py2.6-linux-i686.egg/sklearn/linear_model/stochastic_gradient.py",
 
line 786, in _fit_regressor
    intercept_decay)
  File "sgd_fast.pyx", line 423, in 
sklearn.linear_model.sgd_fast.plain_sgd 
(sklearn/linear_model/sgd_fast.c:6713)
ValueError: floating-point under-/overflow occured.


------------------------------------------------------------------------------
Live Security Virtual Conference
Exclusive live event will cover all the ways today's security and 
threat landscape has changed and how IT managers can respond. Discussions 
will include endpoint security, mobile security and the latest in malware 
threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/
_______________________________________________
Scikit-learn-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/scikit-learn-general

[Scikit-learn-general] failure in SGD classifier for large range in x

Reply via email to