Hi all - 

I am relatively new to the world of machine learning, and I am having a little 
difficulty in interpreting the output of a support vector regression problem.   
For simplicity, lets say I have 2 variables and 100 subjects.  Both variables 
in my model are continuous.  

To make matters a little more complicated, I have four "sites" at which data 
were collected, and I want to "leave one label out", where labels correspond to 
sites for the purposes of assessing whether site has an influence on the 
predictive model. 

Here is the code so far. 



lolo = LeaveOneLabelOut(labels)

for train_index, test_index in lolo:
        
        X_train, X_test = X[train_index], X[test_index]
        y_train, y_test = y[train_index], y[test_index]
        
        clf = svm.SVR()
        clf = clf.fit(X_train, y_train)
        s=clf.score(X_test, y_test)
        print s
        
        scores = cross_validation.cross_val_score(clf,X_test, y_test)
        
        print "Accuracy: %0.2f (+/- %0.2f)" % (scores.mean(), scores.std() / 2)


It produces the following output
0.0343889480748
Accuracy: -0.05 (+/- 0.05)
-0.0786771792262
Accuracy: -0.25 (+/- 0.07)
-0.0871562121791
Accuracy: -0.12 (+/- 0.05)
-0.0496675695436
Accuracy: -0.16 (+/- 0.03)



        


Could someone help me in how to interpret the substantive meaning of the 
'score'  in an SVR problem?  Thanks in advance.




------------------------------------------------------------------------------
Get 100% visibility into Java/.NET code with AppDynamics Lite
It's a free troubleshooting tool designed for production
Get down to code-level detail for bottlenecks, with <2% overhead.
Download for free and get started troubleshooting in minutes.
http://p.sf.net/sfu/appdyn_d2d_ap2
_______________________________________________
Scikit-learn-general mailing list
Scikit-learn-general@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/scikit-learn-general

Reply via email to