Re: [Scikit-learn-general] The scale_C fiasco

bthirion Mon, 30 Apr 2012 05:41:02 -0700

On 04/30/2012 02:30 PM, Alexandre Gramfort wrote:
>> But you're not breaking independence here: you're drawing iid from a
>> finite population. However, as pointed by Olivier, this may create some
>> artefact when used with some classifiers.
>> (the scores across folds are not independent but this is true for most
>> cv techniques, and this is another matter anyway).
> if you look at my tests on scale_C you'll see that I test that if you 
> duplicate
> each sample and C is fixed, you don't change the solution. It seemed at this
> time a very valid concern and a good argument for having the scale_C.
> I do believe now that it's not a valid argument.
yes, bootstrap is reallt different
> When you bootstrap with
> replacement then you're in the "middle“ as you have duplicated samples.
>
> does it make sense?
Not sure about what you consider the "middle" but it might. (a topic for 
next coffee break)


@Olivier: no good reference in mind.

B

------------------------------------------------------------------------------
Live Security Virtual Conference
Exclusive live event will cover all the ways today's security and 
threat landscape has changed and how IT managers can respond. Discussions 
will include endpoint security, mobile security and the latest in malware 
threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/
_______________________________________________
Scikit-learn-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/scikit-learn-general

Re: [Scikit-learn-general] The scale_C fiasco

Reply via email to