[Scikit-learn-general] What to do when the user's Gram matrix is likely wrong?

Vlad Niculae Tue, 26 Jun 2012 10:37:06 -0700

This is a consistency question. I found that enet_path has a clever behaviour 
for this:


https://github.com/scikit-learn/scikit-learn/tree/master/sklearn/linear_model/coordinate_descent.py#L561

The logic here is:
> if center_data changes X, then X wasn't centered.
> If this is the case, and the user supplied a precomputed Gram (and maybe an 
> Xy),
> then we can assume that the user is wrong.
>
> If the user is actually not wrong, that would mean that he took the effort to
> build the Gram matrix from a centered, standardized X, but then passed the 
> uncentered
> X to the enet_path method. This is unlikely.

The problem is that LassoLars for example 
(https://github.com/scikit-learn/scikit-learn/blob/master/sklearn/linear_model/least_angle.py#L427),
 and probably many others don't behave like this.

I find it a bit impolite to ignore the Gram parameter that a user supplies, 
just because it's wrong, even though it's helpful. I see a couple of clean ways 
out of this:

1. Always use the Gram, if the user passed it. Under the assumption that a user 
that passes his own Gram is in pretty deep swamps already, and he knows what 
he's doing.

2. Discard the Gram everywhere in this situation, but also notify the user.

2'. Maybe pass the Gram and the Xy to the _center_data method so we only have 
one point of entry? And then, maybe we can code a clever way to update the Gram 
too, instead of recomputing? (is it possible)

Cheers,
------------------
Vlad N.
http://vene.ro





------------------------------------------------------------------------------
Live Security Virtual Conference
Exclusive live event will cover all the ways today's security and 
threat landscape has changed and how IT managers can respond. Discussions 
will include endpoint security, mobile security and the latest in malware 
threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/
_______________________________________________
Scikit-learn-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/scikit-learn-general

[Scikit-learn-general] What to do when the user's Gram matrix is likely wrong?

Reply via email to