Hi,
I am writing a fault detection system using OCSVM.
I start with a huge matrix shape=(1006, 300000) and  I reduce its dimension
with IncrementalPCA.

If I use 10 - 100 pca components I get a very good AUC score around 0.97
while with 1000 components it drops to 0.5.

Is it possible that incremental PCA becomes unstable when too many
components are used?
I can not find another explanation for the drop in performances. As far as
I know OCSVM should be able to scale well to high dimensional datasets.

The code is here http://jpst.it/CpqB

Let me know.
Thanks!
-- 

Sent by mobile phone
------------------------------------------------------------------------------
_______________________________________________
Scikit-learn-general mailing list
Scikit-learn-general@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/scikit-learn-general

Reply via email to