[scikit-learn] Rapid Outlier Detection via Sampling

Orges Leka Sat, 25 Nov 2017 10:43:36 -0800

Dear scikit-learn Developers,

My Name is Orges Leka and I would like to implement
"Rapid Outlier Detection via Sampling" [1] in scikit-learn.
In R this method is already available [2] by the authors of the method.


In Python I have not seen any implementation yet. The method is very simple
yet effective as the authors show. First one selects say 20 points. Then
computes the shortest distance of all other points to these 20 points. This
is the outlier-score for one specific point.

It would be nice to implement this with different metrics / distances
(euclid, manhattan or other metrics) .

How would I start the implementation? I have already git-cloned
scikit-learn on my pc. Do I need to write object oriented or are functions
also ok?

If this succeeds, I would also like to extend the "example-outliers" doc
with the above method.

Kind regards
Dipl. Math. Orges Leka

[1]
https://papers.nips.cc/paper/5127-rapid-distance-based-outlier-detection-via-sampling.pdf
[2] https://github.com/mahito-sugiyama/sampling-outlier-detection

_______________________________________________
scikit-learn mailing list
scikit-learn@python.org
https://mail.python.org/mailman/listinfo/scikit-learn

[scikit-learn] Rapid Outlier Detection via Sampling

Reply via email to