Hi.
The dprep library has at least three different methods for outlier
detection: baysout, mahaout, robout.
I wanted to test them on a very simple data set:
vrmat-cbind((1:22),c(8,14,14,17,21,20,27,23,25,33,31,32,30,36,37,40,42,44,52,61,81,265))
As you can see by eyeballing this, the last
Hi, there:
I am wondering what packages are available in R which can do outlier
detection in large-scale dataset.
Thanks for sharing info,
weiwei
--
Weiwei Shi, Ph.D
Did you always know?
No, I did not. But I believed...
---Matrix III
__
Random forest can do the job.
HTH.
On 8/3/05, Weiwei Shi [EMAIL PROTECTED] wrote:
Hi, there:
I am wondering what packages are available in R which can do outlier
detection in large-scale dataset.
Thanks for sharing info,
weiwei
--
Weiwei Shi, Ph.D
Did you always know?
No, I did
Thanks.
I knew rf based on proximity can detect the outlier but when the data
size goes to 1 million and the features go around 200, I guess I
probably do not have enough memory to proceed. Do u have some
experience of outlier detection in this kind of data size?
regards,
weiwei
On 8/3/05,