Hi;
    You could use some multivariate outliers detection algorithm. e.g. BACON
algorithm of HADI,2000 is very fast, acurate and you can contact him to get
a copy.
It can handle very large data very fast.
Dr Osama Hussien
Alexandria Univ.
----- Original Message -----
From: "saisat" <[EMAIL PROTECTED]>
To: <[EMAIL PROTECTED]>
Sent: Tuesday, February 18, 2003 10:05 PM
Subject: Removing suspicious data


> Hello all,
>
>
> I have a large quantity of data for which i have the mean and Standard
> deviation. But in this data, quite of few of the values are really
> inaccurate. How can I get the "accurate" mean of the above sample
> data.
>
> For example if I have 100, 100, 100, 500 as the sample data
> the mean is 200. But in my case 500 is obviously an erroneous value
> for the data. The actual mean should be close to 100. How can I get
> this value bearing in mind that my sample in quite large (in millions)
> and i need to somehow "remove" these inaccuracies
>
> Thanks
> Sat
> .
> .
> =================================================================
> Instructions for joining and leaving this list, remarks about the
> problem of INAPPROPRIATE MESSAGES, and archives are available at:
> .                  http://jse.stat.ncsu.edu/                    .
> =================================================================
>


.
.
=================================================================
Instructions for joining and leaving this list, remarks about the
problem of INAPPROPRIATE MESSAGES, and archives are available at:
.                  http://jse.stat.ncsu.edu/                    .
=================================================================

Reply via email to