Publication Date
8-2016
Abstract
Traditional approach to eliminating outliers is that we compute the sample mean μ and the sample standard deviation σ, and then, for an appropriate value k0 = 2, 3, 6, etc., we eliminate all data points outside the interval [μ − k0 * σ, μ + k0 * σ] as outliers. Then, we repeat this procedure with the remaining data, eliminate new outliers, etc., until on some iteration, no new outliers are eliminated. In many applications, this procedure works well. However, in this paper, we provide a realistic example in which this procedure, instead of eliminating all outliers and leaving adequate data points intact, eliminates all the data points. This example shows that one needs to be careful when applying the standard outlier-eliminating procedure.
Comments
Technical Report: UTEP-CS-16-54