Consider the (relative distance) K-means scheme for outlier detection de- scribed in Section 10.5 and the accompanying figure, Figure 10.10.

(a) The points at the bottom of the compact cluster shown in Figure 10.10
have a somewhat higher outlier score than those points at the top of
the compact cluster. Why?
(a) The points at the bottom of the compact cluster shown in Figure 10.10
have a somewhat higher outlier score than those points at the top of
the compact cluster. Why?
(c) The use of relative distance adjusts for differences in density. Give an
example of where such an approach might lead to the wrong conclusion.


(a) The mean of the points is pulled somewhat upward from the center of
the compact cluster by point D.
(b) No. This point would become a cluster by itself.
(c) If absolute distances are important. For example, consider heart rate
monitors for patients. If the heart rate goes above or below a specified
range of values, then this has an physical meaning. It would be incorrect
not to identify any patient outside that range as abnormal, even though
there may be a group of patients that are relatively similar to each other
and all have abnormal heart rates.

Computer Science & Information Technology

You might also like to view...

Information displayed at the top of every page that appears only when you print the form by default is part of the ____.

A. Form header B. Form footer C. Page header D. Page footer

Computer Science & Information Technology

The ________ function retrieves the current system date and time

A) DatePart B) Date C) DateSerial D) Now

Computer Science & Information Technology

The ________ attribute aligns the content of a cell vertically

Fill in the blank(s) with correct word

Computer Science & Information Technology

Which of the following is a way to prevent intrusions?

a. Directed broadcasts b. Access lists c. no ip directed-broadcast command d. Both b and c

Computer Science & Information Technology