Would the cosine measure be the appropriate similarity measure to use with K-means clustering for time series data? Why or why not? If not, what similarity measure would be more appropriate?

What will be an ideal response?


Time series data is dense high-dimensional data, and thus, the cosine measure
would not be appropriate since the cosine measure is appropriate for sparse
data. If the magnitude of a time series is important, then Euclidean distance
would be appropriate. If only the shapes of the time series are important,
then correlation would be appropriate. Note that if the comparison of the
time series needs to take in account that one time series might lead or lag
another or only be related to another during specific time periods, then more
sophisticated approaches to modeling time series similarity must be used.

Computer Science & Information Technology

You might also like to view...

Use of DHCPv6 for address assignment will result in a locally unique host identifier that changes when you move to a different network.

Answer the following statement true (T) or false (F)

Computer Science & Information Technology

When editing a hyperlink, it is enclosed in ________, which Access uses to activate the link when clicked

A) quotation marks (") B) asterisks (*) C) ampersands (&) D) pound signs (#)

Computer Science & Information Technology

A ____ is a collection of Web pages that are linked logically, and located together on a hosting computer.

a. site map b. Web site c. Web gallery d. Web portfolio

Computer Science & Information Technology

The read-only property cannot be set while the file is:

A) open. B) deleted. C) saved. D) read.

Computer Science & Information Technology