Microsoft interview question

How to measure distance between data point?

Interview Answers

Anonymous

29 Mar 2016

It depends on whether the data point contains numeric, categorical or mixed data, and also on the purpose of the distance function. For numeric, Euclidian is simple, but cosine is robust to scaling and dtw is good for time series. For categorical, a basic one will be Jaccard similarity, which local sensitive hashing might apply.

5

Anonymous

12 June 2015

Euclidean distance, manhattan distance

1