In k-means or knn, we use euclidean distance to calculate the distance between nearest neighbors. Why not manhattan distance?
What is a document database?
Explain calculate distance methods briefly?
What is the difference between bayesian estimate and maximum likelihood estimation (mle)?
How can you select k for k-means?
why is data cleaning important for analysis?
How can you compute an inverse matrix faster by playing with some computation tricks?
You have two tables-the first table has data about the users and their friends, the second table has data about the users and the pages they have liked. Write an SQL query to make recommendations using pages that your friends liked. The query result should not recommend the pages that have already been liked by a user.
What is the common perception of visualization?
What is pylab?
How will you prove that the square root of 2 is irrational?
What is an rdbms? Name some examples for rdbms? What is crud?
What is a/b testing in data science?
How is data science different from data analytics?
Please explain the role of data cleaning in data analysis.