What is logistic and linear regression? How do you treat multicollinearity and heteroscedasticity in regression? Name some packages in r and python for building regression models.
274What is nosql? Name some examples of nosql databases. What is a key value store? What is column storage? What is a document database?
263What is the central limit theorem? How is a normal distribution different from chi square distribution?
305Post New Data Science Questions
What is logistic and linear regression? How do you treat multicollinearity and heteroscedasticity in regression? Name some packages in r and python for building regression models.
Explain Data Science Vs Machine Learning?
How do you check for data quality?
Which method in pandas.tools.plotting is used to create scatter plot matrix?
Differentiate between type I and type ii error?
What skills do you need to be a data analyst?
What are the factors used to produce "People You May Know" data product on LinkedIn?
How do you do for loops in python and r?
A certain couple tells you that they have two children, at least one of whom is a girl. What is the probability that they have two girls?
How can you solve a problem that has no solution?
Why is Python used in data science?
Explain the difference between data science and data analytics?
What is the difference between Stack and Queue
Describe the data analysis process.
In k-means or knn, we use euclidean distance to calculate the distance between nearest neighbors. Why not manhattan distance?