Data Science Interview Questions
Questions Answers Views Company eMail

What do you mean by p-value?

174

What is the 80/20 rule?

193

How decision tree algorithm is different from the random forest algorithm?

202

What is the roc curve?

207

What is the aim of a/b testing?

152

Define naive bayes?

171

How is data science different from data analytics?

199

What is the 80/20 rule? Explain its importance in model validation?

175

Explain box cox transformation?

240

Explain the decision tree algorithm, and how is it different from the random forest algorithm?

177

What is the svm algorithm?

202

Explain the importance of 80/20 rule in model validation?

200

Explain the term "data warehouse".

167

What do you understand by ensemble learning?

196

What do you understand by confusion matrix?

185


Post New Data Science Questions

Un-Answered Questions { Data Science }

What are numpy, scipy, and spark essential datatypes?

189


How to solve multi-collinearity?

229


Write a function that takes in two sorted lists and outputs a sorted list that is their union?

245


Explain each data visualization in detail?

193


Explain principal components analysis with equations.

185


Define linear regression?

188


How do you take millions of users with 100's of transactions each, amongst 10000's of products and group the users together in a meaningful segments?

286


What is the role of activation function?

249


You are given 50 cards with five different colors- 10 Green cards, 10 Red Cards, 10 Orange Cards, 10 Blue cards, and 10 Yellow cards. The cards of each colors are numbered from one to ten. Two cards are picked at random. Find out the probability that the cards picked are not of same number and same color.

317


What is Survival Analysis?

201


Define a sql query? What is the difference between select and update query?

220


You can roll a dice three times. You will be given $X where X is the highest roll you get. You can choose to stop rolling at any time (example, if you roll a 6 on the first roll, you can stop). What is your expected pay-out?

198


Which is the best suitable language among python and r for text analytics?

182


In k-means or knn, we use euclidean distance to calculate the distance between nearest neighbors. Why not manhattan distance?

267


Explain how to write a table to a file?

224