Data Science Interview Questions
Questions Answers Views Company eMail

What will you do if removing missing values from a dataset cause bias?

Airbnb,

195

How can you reduce bias in a given data set?

Airbnb,

191

How will you impute missing information in a dataset?

Airbnb,

193

Estimate the probability of a disease in a particular city given that the probability of the disease on a national level is low.

Amazon,

187

How will inspect missing data and when are they important for your analysis?

Amazon,

222

How will you decide whether a customer will buy a product today or not given the income of the customer, location where the customer lives, profession and gender? Define a machine learning algorithm for this.

Amazon,

165

From a long sorted list and a short 4 element sorted list, which algorithm will you use to search the long sorted list for 4 elements.

Amazon,

196

How can you compare a neural network that has one layer, one input and output to a logistic regression model?

Amazon,

176

How do you treat colinearity?

Amazon,

175

How will you deal with unbalanced data where the ratio of negative and positive is huge?

Amazon,

174

What is the difference between Stack and Queue

Amazon,

210

What is the difference between Linkedin and Array

Amazon,

190

You are about to get on a plane to Seattle, you want to know whether you have to bring an umbrella or not. You call three of your random friends and as each one of them if it's raining. The probability that your friend is telling the truth is 2/3 and the probability that they are playing a prank on you by lying is 1/3. If all 3 of them tell that it is raining, then what is the probability that it is actually raining in Seattle.

Facebook,

176

You have been given the data on Facebook user's friending or defriending each other. How will you determine whether a given pair of Facebook users are friends or not?

Facebook,

193

Estimate the number of square feet pizza's eaten in US each year.

Goldman Sachs,

190


Post New Data Science Questions

Un-Answered Questions { Data Science }

Name three disadvantages of using a linear model?

178


Which methods are defined for a class of iterators?

189


What is back propagation?

148


What is collaborative filtering?

167


What is systematic sampling?

171


What is data science? How would you say it is similar or different to business analytics and business intelligence?

163


Write a program to segment a long string into a group of valid words using Dictionary. The result should return false if the string cannot be segmented. Also explain about the complexity of the devised solution.

282


How do you treat multicollinearity and heteroscedasticity in regression? Name some packages in r and python for building regression models.

182


What makes the difference between “long” and “wide” format data?

162


Write a function to check whether a particular word is a palindrome or not.

166


What prior knowledge is required to become data scientist?

181


Explain difference between binomial and Poisson distribution?

177


Pycharm has a debugger?

217


Where is association analysis used?

208


What is the importance of selection bias?

142