How can you generate a random number between 1 – 7 with only a die?
Explain the benefits of using statistics by data scientists
while working on a data set, how can you select important variables? Explain
What is cross-validation?
How can you measure engagement with given Twitter data?
How can you compute an inverse matrix faster by playing with some computation tricks?
Which language is best for text analytics? R or python?
How will you design the heatmap for Uber drivers to provide recommendation on where to wait for passengers? How would you approach this?
Why we need ggplot?
How are confidence intervals constructed and how will you interpret them?
What exactly does a data scientist do?
A test has a true positive rate of 100% and a false-positive rate of 5%. There is a population with a 1/1000 rate of having the condition the test identifies. Considering a positive test, what is the probability of having that condition?
There are 8 identical balls and only one of the ball is slightly heavier than the others. You are given a balance scale to find the heavier ball. What is the least number of times you have to use the balance scale to find the heavier ball?
Which technique will you use to compare the performance of two back-end engines that generate automatic friend recommendations on Facebook?
Estimate the probability of a disease in a particular city given that the probability of the disease on a national level is low.