You are creating a report for user content uploads every month and observe a sudden increase in the number of upload for the month of November. The increase in uploads is particularly in image uploads. What do you think will be the cause for this and how will you test this sudden spike?
353You have a bag with 6 marbles. One marble is white. You reach the bag 100 times. After taking out a marble, it is placed back in the bag. What is the probability of drawing a white marble at least once?
277Suppose that American Express has 1 million card members along with their transaction details. They also have 10,000 restaurants and 1000 food coupons. Suggest a method which can be used to pass the food coupons to users given that some users have already received the food coupons so far.
298You are given a training dataset of users that contain their demographic details, the pages on Facebook they have liked so far and results of psychology test based on their personality i.e. their openness to like FB pages or not. How will you predict the age, gender and other demographics of unseen data?
299Burn two ropes, one needs 60 minutes of time to burn and the other needs 30 minutes of time. How will you achieve this in 45 minutes of time ?
300How can you build and test a metric to compare ranked list of TV shows or Movies for two Netflix users?
523How do you take millions of users with 100's of transactions each, amongst 10000's of products and group the users together in a meaningful segments?
397Post New Data Science Questions
What is chi square test?
What do you mean by graphics devices in data visualization?
What is the central limit theorem and why is it important?
Estimate the number of square feet pizza's eaten in US each year.
You can roll a dice three times. You will be given $X where X is the highest roll you get. You can choose to stop rolling at any time (example, if you roll a 6 on the first roll, you can stop). What is your expected pay-out?
What is the good measure of influence of a Twitter user?
How can you deal with different types of seasonality in time series modelling?
How is true positive rate and recall related? Write the equation.
What is the return type of the function ID?
what is prior probability and likelihood?
Discuss linear regression?
Why we need ggplot?
What is an example of a data set with a non-gaussian distribution?
What are various steps involved in an analytics project?
Which one would you prefer for text analytics python or r?