What are big data and data science?
Different between overfitting and underfitting?
Is macbook good for data science?
How many big Macs does McDonald sell every year in US?
What are the feature vectors?
What is nosql? Name some examples of nosql databases. What is a key value store? What is column storage? What is a document database?
Explain data cleansing?
Who can be scientific data there are three general steps to becoming a data scientist?
Could you draw a comparison between overfitting and underfitting?
Explain how to define the number of clusters in a clustering algorithm?
What will be your expected earnings with the two roll strategy?
Explain OLS in brief?
Pick up a coin C1 given C1+C2 with probability of trials p (h1) =.7, p (h2) =.6 and doing 10 trials. And what is the probability that the given coin you picked is C1 given you have 7 heads and 3 tails?
What does a data scientist do?
What is the best programming language to use in data science?