What is the supervised learning?
What are Random Forests?
When do you need to update the algorithm in data science?
What is meant by selection bias?
Case Study based questions - Cars are implanted with speed tracker so that the insurance companies can track about our driving state. Based on this new scheme what kind of business questions can be answered?
If you had to choose between the programming languages r and python, which one would you use for text analytics?
Can you use machine learning for time series analysis?
How will inspect missing data and when are they important for your analysis?
Explain the term "data warehouse".
Now companies are heavily investing their money and time to make the dashboards. Why?
Differentiate between skewed and uniform distribution?
How do you use sql in sas, python, r languages?
What is random forests?
How do you treat missing values during analysis?
What are outlier values?