What is the procedure to check the cumulative frequency distribution of any categorical variable?
What are recommender systems?
When do you need to update the algorithm in data science?
How do you build a custom function in python or r?
What are the types of business decisions?
What is logistic and linear regression? Name some packages in r and python for building regression models.
How do you treat outlier data?
What is a z test?
What is meant by Random Forest Classifier?
What is an Eigenvalue and Eigenvector?
You are given two tables- friend_request and request_accepted. Friend_request contains requester_id, time and sent_to_id and request_accepted table contains time, acceptor_id and requestor_id. How will you determine the overall acceptance rate of requests?
How is bagging different from boosting?
Explain the differences between bivariate and multivariate analysis?
What is the goal of the clustering?
Could you draw a comparison between overfitting and underfitting?