How can you check if a data set or time series is random?
How to create survival object in R?
Define a sql query? How do you use sql in sas, python, r languages?
Please explain gradient descent?
What do you understand by parametric and non-parametric methods? Explain with examples.
Explain supervised learning?
Why is resampling done?
What is logistic and linear regression? How do you treat multicollinearity and heteroscedasticity in regression?
What is missing value imputation?
What is the difference between a cluster and systematic sampling?
You have 2 dices. What is the probability of getting at least one 4? Also find out the probability of getting at least one 4 if you have n dices.
What is a z test, chi square test, f test and t test?
Explain why data cleansing is essential and which method you use to maintain clean data?
What cross validation technique would you use on time series data set? Is it k-fold or loocv?
How will you test that there is increased probability of a user to stay active after 6 months given that a user has more friends now?