What is an rdbms? Name some examples for rdbms? What is crud?
How do data management procedures like missing data handling make selection bias worse?
Explain the two main components of the hadoop framework?
What is data science with example?
What are the factors used to produce "People You May Know" data product on LinkedIn?
What does _init_.py do?
What is the science of data in simple words?
How do you use goal seek in excel?
What are the time series algorithms?
In k-means or knn, we use euclidean distance to calculate the distance between nearest neighbors. Why not manhattan distance?
What are the variants of back propagation?
You have 2 dices. What is the probability of getting at least one 4? Also find out the probability of getting at least one 4 if you have n dices.
What is an outlier?
Why do you need a for loop? How do you do for loops in python and r?
Explain data preparation?