If a highly positively skewed variable has missing values and we replace them with mean, do we underestimate or overestimate the values?
What kind problems are solved by regularization?
Why is harmonic mean used to calculate f1 score and not the arithmetic mean?
What is the baseline in machine learning?
Why is it important for the royal society to be doing a project about machine learning?
You are given a data set. The data set has missing values which spread along 1 standard deviation from the median. What percentage of data would remain unaffected? Why ?
What is naive bayes classifier?
What is precision and recall?
What is the convex hull?
What are standardization and normalisation? Give one advantage of each over the other?
Why is manhattan distance not used in knn machine learning algorithm to calculate the distance between nearest neighbors?
What is the batch in machine learning?
How to choose notable variables while working on a data set?
How will you know which machine learning algorithm to choose for your classification problem?
What is supervised and unsupervised machine learning?