Big Data Interview Questions
Questions Answers Views Company eMail

What is difference between hive and hdfs?

385

What is skew data in hive?

430

Is kafka an etl tool?

259

What language is apache kafka written in?

282

What is zookeeper server?

1

What is the difference between map and reduce?

348

What is optimal size of a file for distributed cache?

374

What can skew the mean?

187

What is vectorized query execution?

216

What is map side join?

187

What does dag stand for?

199

What is data ingestion pipeline?

185

What is the difference between reducebykey and groupbykey?

201

What is data skew and how do you fix it?

212

Is databricks a database?

212


Un-Answered Questions { Big Data }

List various commonly used machine learning algorithm?

192


Can I do insert … select * into a partitioned table?

34


Why is Kafka technology significant to use?

308


How hbase handles the write failure?

164


What is the default level of parallelism in apache spark?

235






Differentiate between static and dynamic cql tables.

47


What is a udf?

224


What Mapper does?

666


Explain hbasestorage function?

311


What is rdd map?

205


Is it necessary to learn hadoop for spark?

185


What is the key- value pair in MapReduce?

384


What is dataframe api?

202


Is apache spark in demand?

181


What is use of tools command?

121