Hadoop Interview Questions
Questions Answers Views Company eMail

What is pair rdd?

208

What is data pipeline in spark?

203

What is a spark rdd?

223

What are the optimization techniques in spark?

184

Can you run spark on windows?

201

Why is spark good?

199

Do I need to know hadoop to learn spark?

206

Is a distributed machine learning framework on top of spark?

196

How does hadoop achieve fault tolerance?

220

Is hadoop still in demand?

224

What is winutils hadoop?

239

Is hive a nosql database?

383

Is hive similar to sql?

428

What is difference between hive and hdfs?

387

What is skew data in hive?

432


Un-Answered Questions { Hadoop }

What are the 2 modes used to run pig scripts?

295


What is the default value of map and reduce max attempts?

657


Is it possible to iterate through the rows of HBase table in reverse order?

155


What are the different types of Znodes?

697


Why scala is used in spark?

204






Mention some important components of cassandra data models?

45


What is hinted handoff?

56


what Hive is composed of ?

392


Explain the hdfs architecture and list the various hdfs daemons in hdfs cluster?

30


What are the features of apache cassandra?

46


What are the three layers where the hadoop components are actually supported by ambari?

51


What are the data manipulation commands of hbase?

141


What is the use of “resultset execute” method?

70


Mention some use cases of apache mahout?

41


Do I need to learn scala for spark?

184