Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What is throughput? How does HDFS provide good throughput?
Name some features of Apache Cassandra?
What is difference between coalesce and repartition?
What is apache spark architecture?
What a task tracker is in hadoop?
Explain Hadoop Archives?
Suppose Hadoop spawned 100 tasks for a job and one of the task failed. What will Hadoop do?
Explain the core benefits for hadoop users by using the apache ambari?
Explain the role of offset in kafka?
What is pseudo-distributed mode?
What are the parameters of mappers and reducers?
What is difference between secondary namenode, checkpoint namenode & backupnode?
Mention what happens if the preferred replica is not in the ISR?
Describe Accumulator in detail in Apache Spark?
What is Hadoop Distributed File System- HDFS?