Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
Can the balancer be run while Hadoop is in use?
Explain how input and output data format of the hadoop framework?
What is executor and driver in spark?
Which are the methods in the mapper interface?
What is graph db? Explain with an example.
List some use cases of apache kafka?
Can any impala query also be executed in hive?
What does the file hadoop-metrics.properties do?
What is scala spark?
In Hadoop, which file controls reporting in Hadoop?
Clarify what is sequence file input format?
Explain how to Tune Kafka for Optimal Performance?
Can you explain spark streaming?
What happens if the block in HDFS is corrupted?
Describe the different consistency levels for read operation in cassandra?