Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What is NameNode and DataNode in HDFS?
Elaborate on cassandra - cql?
Can we change the file cached by distributed cache
what is hadoop archive?
Explain InputSplit in Hadoop?
Explain what is cassandra?
What are the different parts of Hive ?
How can we see all the hosts that are available in Ambari?
How do I know if flume agent is running?
what is gossip protocol?
What do you understand by standalone (or local) mode?
What does reduce action do?
What is the communication channel between client and namenode/datanode?
What is the need of key-value pair to process the data in MapReduce?
how will you implement SQL in Spark?