Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407) What are nodes and ephemeral nodes?
What is a single point of failure in Hadoop 1 and how is it resolved in Hadoop 2?
How does spark work with python?
Is Namenode machine same as DataNode machine as in terms of hardware in Hadoop?
Explain about the major libraries that constitute the Spark Ecosystem?
What are partitions and tokens in cassandra?
How to insert records in apache tajo?
Is there a dual table?
Is hadoop obsolete?
What is an "Accumulator"?
How does cassandra perform write operations?
How many Reducers run for a MapReduce job?
In which directory hadoop is installed?
What is catalyst framework in spark?
Explain what is jobtracker in hadoop? What are the actions followed by hadoop?