Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
Explain the processing speed difference between Hadoop and Apache Spark?
Clarify what a task tracker is in hadoop?
What are different modes of metastore deployment in Hive?
What are the similarities and differences between Apache Flume and Apache Kafka?
What are the core apis in kafka?
Replication causes data redundancy then why is is pursued in HDFS?
What is apache spark written in?
What do you understand by schemardd in apache spark rdd?
What is spark code?
Give some points of pig for hadoop ?
What are the different modes in which we can configure/install Hadoop?
Explain cassandra data model?
Why do we need sparkcontext?
Are Cassandra, Hadoop, Hbase and Cassandra are the same in nature? Specify.
What is safe mode in Hadoop?