Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
Define the level of parallelism and its need in spark streaming?
State some disadvantages of impala?
Who uses Cassandra?
Define SSTable?
How Facebook Uses Hadoop, Hive and Hbase ?
Tell any two feature Flume?
What is the difference between coalesce and repartition in spark?
Explain what is a column family in cassandra?
What is difference between a MapReduce InputSplit and HDFS block
What are Guarantees provided by Kafka?
Hadoop sqoop is which type of tool?
What is pagerank in graphx?
What is Replication Factor in Cassandra?
Explain the terms memtable, commitlog and sstables.
Does spark need hadoop?