Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What are the bookkeeper elements and concepts?
Explain the benefits of block transfer?
What is a "Spark Driver"?
What is the number of default partitioner in hadoop?
How many ways we can create rdd in spark?
What is the maximum recommended cell size?
What is Sparse Vector?
What do you understand by data center in cassandra?
When is the reducers are started in a MapReduce job?
What is spark databricks?
What is the row key?
What size is recommended for each node?
When to use –target-dir and when to use –warehouse-dir while importing data?
On which port does ssh work?
Is hadoop mandatory for spark?