Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What is driver and executor in spark?
What do you understand by an inner bag and outer bag in Pig?
What is the maximum recommended cell size?
What are the key differences between cassandra and traditional rdbms?
What square measure the options of apache mahout?
What are the tools you need to build Ambari?
What are the parameters of mappers and reducers?
Can you explain commodity hardware?
What is the difference between an RDBMS and Hadoop?
What is spark execution engine?
What do you mean by a bag in Pig?
Explain about trformations and actions in the context of rdds?
What is Partition table in Hive?
Should the region server be located on all DataNodes?
Explain about transformations and actions in the context of RDDs.