Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What are Flume core components?
What are the execution modes in the apache pig?
What is stage and task in spark?
What are the data components used by Hadoop?
What is difference the between sqoop and distcp?
What is data skew and how do you fix it?
List out the commands that are used to start, check the progress and stop the ambari server?
What is dag – directed acyclic graph?
Is there a date data type in Hive?
What do you mean by column family in Cassandra?
If a particular file is 50 mb, will the hdfs block still consume 64 mb as the default size?
Define a metadata?
Name three data source available in SparkSQL
What is the use of foreach operation in Pig scripts?
Can you explain edge nodes in hadoop?