Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What do you understand by compute and storage nodes?
Why do we need MapReduce during Pig programming?
What are the main components of hadoop?
What is Block in HDFS?
What is a hive on spark?
What is lineage graph in spark?
How client application interacts with the NameNode?
What is the usage of "cqlsh-version" command?
How HCatalog helps to capture processing states to enable sharing?
What does rdd mean?
What is spark reducebykey?
Is the hdfs block size reduced to achieve faster query results?
Explain what is a column family in cassandra?
What is the primary purpose of flume in the hadoop architecture?
List some commonly used Machine Learning Algorithm Apache Spark?