Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
Explain about the core components of Flume?
What is the difference between Input Split and an HDFS Block?
What is the difference between the ZooKeeper ensemble and ZooKeeper quorum?
Write the command to copy a file from linux to hdfs?
What is the function of UNION and SPLIT operators? Give examples?
What are the most common InputFormats in Hadoop?
On what all basis can you differentiate rdd, dataframe, and dataset?
Can You Use Apache Spark To Analyze and Access Data Stored In Cassandra Databases?
Mention what are the three modes in which hadoop can be run?
How can we create RDD in Apache Spark?
Developing a MapReduce Application?
Explain the process that overwrites the replication factors in HDFS?
Why do we need Hadoop?
What is a primary key? And what are it’s different types?
how Cassandra writes changed data into commitlog?