Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What do you mean by Free Form Import in Sqoop?
What happen on the namenode when a client tries to read a data file?
did you maintain the hadoop cluster in-house or used hadoop in the cloud?
Explain how is data partitioned before it is sent to the reducer if no custom partitioner is defined in hadoop?
What are the methods to set up the local repository in different methods?
How multi-hop agent can be setup in Flume?
What is spark code?
How do users interact with the shell in apache pig?
Which is the best hadoop certification?
Why HDFS performs replication, although it results in data redundancy in Hadoop?
What is the difference between Gen1 and Gen2 Hadoop with regards to the Namenode?
What is the difference between hadoop and spark?
What does dag stand for?
What is tungsten in spark?
Mention what is the difference between an rdbms and hadoop?