Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
How we can check hadoop sqoop installed or not in a system?
What is a task tracker?
What is the sequence of execution of Mapper, Combiner, and Partitioner in MapReduce?
Why HDFS stores data using commodity hardware despite the higher chance of failures in hadoop?
What is shuffle read and shuffle write in spark?
Is there any difference between HBase datamodel and RDBMS datamodel?
When to choose "Internal Table" in Hive?
What is hdfs spark?
What are the different Data Types available in Hive?
What if a namenode has no data?
What is DistributedCache and its purpose?
What are different String functions available in PIG?
How does reducebykey work in spark?
Explain about the major libraries that constitute the Spark Ecosystem?
What is Zookeeper Cluster?