Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
when do reducers play their role in a mapreduce task?
Explain different queries performed by apache tajo?
What is spark accreditation?
Highlight the key differences between MapReduce and Apache Pig?
What are shared variables in Apache Spark?
Do I need to know scala to learn spark?
When you point a partition of a hive table to a new directory, what happens to the data?
What is the process for starting a Kafka server?
Explain what is kafka?
Give key features of any NoSQL database?
What are the advantages of using map side join in mapreduce?
What is the jobtracker?
what is NameNode in Hadoop?
What are combiners? When should I use a combiner in my MapReduce Job?
Why HDFS stores data using commodity hardware despite the higher chance of failures in hadoop?