Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
Are there any problems which can only be solved by MapReduce and cannot be solved by PIG? In which kind of scenarios MR jobs will be more useful than PIG?
What are the consistency levels for read operations in Cassandra?
In what ways sparksession different from sparkcontext?
What is the need for custom serde?
what are the three modes in which Hadoop can be run?
Explain about Hadoop file system and processing framework?
What are the advantages of using map side join in mapreduce?
What are different hdfs dfs shell commands to perform copy operation?
What is atom in pig?
Can spark work without hadoop?
What is a task instance in hadoop? Where does it run?
What is a dstream in apache spark?
What is a bag in pig?
How can data transfer be minimized when working with Apache Spark?
What bit version that ambari needs and also list out the operating systems that are compatible?