Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What is the difference between rdbms and hadoop?
What is accumulators and broadcast variables in spark?
What is sqoop and flume?
Why is Cassandra popular? Clarify.
What is Hive Data Definition language?
How do I download spark?
Is it possible to have hadoop job output in multiple directories?
What are the types of System tools?
How HDFS client divide the file into the block while storing inside HDFS?
explain the key features of Apache Spark?
When is it not recommended to use MapReduce paradigm for large
Explain why to use hbase?
How to create an rdd?
How do you stop a spark?
What is the need of key-value pair to process the data in MapReduce?