Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
Is the following approach correct? Is the sqrt Of Sum Of Sq a valid reducer?
Can Hadoop be compared to NOSQL database like Cassandra?
Compare MapReduce and Spark?
how JobTracker schedules a task ?
What are the various components in kafka.
What are the common mistakes developers make when running Spark applications?
Define standalone mode in hbase?
What is decorating filters?
Define compaction in HBase?
If you omit the overwrite clause while creating a hive table,what happens to file which are new and files which already exist?
What is a combiner and where you should use it?
how Cassandra writes data?
what daemons run on a master node and slave nodes?
Explain how do ‘map’ and ‘reduce’ work?
Explain the operations of Apache Spark RDD?