Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407) What are clusters in cassandra?
Can you give us some examples, how Hadoop is used in real time environment?
Can you explain accumulators in apache spark?
How to change Replication Factor For below cases ?
What is difference between map and flatmap?
List the various options available with the Hive command?
What does block mean?
What is the use of spark driver, where it gets executed on the cluster?
Why is Apache Spark faster than Hadoop MapReduce?
Explain what are the various types of Transformation on DStream?
What is BloomMapFile?
what is Speculative Execution?
is it necessary to install Spark on all nodes while running Spark application on Yarn?
What is the NameNode port number?
Explain hbasestorage function?