Hadoop (4218)
Big Data General (104)
Big Data AllOther (3) What is the difference between Cassandra, Pig and Hive?
What is the difference between Pig and MapReduce?
What is Resilient Distributed Dataset (RDD) in Apache Spark? How does it make spark operator rich?
What is difference between Column and Super Column?
How can you send some messages in kafka?
Explain about the replication and multiplexing selectors in Flume?
What is the difference between Internal Table and External Table in Hive?
Mention what is the difference between hdfs and nas?
What do you mean by commit log in Cassandra?
What are common uses of Apache Spark?
Explain about the popular use cases of Apache Spark
Can you define rdd lineage?
What are the all tasks we can perform for managing services using the ambari service tab?
How can we create rdds in apache spark?
What are file permissions in HDFS? how does HDFS check permissions for files/directory?