Hadoop (4218)
Big Data General (104)
Big Data AllOther (3) What is Resilient Distributed Dataset (RDD) in Apache Spark? How does it make spark operator rich?
What are spark stages?
What is spark written?
Define primary key in Apache Cassandra?
Give me the examples of Columnar database ?
What are the all tasks we can perform for managing services using the ambari service tab?
Why do we use persist () on links rdd?
What are the the issues associated with the map and reduce slots based mechanism in mapReduce?
Differentiate between describe and describe extended?
What are the usage of different consistency levels for write operations ?
What is hadoop technology?
What types of costs are associated in creating index on hive tables?
Explain the commit log?
What happens if you get a ‘connection refused java exception’ when you type hadoop fsck /?
What is the need of MapReduce in Hadoop?