Hadoop (4218)
Big Data General (104)
Big Data AllOther (3)
explain the concept of RDD (Resilient Distributed Dataset). Also, state how you can create RDDs in Apache Spark.
What is the use of ZooKeeper?
what is the maximum size of the message does Kafka server can receive?
what are the key components of hbase?
Can you tell us how many daemon processes run on a hadoop system?
What is the Use of SSH in Hadoop ?
Why is output file name in Hadoop MapReduce part-r-00000?
How is anti-entropy associated with merkel tree?
When and how to create hadoop archive?
Explain how can apache spark be used alongside hadoop?
What is hive on spark?
How does pig work?
Is rdd type safe?
What bit version that ambari needs and also list out the operating systems that are compatible?
What is the no. Of threads created by impala?