Hadoop (4218)
Big Data General (104)
Big Data AllOther (3)
How to create an rdd?
What are components of ambari tjat are important for automation and integration?
How is fault tolerance achieved in Apache Spark?
What file systems Spark support?
Explain MemStore?
What is used to store data generally?
Explain the top() and takeordered() operation?
Explain about the execution plans of a Pig Script? Or Differentiate between the logical and physical plan of an Apache Pig script?
What do you mean by Persistence?
What is the utility of using Writable Comparable Custom Class in Map Reduce code?
Why is spark so fast?
Kafka has written in which languages?
Which Sorting algorithm is used in Hadoop MapReduce?
What are the main configuration parameters in a MapReduce program?
what factors the block size takes before creation?