Un-Answered Questions { All }

How to create RDD?

361


Does Apache Spark provide check pointing?

313


Explain about the popular use cases of Apache Spark

340


Do you need to install Spark on all nodes of Yarn cluster while running Spark on Yarn?

418


What are the different String functions available in pig?

435


Differentiate between the physical plan and logical plan in Pig script?

502


What are the use cases of Apache Pig?

497


What do you understand by an inner bag and outer bag in Pig?

574


Explain different execution modes available in Pig?

526


How do users interact with HDFS in Apache Pig ?

521


what are the basic parameters of a Mapper?

504


What is a MapReduce Combiner?

508


Where is Mapper output stored?

502


what is JobTracker in Hadoop? What are the actions followed by Hadoop?

520


Is it possible to split 100 lines of input as a single split in MapReduce?

590