Hadoop (4218)
Big Data General (104)
Big Data AllOther (3)
What is amazon spark?
What is rdd lineage graph? How is it useful in achieving fault tolerance?
How to restart NameNode or all the daemons in Hadoop HDFS?
What is data skew in spark?
Where sorting is done on mapper node or reducer node in MapReduce?
What type of data hadoop can handle ?
How data or a file is written into hdfs?
If datanodes increase, then do we need to upgrade namenode?
Which is the best spark certification?
Define role of velocity in big data?
What are Pig Execution modes?
What is partitioning in MapReduce?
What is a Consumer Group?
How is the processing of streaming data achieved in Apache Spark? Explain.
What do you understand by compute and storage nodes?