Hadoop (4218)
Big Data General (104)
Big Data AllOther (3) What is spark ml?
What is throughput in HDFS?
Explain HBase Architecture in brief?
Define a worker node?
How do I start a spark master?
What is Apache Zookeeper Meant For?
What happens to job tracker when namenode is down?
In HDFS, how Name node determines which data node to write on?
How does hadoop achieve fault tolerance?
What is an rdd?
What are the various modes in which Spark runs on YARN? (Local vs Client vs Cluster Mode)
What are the main properties of hdfs-site.xml file?
what is a sequence file in Hadoop?
What are the actions in spark?
What is spark reducebykey?