Hadoop (4218)
Big Data General (104)
Big Data AllOther (3) How to restart NameNode or all the daemons in Hadoop HDFS?
What is hbase in hadoop?
When should you use a reducer?
How is dag created in spark?
Explain about the execution pl of a pig script?
or
differentiate between the logical and physical plan of an apache pig script?
What is a "Spark Executor"?
Explain how indexing in hdfs is done?
What is a combiner in hadoop?
In mapreduce what is a scarce system resource? Explain?
What is catalyst query optimizer in apache spark?
How HDFS helps NameNode in scaling in Hadoop?
Why the output of map tasks are stored (spilled ) into local disc and not in hdfs?
How is Flume-NG different from Flume 0.9?
Explain what is the role of the zookeeper?
What is a kafka cluster?