Hadoop (4218)
Big Data General (104)
Big Data AllOther (3)
Whether the output of mapper or output of partitioner written on local disk?
What is kafka?
What is Rack Awareness? What is its need in Hadoop?
What is the relationship between hdfs, hbase, pig, hive and azkaban?
What do you understand by Thrift?
What does block mean?
Are spark dataframes immutable?
Explain some Disadvantages of Avro?
What is coalesce in spark sql?
How do I change hive execution engine to spark?
Explain how cassandra delete data?
Why is Transformation lazy in Spark?
If datanodes increase, then do we need to upgrade namenode?
Does mapreduce programming model provide a way for reducers to communicate with each other? In a mapreduce job can a reducer communicate with another reducer?
After the Map phase finishes, the Hadoop framework does 'Partitioning, Shuffle and sort'. Explain what happens in this phase?