Hadoop (4218)
Big Data General (104)
Big Data AllOther (3)
What is Data Locality in Hadoop?
What is hadoop sqoop?
how is data partitioned before it is sent to the reducer if no custom partitioner is defined in Hadoop?
What is replication factor?
What is the importance of — the split-by clause in running parallel import tasks in sqoop?
What do you know about the speculative execution?
What is the latest version of ambari that is available in the market and what is the feature that they have added in it?
Explain why the name ‘hadoop’?
How does cassandra perform read operation? Explain
What is the roadmap for apache driver version one.0?
how can you identify whether a given operation is transformation or action?
What is faster than apache spark?
What is the relation between job and task in hadoop?
Explain sum(), max(), min() operation in Apache Spark?
What is SSTable?