Hadoop (4218)
Big Data General (104)
Big Data AllOther (3) How will you write a custom partitioner for a Hadoop job?
What mechanism does hadoop framework provides to synchronize changes made in distribution cache during runtime of the application?
What do you mean by metadata in HDFS?
What do you understand by standalone (or local) mode?
What is speculative execution in spark?
What is the Job interface in MapReduce framework?
How will you make changes to the default configuration files?
In how many ways can we use Spark over Hadoop?
What are the different IDE available for Hive Development?
What is throughput? How does HDFS provide good throughput?
In the Producer, when does QueueFullException occur?
What is Hadoop Map Reduce ?
Explain Erasure Coding in Apache Hadoop?
What is HDFS block size and what did you chose in your project?
Can you explain recommendation engine?