Hadoop (4218)
Big Data General (104)
Big Data AllOther (3)
How is dag created in spark?
What can be optimum value for Reducer?
Define Cassandra?
When does queuefullexception occur?
Which one would you recommend for hbase table design approach – tall-narrow or flat wide?
What is spark reducebykey?
How can we see only top 15 records from the student.txt out of100 records in the HDFS directory?
Explain the flatMap operation on Apache Spark RDD?
Is it possible to rename the output file, and if so, how?
Can you explain how do ‘map’ and ‘reduce’ work?
What is sink processors?
What is the communication channel between client and namenode/datanode?
What is Chain Mapper?
Why Hadoop performs replication, although it results in data redundancy?
Should the region server be located on all DataNodes?