Hadoop (4218)
Big Data General (104)
Big Data AllOther (3)
What do you understand by node in cassandra?
Explain catalyst query optimizer in Apache Spark?
What is oozie in hadoop?
What is the use of flatmap in spark?
Difference between hive and impala?
What is the Repository?
I have a relation r. How can I get the top 10 tuples from the relation r?
Describe Replication Factor?
Explain Accumulator in Spark?
What is the use of checkpoints in spark?
Explain some Disadvantages of Avro?
What is a pipelinedrdd?
How can we change the split size if our commodity hardware has less storage space?
Command to format the NameNode?
Explain a common use case for Flume?