Hadoop (4218)
Big Data General (104)
Big Data AllOther (3)
Explain how can you check whether namenode is working beside using the jps command?
On what basis name node distribute blocks across the data nodes in HDFS?
what is the difference between order by and sort by in Hive?
List the files associated with metadata in hdfs?
What is Output Format in MapReduce?
What is the most widely used API Write Data to Cassandra ?
Define actions in spark.
Differentiate between describe and describe extended?
Why is space not freed up when I issue drop table?
Why do people use spark?
what are the nodes in the Hadoop cluster?
Differentiate between the terms: node, a cluster, and data center in cassandra?
What is a dataset? What are its advantages over dataframe and rdd?
What do you understand by bloom filter in cassandra?
What is anti-entropy?