Hadoop (4218)
Big Data General (104)
Big Data AllOther (3) How will you calculate the number of executors required to do real-time processing using Apache Spark? What factors need to be considered for deciding on the number of nodes for real-time processing?
291In a given spark program, how will you identify whether a given operation is Transformation or Action ?
347
How much is flume worth?
Can you use spark to access and analyze data stored in cassandra databases?
What is a MapFile?
How NameNode tackle Datanode failures in Hadoop?
Can you explain bloommapfile.
Mention what is the difference between Hbase and Hive?
What is a column family?
What is JMX?
How to create an rdd?
Explain the core components of hadoop?
What is apache ambari?
Can we have different replication factor of the existing files in hdfs?
What do you mean by meta data in hdfs? List the files associated with metadata.
What are the components of apache ambari architecture?
What are the various types of shared variable in apache spark?