Hadoop (4218)
Big Data General (104)
Big Data AllOther (3)
What are the exception handling operators in Pig script?
Compare RabbitMQ vs Apache Kafka?
What are spark stages?
What is apache spark architecture?
What is throughput? How does HDFS provide good throughput?
Differentiate between static and dynamic cql tables.
Is it possible to use the same metastore by multiple users, in case of the embedded hive?
Where does the data of a Hive table gets stored?
What is Apache Avro?
Why is there a need for broadcast variables when working with Apache Spark?
What are the main key structures of hbase?
What happens when the node running the map task fails before the map output has been sent to the reducer?
Why Ambari?
Some of the most notable applications of Kafka?
Does spark use tez?