Hadoop (4218)
Big Data General (104)
Big Data AllOther (3) How do I change hive execution engine to spark?
What is hadoop? Name the main components of a hadoop application?
Which command is used to start the cqlsh prompt?
Can I run an ensemble cluster behind a load balancer?
Define replication strategy?
What is the default extension of the files produced from a sqoop import using the –compress parameter?
Explain cogroup() operation in Spark?
In which location name node sores its metadata and why?
What Are Good Use Cases For Impala As Opposed To Hive Or MapReduce?
How can we launch Spark application on YARN?
What do you mean by meta information in hdfs?
Is it possible to add 100 more nodes when we already have 100 nodes in Hive?
Can we deploy job tracker other than name node?
Which one is the master node in HDFS? Can it be commodity hardware?
What is spark in big data?