Hadoop (4218)
Big Data General (104)
Big Data AllOther (3) Does impala performance improve as it is deployed to more hosts in a cluster in much the same way that hadoop performance does?
164What is RDD in Apache Spark? How are they computed in Spark? what are the various ways in which it can create?
340What role does worker node play in Apache Spark Cluster? And what is the need to register a worker node with the driver program?
357What is the reason behind Transformation being a lazy operation in Apache Spark RDD? How is it useful?
507
What are the different tools used for the ambari monitoring purpose?
Mention what is HiveServer2 (HS2)?
Does impala use caching?
Compare Hadoop and Spark?
What is cluster manager in spark?
Explain partitions?
What is SuperColumn in Cassandra?
What are the port numbers of task tracker?
What is a Seed Node in Cassandra ?
Does Cassandra work on Windows?
What do you understand by cassandra?
How does apache flume work?
Define Simple Strategy?
What do you mean by Stream Processing in Kafka?
Can you explain commodity hardware?