Hadoop (4218)
Big Data General (104)
Big Data AllOther (3) Does impala performance improve as it is deployed to more hosts in a cluster in much the same way that hadoop performance does?
164What is RDD in Apache Spark? How are they computed in Spark? what are the various ways in which it can create?
297What role does worker node play in Apache Spark Cluster? And what is the need to register a worker node with the driver program?
332What is the reason behind Transformation being a lazy operation in Apache Spark RDD? How is it useful?
464
What is hadoop pig?
What is crontab? Explain with suitable example?
Enlist all Apache Kafka Operations?
What is Federation?
How can we control particular key should go in a specific reducer?
What are the components of Apache Spark Ecosystem?
What are the four modules that make up the Apache Hadoop framework?
Explain the benefits of big data?
Explain Apache Ambari?
Explain Apache Ambari architecture?
What are Guarantees provided by Kafka?
What is the throughput?
How do you deal with sparse data?
Where does Big Data come from?
How to stop a partition form being queried?