What does rdd stand for?
Answer / Sachin Bajpai
RDD stands for Resilient Distributed Dataset. It is a distributed collection of data that can be processed in parallel within the Apache Spark ecosystem.
| Is This Answer Correct ? | 0 Yes | 0 No |
What are the various types of shared variable in apache spark?
Explain briefly what is Action in Apache Spark? How is final result generated using an action?
Explain Spark Executor
Does spark sql use hive?
List down the languages supported by Apache Spark?
What is difference between client and cluster mode in spark?
How spark is used in hadoop?
What are Actions?
Where is apache spark used?
Why are spark transformations lazy?
Explain reduceByKey() Spark operation?
explain the key features of Apache Spark?
Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)