Where is spark rdd?
Answer / Sonika Dubey
Resilient Distributed Datasets (RDDs) are the fundamental data structure in Apache Spark. They are distributed collections of objects that can be stored in memory or on disk, and they provide an abstraction for fault-tolerant distributed computing.
| Is This Answer Correct ? | 0 Yes | 0 No |
Can aluminum cause a spark?
What is the difference between dataframe and dataset in spark?
What are shared variables?
How many types of Transformation are there?
Explain leftOuterJoin() and rightOuterJoin() operation in Apache Spark?
What is Starvation scenario in spark streaming?
What are the limitations of Spark?
What are the ways to launch Apache Spark over YARN?
Why is Spark RDD immutable?
Define Partition and Partitioner in Apache Spark?
Explain write ahead log(journaling) in spark?
When was spark introduced?
Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)