What is meant by rdd in spark?
Answer / Amit Ranjan
RDD stands for Resilient Distributed Dataset, which is an immutable distributed collection of objects that can be operated on in parallel across a cluster.
| Is This Answer Correct ? | 0 Yes | 0 No |
How do you set up a spark?
What are the various levels of persistence in Apache Spark?
Name three data source available in SparkSQL
How Spark uses Akka?
What is project tungsten in spark?
State the difference between Spark SQL and Hql
Does Apache Spark provide checkpoints?
What is spark sqlcontext?
What is the reason behind Transformation being a lazy operation in Apache Spark RDD? How is it useful?
What is spark yarn executor memoryoverhead?
Is Apache Spark a good fit for Reinforcement learning?
How do I clear my spark cache?
Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)