What is spark rdd?
Answer / Renu Sirohi
Resilient Distributed Datasets (RDD) are the basic building blocks of Spark. An RDD is an immutable distributed collection of objects that can be partitioned across nodes in a cluster.
| Is This Answer Correct ? | 0 Yes | 0 No |
List the popular use cases of Apache Spark?
Explain accumulators in apache spark.
What do you know about transformations in spark?
What are the disadvantages of using Apache Spark over Hadoop MapReduce?
How can you store the data in spark?
How do I start a spark cluster?
What is cluster mode in spark?
How spark is used in hadoop?
How Spark uses Hadoop?
What is number of executors in spark?
What are the components of spark?
How to create a Sparse vector from a dense vector?
Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)