How many types of rdd are there in spark?
Answer / Vikas Kumar Tiwari
There are three main types of Resilient Distributed Datasets (RDDs) in Spark: parallelizable collections, Hadoop InputFormat-generated datasets, and custom datasets created from other RDD operations.
| Is This Answer Correct ? | 0 Yes | 0 No |
how can you identify whether a given operation is transformation or action?
Explain Spark leftOuterJoin() and rightOuterJoin() operation?
How sparksql is different from hql and sql?
How can I speed up my spark?
What is the difference between cache and persist in spark?
What is vectorized query execution?
What is spark technology?
Does spark replace hadoop?
What is the role of Driver program in Spark Application?
What is difference between spark and kafka?
What are the optimization techniques in spark?
Explain the filter transformation?
Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)