What is the difference between rdd and dataframe?
Answer / Kirti Verma
RDD (Resilient Distributed Datasets) in Apache Spark are immutable distributed collections of objects, while DataFrames provide a programming interface for structured data with a schema. DataFrames can be created from RDDs, and vice versa.
| Is This Answer Correct ? | 0 Yes | 0 No |
Does diesel engine have spark plug?
What is off heap memory in spark?
Do I need scala for spark?
Please enumerate the various components of the Spark Ecosystem.
What are the transformations in spark?
Which the fundamental data structure of Spark
What do you know about schemardd?
What operations does the "RDD" support?
How can I improve my spark performance?
Which language is not supported by spark?
What is a "Parquet" in Spark?
What is the default spark executor memory?
Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)