What is difference between dataset and dataframe in spark?

Golgappa.net | Golgappa.org | BagIndia.net | BodyIndia.Com | CabIndia.net | CarsBikes.net | CarsBikes.org | CashIndia.net | ConsumerIndia.net | CookingIndia.net | DataIndia.net | DealIndia.net | EmailIndia.net | FirstTablet.com | FirstTourist.com | ForsaleIndia.net | IndiaBody.Com | IndiaCab.net | IndiaCash.net | IndiaModel.net | KidForum.net | OfficeIndia.net | PaysIndia.com | RestaurantIndia.net | RestaurantsIndia.net | SaleForum.net | SellForum.net | SoldIndia.com | StarIndia.net | TomatoCab.com | TomatoCabs.com | TownIndia.com
Interested to Buy Any Domain ? << Click Here >> for more details...

What is difference between dataset and dataframe in spark?

Question Posted / pawan gill

1 Answers
331 Views
I also Faced
E-Mail Answers

What is difference between dataset and dataframe in spark?..

Answer / Pradeep Prasad

"A Dataset is a distributed collection of data with a strong type, which means that each column has a specified Java or Scala data type. It provides the benefits of both RDDs (Resilient Distributed Datasets) and DataFrames, with the added advantage of static typing. On the other hand, DataFrame is a distributed collection of data organized into named columns, but it does not have strong types for each column."

Is This Answer Correct ?

0 Yes

0 No

Post New Answer

More Apache Spark Interview Questions

What are transformations in spark?

Can copper cause a spark?

What is apache spark core?

What is external shuffle service in spark?

Please enumerate the various components of the Spark Ecosystem.

On what all basis can you differentiate rdd, dataframe, and dataset?

How do you integrate spark and hive?

What is spark vs hadoop?

What is master node in spark?

What is spark databricks?

What is apache spark good for?

Is a distributed machine learning framework on top of spark?

For more Apache Spark Interview Questions Click Here

Categories

Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)