What is difference between hadoop and spark?
Answer / Gireesh Chandra
Hadoop is a framework that provides a distributed file system (HDFS) and a programming model (MapReduce) for processing large data sets, while Spark is an open-source distributed computing system that uses in-memory processing to achieve faster performance. Spark can run on Hadoop but also supports other distributed systems like Mesos.
| Is This Answer Correct ? | 0 Yes | 0 No |
What is spark code?
Does rdd have schema?
What are spark jobs?
Explain a scenario where you will be using spark streaming.
What happens to rdd when one of the nodes on which it is distributed goes down?
What is big data spark?
What is spark shuffle?
Define sparksession in apache spark? Why is it needed?
What is the difference between dataset and dataframe in spark?
What is data skew and how do you fix it?
What is the difference between Caching and Persistence in Apache Spark?
Where does spark plug get power?
Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)