How is hadoop different from spark?
Answer / Kamlesh Bhumarkar
Hadoop is a distributed processing framework that uses MapReduce for batch processing, whereas Spark is a general-purpose cluster computing system providing high-level APIs for DataStreaming and batch processing as well as machine learning and graph processing. Spark runs faster than Hadoop due to its in-memory data processing.
| Is This Answer Correct ? | 0 Yes | 0 No |
Explain the operation reduce() in Spark?
What is spark machine learning?
Explain about the different types of trformations on dstreams?
When creating an RDD, what goes on internally?
Difference between groupByKey vs reduceByKey in Apache Spark?
Name some sources from where Spark streaming component can process real-time data?
Compare hadoop & spark?
What happens when we submit a spark job?
What is DStream in Apache Spark Streaming?
Can you explain spark mllib?
How do you set up a spark?
Is hadoop mandatory for spark?
Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)