What is the difference between reducebykey and groupbykey?
Answer / Neha Rani
reduceByKey is an operation that reduces each pair of key-value pairs to a single value while combining multiple iterations. Group by key, on the other hand, groups the data by the key but does not perform any reduction unless an aggregate function is specified.
| Is This Answer Correct ? | 0 Yes | 0 No |
Where is rdd stored?
Can we install spark on windows?
What is Spark DataFrames?
What is cluster manager in spark?
How is Apache Spark better than Hadoop?
State the difference between Spark SQL and Hql
Does Apache Spark provide check pointing?
What is pagerank in graphx?
Explain distnct(),union(),intersection() and substract() transformation in Spark?
What is hadoop spark?
What is write ahead log(journaling)?
What is DataFrames?
Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)