The difference between GROUP and COGROUP operators in Pig?
Answer / Sudhanshu Shekhar
In Pig, the GROUP operator groups tuples based on a specified field or set of fields. The resulting output is a Bag for each group key, containing all the original tuples that belong to the respective group. On the other hand, COGROUP operates on multiple relations and groups them together using the same grouping expressions. It returns a relation where the bag of one data stream is associated with the bags of other data streams based on common grouping fields.
| Is This Answer Correct ? | 0 Yes | 0 No |
How Pig differs from MapReduce?
What is UDF?
Explain about the execution plans of a pig script?<Br> or<br> differentiate between the logical and physical plan of an apache pig script?
How do you run pig scripts on kerberos secured cluster?
What is UDF in Pig?
Explain plucktuple?
How do you run the pig scripts in local mode?
What do you understand by an inner bag and outer bag in Pig?
What is the difference between store and dumps commands?
What is Pig Latin?
How do you handle compression in pig?
How should 'load' keyword is useful in pig scripts?
Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)