What is shark?
Answer / Mukta Pandey
SHARK (Sort-Heap Adaptive Radix Compaction) is a sorting algorithm used in Apache Spark. It combines the advantages of Sort-Merge and Heap-Sort algorithms to provide efficient sorting for large datasets.
| Is This Answer Correct ? | 0 Yes | 0 No |
What is big data spark?
What is a "worker node"?
What is rdd lineage graph? How is it useful in achieving fault tolerance?
Do you need to install Spark on all nodes of Yarn cluster while running Spark on Yarn?
Which are the methods to create rdd in spark?
Can you define yarn?
Why is there a need for broadcast variables when working with Apache Spark?
What is Sparse Vector?
What is amazon spark?
What is spark in big data?
What is lazy evaluation and how is it useful?
What is apache spark engine?
Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)