Explain Machine Learning library in Spark?
Answer / Mohit Singhal
The MLlib (Machine Learning Library) is the machine learning library for Apache Spark. It includes a collection of scalable machine learning algorithms for classification, regression, clustering, collaborative filtering, dimensionality reduction, and more. These algorithms are designed to work with large-scale datasets distributed across a cluster.
| Is This Answer Correct ? | 0 Yes | 0 No |
What is difference between dataset and dataframe?
Is it possible to use Apache Spark for accessing and analyzing data stored in Cassandra databases?
What is spark dynamic allocation?
Is spark a mapreduce?
In a given spark program, how will you identify whether a given operation is Transformation or Action ?
Why do we need sparkcontext?
explain the key features of Apache Spark?
How can you trigger automatic clean-ups in Spark to handle accumulated metadata?
What is shuffle in spark?
What is a "Parquet" in Spark?
What is Catalyst framework?
Is apache spark a programming language?
Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)