Translations:Large Scale Machine Learning (Big Data)/68/en

From Alliance Doc
Revision as of 22:22, 1 February 2022 by FuzzyBot (talk | contribs) (Importing a new version from external source)
Jump to navigation Jump to search

Spark ML is a Machine Learning library built on top of Apache Spark. It enables users to scale out many Machine Learning methods to massive amounts of data, over multiple nodes, without worrying about distributing datasets or explicitly writing distributed/parallel code. The library also includes many useful tools for distributed Linear Algebra and Statistics. Please see our tutorial on submitting Spark jobs before trying out the examples on the official Spark ML documentation.