Jump to content

Apache Mahout

From Wikipedia, the free encyclopedia

This is an old revision of this page, as edited by 184.153.92.85 (talk) at 21:31, 11 April 2016 (mahout 0.12.0 release). The present address (URL) is a permanent link to this revision, which may differ significantly from the current revision.

Apache Mahout
Developer(s)Apache Software Foundation
Stable release
0.12.0 / 11 April 2016 (2016-04-11)
Repository
Written inJava, Scala
Operating systemCross-platform
TypeMachine Learning
LicenseApache 2.0 Licence
Websitemahout.apache.org

Apache Mahout is a project of the Apache Software Foundation to produce free implementations of distributed or otherwise scalable machine learning algorithms focused primarily in the areas of collaborative filtering, clustering and classification. Many of the implementations use the Apache Hadoop platform.[1][2] Mahout also provides Java libraries for common maths operations (focused on linear algebra and statistics) and primitive Java collections. Mahout is a work in progress; the number of implemented algorithms has grown quickly,[3] but various algorithms are still missing.

While Mahout's core algorithms for clustering, classification and batch based collaborative filtering are implemented on top of Apache Hadoop using the map/reduce paradigm, it does not restrict contributions to Hadoop-based implementations. Contributions that run on a single node or on a non-Hadoop cluster are also welcomed. For example, the 'Taste' collaborative-filtering recommender component of Mahout was originally a separate project and can run stand-alone without Hadoop.

Starting with the release 0.10.0, the project shifts its focus to building backend-independent programming environment, code named "Samsara".[4][5][6] The environment consists of an algebraic backend-independent optimizer and an algebraic Scala DSL unifying in-memory and distributed algebraic operators. At the time of this writing supported algebraic platforms are Apache Spark and H20, and Apache Flink. Support for MapReduce algorithms is being gradually phased out.[7]

References

  1. ^ "Introducing Apache Mahout". ibm.com. 2011. Retrieved 13 September 2011.
  2. ^ "InfoQ: Apache Mahout: Highly Scalable Machine Learning Algorithms". infoq.com. 2011. Retrieved 13 September 2011.
  3. ^ "Algorithms - Apache Mahout - Apache Software Foundation". cwiki.apache.org. 2011. Retrieved 13 September 2011.
  4. ^ "Mahout-Samsara's In-Core Linear Algebra DSL Reference".
  5. ^ "Mahout-Samsara's Distributed Linear Algebra DSL Reference".
  6. ^ "Mahout 0.10.x: first Mahout release as a programming environment". www.weatheringthroughtechdays.com. Retrieved 2016-02-29.
  7. ^ "MAHOUT-1510 ("Good-bye MapReduce")".

Resources