Apache Mahout

Apache Mahout
Developer(s)	Apache Software Foundation
Stable release	0.7 / 16 June 2012
Repository	github.com/apache/mahout ;
Written in	Java
Operating system	Cross-platform
Type	machine learning
License	Apache 2.0 Licence
Website	mahout.apache.org

Apache Mahout is an Apache project to produce free implementations of distributed or otherwise scalable machine learning algorithms on the Hadoop platform.^[1]^[2] Mahout is a work in progress; the number of implemented algorithms has grown quickly,^[3] but there are still various algorithms missing.

While Mahout's core algorithms for clustering, classification and batch based collaborative filtering are implemented on top of Apache Hadoop using the map/reduce paradigm, it does not restrict contributions to Hadoop based implementations. Contributions that run on a single node or on a non-Hadoop cluster are also welcomed. For example, the 'Taste' collaborative-filtering recommender component of Mahout was originally a separate project, and can run stand-alone without Hadoop. Integration with initiatives such as the Pregel-like Giraph are actively under discussion.

References

^ "Introducing Apache Mahout". ibm.com. 2011 [last update]. Retrieved 13 September 2011. {{cite web}}: Check date values in: |year= (help)
^ "InfoQ: Apache Mahout: Highly Scalable Machine Learning Algorithms". infoq.com. 2011 [last update]. Retrieved 13 September 2011. {{cite web}}: Check date values in: |year= (help)
^ "Algorithms - Apache Mahout - Apache Software Foundation". cwiki.apache.org. 2011 [last update]. Retrieved 13 September 2011. {{cite web}}: Check date values in: |year= (help)

External links

Official website
EC2 AMI with Hadoop and Mahout
Giraph - a Graph processing infrastructure that runs on Hadoop (see Pregel).
Pregel - Google's internal graph processing platform, released details in ACM paper.

[1] "Introducing Apache Mahout". ibm.com. 2011 [last update]. Retrieved 13 September 2011. {{cite web}}: Check date values in: |year= (help)

[2] "InfoQ: Apache Mahout: Highly Scalable Machine Learning Algorithms". infoq.com. 2011 [last update]. Retrieved 13 September 2011. {{cite web}}: Check date values in: |year= (help)

[3] "Algorithms - Apache Mahout - Apache Software Foundation". cwiki.apache.org. 2011 [last update]. Retrieved 13 September 2011. {{cite web}}: Check date values in: |year= (help)

[1]

[2]

[3]

v t e The Apache Software Foundation
Top-level projects	Accumulo ActiveMQ Airavata Airflow Allura Ambari Ant Aries Arrow Apache HTTP Server APR Avro Axis Axis2 Beam Bloodhound Brooklyn Calcite Camel CarbonData Cassandra Cayenne CloudStack Cocoon Cordova CouchDB cTAKES CXF Derby Directory Drill Druid Empire-db Felix Flex Flink Flume FreeMarker Geronimo Groovy Guacamole Gump Hadoop HBase Helix Hive Iceberg Ignite Impala Jackrabbit James Jena JMeter Kafka Kudu Kylin Lucene Mahout Maven MINA mod_perl MyFaces Mynewt NiFi NetBeans Nutch NuttX OFBiz Oozie OpenEJB OpenJPA OpenNLP OрenOffice ORC PDFBox Parquet Phoenix POI Pig Pinot Pivot Qpid Roller RocketMQ Samza Shiro SINGA Sling Solr Spark Storm SpamAssassin Struts 1 Struts 2 Subversion Superset SystemDS Tapestry Thrift Tika TinkerPop Tomcat Trafodion Traffic Server UIMA Velocity Wicket Xalan Xerces XMLBeans Yetus ZooKeeper
Commons	BCEL BSF Daemon Jelly Logging
Incubator	Taverna
Other projects	Batik FOP Ivy Log4j
Attic	Apex AxKit Beehive Bluesky iBATIS Click Continuum Deltacloud Etch Giraph Hama Harmony Jakarta Marmotta MXNet ODE River Shale Slide Sqoop Stanbol Tuscany Wave XML
Licenses	Apache License
Category