||This article needs additional citations for verification. (February 2011)|
|Stable release||5.2 / 1 February 2012|
RapidMiner, formerly YALE (Yet Another Learning Environment), is an environment for machine learning, data mining, text mining, predictive analytics, and business analytics. It is used for research, education, training, rapid prototyping, application development, and industrial applications. In a poll by KDnuggets, a data-mining newspaper, RapidMiner ranked second in data mining/analytic tools used for real projects in 2009 and was first in 2010. It is distributed under the AGPL open source license and has been hosted by SourceForge since 2004.
The RapidMiner project was started in 2001 by Ralf Klinkenberg, Ingo Mierswa, and Simon Fischer at the Artificial Intelligence Unit of the Dortmund University of Technology.
RapidMiner provides data mining and machine learning procedures including: data loading and transformation (Extract, transform, load, a.k.a. ETL), data preprocessing and visualization, modelling, evaluation, and deployment. RapidMiner is written in the Java programming language. It uses learning schemes and attribute evaluators from the Weka machine learning environment and statistical modelling schemes from R-Project.
RapidMiner can define analytical steps (similar to R) and be used for analyzing data generated by high-throughput instruments such as those used in genotyping, proteomics, and mass spectrometry. It can be used for text mining, multimedia mining, feature engineering, data stream mining, development of ensemble methods, and distributed data mining. RapidMiner functionality can be extended with additional plugins.
RapidMiner provides a GUI to design an analytical pipeline (the "operator tree"). The GUI generates an XML (eXtensible Markup Language) file that defines the analytical processes the user wishes to apply to the data. Alternatively, the engine can be called from other programs or used as an API. Individual functions can be called from the command line.
RapidMiner is open-source and is offered free of charge as a Community Edition released under the GNU AGPL. There is also an Enterprise Edition offered under a commercial license for integration into closed-source projects.
- "Data Mining Tools Used Poll". Data Mining Tools Used Poll (May 2009). KDnuggets. Retrieved 4 July 2012.
- "Data Mining / Analytic Tools Used Poll". Data Mining / Analytic Tools Used Poll (May 2010). KDnuggets. Retrieved 4 July 2012.
- "RapidMiner -- Data Mining, ETL, OLAP, BI". Sourceforge. Geeknet, Inc. Retrieved 4 July 2012.
- "RapidMiner". Rapid-i. Retrieved 7 March 2011.
- Ingo Mierswa, Michael Wurst, Ralf Klinkenberg, Martin Scholz, and Timm Euler: YALE: Rapid Prototyping for Complex Data Mining Tasks, in Proceedings of the 12th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD-06), 2006.
- YALE becomes RapidMiner