ELKI: Difference between revisions

From Wikipedia, the free encyclopedia
Content deleted Content added
translate from de.wikipedia
(No difference)

Revision as of 18:39, 2 May 2010

Environment for DeveLoping KDD-Applications Supported by Index-Structures
Developer(s)Ludwig Maximilian University of Munich
Stable release
0.3.0 / March 30, 2010
Repository
Operating systemCross-platform
TypeData Mining
Licenseunknown
Websitehttp://www.dbs.ifi.lmu.de/research/KDD/ELKI

Environment for DeveLoping KDD-Applications Supported by Index-Structures (ELKI) is a data mining software developed for use in research and teaching at the Ludwig Maximilian University of Munich, Germany. It aims at allowing the development and evaluation of advanced data mining algorithms and their interaction with database index structures.

Description

The ELKI framework is written in Java and built around a modular architecture. Most included algorithms belong to clustering, outlier detection[1] and database indexes. A key concept of ELKI is to allow the combination of arbitrary algorithms, data types, distance functions and indexes and evaluate these combinations. When developing new algorithms or index structures, the existing components can be reused and combined.

The university project is developed for use in teaching and research. The source code is written with extensability, readability and reusability in mind, but it is not optimized for performance. It currently does not offer intergration with business intelligence applications or even an interface to common database management systems via SQL (albeit this could be added). The application of the algorithm requires knowledge about their use and study of documentation. The audience are students, researchers and software engineers

Included algorithms

Some of the included algorithms:

Licensing

The website or source code does not give an explicit license, it should therefore be considered copyrighted. The authors have stated that research use is acceptable but attribution is required. For commercial use, an explicit license is required.

Version history

Version 0.1 (July 2008) contained several Algorithms from cluster analysis and anomaly detection, as well as some Index structures such as the R*-tree. The focus of the first release was on on subspace clustering algorithms.[2]

Version 0.2 (July 2009) added functionality for time series analysis, in particular distance functions for this. [3]

Version 0.3 (March 2010) extended the choice of anomaly detection algorithms and visualization modules [4]

Related applications

External links

Official web page of ELKI

References

  1. ^ Hans-Peter Kriegel, Peer Kröger, Arthur Zimek (2009). "Outlier Detection Techniques (Tutorial)" (PDF). 13th Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD 2009). Bangkok, Thailand. Retrieved 2010-03-26.{{cite journal}}: CS1 maint: multiple names: authors list (link)
  2. ^ Elke Achtert, Hans-Peter Kriegel, Arthur Zimek (2008). "ELKI: A Software System for Evaluation of Subspace Clustering Algorithms" (PDF). Proceedings of the 20th international conference on Scientific and Statistical Database Management (SSDBM 08). Hong Kong, China: Springer. doi:10.1007/978-3-540-69497-7_41.{{cite journal}}: CS1 maint: multiple names: authors list (link)
  3. ^ Elke Achtert, Thomas Bernecker, Hans-Peter Kriegel, Erich Schubert, Arthur Zimek (2009). "ELKI in time: ELKI 0.2 for the performance evaluation of distance measures for time series" (PDF). Proceedings of the 11th International Symposium on Advances in Spatial and Temporal Databases (SSTD 2010). Aalborg, Dänemark: Springer. doi:10.1007/978-3-642-02982-0_35.{{cite journal}}: CS1 maint: multiple names: authors list (link)
  4. ^ Elke Achtert, Hans-Peter Kriegel, Lisa Reichert, Erich Schubert, Remigius Wojdanowski, Arthur Zimek (2010). "Visual Evaluation of Outlier Detection Models". 15th International Conference on Database Systems for Advanced Applications (DASFAA 2010). Tsukuba, Japan: Spinger. doi:10.1007/978-3-642-12098-5_34.{{cite journal}}: CS1 maint: multiple names: authors list (link)