Mining Software Repositories

From Wikipedia, the free encyclopedia
Jump to: navigation, search

The Mining Software Repositories (MSR) field analyzes the rich data available in software repositories, such as version control repositories, mailing list archives, bug tracking systems, issue tracking systems, etc. to uncover interesting and actionable information about software systems, projects and software engineering.

Definition[edit]

Herzing and Zeller define ”mining software archives” as a process to ”obtain lots of initial evidence” by extracting data from software repositories. Further they define ”data sources” as product-based artefacts like source code, requirement artefacts or version archives and claim that these sources are unbiased, but noisy and incomplete. [1]

Data Repositories[edit]

Metrics[edit]

Defect Prediction[edit]

  • Promise Software Repository [2]

Collection of Open Source Code.[edit]

Techniques[edit]

Tools[edit]

Experimentation Tools[edit]

Trace lab.

Metric Extraction Tools[edit]

Mining Tools[edit]

  • rapidminer [7]

Contradictory Findings[edit]

Software Metrics[edit]

See also[edit]

References[edit]

  1. ^ K. S. Herzig and A. Zeller, “Mining your own evidence,” in Making Software, pp. 517–529, Sebastopol, Calif., USA: O’Reilly, 2011.

External links[edit]