Jump to content

Overlap coefficient

From Wikipedia, the free encyclopedia

This is an old revision of this page, as edited by Narky Blert (talk | contribs) at 10:46, 18 November 2019 (Link to DAB page repaired). The present address (URL) is a permanent link to this revision, which may differ significantly from the current revision.

The overlap coefficient,[1] or Szymkiewicz–Simpson coefficient, is a similarity measure that measures the overlap between two finite sets. It is related to the Jaccard index and is defined as the size of the intersection divided by the smaller of the size of the two sets:

If set X is a subset of Y or the converse then the overlap coefficient is equal to 1.

References

  1. ^ Vijaymeena, M. K.; Kavitha, K. (March 2016). "A Survey on Similarity Measures in Text Mining" (PDF). Machine Learning and Applications: An International Journal. 3 (1): 19–28. doi:10.5121/mlaij.2016.3103.