Overlap coefficient

From Wikipedia, the free encyclopedia
Jump to navigation Jump to search

The overlap coefficient,[1] or Szymkiewicz–Simpson coefficient, is a similarity measure that measures the overlap between two sets. It is related to the Jaccard index and is defined as the size of the intersection divided by the smaller of the size of the two sets:

If set X is a subset of Y or the converse then the overlap coefficient is equal to one.

References[edit]

  1. ^ Vijaymeena, M. K.; Kavitha, K. (March 2016). "A Survey on Similariy Measures in Text Mining" (PDF). Machine Learning and Applications: An International Journal. 3 (1): 19–28. doi:10.5121/mlaij.2016.3103.