In PU learning, two sets of samples are assumed to be available for training: the positive set and a mixed set , which is assumed to contain both positive and negative samples, but without these being labeled as such. This contrasts with other forms of semisupervised learning, where it is assumed that a labeled set containing examples of both classes is available. A variety of techniques exist to adapt supervised classifiers to the PU learning setting. PU learning successfully been applied to text classification  and Bioinformatics tasks.
- Liu, Bing (2007). Web Data Mining. Springer. pp. 165−178.
- Bing Liu, Wee Sun Lee, Philip S. Yu and Xiao-Li Li (2002). "Partially supervised classification of text documents". ICML. pp. 8–12.
- Hwanjo Yu, Jiawei Han, Kevin Chen-Chuan Chang (2002). "PEBL: positive example based learning for web page classification using SVM". ACM SIGKDD.
- Xiao-Li Li and Bing Liu (2003). "Learning to classify text using positive and unlabeled data". IJCAI.
- Peng Yang, Xiao-Li Li, Jian-Ping Mei, Chee-Keong Kwoh and See-Kiong Ng (2012). "Positive-Unlabeled Learning for Disease Gene Identification". Bioinformatics, Vol 28(20).
|This computer science article is a stub. You can help Wikipedia by expanding it.|