|WikiProject Libraries||(Rated Start-class, Top-importance)|
|WikiProject Linguistics / Applied Linguistics||(Rated Start-class)|
Most of the contents of this article is a direct copy of this scholarly article: http://www.academia.edu/1612159/Is_classification_necessary_after_Google126.96.36.199 (talk) 21:31, 24 May 2014 (UTC)
..... I think this page should be moved to Text Classification and Document Classification should redirect to Text Classification. Why? Text Classification better describes what the page is about, such classification techniques are not limited to being applied to a whole document of text, they can be applied to portions of text within a document. Also I think this page also belongs in the WikiProject Computer science. Jeff Kubina (talk) 15:25, 9 April 2009 (UTC)
Classification versus clustering
SemMac (talk) 06:07, 8 November 2009 (UTC) It would be useful if the article makes the distinction between classification and clustering. Classification applies to supervised learning where a teacher gives the categories, while clustering applies to the unsupervised situation.
The title (Document classification) implies this article is about supervised learning, but the body makes it clear that it covers both supervised and unsupervised learning and the algorithms listed also cover both situations. Unfortunately I can't think of a better title. The alternative word categorization normally also implies supervised learning. For example see http://www.scholarpedia.org/article/Text_categorization