Talk:Document classification

From Wikipedia, the free encyclopedia
Jump to: navigation, search
WikiProject Libraries (Rated Start-class, Top-importance)
WikiProject icon This article is within the scope of WikiProject Libraries, a collaborative effort to improve the coverage of Libraries on Wikipedia. If you would like to participate, please visit the project page, where you can join the discussion and see a list of open tasks.
Start-Class article Start  This article has been rated as Start-Class on the project's quality scale.
 Top  This article has been rated as Top-importance on the project's importance scale.
 
WikiProject Linguistics / Applied Linguistics  (Rated Start-class)
WikiProject icon This article is within the scope of WikiProject Linguistics, a collaborative effort to improve the coverage of Linguistics on Wikipedia. If you would like to participate, please visit the project page, where you can join the discussion and see a list of open tasks.
Start-Class article Start  This article has been rated as Start-Class on the project's quality scale.
 ???  This article has not yet received a rating on the project's importance scale.
Taskforce icon
This article is supported by the Applied Linguistics Task Force.
 
Note icon
This article has been automatically rated by a bot or other tool because one or more other projects use this class. Please ensure the assessment is correct before removing the |auto= parameter.

Copyright Infringement[edit]

Copyright Infringement

Most of the contents of this article is a direct copy of this scholarly article: http://www.academia.edu/1612159/Is_classification_necessary_after_Google190.140.148.104 (talk) 21:31, 24 May 2014 (UTC)

..... I think this page should be moved to Text Classification and Document Classification should redirect to Text Classification. Why? Text Classification better describes what the page is about, such classification techniques are not limited to being applied to a whole document of text, they can be applied to portions of text within a document. Also I think this page also belongs in the WikiProject Computer science. Jeff Kubina (talk) 15:25, 9 April 2009 (UTC)

I agree that Text Classification is the better name for this page.Johnhenrylawyer (talk) 20:54, 2 July 2013 (UTC)

Classification versus clustering[edit]

SemMac (talk) 06:07, 8 November 2009 (UTC) It would be useful if the article makes the distinction between classification and clustering. Classification applies to supervised learning where a teacher gives the categories, while clustering applies to the unsupervised situation.

The title (Document classification) implies this article is about supervised learning, but the body makes it clear that it covers both supervised and unsupervised learning and the algorithms listed also cover both situations. Unfortunately I can't think of a better title. The alternative word categorization normally also implies supervised learning. For example see http://www.scholarpedia.org/article/Text_categorization