Talk:Bag-of-words model

From Wikipedia, the free encyclopedia
Jump to: navigation, search
WikiProject Robotics (Rated Start-class, Mid-importance)
WikiProject icon Bag-of-words model is within the scope of WikiProject Robotics, which aims to build a comprehensive and detailed guide to Robotics on Wikipedia. If you would like to participate, you can choose to edit this article, or visit the project page (Talk), where you can join the project and see a list of open tasks.
Start-Class article Start  This article has been rated as Start-Class on the project's quality scale.
 Mid  This article has been rated as Mid-importance on the project's importance scale.
 
WikiProject Linguistics / Applied Linguistics  (Rated Start-class)
WikiProject icon This article is within the scope of WikiProject Linguistics, a collaborative effort to improve the coverage of Linguistics on Wikipedia. If you would like to participate, please visit the project page, where you can join the discussion and see a list of open tasks.
Start-Class article Start  This article has been rated as Start-Class on the project's quality scale.
 ???  This article has not yet received a rating on the project's importance scale.
Taskforce icon
This article is supported by the Applied Linguistics Task Force.
 
Note icon
This article has been automatically rated by a bot or other tool because one or more other projects use this class. Please ensure the assessment is correct before removing the |auto= parameter.

Merge Document-term matrix here[edit]

That article just describes the kind of matrices that are produced by any algorithm that assumes a BoW model. Qwertyus (talk) 13:50, 31 August 2012 (UTC)


Possible mistake in matrix[edit]

  • Sentence: John likes to watch movies. Mary likes too.
  • Dictionary: [...] "likes": 2, [...]
  • Matrix: [1, 1, 1, 1, 1, 0, 0, 0, 1, 1]
  • Algorithm: [...] each entry of the vectors refers to count of the corresponding entry in the dictionary [...]

Shouldn't matrix look like this: [1, 2, 1, 1, 1, 0, 0, 0, 1, 1]?

89.76.114.24 (talk) 21:25, 24 November 2013 (UTC)MP

Indeed, I've corrected this. Compvis (talk) 22:28, 29 November 2013 (UTC)

Examples' grammar.[edit]

It seems strange to me that

"John likes to watch movies. Mary likes too"

is a commonly used text sample, because

"Mary likes too"

isn't a sentence, or a sentence fragment. I was going to replace it with a grammatically correct sentence, but I won't because googling a bit convinces me it's probably intentional, though I don't understand it.--Elvey (talk) 02:53, 3 December 2013 (UTC)