Jump to content

Sparse binary polynomial hashing

From Wikipedia, the free encyclopedia

This is an old revision of this page, as edited by Bryanrutherford0 (talk | contribs) at 15:12, 18 December 2015 (Disambiguating wikilink). The present address (URL) is a permanent link to this revision, which may differ significantly from the current revision.

Sparse binary polynomial hashing (SBPH) is a generalization of Bayesian spam filtering that can match mutating phrases as well as single words. SBPH is a way of generating a large number of features from an incoming text automatically, and then using statistics to determine the weights for each of those features in terms of their predictive values for spam/nonspam evaluation.