TrustRank: Difference between revisions

Content deleted Content added

Inline

Revision as of 13:17, 20 April 2011

TrustRank is a link analysis technique described in a paper by Stanford University and Yahoo! researchers for semi-automatically separating useful webpages from spam.^[1]

Many Web spam pages are created only with the intention of misleading search engines. These pages, chiefly created for commercial reasons, use various techniques to achieve higher-than-deserved rankings on the search engines' result pages. While human experts can easily identify spam, it is too expensive to manually evaluate a large number of pages.

One popular method for improving rankings is to increase artificially the perceived importance of a document through complex linking schemes. Google's PageRank and similar methods for determining the relative importance of Web documents have been subjected to manipulation.

TrustRank method calls for selecting a small set of seed pages to be evaluated by an expert. Once the reputable seed pages are manually identified, a crawl extending outward from the seed set seeks out similarly reliable and trustworthy pages. TrustRank's reliability diminishes as documents become further removed from the seed set.

The researchers who proposed the TrustRank methodology have continued to refine their work by evaluating related topics, such as measuring spam mass.

References

^ Gyöngyi, Zoltán (2004). "Combating Web Spam with TrustRank" (PDF). Proceedings of the International Conference on Very Large Data Bases. 30: 576. Retrieved 2007-10-26. {{cite journal}}: Unknown parameter |coauthors= ignored (|author= suggested) (help)

External links

Z. Gyöngyi, H. Garcia-Molina, J. Pedersen: Combating Web Spam with TrustRank
Link-based spam detection Yahoo! assigned patent application using Trustrank
Content Policy
TrustRank algorithm explained

This World Wide Web–related article is a stub. You can help Wikipedia by expanding it.

[1] Gyöngyi, Zoltán (2004). "Combating Web Spam with TrustRank" (PDF). Proceedings of the International Conference on Very Large Data Bases. 30: 576. Retrieved 2007-10-26. {{cite journal}}: Unknown parameter |coauthors= ignored (|author= suggested) (help)

[1]

@@ Line 38: / Line 38: @@
 * [http://www.vldb.org/conf/2004/RS15P3.PDF Z. Gyöngyi, H. Garcia-Molina, J. Pedersen: ''Combating Web Spam with TrustRank'']
 * [http://appft1.uspto.gov/netacgi/nph-Parser?Sect1=PTO1&Sect2=HITOFF&d=PG01&p=1&u=%2Fnetahtml%2FPTO%2Fsrchnum.html&r=1&f=G&l=50&s1=%2220060095416%22.PGNR.&OS=DN/20060095416&RS=DN/20060095416 Link-based spam detection] Yahoo! assigned patent application using Trustrank
+* [http://knol.google.com/k/content-policy# Content Policy]
 * [http://pagerank.suchmaschinen-doktor.de/trustrank.html TrustRank algorithm explained]

Revision as of 13:17, 20 April 2011

See also

References

External links