Degree-preserving randomization: Difference between revisions

From Wikipedia, the free encyclopedia
Content deleted Content added
DGaffney (talk | contribs)
No edit summary
DGaffney (talk | contribs)
No edit summary
Line 18: Line 18:


Additionally, the method is similar in nature to the broadly used [[Exponential Random Graph models]] popularized in social science<ref>{{cite |last1= Snijders |first1= Tom AB. |title= Markov chain Monte Carlo estimation of exponential random graph models |journal= Journal of Social Structure 3.2 |year= 2002 |url= http://www.researchgate.net/publication/2544726_Markov_Chain_Monte_Carlo_Estimation_of_Exponential_Random_Graph_Models/file/5046351cb0c36a4796.pdf}}</ref> <ref>{{cite |last1= Robins |first1= Garry |last2= Patterson |first2= Pip |last3= Kalish |first3= Yuval |last4= Lusher |first4= Dean |title= An introduction to exponential random graph models for social networks |journal= Social networks 29.2 |year= 2007 |url= http://www.sciencedirect.com/science/article/pii/S0378873306000372}}</ref>, and indeed the various forms of modeling networks against observed networks in order to identify and theorize about the differences expressed in real networks. Importantly, Degree Preserving Randomization provides a simple algorithmic design for those familiar with programming to apply a model to an available observed network.
Additionally, the method is similar in nature to the broadly used [[Exponential Random Graph models]] popularized in social science<ref>{{cite |last1= Snijders |first1= Tom AB. |title= Markov chain Monte Carlo estimation of exponential random graph models |journal= Journal of Social Structure 3.2 |year= 2002 |url= http://www.researchgate.net/publication/2544726_Markov_Chain_Monte_Carlo_Estimation_of_Exponential_Random_Graph_Models/file/5046351cb0c36a4796.pdf}}</ref> <ref>{{cite |last1= Robins |first1= Garry |last2= Patterson |first2= Pip |last3= Kalish |first3= Yuval |last4= Lusher |first4= Dean |title= An introduction to exponential random graph models for social networks |journal= Social networks 29.2 |year= 2007 |url= http://www.sciencedirect.com/science/article/pii/S0378873306000372}}</ref>, and indeed the various forms of modeling networks against observed networks in order to identify and theorize about the differences expressed in real networks. Importantly, Degree Preserving Randomization provides a simple algorithmic design for those familiar with programming to apply a model to an available observed network.

==Example==

Revision as of 06:44, 6 November 2014

Degree Preserving Randomization is a technique used in Network Science that aims to assess whether or not variations observed in a given graph could simply be an artifact of the graph's inherent structural properties rather than properties unique to the nodes, in an observed network.

Background

Cataloged as early as 1996 [1], the most simple implementation of degree preserving randomization relies on a Monte Carlo algorithm that rearranges, or "rewires" the network at random such that, with a sufficient number of rewires, the network's degree distribution is identical to the initial degree distribution of the network, though the topological structure of the network has become completely distinct from the original network.

File:Rewire.png
A demonstration of a single iteration of the Degree Preserving Randomization algorithm.

Degree preserving randomization, while it has many different forms, typically takes on the form of a relatively simple approach: for any network consisting of nodes with edges, select two dyadically tied nodes. For each of these dyadic pairs, switch the edges such that the new dyadic pairs are mismatched. After a sufficient number of these mismatches, the network increasingly loses its original observed topography.

As is common with algorithms based on Markov Chains, the number of iterations, or individual rewires, must occur on a given graph such that the graph is sufficiently random and distinct from the original graph is unknown, though Espinoza[2] asserts that a safe minimum threshold is , where "is at least 100" (Espinoza). Others have provided input for this issue, including one author who states that a safe minimum may instead be at least , where epsilon lies in a range between and , though ultimately the correct number is not presently known [3] [4].

Uses

There are several cases in which published research have explicitly employed degree preserving randomization in order to analyze network properties. Dekker[5] used rewiring in order to more accurately model observed social networks by adding a secondary variable, , which introduces a high-degree attachment bias. Liu et al. [6] have additionally employed degree preserving randomization to assert that the Control Centrality, a metric they identify, alters little when compared to the Control Centrality of an Erdős–Rényi model containing the same number of nodes in their simulations.

Additionally, some work has been done in investigating how Degree Preserving Randomization may be used in addressing considerations of anonymity in networked data research, which has been shown to be a cause for concern in Social Network Analysis, as in the case of a study by Lewis et al[7] [8]. Ultimately the work conducted by Ying and Wu, starting from a foundation of Degree Preserving Randomization, and then forwarding several modifications, has showed moderate advances in protecting anonymity without compromising the integrity of the underlying utility of the observed network[9].

Additionally, the method is similar in nature to the broadly used Exponential Random Graph models popularized in social science[10] [11], and indeed the various forms of modeling networks against observed networks in order to identify and theorize about the differences expressed in real networks. Importantly, Degree Preserving Randomization provides a simple algorithmic design for those familiar with programming to apply a model to an available observed network.

  1. ^ Rao, A Ramachandra; Jana, Rabindranath; Bandyopadhyay, Suraj (1996). "A Markov chain Monte Carlo method for generating random (0, 1)-matrices with given marginals" (PDF). Indian Journal of Statistics Series A. Retrieved November 5, 2014.
  2. ^ Espinoza, Max. "On Network Randomization Methods: A Negative Control Study" (PDF). {{cite journal}}: Cite journal requires |journal= (help)
  3. ^ Re: [igraph] Degree-preserving rewiring of a large graph
  4. ^ Pinar, Ali; Ray, Jaideep; Seshadri, S., Are we there yet? When to stop a Markov chain while generating random graphs (PDF)
  5. ^ Dekker, A.H. (2007), "Realistic Social Networks for Simulation using Network Rewiring" (PDF), Proceedings MODSIM 2007 {{citation}}: line feed character in |journal= at position 20 (help)
  6. ^ Liu, Y-Y.; Slotine, J-J; Barabási, A-L (2012), "Control Centrality and Hierarchical Structure in Complex Networks", PLoS ONE 7(9), doi:10.1371/journal.pone.0044459{{citation}}: CS1 maint: unflagged free DOI (link)
  7. ^ Parry, Marc (July 10, 2011), Harvard Researchers Accused of Breaching Students' Privacy, retrieved November 5, 2014
  8. ^ Lewis, Kevin; Kaufman, Jason; Gonzalez, Marco; Wimmer, Andreas; Christakis, Nicholas (2008), "Tastes, ties, and time: A new social network dataset using Facebook.com" (PDF), Social Networks 30
  9. ^ Ying, Xiaowei; Wu, Xintao (2008), "Randomizing Social Networks: a Spectrum Preserving Approach", SDM
  10. ^ Snijders, Tom AB. (2002), "Markov chain Monte Carlo estimation of exponential random graph models" (PDF), Journal of Social Structure 3.2
  11. ^ Robins, Garry; Patterson, Pip; Kalish, Yuval; Lusher, Dean (2007), "An introduction to exponential random graph models for social networks", Social networks 29.2