Scale-free network

A scale-free network is a network whose degree distribution follows a power law, at least asymptotically. That is, the fraction P(k) of nodes in the network having k connections to other nodes goes for large values of k as

$P(k) \ \sim \ k^\boldsymbol{-\gamma}$

where $\gamma$ is a parameter whose value is typically in the range 2 < $\gamma$ < 3, although occasionally it may lie outside these bounds.[1][2]

Many networks have been reported to be scale-free, although statistical analysis has refuted many of these claims and seriously questioned others.[3] Preferential attachment and the fitness model have been proposed as mechanisms to explain conjectured power law degree distributions in real networks.

History

In studies of the networks of citations between scientific papers, Derek de Solla Price showed in 1965 that the number of links to papers—i.e., the number of citations they receive—had a heavy-tailed distribution following a Pareto distribution or power law, and thus that the citation network is scale-free. He did not however use the term "scale-free network", which was not coined until some decades later. In a later paper in 1976, Price also proposed a mechanism to explain the occurrence of power laws in citation networks, which he called "cumulative advantage" but which is today more commonly known under the name preferential attachment.

Recent interest in scale-free networks started in 1999 with work by Albert-László Barabási and colleagues at the University of Notre Dame who mapped the topology of a portion of the World Wide Web,[4] finding that some nodes, which they called "hubs", had many more connections than others and that the network as a whole had a power-law distribution of the number of links connecting to a node. After finding that a few other networks, including some social and biological networks, also had heavy-tailed degree distributions, Barabási and collaborators coined the term "scale-free network" to describe the class of networks that exhibit a power-law degree distribution. Amaral et al. showed that most of the real-world networks can be classified into two large categories according to the decay of degree distribution P(k) for large k.

Barabási and Albert proposed a generative mechanism to explain the appearance of power-law distributions, which they called "preferential attachment" and which is essentially the same as that proposed by Price. Analytic solutions for this mechanism (also similar to the solution of Price) were presented in 2000 by Dorogovtsev, Mendes and Samukhin [5] and independently by Krapivsky, Redner, and Leyvraz, and later rigorously proved by mathematician Béla Bollobás.[6] Notably, however, this mechanism only produces a specific subset of networks in the scale-free class, and many alternative mechanisms have been discovered since.[7]

The history of scale-free networks also includes some disagreement. On an empirical level, the scale-free nature of several networks has been called into question. For instance, the three brothers Faloutsos believed that the Internet had a power law degree distribution on the basis of traceroute data; however, it has been suggested that this is a layer 3 illusion created by routers, which appear as high-degree nodes while concealing the internal layer 2 structure of the ASes they interconnect. [8] On a theoretical level, refinements to the abstract definition of scale-free have been proposed. For example, Li et al. (2005) recently offered a potentially more precise "scale-free metric". Briefly, let G be a graph with edge set E, and denote the degree of a vertex $v$ (that is, the number of edges incident to $v$) by $\deg(v)$. Define

$s(G) = \sum_{(u,v) \in E} \deg(u) \cdot \deg(v).$

This is maximized when high-degree nodes are connected to other high-degree nodes. Now define

$S(G) = \frac{s(G)}{s_\max},$

where smax is the maximum value of s(H) for H in the set of all graphs with degree distribution identical to G. This gives a metric between 0 and 1, where a graph G with small S(G) is "scale-rich", and a graph G with S(G) close to 1 is "scale-free". This definition captures the notion of self-similarity implied in the name "scale-free".

Characteristics

Random network (a) and scale-free network (b). In the scale-free network, the larger hubs are highlighted.
Complex network degree distribution of random and scale-free

The most notable characteristic in a scale-free network is the relative commonness of vertices with a degree that greatly exceeds the average. The highest-degree nodes are often called "hubs", and are thought to serve specific purposes in their networks, although this depends greatly on the domain.

The scale-free property strongly correlates with the network's robustness to failure. It turns out that the major hubs are closely followed by smaller ones. These smaller hubs, in turn, are followed by other nodes with an even smaller degree and so on. This hierarchy allows for a fault tolerant behavior. If failures occur at random and the vast majority of nodes are those with small degree, the likelihood that a hub would be affected is almost negligible. Even if a hub-failure occurs, the network will generally not lose its connectedness, due to the remaining hubs. On the other hand, if we choose a few major hubs and take them out of the network, the network is turned into a set of rather isolated graphs. Thus, hubs are both a strength and a weakness of scale-free networks. These properties have been studied analytically using percolation theory by Cohen et al.[9][10] and by Callaway et al.[11]

Another important characteristic of scale-free networks is the clustering coefficient distribution, which decreases as the node degree increases. This distribution also follows a power law. This implies that the low-degree nodes belong to very dense sub-graphs and those sub-graphs are connected to each other through hubs. Consider a social network in which nodes are people and links are acquaintance relationships between people. It is easy to see that people tend to form communities, i.e., small groups in which everyone knows everyone (one can think of such community as a complete graph). In addition, the members of a community also have a few acquaintance relationships to people outside that community. Some people, however, are connected to a large number of communities (e.g., celebrities, politicians). Those people may be considered the hubs responsible for the small-world phenomenon.

At present, the more specific characteristics of scale-free networks vary with the generative mechanism used to create them. For instance, networks generated by preferential attachment typically place the high-degree vertices in the middle of the network, connecting them together to form a core, with progressively lower-degree nodes making up the regions between the core and the periphery. The random removal of even a large fraction of vertices impacts the overall connectedness of the network very little, suggesting that such topologies could be useful for security, while targeted attacks destroys the connectedness very quickly. Other scale-free networks, which place the high-degree vertices at the periphery, do not exhibit these properties. Similarly, the clustering coefficient of scale-free networks can vary significantly depending on other topological details.

A final characteristic concerns the average distance between two vertices in a network. As with most disordered networks, such as the small world network model, this distance is very small relative to a highly ordered network such as a lattice graph. Notably, an uncorrelated power-law graph having 2 < γ < 3 will have ultrasmall diameter d ~ ln ln N where N is the number of nodes in the network, as proved by Cohen and Havlin. The diameter of a growing scale-free network might be considered almost constant in practice.

Examples

Although many real-world networks are thought to be scale-free, the evidence often remains inconclusive, primarily due to the developing awareness of more rigorous data analysis techniques.[3] As such, the scale-free nature of many networks is still being debated by the scientific community. A few examples of networks claimed to be scale-free include:

Scale free topology has been also found in high temperature superconductors.[15] The qualities of a high-temperature superconductor — a compound in which electrons obey the laws of quantum physics, and flow in perfect synchrony, without friction — appear linked to the fractal arrangements of seemingly random oxygen atoms and lattice distortion.[16]

Generative models

These scale-free networks do not arise by chance alone. Erdős and Rényi (1960) studied a model of growth for graphs in which, at each step, two nodes are chosen uniformly at random and a link is inserted between them. The properties of these random graphs are different from the properties found in scale-free networks, and therefore a model for this growth process is needed.

The mostly widely known generative model for a subset of scale-free networks is Barabási and Albert's (1999) rich get richer generative model in which each new Web page creates links to existing Web pages with a probability distribution which is not uniform, but proportional to the current in-degree of Web pages. This model was originally discovered by Derek J. de Solla Price in 1965 under the term cumulative advantage, but did not reach popularity until Barabási rediscovered the results under its current name (BA Model). According to this process, a page with many in-links will attract more in-links than a regular page. This generates a power-law but the resulting graph differs from the actual Web graph in other properties such as the presence of small tightly connected communities. More general models and networks characteristics have been proposed and studied (for a review see the book by Dorogovtsev and Mendes).

A somewhat different generative model for Web links has been suggested by Pennock et al. (2002). They examined communities with interests in a specific topic such as the home pages of universities, public companies, newspapers or scientists, and discarded the major hubs of the Web. In this case, the distribution of links was no longer a power law but resembled a normal distribution. Based on these observations, the authors proposed a generative model that mixes preferential attachment with a baseline probability of gaining a link.

Another generative model is the copy model studied by Kumar et al. (2000), in which new nodes choose an existent node at random and copy a fraction of the links of the existent node. This also generates a power law.

Interestingly, the growth of the networks (adding new nodes) is not a necessary condition for creating a scale-free network. Dangalchev[17] (2004) gives examples of generating static scale-free networks. Another possibility (Caldarelli et al. 2002) is to consider the structure as static and draw a link between vertices according to a particular property of the two vertices involved. Once specified the statistical distribution for these vertices properties (fitnesses), it turns out that in some circumstances also static networks develop scale-free properties.

Kasthurirathna and Piraveenan [18] have shown that in socio-ecological systems, the drive towards improved rationality on average might be an evolutionary reason for the emergence of scale-free properties. They did this by simulating a number of strategic games on an initially random network with distributed bounded rationality, then re-wiring the network so that the network on average converged towards Nash equilibria, despite the bounded rationality of nodes. They observed that this re-wiring process results in scale-free networks.

Generalized scale-free model

There has been a burst of activity in the modeling of scale-free complex networks. The recipe of Barabási and Albert[19] has been followed by several variations and generalizations[20][21][22][23] and the revamping of previous mathematical works.[24] As long as there is a power law distribution in a model, it is a scale-free network, and a model of that network is a scale-free model.

Features

Many real networks are scale-free networks, which require scale-free models to describe them. There are two ingredients needed to build up a scale-free model:

1. Adding or removing nodes. Usually we concentrate on growing the network, i.e. adding nodes.

2. Preferential attachment: The probability $\Pi$ that new nodes will be connected to the "old" node.

Note that Fitness models (see below) could work also statically, without changing the number of nodes

Examples

There have been several attempts to generate scale-free network properties. Here are some examples:

The Barabási–Albert model

For example, the first scale-free model, the Barabási–Albert model, has a linear preferential attachment $\Pi(k_i)=\frac{k_i}{\sum_j k_j}$ and adds one new node at every time step.

(Note, another general feature of $\Pi(k)$ in real networks is that $\Pi(0)\neq 0$, i.e. there is a nonzero probability that a new node attaches to an isolated node. Thus in general $\Pi(k)$ has the form $\Pi(k)=A +k^\alpha$, where $A$ is the initial attractiveness of the node.)

Two-level network model

Dangalchev[17] builds a 2-L model by adding a second-order preferential attachment. The attractiveness of a node in the 2-L model depends not only on the number of nodes linked to it but also on the number of links in each of these nodes.

$\Pi(k_i)=\frac{k_i + C \sum_{(i,j)} k_j}{\sum_j k_j + C \sum_j k_j^2}$, where C is a coefficient between 0 and 1.

Non-linear preferential attachment

The Barabási–Albert model assumes that the probability $\Pi(k)$ that a node attaches to node $i$ is proportional to the degree $k$ of node $i$. This assumption involves two hypotheses: first, that $\Pi(k)$ depends on $k$, in contrast to random graphs in which $\Pi(k) = p$, and second, that the functional form of $\Pi(k)$ is linear in $k$. The precise form of $\Pi(k)$ is not necessarily linear, and recent studies have demonstrated that the degree distribution depends strongly on $\Pi(k)$

Krapivsky, Redner, and Leyvraz[22] demonstrate that the scale-free nature of the network is destroyed for nonlinear preferential attachment. The only case in which the topology of the network is scale free is that in which the preferential attachment is asymptotically linear, i.e. $\Pi(k_i) \sim a_\infty k_i$ as $k_i \to \infty$. In this case the rate equation leads to

$P(k) \sim k^{-\gamma}\text{ with }\gamma = 1 + \frac{\mu}{a_\infty}.$

This way the exponent of the degree distribution can be tuned to any value between 2 and $\infty$.

Hierarchical network model

There is another kind of scale-free model, which grows according to some patterns, such as the hierarchical network model.[25]

The iterative construction leading to a hierarchical network. Starting from a fully connected cluster of five nodes, we create four identical replicas connecting the peripheral nodes of each cluster to the central node of the original cluster. From this, we get a network of 25 nodes (N = 25). Repeating the same process, we can create four more replicas of the original cluster - the four peripheral nodes of each one connect to the central node of the nodes created in the first step. This gives N = 125, and the process can continue indefinitely.

Fitness model

The idea is that the link between two vertices is assigned not randomly with a probability p equal for all the couple of vertices. Rather, for every vertex j there is an intrinsic fitness xj and a link between vertex i and j is created with a probability $p(x_i,x_j)$ .[26] Note that the model is both In the case of World Trade Web it is possible to reconstruct all the properties by using as fitnesses of the country their GDP, and taking

$p(x_i,x_j)=\frac {\delta x_ix_j}{1+\delta x_ix_j}$.[27]

Hyperbolic geometric graphs

Assuming that a network has an underlying hyperbolic geometry, one can use the framework of spatial networks to generate scale-free degree distributions. This heterogeneous degree distribution then simply reflects the negative curvature and metric properties of the underlying hyperbolic geometry.[28]

Scale-free ideal network

In the context of network theory a scale-free ideal network is a random network with a degree distribution following the scale-free ideal gas density distribution. These networks are able to reproduce city-size distributions and electoral results by unraveling the size distribution of social groups with information theory on complex networks when a competitive cluster growth process is applied to the network.[29][30] In models of scale-free ideal networks it is possible to demonstrate that Dunbar's number is the cause of the phenomenon known as the 'six degrees of separation' .

Notes

1. ^ Onnela, J. -P.; Saramaki, J.; Hyvonen, J.; Szabo, G.; Lazer, D.; Kaski, K.; Kertesz, J.; Barabasi, A. -L. (2007). "Structure and tie strengths in mobile communication networks". Proceedings of the National Academy of Sciences 104 (18): 7332–7336. arXiv:physics/0610104. Bibcode:2007PNAS..104.7332O. doi:10.1073/pnas.0610245104. PMC 1863470. PMID 17456605.
2. ^ Choromański, K.; Matuszak, M.; MiȩKisz, J. (2013). "Scale-Free Graph with Preferential Attachment and Evolving Internal Vertex Structure". Journal of Statistical Physics 151 (6): 1175. Bibcode:2013JSP...151.1175C. doi:10.1007/s10955-013-0749-1.
3. ^ a b Clauset, Aaron; Cosma Rohilla Shalizi; M. E. J Newman (2007-06-07). "Power-law distributions in empirical data". 0706.1062. arXiv:0706.1062. Bibcode:2009SIAMR..51..661C. doi:10.1137/070710111.
4. ^ Barabási, Albert-László; Albert, Réka. (October 15, 1999). "Emergence of scaling in random networks". Science 286 (5439): 509–512. arXiv:cond-mat/9910332. Bibcode:1999Sci...286..509B. doi:10.1126/science.286.5439.509. MR 2091634. PMID 10521342.
5. ^ Dorogovtsev, S.; Mendes, J.; Samukhin, A. (2000). "Structure of Growing Networks with Preferential Linking". Physical Review Letters 85 (21): 4633–4636. arXiv:cond-mat/0004434. Bibcode:2000PhRvL..85.4633D. doi:10.1103/PhysRevLett.85.4633. PMID 11082614.
6. ^ Bollobás, B.; Riordan, O.; Spencer, J.; Tusn�Dy, G. (2001). "The degree sequence of a scale-free random graph process". Random Structures and Algorithms 18 (3): 279–290. doi:10.1002/rsa.1009. MR 1824277.
7. ^ Dorogovtsev, S. N.; Mendes, J. F. F. (2002). "Evolution of networks". Advances in Physics 51 (4): 1079. doi:10.1080/00018730110112519.
8. ^ Willinger, Walter; David Alderson; John C. Doyle (May 2009). "Mathematics and the Internet: A Source of Enormous Confusion and Great Potential" (PDF). Notices of the AMS (American Mathematical Society) 56 (5): 586–599. Retrieved 2011-02-03.
9. ^ Cohen, Reoven; Erez, K.; ben-Avraham, D.; Havlin, S. (2000). "Resilience of the Internet to Random Breakdowns". Phys. Rev. Lett. 85: 4626–8. arXiv:cond-mat/0007048. Bibcode:2000PhRvL..85.4626C. doi:10.1103/PhysRevLett.85.4626.
10. ^ Cohen, Reoven; Erez, K.; ben-Avraham, D.; Havlin, S. (2001). "Breakdown of the Internet under Intentional Attack". Phys. Rev. Lett. 86: 3682–5. arXiv:cond-mat/0010251. Bibcode:2001PhRvL..86.3682C. doi:10.1103/PhysRevLett.86.3682. PMID 11328053.
11. ^ Callaway, Duncan S.; Newman, M. E. J.; Strogatz, S. H.; Watts, D. J. (2000). "Network Robustness and Fragility: Percolation on Random Graphs". Phys. Rev. Lett. 85: 5468–71. arXiv:cond-mat/0007300. Bibcode:2000PhRvL..85.5468C. doi:10.1103/PhysRevLett.85.5468.
12. ^ De Masi, Giulia; et. al (2006). "Fitness model for the Italian interbank money market". Physical Review E 74: 066112. doi:10.1103/PhysRevE.74.066112.
13. ^ Soramäki, Kimmo; et. al (2007). "The topology of interbank payment flows". Physica A: Statistical Mechanics and its Applications 379 (1): 317–333. Bibcode:2007PhyA..379..317S. doi:10.1016/j.physa.2006.11.093.
14. ^ Steyvers, Mark; Joshua B. Tenenbaum (2005). "The Large-Scale Structure of Semantic Networks: Statistical Analyses and a Model of Semantic Growth". Cognitive Science 29 (1): 41–78. doi:10.1207/s15516709cog2901_3.
15. ^ Fratini, Michela, Poccia, Nicola, Ricci, Alessandro, Campi, Gaetano, Burghammer, Manfred, Aeppli, Gabriel Bianconi, Antonio (2010). "Scale-free structural organization of oxygen interstitials in La2CuO4+y". Nature 466 (7308): 841–4. arXiv:1008.2015. Bibcode:2010Natur.466..841F. doi:10.1038/nature09260. PMID 20703301.
16. ^ Poccia, Nicola, Ricci, Alessandro, Campi, Gaetano, Fratini, Michela, Puri, Alessandro, Di Gioacchino, Daniele, Marcelli, Augusto, Reynolds, Michael, Burghammer, Manfred, Saini, Naurang L., Aeppli, Gabriel Bianconi, Antonio, (2012). "Optimum inhomogeneity of local lattice distortions in La2CuO4+y". Proc. Natl. Acad. Sci. U.S.A. 109 (39): 15685–15690. arXiv:1208.0101. doi:10.1073/pnas.1208492109.
17. ^ a b Dangalchev Ch., Generation models for scale-free networks, Physica A 338, 659 (2004).
18. ^ Kasthurirathna, Dharshana; Piraveenan, Mahendra. (2015). "Emergence of scale-free characteristics in socioecological systems with bounded rationality". Scientific Reports. In Press.
19. ^ Barabási, A.-L. and R. Albert, Science 286, 509 (1999).
20. ^ R. Albert, and A.L. Barabási, Phys. Rev. Lett. 85, 5234(2000).
21. ^ S. N. Dorogovtsev, J. F. F. Mendes, and A. N. Samukhim, cond-mat/0011115.
22. ^ a b P.L. Krapivsky, S. Redner, and F. Leyvraz, Phys. Rev. Lett. 85, 4629 (2000).
23. ^ B. Tadic, Physica A 293, 273(2001).
24. ^ S. Bomholdt and H. Ebel, cond-mat/0008465; H.A. Simon, Bimetrika 42, 425(1955).
25. ^ E. Ravasz and Barabási Phys. Rev. E 67, 026112 (2003).
26. ^ G. Caldarelli et al. Phys. Rev. Lett. 89, 258702 (2002).
27. ^ D. Garlaschelli et al. Phys. Rev. Lett. 93 , 188701 (2004).
28. ^ Krioukov, Dmitri; Papadopoulos, Fragkiskos; Kitsak, Maksim; Vahdat, Amin; Boguñá, Marián. "Hyperbolic geometry of complex networks". Physical Review E 82 (3). doi:10.1103/PhysRevE.82.036106.
29. ^ A. Hernando, D. Villuendas, C. Vesperinas, M. Abad, A. Plastino (2009). "Unravelling the size distribution of social groups with information theory on complex networks". arXiv:0905.3704 [physics.soc-ph]., submitted to European Physics Journal B
30. ^ André A. Moreira, Demétrius R. Paula, Raimundo N. Costa Filho, José S. Andrade, Jr. (2006). "Competitive cluster growth in complex networks". arXiv:cond-mat/0603272 [cond-mat.dis-nn].