Hub (network science)
In network science, a hub is a node with a number of links that greatly exceeds the average. Emergence of hubs is a consequence of a scale-free property of networks. While hubs cannot be observed in a random network, they are expected to emerge in scale-free networks. The uprise of hubs in scale-free networks is associated with power-law distribution. Hubs have a significant impact on the network topology. Hubs can be found in many real networks, such as Brain Network or Internet.
A hub is a component of a network with a high-degree node. Hubs have a significantly larger number of links in comparison with other nodes in the network. The number of links (degrees) for a hub in a scale-free network is much higher than for the biggest node in a random network, keeping the size N of the network and average degree <k> constant. The existence of hubs is the biggest difference between random networks and scale-free networks. In random networks, the degree k is comparable for every node; it is therefore not possible for hubs to emerge. In scale-free networks, a few nodes (hubs) have a high degree k while the other nodes have a small number of links.
Emergence of hubs can be explained by the difference between scale-free networks and random networks. Scale-free networks (Barabási–Albert model) are different from random networks (Erdős–Rényi model) in two aspects: (a) growth, (b) preferential attachment.
- (a) Scale-free networks assume a continuous growth of the number of nodes N, compared to random networks which assume a fixed number of nodes. In scale-free networks the degree of the largest hub rises polynomially with the size of the network. Therefore, the degree of a hub can be high in a scale-free network. In random networks the degree of the largest node rises logaritmically (or slower) with N, thus the hub number will be small even in a very large network.
- (b) A new node in a scale-free network has a tendency to link to a node with a higher degree, compared to a new node in a random network which links itself to a random node. This process is called preferential attachment. The tendency of a new node to link to a node with a high degree k is characterized by power-law distribution (also known as rich-gets-richer process). This idea was introduced by Vilfredo Pareto and it explained why a small percentage of the population earns most of the money. This process is present in networks as well, for example 80 percent of web links point to 15 percent of webpages. The emergence of scale-free networks is not typical only of networks created by human action, but also of such networks as metabolic networks or illness networks. This phenomenon may be explained by the example of hubs on the World Wide Web such as Facebook or Google. These webpages are very well known and therefore the tendency of other webpages pointing to them is much higher than linking to random small webpages.
The mathematical explanation for Barabási–Albert model:
The network begins with an initial connected network of nodes.
New nodes are added to the network one at a time. Each new node is connected to existing nodes with a probability that is proportional to the number of links that the existing nodes already have. Formally, the probability that the new node is connected to node is
where is the degree of the node and the sum is taken over all pre-existing nodes (i.e. the denominator results in twice the current number of edges in the network).
Emergence of hubs in networks is also related to time. In scale-free networks, nodes which emerged earlier have a higher chance of becoming a hub than latecomers. This phenomenon is called first-mover advantage and it explains why some nodes become hubs and some do not. However, in a real network, the time of emergence is not the only factor that influences the size of the hub. For example, Facebook emerged 8 years later after Google became the largest hub on the World Wide Web and yet in 2011 Facebook became the largest hub of WWW. Therefore, in real networks the growth and the size of a hub depends also on various attributes such as popularity, quality or the aging of a node.
There are several attributes of Hubs in a scale-free networks
Shortening the path lengths in a network
The more observable hubs are in a network, the more they shrink a distances between nodes. In a scale-free networks hubs serve as bridges between the small degree nodes. Since the distance of two random nodes in a scale-free networks is small, we refer to scale-free networks as "small" or "ultra small". While a difference between path distance in a various small networks may not be noticeable, the difference in a path distance between large random network and scale-free network is remarkable.
Average path length in scale-free networks:
Aging of hubs (nodes)
The phenomenon present in a real networks, when older hubs are shadowed in a network. This phenomenon is responsible for changes in evolution and topology of networks. The example of aging phenomenon may be the case of Facebook overtaking the position of the largest hub on the Web where Google was the largest node since 2000.
Robustness and Attack Tolerance
During the random failure of nodes or targeted attack hubs are key components of the network. During the random failure of nodes in network hubs are responsible for exceptional robustness of network. The chance that a random failure would delete the hub is very small, because hubs coexists with a large number of small degree nodes. The removal of small degree nodes does not have a large effect on integrity of network. Even though the random removal would hit the hub, the chance of fragmantation of network is very small because the remaining hubs would hold the network together. In this case, hubs are the strength of a scale-free networks.
During the targeted attack on hubs, the integrity of network would fall apart relatively fast. Since small nodes are predominantly linked to hubs the targeted attack on the largest hubs would result in destruction of network in a short period of time. The financial market meltdown in 2008 is an example of such a network failure, when bankrupt of the largest players (hubs) led to a continuous breakdown of the whole system. On the other hand, it may has a possitive effect when removing hubs in a terrorist network may destroy the whole terrorist group. The attack tolerance of network may be increased by connecting its peripheral nodes, however it requires to double the number of links.
The perfect degree correlation means that each degree-k node is connected only to the same degree-k nodes. Such connectivity of nodes decide the topology of networks, which has an effect on robustness of network, the attribute discussed above. If the number of links between the hubs is the same as would be expected by chance, we refer to this network as Neutral Network. If hubs tend to connected to each other while avoiding linking to small-degree nodes we refer to this network as Assortative Network. This network is relatively resistant against attacks, because hubs form a core group, which is more reduntant against hub removal. If hubs avoid connecting to each other while linking to small-degree nodes, we refer to this network as Disassortative Network. This network has a hub-and-spoke character. Therefore, if we remove the hub in this type of network, it may damage or destroy the whole network.
The hubs are also responsible for effective spreading of material on network. In an analysis of disease spreading or information flow, hubs are referred to as super-spreaders. Super-spreaders may have a positive impact, such as effective information flow, but also devastating in a case of epidemic spreading such as H1N1 or AIDS. The mathematical models such as model of H1H1 Epidemic prediction  may allow us to predict the spread of diseases based on human mobility networks, infectiousness, or social interactions among humans. Hubs are also important in the eradication of disease. In a scale-free network hubs are most likely to be infected, because of the large number of connections they have. After the hub is infected, it broadcasts the disease to the nodes it is linked to. Therefore, the selective immunization of hubs may be the cost-effective strategy in eradication of spreading disease.
- Barabási, Albert-László. Network Science: Graph Theory., p. 27
- Albert, Réka; Barabási, Albert-László (2002). "Statistical mechanics of complex networks" (PDF). Reviews of Modern Physics. 74: 47–97. arXiv: . Bibcode:2002RvMP...74...47A. doi:10.1103/RevModPhys.74.47.
- Barabási, Albert-László. Network Science: The Scale-Free Property., p. 8.
- Barabási, Albert-László. Network Science: The Scale-Free Property., p. 23.
- Barabási, Albert-László. Network Science: Evolving Networks., p. 3.
- Cohen, Reoven; Erez, K.; ben-Avraham, D.; Havlin, S. (2000). "Resilience of the Internet to Random Breakdowns". Phys. Rev. Lett. 85: 4626–8. arXiv: . Bibcode:2000PhRvL..85.4626C. doi:10.1103/PhysRevLett.85.4626. PMID 11082612.
- S. V. Buldyrev; R. Parshani; G. Paul; H. E. Stanley; S. Havlin (2010). "Catastrophic cascade of failures in interdependent networks". Nature. 464 (7291): 1025–28. arXiv: . Bibcode:2010Natur.464.1025B. doi:10.1038/nature08932. PMID 20393559.
- Balcan, Duygu; Hu, Hao; Goncalves, Bruno; Bajardi, Paolo; Poletto, Chiara; Ramasco, Jose J.; Paolotti, Daniela; Perra, Nicola; Tizzoni, Michele; Van den Broeck, Wouter; Colizza, Vittoria; Vespignani, Alessandro (14 September 2009). "Seasonal transmission potential and activity peaks of the new influenza A(H1N1): a Monte Carlo likelihood analysis based on human mobility". BMC Medicine. 7 (45): 29. arXiv: . doi:10.1186/1741-7015-7-45. PMC . PMID 19744314.