Force-directed graph drawing
Force-based or force-directed algorithms are a class of algorithms for drawing graphs in an aesthetically pleasing way. Their purpose is to position the nodes of a graph in two-dimensional or three-dimensional space so that all the edges are of more or less equal length and there are as few crossing edges as possible. The idea originated in the 1980s with a spring-embedder model of Eades and Kamada-Kawai; the term force-directed comes from Fruchterman & Reingold's 1990 University of Illinois technical report (UIUCDCS-R-90-1609).
The force-directed algorithms achieve this by assigning forces among the set of edges and the set of nodes; the most straightforward method is to assign forces as if the edges were springs (see Hooke's law) and the nodes were electrically charged particles (see Coulomb's law). The entire graph is then simulated as if it were a physical system. The forces are applied to the nodes, pulling them closer together or pushing them further apart. This is repeated iteratively until the system comes to an equilibrium state; i.e., their relative positions do not change anymore from one iteration to the next. At that moment, the graph is drawn. The physical interpretation of this equilibrium state is that all the forces are in mechanical equilibrium.
An alternative model considers a spring-like force for every pair of nodes where the ideal length of each spring is proportional to the graph-theoretic distance between nodes i and j. In this model, there is no need for a separate repulsive force. Note that minimizing the difference (usually the squared difference) between euclidean and ideal distances between nodes is then equivalent to a metric multidimensional scaling problem. Stress majorization gives a very well-behaved (i.e., monotonically convergent) and mathematically elegant way to minimise these differences and, hence, find a good layout for the graph.
A force-directed graph can involve forces other than mechanical springs and electrical repulsion; examples include logarithmic springs (as opposed to linear springs), gravitational forces (which aggregate connected components in graphs that are not singly connected), magnetic fields (so as to obtain good layouts for directed graphs), and electrically charged springs (in order to avoid overlap or near-overlap in the final drawing). In the case of spring-and-charged-particle graphs, the edges tend to have uniform length (because of the spring forces), and nodes that are not connected by an edge tend to be drawn further apart (because of the electrical repulsion).
While graph drawing is a difficult problem, force-directed algorithms, being physical simulations, usually require no special knowledge about graph theory such as planarity.
It is also possible to employ mechanisms that search more directly for energy minima, either instead of or in conjunction with physical simulation. Such mechanisms, which are examples of general global optimization methods, include simulated annealing and genetic algorithms.
Advantages
The following are among the most important advantages of force-directed algorithms:
- Good-quality results: At least for graphs of medium size (up to 50-100 vertices), the results obtained have usually very good results based on the following criteria: uniform edge length, uniform vertex distribution and showing symmetry. This last criterion is among the most important ones and is hard to achieve with any other type of algorithm.
- Flexibility: Force-directed algorithms can be easily adapted and extended to fulfill additional aesthetic criteria. This makes them the most versatile class of graph drawing algorithms. Examples of existing extensions include the ones for directed graphs, 3D graph drawing[1], cluster graph drawing, constrained graph drawing, and dynamic graph drawing.
- Intuitive: Since they are based on physical analogies of common objects, like springs, the behavior of the algorithms is relatively easy to predict and understand. This is not the case with other types of graph-drawing algorithms.
- Simplicity: Typical force-directed algorithms are simple and can be implemented in a few lines of code. Other classes of graph-drawing algorithms, like the ones for orthogonal layouts, are usually much more involved.
- Interactivity: Another advantage of this class of algorithm is the interactive aspect. By drawing the intermediate stages of the graph, the user can follow how the graph evolves, seeing it unfold from a tangled mess into a good-looking configuration. In some interactive graph drawing tools, the user can pull one or more nodes out of their equilibrium state and watch them migrate back into position. This makes them a preferred choice for dynamic and online graph-drawing systems.
- Strong theoretical foundations: While simple ad-hoc force-directed algorithms (such as the one given in pseudo-code in this article) often appear in the literature and in practice (because they are relatively easy to understand), more reasoned approaches are starting to gain traction. Statisticians have been solving similar problems in multidimensional scaling (MDS) since the 1930s, and physicists also have a long history of working with related n-body problems - so extremely mature approaches exist. As an example, the stress majorization approach to metric MDS can be applied to graph drawing as described above. This has been proven to converge monotonically.[2] Monotonic convergence, the property that the algorithm will at each iteration decrease the stress or cost of the layout, is important because it guarantees that the layout will eventually reach a local minimum and stop. Damping schedules such as the one used in the pseudo-code below, cause the algorithm to stop, but cannot guarantee that a true local minimum is reached.
Disadvantages
The main disadvantages of force-directed algorithms include the following:
- High running time: The typical force-directed algorithms are in general considered to have a running time equivalent to O(n3), where n is the number of nodes of the input graph. This is because the number of iterations is estimated to be O(n), and in every iteration, all pairs of nodes need to be visited and their mutual repulsive forces computed. This is related to the N-body problem in physics. However, since repulsive forces are local in nature the graph can be partitioned such that only neighboring vertices are considered. Common techniques used by algorithms for determining the layout of large graphs include high-dimensional embedding,[3] multi-layer drawing and other methods related to N-body simulation. For example, the Barnes–Hut simulation-based method FADE[4] can improve running time to n*log(n) per iteration. As a rough guide, in a few seconds one can expect to draw at most 1,000 nodes with a standard n2 per iteration technique, and 100,000 with a n*log(n) per iteration technique.[4]
- Poor local minima: It is easy to see that force-directed algorithms produce a graph with minimal energy, in particular one whose total energy is only a local minimum. The local minimum found can be, in many cases, considerably worse than a global minimum, which translates into a low-quality drawing. For many algorithms, especially the ones that allow only down-hill moves of the vertices, the final result can be strongly influenced by the initial layout, that in most cases is randomly generated. The problem of poor local minima becomes more important as the number of vertices of the graph increases. A combined application of different algorithms is helpful to solve this problem. For example, using the Kamada-Kawai algorithm[5] to quickly generate a reasonable initial layout and then the Fruchterman-Reingold algorithm[6] to improve the placement of neighbouring nodes.
Pseudocode
Each node has x,y position and dx,dy velocity and mass m. There is usually a spring constant, s, and damping: 0 < damping < 1. The force toward and away from nodes is calculated according to Hooke's Law and Coulomb's law or similar as discussed above. The example can be trivially expanded to include a z position for 3D representation.
set up initial node velocities to (0,0)
set up initial node positions randomly // make sure no 2 nodes are in exactly the same position
loop
total_kinetic_energy := 0 // running sum of total kinetic energy over all particles
for each node
net-force := (0, 0) // running sum of total force on this particular node
for each other node
net-force := net-force + Coulomb_repulsion( this_node, other_node )
next node
for each spring connected to this node
net-force := net-force + Hooke_attraction( this_node, spring )
next spring
// without damping, it moves forever
this_node.velocity := (this_node.velocity + timestep * net-force) * damping
this_node.position := this_node.position + timestep * this_node.velocity
total_kinetic_energy := total_kinetic_energy + this_node.mass * (this_node.velocity)^2
next node
until total_kinetic_energy is less than some small number // the simulation has stopped moving
See also
- Tulip, software that implements most of the force directed layout (GEM, LGL, GRIP, FM³)
- Cytoscape, software for visualising biological networks. The base package includes force-directed layouts as one of the built-in methods.
- Gephi, an interactive visualization and exploration platform for all kinds of networks and complex systems, dynamic and hierarchical graphs.
References
Further reading
- di Battista, Giuseppe (1999). Graph Drawing: Algorithms for the Visualization of Graphs. Prentice Hall. ISBN 978-0-13-301615-4.
{{cite book}}
: Unknown parameter|coauthors=
ignored (|author=
suggested) (help) - Eades, Peter (1984). "A Heuristic for Graph Drawing". Congressus Numerantium. 42 (11): 149–160.
- Fruchterman, Thomas M. J.; Reingold, Edward M. (1991). "Graph Drawing by Force-Directed Placement". Software – Practice & Experience. 21 (11). Wiley: 1129–1164. doi:10.1002/spe.4380211102.
- Harel, David; Koren, Yehuda (2002). "Graph Drawing by High-Dimensional Embedding". Proceedings of the 9th International Symposium on Graph Drawing. pp. 207–219. ISBN 3-540-00158-1.
{{cite conference}}
: Unknown parameter|booktitle=
ignored (|book-title=
suggested) (help) - Kamada, Tomihisa; Kawai, Satoru (1989). "An algorithm for drawing general undirected graphs". Information Processing Letters. 31 (1). Elsevier: 7–15. doi:10.1016/0020-0190(89)90102-6.
- Kaufmann, Michael; Wagner, Dorothea, eds. (2001). Drawing graphs: methods and models. Lecture Notes in Computer Science 2025. Springer. doi:10.1007/3-540-44969-8. ISBN 978-3-540-42062-0.
- de Leeuw, Jan (1988). "Convergence of the majorization method for multidimensional scaling". Journal of Classification. 5 (2). Springer: 163–180. doi:10.1007/BF01897162.
- Quigley, Aaron; Eades, Peter (2001). "FADE: Graph Drawing, Clustering, and Visual Abstraction" (PDF). Proceedings of the 8th International Symposium on Graph Drawing. pp. 197–210. ISBN 3-540-41554-8.
{{cite conference}}
: Unknown parameter|booktitle=
ignored (|book-title=
suggested) (help)
External links
- aiSee's force-directed layout
- Video of Spring Algorithm
- Live visualisation in flash + source code and description
- Short explanation of the Kamada-Kawai spring-based graph layout algorithm featuring a picture
- Short explanation of Fruchterman-Reingold algorithm. The algorithm implements a variable step width (“temperature”) to guarantee that the system reaches equilibrium state
- Daniel Tunkelang's dissertation (with source code and demonstration applet) on force-directed graph layout
- Hyperassociative Map Algorithm
- Implementation of a Force Directed Graph with C# including video demonstration
- Interactive and real-time force directed graphing algorithms used in an online database modeling tool