In the mathematical area of graph theory, an induced path in an undirected graph G is a path that is an induced subgraph of G. That is, it is a sequence of vertices in G such that each two adjacent vertices in the sequence are connected by an edge in G, and each two nonadjacent vertices in the sequence are not connected by any edge in G. An induced path is sometimes called a snake, and the problem of finding long induced paths in hypercube graphs is known as the snake-in-the-box problem.
Similarly, an induced cycle is a cycle that is an induced subgraph of G; induced cycles are also called chordless cycles or (when the length of the cycle is four or more) holes. An antihole is a hole in the complement of G, i.e., an antihole is a complement of a hole.
The length of the longest induced path in a graph has sometimes been called the detour number of the graph; for sparse graphs, having bounded detour number is equivalent to having bounded tree-depth. The induced path number of a graph G is the smallest number of induced paths into which the vertices of the graph may be partitioned, and the closely related path cover number of G is the smallest number of induced paths that together include all vertices of G. The girth of a graph is the length of its shortest cycle, but this cycle must be an induced cycle as any chord could be used to produce a shorter cycle; for similar reasons the odd girth of a graph is also the length of its shortest odd induced cycle.
The illustration shows a cube, a graph with eight vertices and twelve edges, and an induced path of length four in this graph. A straightforward case analysis shows that there can be no longer induced path in the cube, although it has an induced cycle of length six. The problem of finding the longest induced path or cycle in a hypercube, first posed by Kautz (1958), is known as the snake-in-the-box problem, and it has been studied extensively due to its applications in coding theory and engineering.
Characterization of graph families
Many important graph families can be characterized in terms of the induced paths or cycles of the graphs in the family.
- Trivially, the connected graphs with no induced path of length two are the complete graphs, and the connected graphs with no induced cycle are the trees.
- A triangle-free graph is a graph with no induced cycle of length three.
- The cographs are exactly the graphs with no induced path of length three.
- The chordal graphs are the graphs with no induced cycle of length four or more.
- The even-hole-free graphs are the graphs in containing no induced cycles with an even number of vertices.
- The trivially perfect graphs are the graphs that have neither an induced path of length three nor an induced cycle of length four.
- By the strong perfect graph theorem, the perfect graphs are the graphs with no odd hole and no odd antihole.
- The distance-hereditary graphs are the graphs in which every induced path is a shortest path, and the graphs in which every two induced paths between the same two vertices have the same length.
- The block graphs are the graphs in which there is at most one induced path between any two vertices, and the connected block graphs are the graphs in which there is exactly one induced path between every two vertices.
Algorithms and complexity
It is NP-complete to determine, for a graph G and parameter k, whether the graph has an induced path of length at least k. Garey & Johnson (1979) credit this result to an unpublished communication of Mihalis Yannakakis. However, this problem can be solved in polynomial time for certain graph families, such as asteroidal-triple-free graphs or graphs with no long holes.
It is also NP-complete to determine whether the vertices of a graph can be partitioned into two induced paths, or two induced cycles. As a consequence, determining the induced path number of a graph is NP-hard.
The complexity of approximating the longest induced path or cycle problems can be related to that of finding large independent sets in graphs, by the following reduction. From any graph G with n vertices, form another graph H with twice as many vertices as G, by adding to G n(n − 1)/2 vertices having two neighbors each, one for each pair of vertices in G. Then if G has an independent set of size k, H must have an induced path and an induced cycle of length 2k, formed by alternating vertices of the independent set in G with vertices of I. Conversely, if H has an induced path or cycle of length k, any maximal set of nonadjacent vertices in G from this path or cycle forms an independent set in G of size at least k/3. Thus, the size of the maximum independent set in G is within a constant factor of the size of the longest induced path and the longest induced cycle in H. Therefore, by the results of Håstad (1996) on inapproximability of independent sets, unless NP=ZPP, there does not exist a polynomial time algorithm for approximating the longest induced path or the longest induced cycle to within a factor of O(n1/2-ε) of the optimal solution.
Holes (and antiholes in graphs without chordless cycles of length 5) in a graph with n vertices and m edges may be detected in time (n+m2).
Atomic cycles are a generalization of chordless cycles, that contain no n-chords. Given some cycle, an n-chord is defined as a path of length n connecting two points on the cycle, where n is less than the length of the shortest path on the cycle connecting those points. If a cycle has no n-chords, it is called an atomic cycle, because it cannot be decomposed into smaller cycles. In the worst case, the atomic cycles in a graph can be enumerated in O(m2) time, where m is the number of edges in the graph.
- Buckley & Harary (1988).
- Nešetřil & Ossona de Mendez (2012), Proposition 6.4, p. 122.
- Chartrand et al. (1994).
- Barioli, Fallat & Hogben (2004).
- Kratsch, Müller & Todinca (2003).
- Gavril (2002).
- Le, Le & Müller (2003).
- Berman & Schnitger (1992).
- Nikolopoulos & Palios (2004).
- Gashler & Martinez (2012).
- Barioli, Francesco; Fallat, Shaun; Hogben, Leslie (2004). "Computation of minimal rank and path cover number for certain graphs" (PDF). Linear Algebra Appl. 392: 289–303. doi:10.1016/j.laa.2004.06.019.
- Berman, Piotr; Schnitger, Georg (1992). "On the complexity of approximating the independent set problem". Information and Computation 96 (1): 77–94. doi:10.1016/0890-5401(92)90056-L.
- Buckley, Fred; Harary, Frank (1988). "On longest induced paths in graphs". Chinese Quart. J. Math. 3 (3): 61–65.
- Garey, Michael R.; Johnson, David S. (1979). Computers and Intractability: A Guide to the Theory of NP-Completeness. W. H. Freeman. p. 196.
- Gashler, Michael; Martinez, Tony (2012). "Robust manifold learning with CycleCut" (PDF). Connection Science 24 (1): 57–69. doi:10.1080/09540091.2012.664122.
- Gavril, Fănică (2002). "Algorithms for maximum weight induced paths". Information Processing Letters 81 (4): 203–208. doi:10.1016/S0020-0190(01)00222-8.
- Håstad, Johan (1996). "Clique is hard to approximate within n1-ε". Proceedings of the 37th Annual IEEE Symposium on Foundations of Computer Science. pp. 627–636. doi:10.1109/SFCS.1996.548522.
- Kautz, W. H. (1958). "Unit-distance error-checking codes". IRE Trans. Elect. Comput. 7: 177–180.
- Kratsch, Dieter; Müller, Haiko; Todinca, Ioan (2003). "Feedback vertex set and longest induced path on AT-free graphs". Graph-theoretic concepts in computer science. Berlin: Lecture Notes in Computer Science, Vol. 2880, Springer-Verlag. pp. 309–321. doi:10.1007/b93953.
- Le, Hoàng-Oanh; Le, Van Bang; Müller, Haiko (2003). "Splitting a graph into disjoint induced paths or cycles" (The Second International Colloquium "Journées de l'Informatique Messine", Metz, 2000). Discrete Appl. Math. 131 (1): 199–212. doi:10.1016/S0166-218X(02)00425-0.
- Nešetřil, Jaroslav; Ossona de Mendez, Patrice (2012). "Chapter 6. Bounded height trees and tree-depth". Sparsity: Graphs, Structures, and Algorithms. Algorithms and Combinatorics 28. Heidelberg: Springer. pp. 115–144. doi:10.1007/978-3-642-27875-4. ISBN 978-3-642-27874-7. MR 2920058.
- Nikolopoulos, Stavros D.; Palios, Leonidas (2004). "Hole and antihole detection in graphs". Proc. 15th ACM-SIAM Symposium on Discrete Algorithms. pp. 850–859..