The Floyd–Warshall algorithm is an example of dynamic programming, and was published in its currently recognized form by Robert Floyd in 1962.3 However, it is essentially the same as algorithms previously published by Bernard Roy in 19594 and also by Stephen Warshall in 19625 for finding the transitive closure of a graph,6 and is closely related to Kleene's algorithm (published in 1956) for converting a deterministic finite automaton into a regular expression, with the difference being the use of a min-plus semiring.7 The modern formulation of the algorithm as three nested for-loops was first described by Peter Ingerman, also in 1962.8
The Floyd–Warshall algorithm compares many possible paths through the graph between each pair of vertices. It is guaranteed to find all shortest paths and is able to do this with Θ ( | V | 3 ) {\displaystyle \Theta (|V|^{3})} comparisons in a graph,910 even though there may be Θ ( | V | 2 ) {\displaystyle \Theta (|V|^{2})} edges in the graph. It does so by incrementally improving an estimate on the shortest path between two vertices, until the estimate is optimal.
Consider a graph G {\displaystyle G} with vertices V {\displaystyle V} numbered 1 through N {\displaystyle N} . Further consider a function s h o r t e s t P a t h ( i , j , k ) {\displaystyle \mathrm {shortestPath} (i,j,k)} that returns the length of the shortest possible path (if one exists) from i {\displaystyle i} to j {\displaystyle j} using vertices only from the set { 1 , 2 , … , k } {\displaystyle \{1,2,\ldots ,k\}} as intermediate points along the way. Now, given this function, our goal is to find the length of the shortest path from each i {\displaystyle i} to each j {\displaystyle j} using any vertex in { 1 , 2 , … , N } {\displaystyle \{1,2,\ldots ,N\}} . By definition, this is the value s h o r t e s t P a t h ( i , j , N ) {\displaystyle \mathrm {shortestPath} (i,j,N)} , which we will find recursively.
Observe that s h o r t e s t P a t h ( i , j , k ) {\displaystyle \mathrm {shortestPath} (i,j,k)} must be less than or equal to s h o r t e s t P a t h ( i , j , k − 1 ) {\displaystyle \mathrm {shortestPath} (i,j,k-1)} : we have more flexibility if we are allowed to use the vertex k {\displaystyle k} . If s h o r t e s t P a t h ( i , j , k ) {\displaystyle \mathrm {shortestPath} (i,j,k)} is in fact less than s h o r t e s t P a t h ( i , j , k − 1 ) {\displaystyle \mathrm {shortestPath} (i,j,k-1)} , then there must be a path from i {\displaystyle i} to j {\displaystyle j} using the vertices { 1 , 2 , … , k } {\displaystyle \{1,2,\ldots ,k\}} that is shorter than any such path that does not use the vertex k {\displaystyle k} . Since there are no negative cycles this path can be decomposed as:
And of course, these must be a shortest such path (or several of them), otherwise we could further decrease the length. In other words, we have arrived at the recursive formula:
The base case is given by
where w ( i , j ) {\displaystyle w(i,j)} denotes the weight of the edge from i {\displaystyle i} to j {\displaystyle j} if one exists and ∞ (infinity) otherwise.
These formulas are the heart of the Floyd–Warshall algorithm. The algorithm works by first computing s h o r t e s t P a t h ( i , j , k ) {\displaystyle \mathrm {shortestPath} (i,j,k)} for all ( i , j ) {\displaystyle (i,j)} pairs for k = 0 {\displaystyle k=0} , then k = 1 {\displaystyle k=1} , then k = 2 {\displaystyle k=2} , and so on. This process continues until k = N {\displaystyle k=N} , and we have found the shortest path for all ( i , j ) {\displaystyle (i,j)} pairs using any intermediate vertices. Pseudocode for this basic version follows.
The algorithm above is executed on the graph on the left below:
Prior to the first recursion of the outer loop, labeled k = 0 above, the only known paths correspond to the single edges in the graph. At k = 1, paths that go through the vertex 1 are found: in particular, the path [2,1,3] is found, replacing the path [2,3] which has fewer edges but is longer (in terms of weight). At k = 2, paths going through the vertices {1,2} are found. The red and blue boxes show how the path [4,2,1,3] is assembled from the two known paths [4,2] and [2,1,3] encountered in previous iterations, with 2 in the intersection. The path [4,2,3] is not considered, because [2,1,3] is the shortest path encountered so far from 2 to 3. At k = 3, paths going through the vertices {1,2,3} are found. Finally, at k = 4, all shortest paths are found.
The distance matrix at each iteration of k, with the updated distances in bold, will be:
A negative cycle is a cycle whose edges sum to a negative value. There is no shortest path between any pair of vertices i {\displaystyle i} , j {\displaystyle j} which form part of a negative cycle, because path-lengths from i {\displaystyle i} to j {\displaystyle j} can be arbitrarily small (negative). For numerically meaningful output, the Floyd–Warshall algorithm assumes that there are no negative cycles. Nevertheless, if there are negative cycles, the Floyd–Warshall algorithm can be used to detect them. The intuition is as follows:
Hence, to detect negative cycles using the Floyd–Warshall algorithm, one can inspect the diagonal of the path matrix, and the presence of a negative number indicates that the graph contains at least one negative cycle.11 During the execution of the algorithm, if there is a negative cycle, exponentially large numbers can appear, as large as Ω ( ⋅ 6 n − 1 w m a x ) {\displaystyle \Omega (\cdot 6^{n-1}w_{max})} , where w m a x {\displaystyle w_{max}} is the largest absolute value of a negative edge in the graph. To avoid overflow/underflow problems one should check for negative numbers on the diagonal of the path matrix within the inner for loop of the algorithm.12 Obviously, in an undirected graph a negative edge creates a negative cycle (i.e., a closed walk) involving its incident vertices. Considering all edges of the above example graph as undirected, e.g. the vertex sequence 4 – 2 – 4 is a cycle with weight sum −2.
The Floyd–Warshall algorithm typically only provides the lengths of the paths between all pairs of vertices. With simple modifications, it is possible to create a method to reconstruct the actual path between any two endpoint vertices. While one may be inclined to store the actual path from each vertex to each other vertex, this is not necessary, and in fact, is very costly in terms of memory. Instead, we can use the shortest-path tree, which can be calculated for each node in Θ ( | E | ) {\displaystyle \Theta (|E|)} time using Θ ( | V | ) {\displaystyle \Theta (|V|)} memory, and allows us to efficiently reconstruct a directed path between any two connected vertices.
The array prev[u][v] holds the penultimate vertex on the path from u to v (except in the case of prev[v][v], where it always contains v even if there is no self-loop on v):13
Let n {\displaystyle n} be | V | {\displaystyle |V|} , the number of vertices. To find all n 2 {\displaystyle n^{2}} of s h o r t e s t P a t h ( i , j , k ) {\displaystyle \mathrm {shortestPath} (i,j,k)} (for all i {\displaystyle i} and j {\displaystyle j} ) from those of s h o r t e s t P a t h ( i , j , k − 1 ) {\displaystyle \mathrm {shortestPath} (i,j,k-1)} requires Θ ( n 2 ) {\displaystyle \Theta (n^{2})} operations. Since we begin with s h o r t e s t P a t h ( i , j , 0 ) = e d g e C o s t ( i , j ) {\displaystyle \mathrm {shortestPath} (i,j,0)=\mathrm {edgeCost} (i,j)} and compute the sequence of n {\displaystyle n} matrices s h o r t e s t P a t h ( i , j , 1 ) {\displaystyle \mathrm {shortestPath} (i,j,1)} , s h o r t e s t P a t h ( i , j , 2 ) {\displaystyle \mathrm {shortestPath} (i,j,2)} , … {\displaystyle \ldots } , s h o r t e s t P a t h ( i , j , n ) {\displaystyle \mathrm {shortestPath} (i,j,n)} , each having a cost of Θ ( n 2 ) {\displaystyle \Theta (n^{2})} , the total time complexity of the algorithm is n ⋅ Θ ( n 2 ) = Θ ( n 3 ) {\displaystyle n\cdot \Theta (n^{2})=\Theta (n^{3})} .1415
The Floyd–Warshall algorithm can be used to solve the following problems, among others:
Implementations are available for many programming languages.
For graphs with non-negative edge weights, Dijkstra's algorithm can be used to find all shortest paths from a single vertex with running time Θ ( | E | + | V | log | V | ) {\displaystyle \Theta (|E|+|V|\log |V|)} . Thus, running Dijkstra starting at each vertex takes time Θ ( | E | | V | + | V | 2 log | V | ) {\displaystyle \Theta (|E||V|+|V|^{2}\log |V|)} . Since | E | = O ( | V | 2 ) {\displaystyle |E|=O(|V|^{2})} , this yields a worst-case running time of repeated Dijkstra of O ( | V | 3 ) {\displaystyle O(|V|^{3})} . While this matches the asymptotic worst-case running time of the Floyd-Warshall algorithm, the constants involved matter quite a lot. When a graph is dense (i.e., | E | ≈ | V | 2 {\displaystyle |E|\approx |V|^{2}} ), the Floyd-Warshall algorithm tends to perform better in practice. When the graph is sparse (i.e., | E | {\displaystyle |E|} is significantly smaller than | V | 2 {\displaystyle |V|^{2}} ), Dijkstra tends to dominate.
For sparse graphs with negative edges but no negative cycles, Johnson's algorithm can be used, with the same asymptotic running time as the repeated Dijkstra approach.
There are also known algorithms using fast matrix multiplication to speed up all-pairs shortest path computation in dense graphs, but these typically make extra assumptions on the edge weights (such as requiring them to be small integers).1920 In addition, because of the high constant factors in their running time, they would only provide a speedup over the Floyd–Warshall algorithm for very large graphs.
Cormen, Thomas H.; Leiserson, Charles E.; Rivest, Ronald L. (1990). Introduction to Algorithms (1st ed.). MIT Press and McGraw-Hill. ISBN 0-262-03141-8. See in particular Section 26.2, "The Floyd–Warshall algorithm", pp. 558–565 and Section 26.4, "A general framework for solving path problems in directed graphs", pp. 570–576. 0-262-03141-8 ↩
Kenneth H. Rosen (2003). Discrete Mathematics and Its Applications, 5th Edition. Addison Wesley. ISBN 978-0-07-119881-3. 978-0-07-119881-3 ↩
Floyd, Robert W. (June 1962). "Algorithm 97: Shortest Path". Communications of the ACM. 5 (6): 345. doi:10.1145/367766.368168. S2CID 2003382. /wiki/Robert_W._Floyd ↩
Roy, Bernard (1959). "Transitivité et connexité". C. R. Acad. Sci. Paris (in French). 249: 216–218. /wiki/Bernard_Roy ↩
Warshall, Stephen (January 1962). "A theorem on Boolean matrices". Journal of the ACM. 9 (1): 11–12. doi:10.1145/321105.321107. S2CID 33763989. https://doi.org/10.1145%2F321105.321107 ↩
Weisstein, Eric W. "Floyd-Warshall Algorithm". MathWorld. /wiki/Eric_W._Weisstein ↩
Kleene, S. C. (1956). "Representation of events in nerve nets and finite automata". In C. E. Shannon and J. McCarthy (ed.). Automata Studies. Princeton University Press. pp. 3–42. /wiki/Stephen_Cole_Kleene ↩
Ingerman, Peter Z. (November 1962). "Algorithm 141: Path Matrix". Communications of the ACM. 5 (11): 556. doi:10.1145/368996.369016. S2CID 29010500. https://doi.org/10.1145%2F368996.369016 ↩
Hochbaum, Dorit (2014). "Section 8.9: Floyd-Warshall algorithm for all pairs shortest paths" (PDF). Lecture Notes for IEOR 266: Graph Algorithms and Network Flows. Department of Industrial Engineering and Operations Research, University of California, Berkeley. pp. 41–42. /wiki/Dorit_S._Hochbaum ↩
Stefan Hougardy (April 2010). "The Floyd–Warshall algorithm on graphs with negative cycles". Information Processing Letters. 110 (8–9): 279–281. doi:10.1016/j.ipl.2010.02.001. /wiki/Doi_(identifier) ↩
"Free Algorithms Book". https://books.goalkicker.com/AlgorithmsBook/ ↩
Baras, John; Theodorakopoulos, George (2022). Path Problems in Networks. Springer International Publishing. ISBN 9783031799839. 9783031799839 ↩
Gross, Jonathan L.; Yellen, Jay (2003). Handbook of Graph Theory. Discrete Mathematics and Its Applications. CRC Press. p. 65. ISBN 9780203490204.. 9780203490204 ↩
Peñaloza, Rafael. "Algebraic Structures for Transitive Closure". Seminar "Graph Algorithms". Dresden University of Technology, Department of Computer Science, Institute for Theoretical Computer Science. Archived from the original on 2009-10-22. https://web.archive.org/web/20091022202353/http://geocities.com/rpenalozan/ps/graphalg2005.ps ↩
Gillies, Donald (1993). Scheduling Tasks with AND/OR precedence contraints (PhD Thesis, Appendix B) (PDF) (Report). http://www.ece.ubc.ca/~gillies/download/Donald_W_Gillies_PhD_1993_Scheduling_With_AND_OR_Precedence.pdf ↩
Zwick, Uri (May 2002). "All pairs shortest paths using bridging sets and rectangular matrix multiplication". Journal of the ACM. 49 (3): 289–317. arXiv:cs/0008011. doi:10.1145/567112.567114. S2CID 1065901.. /wiki/Uri_Zwick ↩
Chan, Timothy M. (January 2010). "More algorithms for all-pairs shortest paths in weighted graphs". SIAM Journal on Computing. 39 (5): 2075–2089. CiteSeerX 10.1.1.153.6864. doi:10.1137/08071990x.. /wiki/Timothy_M._Chan ↩