A* algorithm

written by @marc_lelarge after this thread.

We always assume that the graph we consider is connected (and finite) so that there is always a shortest path between

s

and

t

Recap: Dijkstra's shortest-path algorithm

See Chapter 4 of Algorithms - Sanjoy Dasgupta, Christos H. Papadimitriou, and Umesh V. Vazirani

Dijkstra's shortest-path algorithm
Input: Graph

G = (V, E)

; positive edge lengths

{ℓ_{e}, e \in E}

; vertex

s \in V

Output: For all vertices

u

reachable from

s

, the distance from

s

u

dist (u)

for all

u \in V

dist (u) = \infty

prev (u) = nil

dist (s) = 0

H = makequeue (V, dist)

(using dist-values as keys)
while

H

is not empty:

u = deletemin (H, dist)

for all edges

(u, v) \in E

:
if

dist (v) > dist (u) + ℓ (u, v)

dist (v) = dist (u) + ℓ (u, v)

prev (v) = u

decreasekey (H, v)

At the end of the algorithm,

prev

will hold for each node

u

the identity of the node immediately before it on the shortest path from

s

u

.
The right data structure for

H

is a priority queue which maintians a set of elements (nodes) with associated numeric key values (

dist

) and supports the following operations:

Insert. Add a new alement to the set
Decrease-key. Accommodate the decrease in key value of a particular element.
Delete-min. Return the element with the smallest key, and remove it from the set.
Make-queue. Build a priority queue out of the given elements, with the given key values.

Note that each node is added once in

H

before the while loop and removed from

H

during an iteration of the while loop. For each node

u

dist (u)

can only decrease in this algorithm and is always an upper bound for the true distance from

s

u

Property : when

u

is removed from

H

, then

dist (u)

is the correct distance from

s

u

and

prev (u)

is the correct node before

u

in the shortest path from

s

u

Proof : This can be proved by induction with the following inductive hypothesis: at the end of each iteration of the while loop, the following conditions hold with

u = deletemin (H, dist)

H^{'} = H ∖ {u}

and

B = V ∖ H^{'}

: (1) all nodes in

H^{'}

are at distance

\geq dist (u)

from

s

and all nodes in

B

are at distance

\leq dist (u)

from

s

, and (2) for every node

v

, the value

dist (v)

is the length of the shortest path from

s

v

whose intermediate nodes are constrained to be in

B

(if no such path exists, the value is

\infty

The base case is straightforward since

s

is the first removed element from

H

so that

B = {s}

.
We denote by

δ (v)

the true distance between

s

and

v

.
Points (1) and (2) of the inductive hypothesis imply that all nodes

v \in B

are at

dist (v)

from

s

\forall v \in B, δ (v) = dist (v)

. If this is not the case, by (2), there exists

w \in H^{'}

on the shortest path from

s

v

. Since the edge lenghts are positive, the partial path from

s

w

is the shortest path between

s

and

w

and its lenght

δ (w)

is strictly less than the distance

δ (v)

between

s

and

v

but by (1)

δ (v) \leq dist (u)

contradicting the fact that

δ (w) \geq dist (u)

. As a result, nodes

v \in B

have been removed from the queue in increasing order of

dist (v) = δ (v)

.
Assume now that the inductive hypothesis is correct and let

u = deletemin (H)

H^{'} = H ∖ {u}

and

B = V ∖ H^{'}

. Let

v

be the last node in

B

on the shortest path from

s

u

and

w

the node on this path neighbor of

v

and closer to

u

s ⇝ v - w ⇝ u

. We have

δ (w) = δ (v) + ℓ (v, w)

. By the inductive hypothesis:

δ (v) = dist (v)

and

dist (w) \leq dist (v) + ℓ (v, w) = δ (v) + ℓ (v, w)

hence

dist (w) = δ (w)

. But since

u = deletemin (H)

, we have

δ (u) \leq dist (u) \leq dist (w) = δ (w)

, so that

u = w

and we proved

dist (u) = δ (u) .

Clearly all nodes in

B

are at distance

\leq dist (u)

and (2) is correct. Suppose there exists

x \in H^{'}

with

δ (x) < dist (u) = δ (u)

. We can apply the same argument as above by considering the shortest pasth form

s

x

and the last node

v

B

s ⇝ v - w ⇝ x

so that

dist (v) = δ (w)

and since

w \in H^{'}

dist (u) = δ (u) \leq dist (w) = δ (w) \leq δ (x)

, a contradiction. Hence we proved (1).

Modifying Dijkstra's shortest-path algorithm to get A* algorithm

Instead of computing all the shortest paths from

s

, we now have a target

t

and want to compute efficiently the shortest path from

s

t

We have access to a heuristic fucntion

h (u)

that estimates the length of the path from any node

u

to the target

t

The A* algorithm is a simple variation of the Dijkstra's algorithm with a single modification: the priority in the queue is now the function:

dist (u) + h (u)

A* algorithm
Input: Graph

G = (V, E)

; positive edge lengths

{ℓ_{e}, e \in E}

; vertex

s \in V

; a vertex

t \in V

and an associated heuristic function

h (u)

estimating the distance from

u

t

.
Output: under some conditions on the heuristic

h

, a shortest path from

s

t

for all

u \in V

dist (u) = \infty

prev (u) = nil

dist (s) = 0

H = makequeue (V, dist + h)

(using keys:

dist (u) + h (u)

)
while

H

is not empty:

u = deletemin (H, dist + h)

u = t

:
break
for all edges

(u, v) \in E

:
if

dist (v) > dist (u) + ℓ (u, v)

dist (v) = dist (u) + ℓ (u, v)

prev (v) = u

v \notin H

insert (H, v)

decreasekey (H, v)

We still have that for each node

u

dist (u)

can only decrease and is always an upper bound for the true distance from

s

u

. Note that

s

is the first node removed from

H

(and nerver added back to

H

If the heuristic function is zero:

h (u) = 0

, A* reduces to Dijkstra's algorithm.
In this case, it follows from previous proof that when

dist (v)

is decreased,

v

is always in

H

, hence there is no insertion in

H

and all steps are the same in both algorithms.

A comprehensive path-finding library in javascript.

Accessible here: Demo and GitHub code

Image Not Showing Possible Reasons

The image was uploaded to a note which you don't have access to
The note which the image was originally uploaded to has been deleted

Learn More →

Properties of A* algorithm

Lemma : For an optimal path

s = v_{0}, v_{1}, \dots, v_{k} = t

, if

v_{i}

is removed from

H

and all the

v_{j}

's for

j < i

have been removed from

H

then

dist (v_{i})

is the true distance between

s

and

v_{i}

and

v_{0}, \dots, v_{i}

will never be inserted back into

H

Proof : the claim is clearly true for

v_{0} = s

. Assume it is true for all

v_{0}, v_{1}, \dots, v_{i - 1}

and

v_{i}

is removed from

H

. Between the removal of

v_{i - 1}

and

v_{i}

some other nodes might have been removed from

H

but since

dist (v_{j}) = δ (s, v_{j})

for all

j \leq i - 1

, these removals will not modify any of the

dist (v_{j})

(and not add them back to

H

). Now after removal of

v_{i - 1}

, since

v_{i}

is a neighbor, we have

dist (v_{i}) = dist (v_{i - 1}) + ℓ (v_{i - 1}, v_{i}) = δ (s, v_{i})

Consider a variant of A*, where we remove the break. This variant of A* terminates, i.e.

H

will become empty after a finite number of iterations since the number of possible values for all the

dist (u)

is finite. Moreover, thanks to previous Lemma, in this variant of A*, when

v_{i}

is last removed from

H

then

dist (v_{i})

is the true distance between

s

and

v_{i}

(and all the

v_{j}

for

j \leq i

have been removed from

H

). In particular, the only way for A* to fail to return an optimal path is because it removes the target

t

from

H

too early. Under some conditions on the heuristic, we can guarantee that this will nerver happen.

A heuristic function is admissible if it never overestimates the actual cost to get to the goal:

h (u) \leq δ (u, t)

, where

δ (u, t)

is the distance between

u

and the goal

t

A heuristic is consistent (or monotone) if for each edge

(u, v) \in E

h (u) \leq ℓ (u, v) + h (v)

Property : every consistent heuristic is also admissible.

Proof : By definition, we have

h (u) \leq ℓ (u, v) + h (v)

and we need to prove that

h (u) \leq δ (u, t)

.
Consider a shortest path from

u

t

denoted

(u, v_{1}, \dots, v_{k}, t)

. We have

h (v_{k}) \leq ℓ (v_{k}, t) + h (t) = ℓ (v_{k}, t)

h (t) = 0

. Then

h (v_{k - 1}) \leq ℓ (v_{k - 1}, v_{k}) + h (v_{k}) \leq ℓ (v_{k - 1}, v_{k}) + ℓ (v_{k}, t)

and so on so that we get

h (u) \leq ℓ (u, v_{1}) + \dots + ℓ (v_{k}, t) = δ (u, t)

Property : A* is equivalent to Dijkstra’s algorithm on a graph with reduced lengths

c (u, v) = ℓ (u, v) - h (u) + h (v)

. Note that, since Dijkstra’s algorithm requires arc costs to be nonnegative, the heuristic needs to be consistent.

Running Dijkstra's algorithm with the reduced lengths, if

π

is the current path from

s

v

, we have:

\begin{array}{rcl} {dist}_{Dijkstra} (v) & = & \sum_{(u, w) \in π} c (u, w) \\ = & \sum_{(u, w) \in π} ℓ (u, w) + h (v) - h (s) \\ = & {dist}_{A *} (v) + h (v) - h (s) . \end{array}

The result follows since

h (s)

is a constant and the fact that if

{dist}_{Dijkstra} (v)

is updated then

v \in H

As a consequence A* returns an optimal path with reduced costs but since

h (t) = 0

, this optimal path is also optimal for the original cost.

It turns out, we only need to have an admissible heuristic for A* to find shortest paths:

Property : if the heuristic is admissible, then A* always finds a shortest path.

This can be proved by contradiction. Assume that the path retruned by A* between

s

and

t

is not optimal, i.e.

dist (t) > δ (s, t)

when

t

is removed from

H

. Consider

H

just before

t

is chosen. Denote by

s = v_{0}, v_{1}, \dots, v_{k} = t

an optimal path from

s

t

. Since

s \notin H

dist (s) = 0

and

t \in H

, by previous Lemma, there exists a node

v_{i}

such that for all

j \leq i

dist (v_{j}) = δ (s, v_{j})

and

v_{i + 1} \in H

. But since

t

has been chosen and

h (t) = 0

, we have

dist (t) \leq dist (v_{i + 1}) + h (v_{i + 1})

. But we have

dist (v_{i + 1}) = δ (s, v_{i + 1})

(since

v_{i}

has been removed from

H

and

dist (v_{i}) = δ (s, v_{i})

) and

h (v_{i + 1}) \leq δ (v_{i + 1}, t)

(by admissibility). Hence, we get

δ (s, t) \leq dist (t) \leq δ (s, v_{i + 1}) + δ (v_{i + 1}, t) = δ (s, t)

, a contradiction.

A* algorithm

Recap: Dijkstra's shortest-path algorithm

Modifying Dijkstra's shortest-path algorithm to get A* algorithm

A comprehensive path-finding library in javascript.

Properties of A* algorithm

tags: public agreginfo

Read more

Broadcasting in Python: K-means algorithm

Basics about probability distribution and Gaussians

Autodiff and Backpropagation

Transformers using Named Tensor Notation

tags: `public` `agreginfo`