Try   HackMD

Extremal Density Matrices

Last update: 8/30/2021, Isaac Kim

Unfortunately, this line of approach seems to be too restrictive; see the last section for the underlying reason. Therefore, the notion of extremality, at least the one defined in this document, does not seem like a promising approach.

Nevertheless, this document is left as is because there are some interesting concepts introduced here. Some of those concepts may be migrated to a separate note in the future.

Consider a quantum spin system arranged on

R2. We can partition
R2
into a set of
2
-cells,
1
-cells, and
0
-cells. Each
2
-cell and
1
-cell is homeomorphic to a disk and an open interval, respectively. The
0
-cells are points. In the figure below, a
2
-cell is represented by a red cell, a
1
-cell is represented by a blue line, and a
0
-cell is represented by a green dot.

Image Not Showing Possible Reasons
  • The image file may be corrupted
  • The server hosting the image is unavailable
  • The image path is incorrect
  • The image format is not supported
Learn More →

(Physically, we can think of a

2-cell as a coarse-grained degree of freedom representing a collection of microscopic degrees of freedom.)

Primer on two-dimensional topology

The set of

0-,
1
-, and the
2
-cells form a CW-complex, complex for short. We shall denote the set of
k
-cells as
Ck
. A subset
CC2
of
2
-cells induces a complex defined by
Ck:={c:cCk,cc¯for some cC}

for
k<2
. (Here
c¯
means a closure of
c
in
R2
.)

Definition 1.
Let

cC. A minimal neighborhood of
c
is
NminC(c):={cC:c¯c¯}.

Similarly, a minimal neighborhood of
CC
is
NminC(C)=cCNminC(c)

Definition 2.
A path from

CC2 to
CC2
is a sequence of sub complexes induced by a sequence of
2
-cells
(C2)i=1n
,
(C2)1=C
,
(C2)n=C
s.t.
|(C2)i+1(C2)i|=1or|(C2)i(C2)i+1|=1

Definition 3.
A path is increasing if

|(C2)i+1(C2)i|=1 for all
i
. A path is decreasing if
|(C2)i(C2)i+1|=1
for all i.

Definition 4.
Let

CC2.
C:=cCc¯
.

Definition 5.
A path

(C2)i=1n is smooth if
(C2)i(C2)i+1
for all
i
.

Lemma 1.
Let

CCC2 and
CCD2
. There exists a
cCC
s.t.
CC{c}
.

Proof of Lemma 1.
Let us prove the contrapositive statement. First, we show that there must exist a

cCC such that
cNminC(C)
. If this is not the case, one can partition
C
into
CCC
so that
CC
has no points in common with
C
. Therefore,
C
is not connected and cannot be homeomorphic to
D2
. Thus,
cNminC(C)
.

Next, we show that there exists

cNminC(C) s.t.
CC{c}
. If not, for all
cNminC(C)
,
C{c}
is not homeomorphic to
D2
. Since
C{c}
is a connected subset of
R2
,
C{c}
must be homeomorphic to a closed disk with
k>0
open punctures. In particular, the interior of at least one of the punctures is not a subset of
C
. Since
C{c}C
,
C
must have a puncture and thus cannot be homeomorphic to
D2
. Thus, there exists
cNminC(C)
s.t.
CC{c}
.

Theorem 1.
Let

CCC2 and
CCD2
. There is a smooth increasing path from
C
to
C
.

The proof follows straightforwardly from Lemma 1.

Let us also prove an intuitive fact which will prove to be useful later.

Proposition 1.
Let

cC2. Let
C
be a subset of
Nmin(c)
which includes
c
. Then
CD2
.

Proof of Proposition 1.

If

C=Nmin(c),
CD2
by definition. Note that, for
A,BD2
,
BA
, s.t.
ABD1
,
ABD2
. Thus, if
C
is a strict subset of
Nmin(c)
s.t.
|C|=|Nmin(c)|+1
, since
cC Nmin(c)
,
CD2
. This argument can be used recursively to prove
CD2
for
|C|=|Nmin(c)|+k
for any
k>1
.

Extremal density matrices

Now we move onto the realm of quantum mechanics. Consider a Hilbert space which has a tensor product structure over the

2-cells:
H=cC2Hc,

where
dim(Hc)<
.

Definition 6.
Let

(C2)i=1n is a smooth increasing path and
ρ
be a density matrix supported on
(C2)n
, where
|(C2)1|=1
.
MED(C2)i=1n(ρ)=MED(C2)i=1n1(ρ)+S(δn|(C2)n1Nmin(δn)),

where
δn=(C2)n(C2)n1
.

Definition 7.
A density matrix

ρ over
C
is extremal if for all smooth increasing path
P
from a
2
-cell to
C
,
MEDP(ρ)=S(ρ).

  • Remark: By SSA,
    MEDP(ρ)S(ρ)
    for general quantum states.

Definition 8.
A density matrix

ρ over
C
is locally extremal if for all
cC
, the reduced density matrix of
ρ
on
CNmin(c)
is extremal.

Lemma 2.
Let

ρA be extremal. If
BA
and
BA
,
ρB
is extremal. Note: This is wrong. See the k-point function section.

Proof or Lemma 2.
Since

BA, there is a smooth increasing path from
B
to
A
. For any smooth increasing path from a single
2
-cell to
B
, we can concatenate this path with a path from
B
to
A
, yielding a path from a single
2
-cell to
A
. Let us denote this path as
(P1,P2)
, where
P1
is a path from a
2
-cell to
B
and
P2
is a path from
B
to
A
.

MED(P1,P2)(ρA)S(ρA)=MEDP1(ρB)S(ρB)+,
where the ellipsis represents a sum of
S(ρB)
conditional entropies which altogether is nonnegative because of SSA. The left-hand-side is
0
by definition and the right-hand-side is a sum of nonnegative terms. Therefore, both those terms must be zero. Thus,
MEDP1(ρB)=S(ρB)
for any path from a
2
-cell to
B
.

BK Entropy: Key to efficient characterization

Naively, verifying the extremality condition seems like a formidable task because the number of paths grows very quickly with

|C|. Note that, for a connected set of size, its boundary size is
Ω(n)
. So, for the first
k=O(|C|)
steps, the number of distinct paths is
Ω(k!)
. This naive counting argument thus leads to a number of conditons that scales faster than exponentially with
|C|
.

Remarkably, there is a sufficient condition for verifying the extremality which reduces this cost dramatically. This is the main subject of this section. The main protagnist of this story is the Bethe-Kikuchi entropy -BK entropy for short - which we shall denote as

SBK(ρA). The BK entropy depends implicitly on the notion of elementary clusters. In our context, this is simply a family of sets of
2
-cells, wherein each set is representing a local patch of the underlying physical system supported on a ball of bounded radius. Let
EC
be an ordered set of elementary clusters. Define
ECA
, where
A
is an ordered set of
2
-cells, as
ECA=(Ci:CiEC where cC,cA).

The BK entropy is defined as
SBKEC(ρA)=1inS(ρCi)1i<jnS(ρCiCj)+1i<j<knS(ρCiCjCk).

BK entropy satisfies the following removal lemma.

Lemma 3. (Removal lemma) Let

EC=EC{C} where
CC
for some
CEC
. Then
SBKEC(ρA)=SBKEC(ρA).

Proof of Lemma 3.

If

CECA, the statement is trivial. Thus, without loss of generality, suppose there are
n
CiECA
s.t.
CCi
. Define two sets:
IA={C:CECA,CC},OA=ECAIA.

Note that
|IA|=n
.

Consider the decomposition of

SBKEC(ρA)SBKEC(ρA) in terms of
S(ρC)
,
S(ρCC1)
,
S(ρCC1Ck)
, etc, where
CiOA
. Without loss of generality, consider the term
S(ρCC1Ck)
. This term appears in the entropy of an intersection of
C1,,CkOA
,
C
and elements in
IA
. Let
IA,i
be the set of elements in
IA
lying between
Ci
and
Ci+1
; similarly, let
IA,0
and
IA,k
be the set of elements in
IA
appearing before and after
C1
and
Ck
, respectively. The integer coefficient of
S(ρCC1Ck)
can be determined by summing over all possible subsets of
IA,i
- denoted as
I~A,i
- and weighing them by
(1)i=0k|I~A,i|
.

This sum is exactly zero. To see why, fix all but one

I~A,i and take the sum. Let
I~A,k
be the variable subset. The corresponding sum is proportional to
j=0mk(mki)(1)j=0,

where
mk=|I~A,k|
because the sum is equal to
(1+(1))mk
by the binomial expansion.

Moreover,

Lemma 4. (Inclusion-Exclusion Principle) Suppose every element in

EC is contained in
AB,BC,
or
AC
. Then
SBKEC(ρABC)=SBKEC(ρAB)+SBKEC(ρBC)+SBKEC(ρAC)SBKEC(ρA)SBKEC(ρB)SBKEC(ρC).

The proof follows straightforwardly from the assumption. On a similar note, if every element of

EC is contained in
A
or
B
,
SBKEC(ρAB)=SBKEC(ρA)+SBKEC(ρB).

Extremality and BK entropy.

Now we establish an equivalence of extremal density matrices and locally extremal density matrices whose entropies are equal to their respective BK entropies. In this discussion, we shall choose

EC using the following rule. Consider a graph obtained by assigning
2
-cells to the vertices and an edge between
c
and
c
if
cc
. This is a planar graph, so we can naturally define a notion of face. We choose
EC
to be the union of the set of vertices, edges, and faces on the newly defined planar graph. (Note that this is not exactly a dual graph of the original planar graph.) Lastly, we will focus on a hexagonal lattice. (Its dual lattice is thus triangular.) While we suspect the proofs in this section will apply to more general planar graphs, we leave this as an open problem.

Image Not Showing Possible Reasons
  • The image file may be corrupted
  • The server hosting the image is unavailable
  • The image path is incorrect
  • The image format is not supported
Learn More →

In the figure above, the dual
0
-,
1
-, and
2
-cell is colored in red, green, and blue, respectively. Since
EC
will be fixed throughout this section, we drop it in
SBKEC
, instead using
SBK
as a short-hand notation.

Lemma 5.
Let

ANmin(c) for some
cC2
. A density matrix
ρA
is extremal iff it is (i) locally extremal and (ii)
S(ρA)=SBK(ρA)
.

Proof of Lemma 5.

Simply consider all possibilities and use induction.

Lemma 6.
Suppose a density matrix

ρA is locally extremal and
AD2
. Then
MEDP(ρA)=SBK(ρA)
for any path
P
from a
2
-cell to
A
.

Proof of Lemma 6.
The proof is based on induction. Suppose we have proven this statement for all disks

BA.
MEDP(ρA)=MEDP(ρB)+S(AB|Nmin(AB)B),

where
P
is a subsequence of
P
obtained by removing the last element. By our hypothesis,
MEDP(ρA)=SBK(ρB)+S(AB|Nmin(AB)B).

Now we relate the conditional entropy to the BK entropy. Note that, the set we are focused on is a subset of a minimal neighborhood. By the local extremality, the conditional entropy can be further decomposed into a BK-entropy. Further, using the inclusion-exclusion principle, we get

MEDP(ρA)=SBK(ρA).

Theorem 2.
A density matrix

ρA s.t.
AD2
is extremal iff it is (i) locally extremal and (ii)
S(ρA)=SBK(ρA)
.

Proof of Theorem 2.

If

ρA is extremal, by Lemma 2,
ρB
is extremal for every
BA
which is a subset of
A
. Since
ANmin(c)D2
for every
cA
(see Proposition 1),
ρA
is locally extremal.

Next, we show that extremality of

ρA implies
S(ρA)=SBK(ρA)
, by induction. Suppose we have proved this statement for any
BA
s.t.
BA
. By the extremality of
ρA
,
S(ρA)=MEDP(ρB)+S(AB|Nmin(AB)B)=SBK(ρB)+S(AB|Nmin(AB)B)

for any path
P
from a
2
-cell to
B
. By the local extremality and our hypothesis,
S(ρA)=SBK(ρB)+SBK(ρNmin(AB)A)+SBK(ρNmin(AB)B).

Now we are in a position to use the inclusion-exclusion principle of the BK entropy; see Lemma 4 and a remark below it. Note that every element in

ECA is either in
B
or
Nmin(AB)A
. Moreover, since there is no cell with nontrivial overlap with an element in
BNmin(AB)
and an element in
AB
, we get, using the inclusion-exclusion principle,
S(ρA)=SBK(ρA).

Note that the base case of our induction argument follow from Lemma 5. Thus we have proved the "only if" part of the statement.

Now let us prove the converse statement. Given

A, pick a
BA
s.t.
|B|=|A|1
and
BA
. Suppose we have proven that, for all those
B
, that if
ρB
is locally extremal and
SBK(ρB)=S(ρB)
then
ρB
is extremal. If
S(ρA)=SBK(ρA)
. Then by the local extremality and the inclusion-exclusion principle of the BK entropy,
S(ρA)SBK(ρB)=S(AB|Nmin(AB)B).

By the local extremality,
SBK(ρB)=MEDP(ρB)
for any path
P
from a
2
-cell to
B
(Lemma 6). Because any path from a
2
-cell to
A
is a path from the same
2
-cell to some disk
BA
followed by
AB
,
ρA
is extremal.

Thus, now we have a better way of certifying extremality. Simply check the local extremality and calculate the BK entropy and check if it is equal to its global entropy. Alas, while this reduces the number of constraints that need to be checked to something that scales linearly with the number of

2-cells, this is still a difficult condition to verify. Caculating the von Neumann entropy of a large system is not easy!

Merging extremal density matrices

Now we show that extremal density matrices form a "nice" set. What we mean by this is that given two extremal density matrices, we can "merge" them together form a new extremal density matrix. The only extra condition needed is local consistency. That is, the two density matrices must yield the same reduced density matrix on their overlapping support. Obviously, if this condition is not satisfied, no such density matrix can exist (let alone something extremal).

Lemma 7.
(Merging lemma) Let

ρABC and
σBCD
be density matrices such that
ρABC=cσBCD
and
I(A:C|B)ρ=I(B:D|C)σ=0
. Then there exists a density matrix
τABCD
such that
τABC=ρABC
,
τBCD=σBCD
, and
I(AB:D|C)=I(A:CD|B)=0
.

By the uniqueness of the maximal entropy state, the state

τABCD that satisfies the above-mentioned conditions is unique. We shall represent this as
τABCD=ρABCσBCD
and
τABCD
as the merged density matrix.

Lemma 8.
(Extremal merging lemma) Let

ρA and
σB
be extremal density matrices and
ρA=cσB
. If there is a partition of
AB=CD
for disjoint sets
C
and
D
such that
AD
,
BC
,
CD
are disks and
d(AB,BC),d(BA,AC)>0
, there exists an extremal density matrix
τAB=cρA,σB
.

Again, by the uniquenes of the maximal entropy state,

τAB is unique. So we can simply say that, under the conditions described above,
τAB=ρAσB
is extremal.

Applications

Solution to the quantum marginal problem

Using the following merging sequence, given a set of locally consistent extremal density matrics on

3×3 patches, we can infer the existence of a global state consistent with those density matrices. Note that at all stage all the density matrices remain extremal. For each diagram, the the red and the blue regions are the supports of the respective density matrices.

  • Step 1: Merging the first three rows.
  • Step 2: Merging the fourth row.

  • Step 3: Repeat Step 2.

Free energy bound

Using the exrtremality of the merged density matrix, we can write down an exact expression for the global entropy; this is precisely the BK-entropy.

k
-point functions

k-point functions can be computed efficiently as long as the
k
points all lie on a single line.


Here the colored region forms a one-dimensional Markov chain. These don't work because there is no path from them to a "regular" large disk, technically speaking. However, this idea does work if we thicken the chain sufficiently.

This kind of flexibility shows that the notion of extremal density matrix is too restrictive. One can argue that, based on the consistency of the

2-point correlation functions over two different paths (specifically, one which is short and another which is long), that correlation ought to vanish strictly outside some bounded region. Let's discuss more about this in detail below.

Why extremality is too much.

Extremality is quite restrictive because it requires

S(ρA)=MEDP(ρA) for all path from a
2
-cell to
A
. For example, consider a "trivial" example which has a Markov chain along the
x
-direction on a single
y
coordinate and a trivial product state on the rest. Markov chains can have nonzero correlation and one can show that the extremality condition does not apply to this example. The basic reason is that if one considers a path in which, if we consider the subsequence formed by the points with
y
-coordinates on which the Markov chain exists, the ordering is not strictly increasing or decreasing, in order for the extremality condition to hold, the Markov chain must have zero correlation.