## Thursday, May 3, 2012

### Density Matrices and Density Operators

 John von Neumann at Los Alamos in the 1940's
John von Neumann wrote: "The part of my work I consider most essential is that on quantum mechanics, which developed in Göttingen in 1926, and subsequently in Berlin in 1927–1929. Also, my work on various forms of operator theory, Berlin 1930 and Princeton 1935–1939; on the ergodic theorem, Princeton, 1931–1932."

In Phyisics graduate school, 1st year 2nd semester, when they students take Quantum Mechanics, they are taught the important subject of Density Operators and Matrices. This may be over the head of most of you my dear readers, it is a bit over mine (simply because I haven't studied it in detail ... yet), but we're basically talking about linear algebra. At a minimum to understand this important subject that describes our real world, 5 semesters of College Math (Calculus I-IV + Transforms), Linear Algebra, and Wavefunctions should have been studied first. Then it's easy. :-)

The operative sentence is this:

"Just as the
Schrödinger equation describes how pure states evolve in time, the von Neumann equation (also known as Liouville-von Neumann equation) describes how a density operator evolves in time. In fact, the two equations are equivalent, in the sense that either can be derived from the other."

from Wikipedia:

density matrix is a matrix that describes a quantum system in a mixed state, a statistical ensemble of several quantum states. (In contrast, a pure state is described by a singlestate vector). The density matrix is the quantum-mechanical analogue to a phase-space probability measure (probability distribution of position and momentum) in classicalstatistical mechanics.
Mixed states arise in situations where there is classical uncertainty, i.e., when the experimenter does not know which particular states are being manipulated. (This should not be confused with quantum uncertainty, which dictates that even if the experimenter knows which states are being manipulated, the results of some measurements cannot be predicted.) Examples include a system in thermal equilibrium (at finite temperatures) or a system with an uncertain or randomly-varying preparation history (so one does not know which pure state the system is in). Also, if a quantum system has two or more subsystems that are entangled, then each subsystem must be treated as a mixed state even if the complete system is in a pure state. The density matrix is also a crucial tool in quantum decoherence theory.
The density matrix is a representation of a linear operator called the density operator. (The close relationship between matrices and operators is a basic concept in linear algebra.) In practice, the terms "density matrix" and "density operator" are often used interchangeably. Both matrix and operator are self-adjoint (or Hermitian), positive semi-definite, of traceone, and may be infinite-dimensional.[1] The formalism was introduced by John von Neumann[2] (and independently but less systematically by Lev Landau and Felix Bloch in 1927).[3][4][non-primary source needed]

[hide

## Pure and mixed states

In quantum mechanics, a quantum system is represented by a state vector (or ket$| \psi \rangle$. A quantum system with a state vector $| \psi \rangle$ is called a pure state. However, it is also possible for a system to be in a statistical ensemble of different state vectors: For example, there may be a 50% probability that the state vector is $| \psi_1 \rangle$ and a 50% chance that the state vector is $| \psi_2 \rangle$. This system would be in a mixed state. The density matrix is especially useful for mixed states, because any state, pure or mixed, can be characterized by a single density matrix.
A mixed state is different from a quantum superposition. In fact, a quantum superposition of pure states is another pure state, for example $| \psi \rangle = (| \psi_1 \rangle + | \psi_2 \rangle)/\sqrt{2}$.

### Example: Light polarization

An example of pure and mixed states is light polarization. Photons can have two helicities, corresponding to two orthogonal quantum states, $|R\rangle$ (right circular polarization) and $|L\rangle$(left circular polarization). A photon can also be in a superposition state, such as $(|R\rangle+|L\rangle)/\sqrt{2}$ (vertical polarization) or $(|R\rangle-|L\rangle)/\sqrt{2}$ (horizontal polarization). More generally, it can be in any state $\alpha|R\rangle+\beta|L\rangle$, corresponding to linearcircular, or elliptical polarization. If we pass $(|R\rangle+|L\rangle)/\sqrt{2}$ polarized light through a circular polarizer which allows either only $|R\rangle$ polarized light, or only $|L\rangle$ polarized light, intensity would be reduced by half in both cases. This may make it seem like half of the photons are in state $|R\rangle$ and the other half in state $|L\rangle$. But this is not correct: Both $|R\rangle$ and $|L\rangle$ photons are partly absorbed by a vertical linear polarizer, but the $(|R\rangle+|L\rangle)/\sqrt{2}$ light will pass through that polarizer with no absorption whatsoever.
However, unpolarized light (such as the light from an incandescent light bulb) is different from any state like $\alpha|R\rangle+\beta|L\rangle$ (linear, circular, or elliptical polarization). Unlike linearly or elliptically polarized light, it passes through a polarizer with 50% intensity loss whatever the orientation of the polarizer; and unlike circularly polarized light, it cannot be made linearly polarized with any wave plate. Indeed, unpolarized light cannot be described as any state of the form $\alpha|R\rangle+\beta|L\rangle$. However, unpolarized light can be described perfectly by assuming that each photon is either $| R \rangle$ with 50% probability or $| L \rangle$ with 50% probability. The same behavior would occur if each photon was either vertically polarized with 50% probability or horizontally polarized with 50% probability.
Therefore, unpolarized light cannot be described by any pure state, but can be described as a statistical ensemble of pure states in at least two ways (the ensemble of half left and half right circularly polarized, or the ensemble of half vertically and half horizontally linearly polarized). These two ensembles are completely indistinguishable experimentally, and therefore they are considered the same mixed state. One of the advantages of the density matrix is that there is just one density matrix for each mixed state, whereas there are many statistical ensembles of pure states for each mixed state. Nevertheless, the density matrix contains all the information necessary to calculate any measurable property of the mixed state.
Where do mixed states come from? To answer that, consider how to generate unpolarized light. One way is to use a system in thermal equilibrium, a statistical mixture of enormous numbers of microstates, each with a certain probability (the Boltzmann factor), switching rapidly from one to the next due to thermal fluctuations. Thermal randomness explains why an incandescent light bulb, for example, emits unpolarized light. A second way to generate unpolarized light is to introduce uncertainty in the preparation of the system, for example, passing it through a birefringent crystal with a rough surface, so that slightly different parts of the beam acquire different polarizations. A third way to generate unpolarized light uses an EPR setup: A radioactive decay can emit two photons traveling in opposite directions, in the quantum state $(|R,L\rangle+|L,R\rangle)/\sqrt{2}$. The two photons together are in a pure state, but if you only look at one of the photons and ignore the other, the photon behaves just like unpolarized light.
More generally, mixed states commonly arise from a statistical mixture of the starting state (such as in thermal equilibrium), from uncertainty in the preparation procedure (such as slightly different paths that a photon can travel), or from looking at a subsystem entangled with something else.

### Mathematical description

The state vector $| \psi \rangle$ of a pure state completely determines the statistical behavior of a measurement. As an example, take an observable quantity, and let A be the associatedobservable operator that has a representation on the Hilbert space $\mathcal{H}$ of the quantum system. For any real-valued function F defined on the real numbers,[5] suppose that F(A) is the result of applying F to the outcome of a measurement. The expectation value of F(A) is
$\langle \psi | F(A) | \psi \rangle\, .$
Now consider a mixed state prepared by statistically combining two different pure states $| \psi \rangle$ and $|\phi\rangle$, with the associated probabilities p and 1 − p, respectively. The associated probabilities mean that the preparation process for the quantum system ends in the state $|\psi\rangle$ with probability p and in the state $|\phi\rangle$ with probability 1 − p.
It is not hard to show that the statistical properties of the observable for the system prepared in such a mixed state are completely determined. However, there is no state vector $|\xi\rangle$which determines this statistical behavior in the sense that the expectation value of F(A) is
$\langle \xi | F(A) | \xi \rangle \, .$
Nevertheless, there is a unique operator ρ such that the expectation value of F(A) can be written as
$\operatorname{tr}[\rho F(A)]\, ,$
where the operator ρ is the density operator of the mixed system. A simple calculation shows that the operator ρ for the above example is given by
$\rho = p | \psi\rangle \langle \psi | + (1-p) | \phi\rangle \langle \phi |\,.$

## Formulation

For a finite dimensional function space, the most general density operator is of the form
$\rho = \sum_j p_j |\psi_j \rang \lang \psi_j|$
where the coefficients pj are non-negative and add up to one. This represents a statistical mixture of pure states. If the given system is closed, then one can think of a mixed state as representing a single system with an uncertain preparation history, as explicitly detailed above; or we can regard the mixed state as representing an ensemble of systems, i.e. large number of copies of the system in question, where pj is the proportion of the ensemble being in the state $\textstyle |\psi_j \rang$. An ensemble is described by a pure state if every copy of the system in that ensemble is in the same state, i.e. it is a pure ensemble. If the system is not closed, however, then it is simply not correct to claim that it has some definite but unknown state vector, as the density operator may record physical entanglements to other systems.
Consider a quantum ensemble of size N with occupancy numbers n1n2,...,nk corresponding to the orthonormal states $\textstyle |1\rang,...,|k\rang$, respectively, where n1+...+nk = N, and, thus, the coefficients pj = nj /N. For a pure ensemble, where all N particles are in state $\textstyle |i\rang$, we have nj = 0, for all j ≠ i, from which we recover the corresponding density operator $\textstyle\rho = |i\rang\lang i|$. However, the density operator of a mixed state does not capture all the information about a mixture; in particular, the coefficients pj and the kets ψj are not recoverable from the operator ρ without additional information. This non-uniqueness implies that different ensembles or mixtures may correspond to the same density operator. Such equivalent ensembles or mixtures cannot be distinguished by measurement of observables alone. This equivalence can be characterized precisely. Two ensembles ψ, ψ' define the same density operator if and only if there is a matrix U with
$U^*U=I$
i.e., U is unitary and such that
$| \psi_i'\rangle \sqrt {p_i'} = \sum_{j} u_{ij} | \psi_j\rangle \sqrt {p_j}.$
This is simply a restatement of the following fact from linear algebra: for two square matrices M and NM M* = N N* if and only if M = NU for some unitary U. (See square root of a matrix for more details.) Thus there is a unitary freedom in the ket mixture or ensemble that gives the same density operator. However if the kets in the mixture are orthonormal then the original probabilities pj are recoverable as the eigenvalues of the density matrix.
In operator language, a density operator is a positive semidefinitehermitian operator of trace 1 acting on the state space. A density operator describes a pure state if it is a rank one projection. Equivalently, a density operator ρ is a pure state if and only if
$\; \rho = \rho^2$,
i.e. the state is idempotent. This is true regardless of whether H is finite dimensional or not.
Geometrically, when the state is not expressible as a convex combination of other states, it is a pure state. The family of mixed states is a convex set and a state is pure if it is anextremal point of that set.
It follows from the spectral theorem for compact self-adjoint operators that every mixed state is an infinite convex combination of pure states. This representation is not unique. Furthermore, a theorem of Andrew Gleason states that certain functions defined on the family of projections and taking values in [0,1] (which can be regarded as quantum analogues of probability measures) are determined by unique mixed states. See quantum logic for more details.

## Measurement

Let A be an observable of the system, and suppose the ensemble is in a mixed state such that each of the pure states $\textstyle |\psi_j\rang$ occurs with probability pj. Then the corresponding density operator is:
$\rho = \sum_j p_j |\psi_j \rang \lang \psi_j| .$
The expectation value of the measurement can be calculated by extending from the case of pure states (see Measurement in quantum mechanics):
$\lang A \rang = \sum_j p_j \lang \psi_j|A|\psi_j \rang = \operatorname{tr}[\rho A],$
where tr denotes trace. Moreover, if A has spectral resolution
$A = \sum_i a_i |a_i \rang \lang a_i| = \sum _i a_i P_i,$
where $P_i = |a_i \rang \lang a_i|$, the corresponding density operator after the measurement is given by:
$\; \rho ^' = \sum_i P_i \rho P_i.$
Note that the above density operator describes the full ensemble after measurement. The sub-ensemble for which the measurement result was the particular value ai is described by the different density operator
$\rho_i' = \frac{P_i \rho P_i}{\operatorname{tr}[\rho P_i]}.$
This is true assuming that $\textstyle |a_i\rang$ is the only eigenket (up to phase) with eigenvalue ai; more generally, Pi in this expression would be replaced by the projection operator into theeigenspace corresponding to eigenvalue ai.

## Entropy

The von Neumann entropy $S$ of a mixture can be expressed in terms of the eigenvalues of $\rho$ or in terms of the trace and logarithm of the density operator $\rho$. Since $\rho$ is a positive semi-definite operator, it has a spectral decomposition such that $\rho= \sum_i \lambda_i |\varphi_i\rangle\langle\varphi_i|$ where $|\varphi_i\rangle$ are orthonormal vectors. Therefore the entropy of a quantum system with density matrix $\rho$ is
$S = -\sum_i \lambda_i \ln \,\lambda_i = -\operatorname{tr}(\rho \ln \rho)\quad.$
Also it can be shown that
$S\left(\rho=\sum_i p_i\rho_i\right)= H(p_i) + \sum_i p_iS(\rho_i)$
when $\rho_i$ have orthogonal support, where $H(p)$ is the Shannon entropy. This entropy can increase but never decrease with a projective measurement, however generalised measurements can decrease entropy [6][7]. The entropy of a pure state is zero, while that of a proper mixture always greater than zero. Therefore a pure state may be converted into a mixture by a measurement, but a proper mixture can never be converted into a pure state. Thus the act of measurement induces a fundamental irreversible change on the density matrix; this is analogous to the "collapse" of the state vector, or wavefunction collapse. Perhaps counterintuitively, the measurement actually decreases information by erasing quantum interference in the composite system—cf. quantum entanglement and quantum decoherence.
(A subsystem of a larger system can be turned from a mixed to a pure state, but only by increasing the von Neumann entropy elsewhere in the system. This is analogous to how the entropy of an object can be lowered by putting it in a refrigerator: The air outside the refrigerator's heat-exchanger warms up, gaining even more entropy than was lost by the object in the refrigerator. See second law of thermodynamics. See Entropy in thermodynamics and information theory.)

## The Von Neumann equation for time evolution

Just as the Schrödinger equation describes how pure states evolve in time, the von Neumann equation (also known as Liouville-von Neumann equation) describes how a density operator evolves in time (in fact, the two equations are equivalent, in the sense that either can be derived from the other.) The von Neumann equation dictates that[8][9]
$i \hbar \frac{\partial \rho}{\partial t} = [H,\rho]~,$
where the brackets denote a commutator.
Note that this equation only holds when the density operator is taken to be in the Schrödinger picture, even though this equation seems at first look to emulate the Heisenberg equation of motion in the Heisenberg picture, with a crucial sign difference:
$\frac{dA^{(H)}}{dt}=-\frac{i}{\hbar}[A^{(H)},H] ~,$
where $A^{(H)}(t)$ is some Heisenberg picture operator; but in this picture the density matrix is not time-dependent, and the relative sign ensures that the time derivative of the expected value $\langle A \rangle$ comes out the same as in the Schrödinger picture.
Taking the density operator to be in the Schrödinger picture makes sense, since it is composed of 'Schrödinger' kets and bras evolved in time, as per the Schrödinger picture. If the Hamiltonian is time-independent, this differential equation can be easily solved to yield
$\rho(t) = e^{-i H t/\hbar} \rho(0) e^{i H t/\hbar}.$

## "Quantum Liouville", Moyal's equation

The density matrix operator may also be realized in phase space. Under the Wigner map, the density matrix transforms into the equivalent Wigner function,
$W(x,p)\stackrel{\mathrm{def}}{=}\frac{1}{\pi\hbar}\int_{-\infty}^\infty \psi^*(x+y)\psi(x-y)e^{2ipy/\hbar}\,dy ~.$
The equation for the time-evolution of the Wigner function is then the Wigner-transform of the above von Neumann equation,
$\frac{\partial W(q,p,t)}{\partial t} = -\{\{W(q,p,t) , H(q,p )\}\}~,$
where H(q,p) is the Hamiltonian, and { { •,• } } is the Moyal bracket, the transform of the quantum commutator.
The evolution equation for the Wigner function is then analogous to that of its classical limit, the Liouville equation of classical physics. In the limit of vanishing Planck's constant ħ,W(q,p,t) reduces to the classical Liouville probability density function in phase space.
The classical Liouville equation can be solved using the method of characteristics for partial differential equations, the characteristic equations being Hamilton's equations. The Moyal equation in quantum mechanics similarly admits formal solutions in terms of quantum characteristics, predicated on the ∗−product of phase space, although, in actual practice, solution-seeking follows different methods.

## Composite Systems

The joint density matrix of a composite system of two systems A and B is described by $\rho_{AB}$. Then the subsystems are described by their reduced density operator.
$\rho_A=\operatorname{tr}_B\rho_{AB}$
$\operatorname{tr}_B$ is called partial trace over system B. If A and B are two distinct and independent systems then $\rho_{AB}=\rho_{A}\otimes\rho_{B}$ which is a product state.

## C*-algebraic formulation of states

It is now generally accepted that the description of quantum mechanics in which all self-adjoint operators represent observables is untenable.[10][11] For this reason, observables are identified to elements of an abstract C*-algebra A (that is one without a distinguished representation as an algebra of operators) and states are positive linear functionals on A. However, by using the GNS construction, we can recover Hilbert spaces which realize A as a subalgebra of operators.
Geometrically, a pure state on a C*-algebra A is a state which is an extreme point of the set of all states on A. By properties of the GNS construction these states correspond toirreducible representations of A.
The states of the C*-algebra of compact operators K(H) correspond exactly to the density operators and therefore the pure states of K(H) are exactly the pure states in the sense of quantum mechanics.
The C*-algebraic formulation can be seen to include both classical and quantum systems. When the system is classical, the algebra of observables become an abelian C*-algebra. In that case the states become probability measures, as noted in the introduction.

## Notes and references

1. ^ Fano, Ugo (1957), "Description of States in Quantum Mechanics by Density Matrix and Operator Techniques", Reviews of Modern Physics 29: 74–93, Bibcode 1957RvMP...29...74F,doi:10.1103/RevModPhys.29.74.
2. ^ von Neumann, John (1927), "Wahrscheinlichkeitstheoretischer Aufbau der Quantenmechanik", Göttinger Nachrichten 1: 245–272.
3. ^ Landau, L. D. (1927), "Das Dämpfungsproblem in der Wellenmechanik", Zeitschrift für Physik 45 (5–6): 430–441, Bibcode 1927ZPhy...45..430Ldoi:10.1007/BF01343064
4. ^ Landau, L. D., and Lifshitz, E. M. (1977), Quantum Mechanics, Non-Relativistic Theory: Volume 3, Oxford: Pergamon Press, pp. 41, ISBN 0-08-017801-4
5. ^ Technically, F must be a Borel function
6. ^ Nielsen, Michael; Chuang, Isaac (2000), Quantum Computation and Quantum InformationCambridge University PressISBN 978-0-521-63503-5. Chapter 11: Entropy and information, Theorem 11.9, "Projective measurements cannot decrease entropy"
7. ^ Everett, Hugh (1973), "The Theory of the Universal Wavefunction (1956) Appendix I. "Monotone decrease of information for stochastic processes"", The Many-Worlds Interpretation of Quantum Mechanics, Princeton Series in Physics, Princeton University Press, pp. 128–129, ISBN 978-0-691-08131-1
8. ^
9. ^
10. ^ See appendix, Mackey, George Whitelaw (1963), Mathematical Foundations of Quantum Mechanics, Dover Books on Mathematics, New York: Dover PublicationsISBN 978-0-486-43517-6
11. ^ Emch, Gerard G. (1972), Algebraic methods in statistical mechanics and quantum field theoryWiley-InterscienceISBN 978-0-471-23900-0