3.2 Beam-splitters: quantum interference, revisited

One of the simplest quantum devices in which quantum interference can be controlled is a Mach–Zehnder interferometer — see Figure 3.5.67

The Mach–Zehnder interferometer, with the input photon represented by the coloured dot. This experimental set-up is named after the physicists Ludwig Mach and Ludwig Zehnder, who proposed it back in 1890s.

Figure 3.5: The Mach–Zehnder interferometer, with the input photon represented by the coloured dot. This experimental set-up is named after the physicists Ludwig Mach and Ludwig Zehnder, who proposed it back in 1890s.

This is a slightly modified version of the construction shown in Figure 3.2, where we have added two slivers of glass of different thickness into each of the optical paths connecting the two beam-splitters. The slivers are usually referred to as “phase shifters”, and their thicknesses φ0\varphi_0 and φ1\varphi_1 are measured in units of the photon’s wavelength multiplied by 2π2\pi. These phase shifters are so called because they modify the probability amplitudes by phase factors eiφ0e^{i\varphi_0} and eiφ1e^{i\varphi_1}, respectively. The only other change that we make is replacing the symmetric beam-splitters with non-symmetric ones, i.e. they no longer transmit or reflect with equal probability, but instead reflect with some (fixed) probability amplitude iRi\sqrt{R} and transmit with some probability amplitude T\sqrt{T}, where R+T=1R+T=1. As before, the two input ports of the interferometer are labelled as 0|0\rangle and 1|1\rangle, and each of the two output ports, also labelled as 00 and 11, terminates in a photodetector.

A photon (the coloured dot in the figure) impinges on the first beam-splitter from one of the two input ports (here input 0|0\rangle) and begins its journey towards one of the two photodetectors. Let68 UijU_{ij} denote the probability amplitude that the photon initially in input port j=0,1j=0,1 ends up in detector i=0,1i=0,1.

In quantum theory we almost always work with probability amplitudes, and only once we have a full expression for the amplitude of a given outcome do we square its absolute value to get the corresponding probability.

For example, let us calculate U00U_{00}. We notice that there are two alternative ways for the photon in the input port 0|0\rangle to end up at the output port 00: It can take the lower path, through the phase shifter φ0\varphi_0, or the upper path, through the phase shifter φ1\varphi_1. The lower path implies two consecutive transmissions at the beam-splitters and the phase factor eiφ0e^{i\varphi_0}, whereas the upper path implies two consecutive reflections and the phase factor eiφ1e^{i\varphi_1}. Once the photon ends in the output port 00 there is no way of knowing which path was taken, so we add the amplitudes pertaining to each path. The resulting amplitude is U00=Teiφ0T+iReiφ1iR, U_{00} = \sqrt{T} e^{i\varphi_0} \sqrt{T} + i\sqrt{R} e^{i\varphi_1} i \sqrt{R}, and the corresponding probability P00=U002P_{00}=|U_{00}|^2 is P00=Teiφ0T+iReiφ1iR2=Teiφ0Reiφ12=T2+R22TRcos(φ1φ0). \begin{aligned} P_{00} & = \left\vert \sqrt{T}e^{i\varphi_0}\sqrt{T} + i\sqrt{R}e^{i\varphi_1}i\sqrt{R} \right\vert^2 \\&= \left\vert Te^{i\varphi_0} - Re^{i\varphi_1} \right\vert^2 \\& = T^2 + R^2 - 2TR\cos(\varphi_1-\varphi_0). \end{aligned}

The “classical” part of this expression, T2+R2T^2+R^2, basically says that the photon undergoes either two consecutive transmissions with probability T2T^2, or two consecutive reflections with probability R2R^2. The probability of being transmitted through any phase shifter is always 11, hence the phase shifters play no role in the classical description of this process. But the classical description is not correct — it doesn’t agree with physical experiments! — whence the interference term 2TRcos(φ1φ0)2TR\cos(\varphi_1-\varphi_0) in which the phase shifters play the essential role. Depending on the relative phase φ=φ1φ0\varphi=\varphi_1-\varphi_0 the probability that the detector 00 “clicks” after having fired a photon into input 0|0\rangle can vary from (TR)2(T-R)^2 to 11.

If we do not care about the experimental details, we can represent the action of the Mach–Zehnder interferometer in terms of a diagram, as in Figure 3.6.

The Mach--Zehnder interferometer represented as an abstract diagram.

Figure 3.6: The Mach–Zehnder interferometer represented as an abstract diagram.

Here we can follow, from left to right, the multiple different paths that a photon can take in between specific input and output ports. The amplitude along any given path is just the product of the amplitudes pertaining to the path segments (Rule 1, Section 1.1), while the overall amplitude is the sum of the amplitudes for the many different paths (Rule 2, Section 1.1). You can, for example, see that the probability amplitude U10U_{10} is given by U10=Teiφ0iR+iReiφ1T U_{10} = \sqrt{T}e^{i\varphi_0}i\sqrt{R} + i\sqrt{R}e^{i\varphi_1}\sqrt{T} and the corresponding probability is P10=Teiφ0iR+iReiφ1T2=2RT+2RTcos(φ1φ0). \begin{aligned} P_{10} & = \left\vert \sqrt{T}e^{i\varphi_0}i\sqrt{R} + i\sqrt{R}e^{i\varphi_1}\sqrt{T} \right\vert^2 \\& = 2RT + 2RT\cos(\varphi_1-\varphi_0). \end{aligned} Again, the first term is of “classical” origin and represents probabilities corresponding to each path: one reflection followed by one transmission plus one transmission followed by one reflection, that is, RT+TR=2RTRT+TR=2RT. The second term is the interference term. Clearly, the photon entering port 0|0\rangle will end up in one of the two detectors, hence P00+P10=R2+2RT+T2=(T+R)2=1. P_{00} + P_{10} = R^2 + 2RT + T^2 = (T+R)^2 = 1. The action of the interferometer is thus fully described69 by the four probability amplitudes UijU_{ij} (i,j=0,1i,j=0,1).

The most popular instance of a Mach–Zehnder interferometer involves only symmetric beam-splitters (i.e. R=T=12R=T=\frac{1}{2}) and is fully described by the matrix70 U=[sinφ/2cosφ/2cosφ/2sinφ/2] U = \begin{bmatrix} -\sin\varphi/2 & \cos\varphi/2 \\\cos\varphi/2 & \sin\varphi/2 \end{bmatrix} where φ=φ1φ0\varphi=\varphi_1-\varphi_0.


  1. You can play around with a virtual Mach–Zehnder interferometer at Quantum Flytrap’s Virtual Lab. (There are also lots of other things you can do in this virtual lab — go have a look!).↩︎

  2. We will often use ii as an index even though it is also used for the imaginary unit. Hopefully, no confusion will arise for it should be clear from the context which one is which.↩︎

  3. Any isolated quantum device can fully be described by the matrix of probability amplitudes UijU_{ij} that input jj generates output ii.↩︎

  4. Really, when you write down the matrices describing the action of the symmetric beam-splitters and the phase gates, and then multiply them all together (which is an exercise worth doing!), you actually obtain ieiφ0+φ12Uie^{i\frac{\varphi_0+\varphi_1}{2}}U rather than UU, but as we have already said, we can ignore global phase factors.↩︎