Electromagnetic wave
From Citizendium, the Citizens' Compendium
Paul Wormer (Talk  contribs) m 
Paul Wormer (Talk  contribs) m (→Relation to Maxwell's equations: rm scriptstyle added verticalalign: baseline) 

Line 102:  Line 102:  
\mathbf{E}(\mathbf{r}, t) = \mathbf{e}_x E_0 \sin\big[k(zct)\big]\quad\hbox{with}\quad k \equiv \frac{2\pi}{\lambda}.  \mathbf{E}(\mathbf{r}, t) = \mathbf{e}_x E_0 \sin\big[k(zct)\big]\quad\hbox{with}\quad k \equiv \frac{2\pi}{\lambda}.  
</math>  </math>  
  The snapshot is taken at <math>\  +  The snapshot is taken at <font style="verticalalign: baseline"><math> \, t = 2\pi n /(ck)</math></font> for some arbitrary integer ''n''. We assumed here that the direction of '''E''' defines the direction of the ''x''axis with unit vector '''e'''<sub>''x''</sub> along this axis. The quantity ''E''<sub>0</sub> is the [[amplitude]] of the wave. Insertion of this expression in in the left hand side of the wave equation for '''E''' gives 
:<math>  :<math>  
\boldsymbol{\nabla}^2 \mathbf{E} = \mathbf{e}_x E_0 \frac{\partial^2 \sin\big[k(zct)\big]}{\partial z^2} =  k^2 \mathbf{e}_x E_0 \sin\big[k(zct)\big].  \boldsymbol{\nabla}^2 \mathbf{E} = \mathbf{e}_x E_0 \frac{\partial^2 \sin\big[k(zct)\big]}{\partial z^2} =  k^2 \mathbf{e}_x E_0 \sin\big[k(zct)\big]. 
Revision as of 14:10, 14 January 2009

In physics, an electromagnetic wave is a change, periodic in space and time, of an electric field E(r,t) and a magnetic field B(r,t). A stream of electromagnetic waves is referred to as electromagnetic radiation. Because an electric as well as a magnetic field is involved, the term electromagnetic (EM) is used, a contamination of electric and magnetic. Examples of EM waves in increasing wavelength are: gamma rays, Xrays, ultraviolet light, visible light, infrared, microwaves, and radio waves. All these waves propagate in vacuum with the same speed c, the speed of light. The speed of light in air (at standard temperature and pressure) is very close to the speed of light in vacuum (the refractive index of air, n, is 1.0002926, meaning that the speed of electromagnetic waves in air is c/n ≈ c).
Classically (nonquantum mechanically), EM radiation is produced by accelerating charges, for instance, by the oscillating charge in a radio antenna. Quantum mechanically, EM radiation is emitted whenever a system in an energetically high state (of energy E_{2}) makes a transition to a state of lower energy (E_{1}); during this transition a photon (light quantum) of energy proportional to the difference E_{2}  E_{1} > 0 is emitted. This is what happens in a fluorescent tube: mercury atoms are brought into an energetically high state by collisions with electrons, and upon subsequent falling down to their lowest energy state they emit photons.
Electromagnetic waves were predicted on theoretical grounds by James Clerk Maxwell in 1861 and first emitted and received in the laboratory by Heinrich Hertz a quarter century later. The first to see the applicability for communication purposes was the inventor of radiotelegraphy Guglielmo Marconi (around 1900). Later applications are radio, television, radar, cellular phones, gps, and all kinds of wireless applications, from remote control of television sets to internet access in public places.
Properties
In figure 1 we see a snapshot (i.e., a picture at a certain point in time) of the magnetic and electric fields in adjacent points of space. In each point, the vector E is perpendicular to the vector B. The wave propagates to the right, along an axis which we conveniently refer to as zaxis. Both E and B are perpendicular to the propagation direction, which is expressed by stating that an electromagnetic wave is a transverse wave, in contrast to sound waves, which are longitudinal waves (i.e., air molecules vibrate parallel to the propagation direction of the sound).
Assume that the snapshot in figure 1 is taken at time t, then at a certain point z we see an arrow of certain length representing E(z,t) and also a vector B(z,t). At a point in time Δt later, the same values of E and B (same arrows) are seen at z + c Δt. The arrows seem to have propagated to the right with a speed c.
The time t is fixed and the position z varies in figure 1. Conversely, we can keep the position fixed and imagine what happens if time changes. Focus on a fixed point z, then in progressing time the two vectors E(z,t) and B(z,t) in the point z, grow to a maximum value, then shrink to zero, become negative, go to a minimum value, and grow again, passing through zero on their way to the same maximum value. This cycle is repeated indefinitely. When we now plot E and B in the fixed point z as a function of time t, we see the same type (sinetype) function as in figure 1. The number of times per second that the vectors go through a full cycle is the frequency of the electromagnetic wave.
Periodicity in space means that the EM wave is repeated after a certain distance. This distance, the wavelength is traditionally designated by λ, see figure 1. If we go at a fixed time a distance λ to the right or to the left we encounter the very same fields E and B.
Basically, the only property distinguishing different kinds of EM waves, is their wavelength, see figure 2. Note the enormous span in wavelengths, from one trillionth of a millimeter for gammarays (radioactive rays) up to the VLF (very low frequency) radio waves of about 100 kilometer.
Frequency of electromagnetic waves
Often EM waves are characterized by their frequency ν, instead of their wavelength λ. If the EM field goes through ν full cycles in a second, where ν is a positive integral number, the field has a frequency of ν Hz (hertz). The speed of propagation of the EM waves being c, in 1/ν seconds the wave propagates a distance c × (1/ν) meter (according to the formula: distance traveled is speed times traveling time). The distance covered in 1/ν seconds is by definition the wavelength λ:
If we express c in m/s then λ is obtained in m. To convert quickly from wavelength to frequency we can approximate c by 3·10^{8} m/s.
As was pointed out above, the wavelengths of the various parts of the EM spectrum differ many orders of magnitude. Furthermore, the sources of the radiations, the interactions with matter, and the detectors employed differ widely, too. So, it is not surprising that in the past different parts of the spectrum were discovered at different times and that electromagnetic radiation of different wavelengths is called by different names. In the table and in figure 2 some illustrative values are given for several kinds of EM waves, together with their names.
 
Some typical values of: wavelength (λ), frequency (ν = c/λ), photon energy (hν), cycle time (T = 1/ν), and inverse wavelength [1/(100⋅λ)].  
 
EM wave  λ (m)  ν (1/s)  hν (J)  T (s)  1/λ (cm^{−1}) 
 
γrays  1.00⋅10^{−14}  3.00⋅10^{22}  1.98⋅10^{−11}  3.33⋅10^{−23}  1.00⋅10^{12 } 
Xrays  5.00⋅10^{−10}  6.00⋅10^{17}  3.96⋅10^{−16}  1.67⋅10^{−18}  2.00⋅10^{7 } 
Ultraviolet  2.00⋅10^{−7 }  1.50⋅10^{15}  9.90⋅10^{−19}  6.67⋅10^{−16}  5.00⋅10^{4 } 
Visible  6.00⋅10^{−7 }  5.00⋅10^{14}  3.30⋅10^{−19}  2.00⋅10^{−15}  1.67⋅10^{4 } 
Infrared  5.00⋅10^{−6 }  6.00⋅10^{13}  3.96⋅10^{−20}  1.67⋅10^{−14}  2.00⋅10^{3 } 
Microwave  1.00⋅10^{−2 }  3.00⋅10^{10}  1.98⋅10^{−23}  3.33⋅10^{−11}  1.00 
Radio  1.00⋅10^{2 }  3.00⋅10^{6 }  1.98⋅10^{−27}  3.33⋅10^{−7 }  1.00⋅10^{−4} 
Monochromatic linearly polarized waves
The wave depicted in figure 1 is monochromatic, i.e., it is characterized by a single wavelength (monochromatic means "of one color". In the visible region, different wavelengths correspond to light of different colors). It is known that EM waves can be linearly superimposed, which is due to the fact that they are solutions of a linear partial differential equation, the wave equation (see next section). A linear superposition of waves is a solution of the same wave equation as the waves themselves. Such a superposition is also an electromagnetic wave (a propagating periodic EM field). If waves of different wavelengths are superimposed, then a nonmonochromatic wave is obtained (the term multichromatic wave would be apt, but is never used). By means of Fourier analysis a nonmonochromatic wave can be decomposed into its monochromatic components.
The electric field vectors in figure 1 are all in one plane, this is the plane of polarization, and a wave with one fixed polarization plane, is called linearly polarized.
The radiation of many lasers is monochromatic and linearly polarized (at least to a very good approximation).
Relation to Maxwell's equations
In this section it will be shown that the electromagnetic wave depicted in figure 1 is a solution of the Maxwell equations in the vacuum.
We assume that at some distance away from the source of EM waves (a radio transmitter, a laser, gamma radiating nuclei, etc.), there is no charge density ρ and no current density J. For that region of space, the microscopic (vacuum) Maxwell equations become (in SI units):
and
Apply to the last Maxwell equation the following relation, known from vector analysis and valid for any (differentiable) vector field,
and use that ∇ · E = 0, then E satisfies the wave equation,
Note that the displacement current (time derivative of E) is essential in this equation, if it were absent (zero), the field E would be a static, timeindependent, electric field, and there would be no waves.
In the very same way we derive a wave equation for B,
Observe that E and B are related by the third and fourth Maxwell equation, which express the fact that a displacement current causes a magnetic field, and a changing magnetic field causes an electric field (Faraday's law of induction), respectively. So a timedependent electric field that is not associated with a timedependent magnetic field cannot exist, and conversely. Indeed, in special relativity E and c B can be transformed into one another by a Lorentz transformation of the electromagnetic field tensor, which shows their close relationship.
The wave equation is without doubt the most widely studied differential equation in mathematical physics. In figure 1 the electric field depicts a particular solution, with special initial and boundary conditions. The snapshot that is depicted has the analytic form
The snapshot is taken at for some arbitrary integer n. We assumed here that the direction of E defines the direction of the xaxis with unit vector e_{x} along this axis. The quantity E_{0} is the amplitude of the wave. Insertion of this expression in in the left hand side of the wave equation for E gives
Insertion of this expression in in the right hand side of the wave equation for E gives
so that it follows that the special solution, depicted in figure 1, is indeed a solution of the wave equation for E.
We could now proceed in the very same way and solve the wave equation for B, but then we could easily overlook the relation between the two fields. So we rather substitute the solution for E into the fourth Maxwell equation and use the definition of curl as a determinant,
It is easy to see that
is a solution of this equation. It follows that E and B are perpendicular (along the xaxis and yaxis, respectively) and are in phase. That is, E and B are simultaneously zero and attain simultaneously their maximum and minimum. The fact that (in vacuum) the amplitude of B is a factor c smaller than that of E is due to the use of SI units, in which the amplitudes have different dimensions. In Gaussian units this is not the case and E_{0} = B_{0}.
Energy and energy flow in electromagnetic field
Relation between power densities
In this section the following balance of power (energy per unit time) densities will be discussed:
where
The terms in equation (1) have dimension W/m^{3} (watt per volume). The quantity represents the rate at which energy is produced per unit volume by ordinary Joulean (resistance) heating. The quantity is the energy density of the EM field and the time derivative is the rate of increase (power density). The vector S, Poynting's vector, is the power flux, the amount of energy crossing unit area perpendicular to the vector, per unit time.
Multiplying the terms in equation (1) by a volume ΔV and a time span Δt, both small enough that the terms in the equation may assumed to be constant over the volume and the time span, the equation represents the conservation of energy: the energy generated in ΔVΔt (the right hand side) is equal to the increase in energy in ΔV plus the net flow of energy leaving ΔV. Hence, when this equation is multiplied by ΔVΔt it is an equation of continuity for energy.
Intensity
The intensity I of an electromagnetic wave is by definition the modulus of the Poynting vector, the amount of energy carried by the wave across a unit surface in unit time [dimension: volt×ampere/meter^{2} = joule/(second×m^{2}) = W/m^{2}],
Use E ⋅ H = 0 and that in vacuum and SI units
then
Often I is timeaveraged over a complete cycle. Since the integral of cos^{2} and sin^{2} over a cycle is 1/2, we get for the timeaveraged intensity:
where E_{0} is the amplitude of the electric field.
Clearly, the cycle averaged intensity of a wave going through vacuum is constant, independent of the direction of propagation, z. This is because in vacuum the wave does not lose energy to the medium. In a medium this may be different, may decrease because of energy loss. The LambertBeer law states in that case
Hence the electromagnetic wave is damped by the factor exp[− kz/2]. This damping gives rise to an imaginary component of the index of refraction.
Derivation of relation between power densities
Equation (1), the balance of power, will be proved. Recall from elementary electricity theory the laws of Joule and Ohm. They state that the amount of energy W per unit time, produced by a conduction current I, is equal to
where R is the resistance and V a voltage difference.
Assuming that the current flows along z, we introduce the current density J_{z}, and using
we obtain
We could continue discussing the system with the small volume . However, because all terms in the equations would be multiplied by the same volume, it is more convenient to consider densities and to divide out the volume. Nevertheless, we still refer to the system. Thus, we define
The negative quantity is the loss of energy of the system per unit time and per unit volume (according to Joule's and Ohm's laws). One may look upon the quantity as the work (per unit time and unit volume) done by the Lorentz force on the moving particles constituting the current density J. Since , this work depends only on the electric field E.
Apply one of Maxwell's equations:
Use a rule known from vector analysis and apply another one of Maxwell's equations,
Define
with
where ε_{0} is the electric constant and μ_{0} the magnetic constant of the vacuum. Define also
where S is the Poynting vector called after John Henry Poynting. This vector is perpendicular to the plane of E and B and by the righthand rule, it points in the direction of propagation of the EM wave. The divergence of the Poynting vector is the energy flow associated with the electromagnetic wave, i.e., with the pair E(r,t) and B(r,t). By definition ∇·S gives the flow leaving the system and − ∇·S gives the flow entering the system. The total energy balance becomes
Here we have found an example of the conservation of energy, known as Poynting's theorem: The energy produced per unit time according to Joule's law is equal to the rate in increase of the electromagnetic energy of the system , plus the flow of EM radiation ∇·S leaving the system.
If there is no current, J = 0, then
which is the continuity equation. The increase of field energy per unit time is the flow of radiation energy into the system.
Example
As an example we give an orderofmagnitudecalculation of an electromagnetic energy density. Consider to that end a radio station with a signal of strength P kW. We compute the energy density at a distance R from the station. First we must assume what the shape is of the waves emitted by the antenna, are they spherical or cylindrical? We choose the latter and call the cylinder height z. Further it is assumed that power density is homogeneous and that all power crosses the cylindrical walls, that is, power crossing the top and bottom of the cylinder is assumed to be zero. Also no absorption by the atmosphere or the Earth will occur. When a steady state is reached (some time after the beginning of the transmission), the time derivative of vanishes. The energy density at a distance R becomes constant in time,
where c is the speed of propagation of the radio signal (is speed of ligth ≈ 3·10^{8} m/s).^{[1]} To give a numerical example: P = 100 kW, R = 5 km (about 3 miles), z = 50 m, then = 2.1·10^{−10} J/m^{3}.
Fourier expansions of the fields
The quantization of the EM fields leads to photons, light quanta of welldefined energy and momentum. In field quantization a key role is played by the Fourier expansion of the different vector fields. Hence, we will now discuss the Fourier expansions of the fields E, B, and A. It will be seen that the expansion of the vector potential A yields immediately the expansions of the fields E and B.
Fourier expansion of a vector field
For an arbitrary real vector field F its Fourier expansion is the following:
where the bar indicates complex conjugation. Such an expansion, labeled by a discrete (countable) set of vectors k, is always possible when F satisfies periodic boundary conditions, i.e., F(r + p,t) = F(r,t) for some finite vector p. To impose such boundary conditions, it is common to consider EM waves as if they are in a virtual box of finite volume V. Waves on opposite walls of the box are enforced to have the same value (usually zero). Note that the waves are not restricted to the box: the box is replicated an infinite number of times in x, y, and z direction.
Vector potential and its expansion
The magnetic field B is a transverse field and hence can be written as
in which the vector potential A is introduced. Also the electric field E is transverse, because earlier we assumed absence of charge distributions. The electric field E also follows from A,
The fact that E can be written this way is due to the choice of Coulomb gauge for A:
By definition, a choice of gauge does not affect any measurable properties (the best known example of a choice of gauge is the fixing of the zero of an electric potential, for instance at infinity). The Coulomb gauge makes A transverse as well, and clearly A is in the same plane as E. (The time differentiation does not affect direction.) So, the vector fields A, B, and E are all in a plane perpendicular to the propagation direction and can be written in terms of e_{x} and e_{y} (in the definition of figure 1). It is more convenient to choose complex unit vectors:
which are orthonormal,
The Fourier expansion of the vector potential reads
The vector potential obeys the wave equation. The substitution of the Fourier series of A into the wave equation yields for the individual terms,
It is now an easy matter to construct the corresponding Fourier expansions for E and B from the expansion of the vector potential A. For instance, the expansion for E follows from differentiation with respect to time,
Fourierexpanded energy
The electromagnetic energy density , defined earlier in this article, can be expressed in terms of the Fourier coefficients. We define the total energy (classical Hamiltonian) of a finite volume V by
The classical Hamiltonian in terms of Fourier coefficients takes the form
with
and ε_{0} is the electric constant. The two terms in the summand of H are identical (factors commute) and may be summed. However, after quantization (interpretation of the expansion coefficients as operators) the factors do no longer commute and according to quantum mechanical rules one must depart from the symmetrized classical Hamiltonian.
Fourierexpanded momentum
The electromagnetic momentum, P_{EM}, of EM radiation enclosed by a volume V is proportional to an integral of the Poynting vector (see above). In SI units:
Quantization of the electromagnetic field
Einstein postulated in 1905 that an electromagnetic field consists of energy parcels (light quanta, later called photons) of energy hν, where h is Planck's constant. In 1927 Paul A. M. Dirac was able to fit the photon concept into the framework of the new quantum mechanics. He applied a technique which is now generally called second quantization,^{[2]} although this term is somewhat of a misnomer for EM fields, because they are, after all, solutions of the classical Maxwell equations, and it is the first time that they are quantized.
Second quantization
Second quantization starts with an expansion of a field in a basis consisting of a complete set of functions. The coefficients multiplying the basis functions are then interpreted as operators and (anti)commutation relations between these new operators are imposed, commutation relations for bosons and anticommutation relations for fermions (nothing happens to the basis functions themselves). By doing this, the expanded field is converted into a fermion or boson operator field. The expansion coefficients have become creation and annihilation operators: a creation operator creates a particle in the corresponding basis function and an annihilation operator annihilates a particle in this function.
In the case of EM fields the required expansion of the field is the Fourier expansion.
Quantization of EM field
The best known example of quantization is the replacement of the timedependent linear momentum of a particle by the rule
 .
Note that Planck's constant is introduced here and that time disappears (in the socalled Schrödinger picture).
For the EM field we do something similar and apply the quantization rules:
subject to the boson commutation relations
The square brackets indicate a commutator, defined by
for any two quantum mechanical operators A and B.
The quantized fields (operator fields) are the following
where ω = c k = ck.
Hamiltonian of the field
Substitution of the operators into the classical Hamiltonian gives the Hamilton operator of the EM field
By the use of the commutation relations the second line follows from the first. Note that , which is the wellknown Einstein expression for photon energy. Remember that ω depends on k, even though it is not explicit in the notation. The notation ω(k) could have been introduced, but is not common.
Digression: harmonic oscillator
The second quantized treatment of the onedimensional quantum harmonic oscillator is a wellknown topic in quantum mechanical courses. We digress and say a few words about it. The harmonic oscillator Hamiltonian has the form
where ω ≡ 2πν is the fundamental frequency of the oscillator. The ground state of the oscillator is designated by  0 > and is referred to as vacuum state. It can be shown that is an excitation operator, it excites from an n fold excited state to an n+1 fold excited state:
Since harmonic oscillator energies are equidistant, the nfold excited state  n > can be looked upon as a single state containing n particles (sometimes called vibrons) all of energy hν. These particles are bosons. For obvious reason the excitation operator is called a creation operator.
From the commutation relation follows that the Hermitian adjoint deexcites:
because a function times the number 0 is the zero function. For obvious reason the deexcitation operator is called an annihilation operator.
Suppose now we have a number of noninteracting (independent) onedimensional harmonic oscillators, each with its own fundamental frequency ω_{i}. Because the oscillators are independent, the Hamiltonian is a simple sum:
Making the substitution
we see that the Hamiltonian of the EM field can be looked upon as a Hamiltonian of independent oscillators of energy ω = k c and oscillating along direction e^{(μ)} with μ=1,−1.
Photon energy
The quantized EM field has a vacuum (no photons) state  0 >. The application of, say,
gives a quantum state of m photons in mode (μ, k) and n photons in mode (μ', k'). We use the proportionality symbol because the state on the righthand is not normalized to unity.
We can shift the zero of energy and rewrite the Hamiltonian as
The operator is the number operator. When acting on a quantum mechanical photon state, it returns the number of photons in mode (μ, k). Such a photon state is an eigenstate of the number operator. This is why the formalism described here, is often referred to as the occupation number representation. The effect of H on a singlephoton state is
Apparently, the singlephoton state is an eigenstate of H and is the corresponding energy.
Example photon density
In an earlier example we introduced a radio station and calculated the electromagnetic energy density that the station creates in its environment; at 5 km from the station this was 2.1·10^{−10} J/m^{3}. Let us now see if we need quantum mechanics to describe the broadcasting of this station.
The classical approximation to EM radiation is good when the number of photons is much larger than unity in the volume
where λ is the length of the radio waves. In that case quantum fluctuations are negligible and cannot be heard.
Suppose the radio station broadcasts at ν = 100 MHz, then it is sending out photons with an energy content of νh = 1·10^{8}× 6.6·10^{−34} = 6.6·10^{−26} J, where h is Planck's constant. The wavelength of the station is λ = c/ν = 3 m, so that λ/(2π) = 48 cm and the volume is 0.111 m^{3}. The energy content of this volume element is 2.1·10^{−10} × 0.111 = 2.3 ·10^{−11} J, which amounts to
 3.5 ·10^{12} photons per
Obviously, 3.5 ·10^{12} is much larger than one and hence quantum effects do not play a role; the waves emitted by this station are well into the classical limit, even when it plays nonclassical music, for instance of Led Zepplin.
Photon momentum
Introducing the operator expansions for E and B into the classical form
yields
The 1/2 that appears can be dropped because when we sum over the allowed k, k cancels with −k. The effect of P_{EM} on a singlephoton state is
Apparently, the singlephoton state is an eigenstate of the momentum operator, and is the eigenvalue (the momentum of a single photon).
Photon mass
The photon having nonzero linear momentum, one could imagine that it has a nonvanishing rest mass m_{0}, which is its mass at zero speed. However, we will now show that this is not the case: m_{0} = 0.
Since the photon propagates with the speed of light, special relativity is called for. The relativistic expressions for energy and momentum squared are,
From p^{2}/E^{2},
Use
and it follows that
so that m_{0} = 0.
Photon spin
The photon can be assigned a triplet spin with spin quantum number S = 1. This is similar to, say, the nuclear spin of the ^{14}N isotope, but with the important difference that the state with M_{S} = 0 is zero, only the states with M_{S} = ±1 are nonzero.
We define spin operators:
The products between the unit vectors on the righthand side are dyadic products. The unit vectors are perpendicular to the propagation direction k (the direction of the z axis, which is the spin quantization axis).
The spin operators satisfy the usual angular momentum commutation relations
Indeed,
Define states
By inspection it follows that
and correspondingly we see that μ labels the photon spin,
Because the vector potential A is a transverse field, the photon has no forward (μ = 0) spin component.
Notes
 ↑ P is the integral of = −E⋅J over the volume of the cylinder; is the integral of the Poynting vector S over the surface of the cylinder.
 ↑ The name derives from the second quantization of quantum mechanical wave functions. Such a wave function is a scalar field: the "Schrödinger field" and can be quantized in the very same way as EM fields. Since a wave function is derived from a "first" quantized Hamiltonian, the quantization of the Schrödinger field is the second time quantization is performed, hence the name.