Chapter 2
Maxwell’s theory of electromagnetism
2.1 Thepostulate
In 1864, James Clerk Maxwell proposed one of the most successful theories in the
history of science. In a famous memoir to the Royal Society [125] he presented nine
equations summarizing all known laws on electricity and magnetism. This was more
than a mere cataloging of the laws of nature. By postulating the need for an additional
term to make the set of equations self-consistent, Maxwell was able to put forth what
is still considered a complete theory of macroscopic electromagnetism. The beauty of
Maxwell’s equations led Boltzmann to ask, “Was it a god who wrote these lines ...?”
[185].
Since that time authors have struggled to ?nd the best way to present Maxwell’s
theory. Although it is possible to study electromagnetics from an “empirical–inductive”
viewpoint (roughly following the historical order of development beginning with static
?elds), it is only by postulating the complete theory that we can do justice to Maxwell’s
vision. His concept of the existence of an electromagnetic “?eld” (as introduced by
Faraday) is fundamental to this theory, and has become one of the most signi?cant
principles of modern science.
We ?nd controversy even over the best way to present Maxwell’s equations. Maxwell
worked at a time before vector notation was completely in place, and thus chose to
use scalar variables and equations to represent the ?elds. Certainly the true beauty
of Maxwell’s equations emerges when they are written in vector form, and the use of
tensors reduces the equations to their underlying physical simplicity. We shall use vector
notation in this book because of its wide acceptance by engineers, but we still must
decide whether it is more appropriate to present the vector equations in integral or point
form.
On one side of this debate, the brilliant mathematician David Hilbert felt that the
fundamental natural laws should be posited as axioms, each best described in terms
of integral equations [154]. This idea has been championed by Truesdell and Toupin
[199]. On the other side, we may quote from the great physicist Arnold Sommerfeld:
“The general development of Maxwell’s theory must proceed from its di?erential form;
for special problems the integral form may, however, be more advantageous” ([185], p.
23). Special relativity ?ows naturally from the point forms, with ?elds easily converted
between moving reference frames. For stationary media, it seems to us that the only
di?erence between the two approaches arises in how we handle discontinuities in sources
and materials. If we choose to use the point forms of Maxwell’s equations, then we must
also postulate the boundary conditions at surfaces of discontinuity. This is pointed out
clearly by Tai [192], who also notes that if the integral forms are used, then their validity
across regions of discontinuity should be stated as part of the postulate.
We have decided to use the point form in this text. In doing so we follow a long
history begun by Hertz in 1890 [85] when he wrote down Maxwell’s di?erential equations
as a set of axioms, recognizing the equations as the launching point for the theory of
electromagnetism. Also, by postulating Maxwell’s equations in point form we can take
full advantage of modern developments in the theory of partial di?erential equations; in
particular, the idea of a “well-posed” theory determines what sort of information must
be speci?ed to make the postulate useful.
We must also decide which form of Maxwell’s di?erential equations to use as the basis
of our postulate. There are several competing forms, each di?ering on the manner in
which materials are considered. The oldest and most widely used form was suggested
by Minkowski in 1908 [130]. In the Minkowski form the di?erential equations contain
no mention of the materials supporting the ?elds; all information about material media
is relegated to the constitutive relationships. This places simplicity of the di?erential
equations above intuitive understanding of the behavior of ?elds in materials. We choose
the Maxwell–Minkowski form as the basis of our postulate, primarily for ease of ma-
nipulation. But we also recognize the value of other versions of Maxwell’s equations.
We shall present the basic ideas behind the Bo? form, which places some information
about materials into the di?erential equations (although constitutive relationships are
still required). Missing, however, is any information regarding the velocity of a moving
medium. By using the polarization and magnetization vectors P and M rather than the
?elds D and H, it is sometimes easier to visualize the meaning of the ?eld vectors and
to understand (or predict) the nature of the constitutive relations.
The Chu and Amperian forms of Maxwell’s equations have been promoted as useful
alternatives to the Minkowski and Bo? forms. These include explicit information about
the velocity of a moving material, and di?er somewhat from the Bo? form in the physical
interpretation of the electric and magnetic properties of matter. Although each of these
models matter in terms of charged particles immersed in free space, magnetization in the
Bo? and Amperian forms arises from electric current loops, while the Chu form employs
magnetic dipoles. In all three forms polarization is modeled using electric dipoles. For a
detailed discussion of the Chu and Amperian forms, the reader should consult the work
of Kong [101], Tai [193], Pen?eld and Haus [145], or Fano, Chu and Adler [70].
Importantly, all of these various forms of Maxwell’s equations produce the same values
of the physical ?elds (at least external to the material where the ?elds are measurable).
We must include several other constituents, besides the ?eld equations, to make the
postulate complete. To form a complete ?eld theory we need a source ?eld, a mediating
?eld, and a set of ?eld di?erential equations. This allows us to mathematically describe
the relationship between e?ect (the mediating ?eld) and cause (the source ?eld). In
a well-posed postulate we must also include a set of constitutive relationships and a
speci?cation of some ?eld relationship over a bounding surface and at an initial time. If
the electromagnetic ?eld is to have physical meaning, we must link it to some observable
quantity such as force. Finally, to allow the solution of problems involving mathematical
discontinuities we must specify certain boundary, or “jump,” conditions.
2.1.1 The Maxwell–Minkowski equations
In Maxwell’s macroscopic theory of electromagnetics, the source ?eld consists of the
vector ?eld J(r,t) (the current density) and the scalar ?eld ρ(r,t) (the charge density).
In Minkowski’s form of Maxwell’s equations, the mediating ?eld is the electromagnetic
?eld consisting of the set of four vector ?elds E(r,t), D(r,t), B(r,t), and H(r,t). The ?eld
equations are the four partial di?erential equations referred to as the Maxwell–Minkowski
equations
?×E(r,t)=?
?
?t
B(r,t), (2.1)
?×H(r,t)= J(r,t)+
?
?t
D(r,t), (2.2)
?·D(r,t)=ρ(r,t), (2.3)
?·B(r,t)= 0, (2.4)
along with the continuity equation
?·J(r,t)=?
?
?t
ρ(r,t). (2.5)
Here (2.1) is called Faraday’s law, (2.2) is called Ampere’s law, (2.3) is called Gauss’s
law, and (2.4) is called the magnetic Gauss’s law. For brevity we shall often leave the
dependence on r and t implicit, and refer to the Maxwell–Minkowski equations as simply
the “Maxwell equations,” or “Maxwell’s equations.”
Equations (2.1)–(2.5), the point forms of the ?eld equations, describe the relation-
ships between the ?elds and their sources at each point in space where the ?elds are
continuously di?erentiable (i.e., the derivatives exist and are continuous). Such points
are called ordinary points. We shall not attempt to de?ne the ?elds at other points,
but instead seek conditions relating the ?elds across surfaces containing these points.
Normally this is necessary on surfaces across which either sources or material parameters
are discontinuous.
The electromagnetic ?elds carry SI units as follows: E is measured in Volts per meter
(V/m), B is measured in Teslas (T), H is measured in Amperes per meter (A/m), and
D is measured in Coulombs per square meter (C/m
2
). In older texts we ?nd the units of
B given as Webers per square meter (Wb/m
2
) to re?ect the role of B as a ?ux vector; in
that case the Weber (Wb = T·m
2
) is regarded as a unit of magnetic ?ux.
The interdependence of Maxwell’s equations. It is often claimed that the diver-
gence equations (2.3) and (2.4) may be derived from the curl equations (2.1) and (2.2).
While this is true, it is not proper to say that only the two curl equations are required
to describe Maxwell’s theory. This is because an additional physical assumption, not
present in the two curl equations, is required to complete the derivation. Either the
divergence equations must be speci?ed, or the values of certain constants that ?x the
initial conditions on the ?elds must be speci?ed. It is customary to specify the divergence
equations and include them with the curl equations to form the complete set we now call
“Maxwell’s equations.”
To identify the interdependence we take the divergence of (2.1) to get
?·(?×E)=?·
parenleftbigg
?
?B
?t
parenrightbigg
,
hence
?
?t
(?·B)= 0
by (B.49). This requires that ?·B be constant with time, say ?·B(r,t) = C
B
(r).
The constant C
B
must be speci?ed as part of the postulate of Maxwell’s theory, and
the choice we make is subject to experimental validation. We postulate that C
B
(r)= 0,
which leads us to (2.4). Note that if we can identify a time prior to which B(r,t) ≡ 0,
then C
B
(r) must vanish. For this reason, C
B
(r)= 0 and (2.4) are often called the “initial
conditions” for Faraday’s law [159]. Next we take the divergence of (2.2) to ?nd that
?·(?×H)=?·J +
?
?t
(?·D).
Using (2.5) and (B.49), we obtain
?
?t
(ρ??·D)= 0
and thus ρ ??·D must be some temporal constant C
D
(r). Again, we must postulate
the value of C
D
as part of the Maxwell theory. We choose C
D
(r) = 0 and thus obtain
Gauss’s law (2.3). If we can identify a time prior to which both D and ρ are everywhere
equal to zero, then C
D
(r) must vanish. Hence C
D
(r) = 0 and (2.3) may be regarded
as “initial conditions” for Ampere’s law. Combining the two sets of initial conditions,
we ?nd that the curl equations imply the divergence equations as long as we can ?nd a
time prior to which all of the ?elds E,D,B,H and the sources J and ρ are equal to zero
(since all the ?elds are related through the curl equations, and the charge and current are
related through the continuity equation). Conversely, the empirical evidence supporting
the two divergence equations implies that such a time should exist.
Throughout this book we shall refer to the two curl equations as the “fundamental”
Maxwell equations, and to the two divergence equations as the “auxiliary” equations.
The fundamental equations describe the relationships between the ?elds while, as we
have seen, the auxiliary equations provide a sort of initial condition. This does not
imply that the auxiliary equations are of lesser importance; indeed, they are required
to establish uniqueness of the ?elds, to derive the wave equations for the ?elds, and to
properly describe static ?elds.
Field vector terminology. Various terms are used for the ?eld vectors, sometimes
harkening back to the descriptions used by Maxwell himself, and often based on the
physical nature of the ?elds. We are attracted to Sommerfeld’s separation of the ?elds
into entities of intensity (E,B) and entities of quantity (D,H). In this system E is called
the electric ?eld strength, B the magnetic ?eld strength, D the electric excitation, and H
the magnetic excitation [185]. Maxwell separated the ?elds into a set (E,H) of vectors
that appear within line integrals to give work-related quantities, and a set (B,D) of
vectors that appear within surface integrals to give ?ux-related quantities; we shall see
this clearly when considering the integral forms of Maxwell’s equations. By this system,
authors such as Jones [97] and Ramo, Whinnery, and Van Duzer [153] call E the electric
intensity, H the magnetic intensity, B the magnetic ?ux density, and D the electric ?ux
density.
Maxwell himself designated names for each of the vector quantities. In his classic
paper “A Dynamical Theory of the Electromagnetic Field,” [178] Maxwell referred to
the quantity we now designate E as the electromotive force, the quantity D as the elec-
tric displacement (with a time rate of change given by his now famous “displacement
current”), the quantity H as the magnetic force, and the quantity B as the magnetic
induction (although he described B as a density of lines of magnetic force). Maxwell
also included a quantity designated electromagnetic momentum as an integral part of his
theory. We now know this as the vector potential A which is not generally included as a
part of the electromagnetics postulate.
Many authors follow the original terminology of Maxwell, with some slight modi?ca-
tions. For instance, Stratton [187] calls E the electric ?eld intensity, H the magnetic
?eld intensity, D the electric displacement, and B the magnetic induction. Jackson [91]
calls E the electric ?eld, H the magnetic ?eld, D the displacement, and B the magnetic
induction.
Other authors choose freely among combinations of these terms. For instance, Kong
[101] calls E the electric ?eld strength, H the magnetic ?eld strength, B the magnetic ?ux
density, and D the electric displacement. We do not wish to inject further confusion into
the issue of nomenclature; still, we ?nd it helpful to use as simple a naming system as
possible. We shall refer to E as the electric ?eld, H as the magnetic ?eld, D as the electric
?ux density and B as the magnetic ?ux density. When we use the term electromagnetic
?eld we imply the entire set of ?eld vectors (E,D,B,H) used in Maxwell’s theory.
Invariance of Maxwell’s equations. Maxwell’s di?erential equations are valid for
anysysteminuniformrelativemotionwithrespecttothelaboratoryframeofreferencein
which we normally do our measurements. The ?eld equations describe the relationships
between the source and mediating ?elds within that frame of reference. This property
was ?rst proposed for moving material media by Minkowski in 1908 (using the term
covariance) [130]. For this reason, Maxwell’s equations expressed in the form (2.1)–(2.2)
are referred to as the Minkowski form.
2.1.2 Connection to mechanics
Our postulate must include a connection between the abstract quantities of charge and
?eld and a measurable physical quantity. A convenient means of linking electromagnetics
to other classical theories is through mechanics. We postulate that charges experience
mechanical forces given by the Lorentz force equation. If a small volume element dV
contains a total charge ρ dV, then the force experienced by that charge when moving at
velocity v in an electromagnetic ?eld is
dF =ρ dV E +ρv dV × B. (2.6)
As with any postulate, we verify this equation through experiment. Note that we write
the Lorentz force in terms of charge ρ dV, rather than charge density ρ, since charge is
an invariant quantity under a Lorentz transformation.
The important links between the electromagnetic ?elds and energy and momentum
must also be postulated. We postulate that the quantity
S
em
= E × H (2.7)
represents the transport density of electromagnetic power, and that the quantity
g
em
= D × B (2.8)
represents the transport density of electromagnetic momentum.
2.2 Thewell-posednatureofthepostulate
It is important to investigate whether Maxwell’s equations, along with the point form
of the continuity equation, su?ce as a useful theory of electromagnetics. Certainly we
must agree that a theory is “useful” as long as it is de?ned as such by the scientists and
engineers who employ it. In practice a theory is considered useful if it predicts accurately
the behavior of nature under given circumstances, and even a theory that often fails may
be useful if it is the best available. We choose here to take a more narrow view and
investigate whether the theory is “well-posed.”
A mathematical model for a physical problem is said to be well-posed,orcorrectly set,
if three conditions hold:
1. the model has at least one solution (existence);
2. the model has at most one solution (uniqueness);
3. the solution is continuously dependent on the data supplied.
The importance of the ?rst condition is obvious: if the electromagnetic model has no
solution, it will be of little use to scientists and engineers. The importance of the second
condition is equally obvious: if we apply two di?erent solution methods to the same
model and get two di?erent answers, the model will not be very helpful in analysis or
design work. The third point is more subtle; it is often extended in a practical sense to
the following statement:
3
prime
. Small changes in the data supplied produce equally small changes in the solution.
That is, the solution is not sensitive to errors in the data. To make sense of this we
must decide which quantity is speci?ed (the independent quantity) and which remains
to be calculated (the dependent quantity). Commonly the source ?eld (charge) is taken
as the independent quantity, and the mediating (electromagnetic) ?eld is computed from
it; in such cases it can be shown that Maxwell’s equations are well-posed. Taking the
electromagnetic ?eld to be the independent quantity, we can produce situations in which
the computed quantity (charge or current) changes wildly with small changes in the
speci?ed ?elds. These situations (called inverse problems) are of great importance in
remote sensing, where the ?eld is measured and the properties of the object probed are
thereby deduced.
At this point we shall concentrate on the “forward” problem of specifying the source
?eld (charge) and computing the mediating ?eld (the electromagnetic ?eld). In this case
we may question whether the ?rst of the three conditions (existence) holds. We have
twelve unknown quantities (the scalar components of the four vector ?elds), but only
eight equations to describe them (from the scalar components of the two fundamental
Maxwell equations and the two scalar auxiliary equations). With fewer equations than
unknowns we cannot be sure that a solution exists, and we refer to Maxwell’s equations
as being inde?nite. To overcome this problem we must specify more information in
the form of constitutive relations among the ?eld quantities E, B, D, H, and J. When
these are properly formulated, the number of unknowns and the number of equations
are equal and Maxwell’s equations are in de?nite form. If we provide more equations
than unknowns, the solution may be non-unique. When we model the electromagnetic
properties of materials we must supply precisely the right amount of information in the
constitutive relations, or our postulate will not be well-posed.
OnceMaxwell’sequationsareinde?niteform, standardmethodsforpartialdi?erential
equations can be used to determine whether the electromagnetic model is well-posed. In
a nutshell, the system (2.1)–(2.2) of hyperbolic di?erential equations is well-posed if and
only if we specify E and H throughout a volume region V at some time instant and also
specify, at all subsequent times,
1. the tangential component of E over all of the boundary surface S,or
2. the tangential component of H over all of S,or
3. the tangential component of E over part of S, and the tangential component of H
over the remainder of S.
Proof of all three of the conditions of well-posedness is quite tedious, but a simpli?ed
uniqueness proof is often given in textbooks on electromagnetics. The procedure used
by Stratton [187] is reproduced below. The interested reader should refer to Hansen [81]
for a discussion of the existence of solutions to Maxwell’s equations.
2.2.1 Uniqueness of solutions to Maxwell’sequations
Consider a simply connected region of space V bounded by a surface S, where both
V and S contain only ordinary points. The ?elds within V are associated with a current
distribution J, which may be internal to V (entirely or in part). By the initial conditions
that imply the auxiliary Maxwell’s equations, we know there is a time, say t = 0, prior
to which the current is zero for all time, and thus by causality the ?elds throughout V
are identically zero for all times t < 0. We next assume that the ?elds are speci?ed
throughout V at some time t
0
> 0, and seek conditions under which they are determined
uniquely for all t > t
0
.
Let the ?eld set (E
1
,D
1
,B
1
,H
1
) be a solution to Maxwell’s equations (2.1)–(2.2)
associated with the current J (along with an appropriate set of constitutive relations),
and let (E
2
,D
2
,B
2
,H
2
) be a second solution associated with J. To determine the con-
ditions for uniqueness of the ?elds, we look for a situation that results in E
1
= E
2
,
B
1
= B
2
, and so on. The electromagnetic ?elds must obey
?×E
1
=?
?B
1
?t
,
?×H
1
= J +
?D
1
?t
,
?×E
2
=?
?B
2
?t
,
?×H
2
= J +
?D
2
?t
.
Subtracting, we have
?×(E
1
? E
2
)=?
?(B
1
? B
2
)
?t
, (2.9)
?×(H
1
? H
2
)=
?(D
1
? D
2
)
?t
, (2.10)
hence de?ning E
0
= E
1
? E
2
, B
0
= B
1
? B
2
, and so on, we have
E
0
·(?×H
0
)= E
0
·
?D
0
?t
, (2.11)
H
0
·(?×E
0
)=?H
0
·
?B
0
?t
. (2.12)
Subtracting again, we have
E
0
·(?×H
0
)? H
0
·(?×E
0
)= H
0
·
?B
0
?t
+ E
0
·
?D
0
?t
,
hence
?? ·(E
0
× H
0
)= E
0
·
?D
0
?t
+ H
0
·
?B
0
?t
by (B.44). Integrating both sides throughout V and using the divergence theorem on the
left-hand side, we get
?
contintegraldisplay
S
(E
0
× H
0
)· dS =
integraldisplay
V
parenleftbigg
E
0
·
?D
0
?t
+ H
0
·
?B
0
?t
parenrightbigg
dV.
Breaking S into two arbitrary portions and using (B.6), we obtain
integraldisplay
S
1
E
0
·(?n × H
0
)dS?
integraldisplay
S
2
H
0
·(?n × E
0
)dS=
integraldisplay
V
parenleftbigg
E
0
·
?D
0
?t
+ H
0
·
?B
0
?t
parenrightbigg
dV.
Now if ?n × E
0
= 0 or ?n × H
0
= 0 over all of S, or some combination of these conditions
holds over all of S, then
integraldisplay
V
parenleftbigg
E
0
·
?D
0
?t
+ H
0
·
?B
0
?t
parenrightbigg
dV = 0. (2.13)
This expression implies a relationship between E
0
, D
0
, B
0
, and H
0
. Since V is arbitrary,
we see that one possibility is simply to have D
0
and B
0
constant with time. However,
since the ?elds are identically zero for t < 0, if they are constant for all time then those
constant values must be zero. Another possibility is to have one of each pair (E
0
,D
0
)
and (H
0
,B
0
) equal to zero. Then, by (2.9) and (2.10), E
0
= 0 implies B
0
= 0, and
D
0
= 0 implies H
0
= 0.ThusE
1
= E
2
, B
1
= B
2
, and so on, and the solution is unique
throughout V. However, we cannot in general rule out more complicated relationships.
The number of possibilities depends on the additional constraints on the relationship
between E
0
, D
0
, B
0
, and H
0
that we must supply to describe the material supporting
the ?eld — i.e., the constitutive relationships. For a simple medium described by the
time-constant permittivity epsilon1 and permeability μ, (13) becomes
integraldisplay
V
parenleftbigg
E
0
·epsilon1
?E
0
?t
+ H
0
·μ
?H
0
?t
parenrightbigg
dV = 0,
or
1
2
?
?t
integraldisplay
V
(epsilon1E
0
· E
0
+μH
0
· H
0
)dV = 0.
Since the integrand is always positive or zero (and not constant with time, as mentioned
above), the only possible conclusion is that E
0
and H
0
must both be zero, and thus the
?elds are unique.
When establishing more complicated constitutive relations, we must be careful to en-
sure that they lead to a unique solution, and that the condition for uniqueness is un-
derstood. In the case above, the assumption ?n × E
0
vextendsingle
vextendsingle
S
= 0 implies that the tangential
components of E
1
and E
2
are identical over S — that is, we must give speci?c values of
these quantities on S to ensure uniqueness. A similar statement holds for the condition
?n × H
0
vextendsingle
vextendsingle
S
= 0. Requiring that constitutive relations lead to a unique solution is known
as just setting, and is one of several factors that must be considered, as discussed in the
next section.
Uniqueness implies that the electromagnetic state of an isolated region of space may
be determined without the knowledge of conditions outside the region. If we wish to
solve Maxwell’s equations for that region, we need know only the source density within
the region and the values of the tangential ?elds over the bounding surface. The e?ects
of a complicated external world are thus reduced to the speci?cation of surface ?elds.
This concept has numerous applications to problems in antennas, di?raction, and guided
waves.
2.2.2 Constitutive relations
We now supply a set of constitutive relations to complete the conditions for well-
posedness. We generally split these relations into two sets. The ?rst describes the
relationships between the electromagnetic ?eld quantities, and the second describes me-
chanical interaction between the ?elds and resulting secondary sources. All of these
relations depend on the properties of the medium supporting the electromagnetic ?eld.
Material phenomena are quite diverse, and it is remarkable that the Maxwell–Minkowski
equations hold for all phenomena yet discovered. All material e?ects, from nonlinearity
to chirality to temporal dispersion, are described by the constitutive relations.
The speci?cation of constitutive relationships is required in many areas of physical
science to describe the behavior of “ideal materials”: mathematical models of actual
materials encountered in nature. For instance, in continuum mechanics the constitutive
equations describe the relationship between material motions and stress tensors [209].
Truesdell and Toupin [199] give an interesting set of “guiding principles” for the con-
cerned scientist to use when constructing constitutive relations. These include consider-
ation of consistency (with the basic conservation laws of nature), coordinate invariance
(independence of coordinate system), isotropy and aeolotropy (dependence on, or inde-
pendence of, orientation), just setting (constitutive parameters should lead to a unique
solution), dimensional invariance (similarity), material indi?erence (non-dependence on
the observer), and equipresence (inclusion of all relevant physical phenomena in all of
the constitutive relations across disciplines).
The constitutive relations generally involve a set of constitutive parameters and a set
of constitutive operators. The constitutive parameters may be as simple as constants
of proportionality between the ?elds or they may be components in a dyadic relation-
ship. The constitutive operators may be linear and integro-di?erential in nature, or may
imply some nonlinear operation on the ?elds. If the constitutive parameters are spa-
tially constant within a certain region, we term the medium homogeneous within that
region. If the constitutive parameters vary spatially, the medium is inhomogeneous.If
the constitutive parameters are constants with time, we term the medium stationary;
if they are time-changing, the medium is nonstationary. If the constitutive operators
involve time derivatives or integrals, the medium is said to be temporally dispersive;if
space derivatives or integrals are involved, the medium is spatially dispersive. Examples
of all these e?ects can be found in common materials. It is important to note that the
constitutive parameters may depend on other physical properties of the material, such
as temperature, mechanical stress, and isomeric state, just as the mechanical constitu-
tive parameters of a material may depend on the electromagnetic properties (principle
of equipresence).
Many e?ects produced by linear constitutive operators, such as those associated with
temporal dispersion, have been studied primarily in the frequency domain. In this case
temporal derivative and integral operations produce complex constitutive parameters. It
is becoming equally important to characterize these e?ects directly in the time domain
for use with direct time-domain ?eld solving techniques such as the ?nite-di?erence time-
domain (FDTD) method. We shall cover the very basic properties of dispersive media
in this section. A detailed description of frequency-domain ?elds (and a discussion of
complex constitutive parameters) is deferred until later in this book.
It is di?cult to ?nd a simple and consistent means for classifying materials by their
electromagnetic e?ects. One way is to separate linear and nonlinear materials, then cate-
gorize linear materials by the way in which the ?elds are coupled through the constitutive
relations:
1. Isotropic materials are those in which D is related to E, B is related to H, and
the secondary source current J is related to E, with the ?eld direction in each pair
aligned.
2.Inanisotropic materials the pairings are the same, but the ?elds in each pair are
generally not aligned.
3. In biisotropic materials (such as chiral media) the ?elds D and B depend on both
E and H, but with no realignment of E or H; for instance, D is given by the
addition of a scalar times E plus a second scalar times H. Thus the contributions
to D involve no changes to the directions of E and H.
4. Bianisotropic materials exhibit the most general behavior: D and H depend on both
E and B, with an arbitrary realignment of either or both of these ?elds.
In 1888, Roentgen showed experimentally that a material isotropic in its own station-
aryreferenceframeexhibitsbianisotropicpropertieswhenobservedfromamovingframe.
Only recently have materials bianisotropic in their own rest frame been discovered. In
1894 Curie predicted that in a stationary material, based on symmetry, an electric ?eld
might produce magnetic e?ects and a magnetic ?eld might produce electric e?ects. These
e?ects, coined magnetoelectric by Landau and Lifshitz in 1957, were sought unsuccess-
fully by many experimentalists during the ?rst half of the twentieth century. In 1959 the
Soviet scientist I.E. Dzyaloshinskii predicted that, theoretically, the antiferromagnetic
material chromium oxide (Cr
2
O
3
) should display magnetoelectric e?ects. The magneto-
electric e?ect was ?nally observed soon after by D.N. Astrov in a single crystal of Cr
2
O
3
using a 10 kHz electric ?eld. Since then the e?ect has been observed in many di?erent
materials. Recently, highly exotic materials with useful electromagnetic properties have
been proposed and studied in depth, including chiroplasmas and chiroferrites [211]. As
the technology of materials synthesis advances, a host of new and intriguing media will
certainly be created.
The most general forms of the constitutive relations between the ?elds may be written
in symbolic form as
D = D[E,B], (2.14)
H = H[E,B]. (2.15)
That is, D and H have some mathematically descriptive relationship to E and B. The
speci?c forms of the relationships may be written in terms of dyadics as [102]
cD =
ˉ
P · E +
ˉ
L ·(cB), (2.16)
H =
ˉ
M · E +
ˉ
Q ·(cB), (2.17)
where each of the quantities
ˉ
P,
ˉ
L,
ˉ
M,
ˉ
Q may be dyadics in the usual sense, or dyadic
operators containing space or time derivatives or integrals, or some nonlinear operations
on the ?elds. We may write these expressions as a single matrix equation
bracketleftbigg
cD
H
bracketrightbigg
= [
ˉ
C]
bracketleftbigg
E
cB
bracketrightbigg
(2.18)
where the 6 × 6 matrix
[
ˉ
C] =
bracketleftbigg
ˉ
P
ˉ
L
ˉ
M
ˉ
Q
bracketrightbigg
.
This most general relationship between ?elds is the property of a bianisotropic material.
We may wonder why D is not related to (E,B,H), E to (D,B), etc. The reason is
that since the ?eld pairs (E,B) and (D,H) convert identically under a Lorentz transfor-
mation, a constitutive relation that maps ?elds as in (2.18) is form invariant, as are the
Maxwell–Minkowski equations. That is, although the constitutive parameters may vary
numerically between observers moving at di?erent velocities, the form of the relationship
given by (2.18) is maintained.
Many authors choose to relate (D,B) to (E,H), often because the expressions are
simpler and can be more easily applied to speci?c problems. For instance, in a linear,
isotropic material (as shown below) D is directly proportional to E and B is directly
proportional to H. To provide the appropriate expression for the constitutive relations,
we need only remap (2.18). This gives
D = ˉepsilon1· E +
ˉ
ξ· H, (2.19)
B =
ˉ
ζ · E + ˉμ· H, (2.20)
or
bracketleftbigg
D
B
bracketrightbigg
=
bracketleftbig
ˉ
C
EH
bracketrightbig
bracketleftbigg
E
H
bracketrightbigg
, (2.21)
where the new constitutive parameters ˉepsilon1,
ˉ
ξ,
ˉ
ζ, ˉμ can be easily found from the original
constitutive parameters
ˉ
P,
ˉ
L,
ˉ
M,
ˉ
Q. We do note, however, that in the form (2.19)–(2.20)
the Lorentz invariance of the constitutive equations is not obvious.
In the following paragraphs we shall characterize some of the most common materials
according to these classi?cations. With this approach e?ects such as temporal or spatial
dispersion are not part of the classi?cation process, but arise from the nature of the
constitutive parameters. Hence we shall not dwell on the particulars of the constitutive
parameters, but shall concentrate on the form of the constitutive relations.
Constitutive relations for ?elds in free space. In a vacuum the ?elds are related
by the simple constitutive equations
D =epsilon1
0
E, (2.22)
H =
1
μ
0
B. (2.23)
The quantities μ
0
and epsilon1
0
are, respectively, the free-space permeability and permittivity
constants. It is convenient to use three numerical quantities to describe the electromag-
netic properties of free space — μ
0
, epsilon1
0
, and the speed of light c — and interrelate them
through the equation
c = 1/(μ
0
epsilon1
0
)
1/2
.
Historically it has been the practice to de?neμ
0
, measure c, and computeepsilon1
0
. In SI units
μ
0
= 4π × 10
?7
H/m,
c = 2.998 × 10
8
m/s,
epsilon1
0
= 8.854 × 10
?12
F/m.
With the two constitutive equations we have enough information to put Maxwell’s
equations into de?nite form. Traditionally (2.22) and (2.23) are substituted into (2.1)–
(2.2) to give
?×E =?
?B
?t
, (2.24)
?×B =μ
0
J +μ
0
epsilon1
0
?E
?t
. (2.25)
These are two vector equations in two vector unknowns (equivalently, six scalar equations
in six scalar unknowns).
In terms of the general constitutive relation (2.18), we ?nd that free space is isotropic
with
ˉ
P =
ˉ
Q =
1
η
0
ˉ
I,
ˉ
L =
ˉ
M = 0,
where η
0
= (μ
0
/epsilon1
0
)
1/2
is called the intrinsic impedance of free space. This emphasizes
the fact that free space has, along with c, only a single empirical constant associated
with it (i.e.,epsilon1
0
orη
0
). Since no derivative or integral operators appear in the constitutive
relations, free space is nondispersive.
Constitutive relations in a linear isotropic material. In a linear isotropic mate-
rial there is proportionality between D and E and between B and H. The constants of
proportionality are the permittivity epsilon1 and the permeability μ. If the material is nondis-
persive, the constitutive relations take the form
D =epsilon1E, B =μH,
where epsilon1 and μ may depend on position for inhomogeneous materials. Often the permit-
tivity and permeability are referenced to the permittivity and permeability of free space
according to
epsilon1 =epsilon1
r
epsilon1
0
,μ=μ
r
μ
0
.
Here the dimensionless quantities epsilon1
r
and μ
r
are called, respectively, the relative permit-
tivity and relative permeability.
When dealing with the Maxwell–Bo? equations (§ 2.4) the di?erence between the
material and free space values of D and H becomes important. Thus for linear isotropic
materials we often write the constitutive relations as
D =epsilon1
0
E +epsilon1
0
χ
e
E, (2.26)
B =μ
0
H +μ
0
χ
m
H, (2.27)
where the dimensionless quantities χ
e
=epsilon1
r
? 1 and χ
m
=μ
r
? 1 are called, respectively,
the electric and magnetic susceptibilities of the material. In terms of (2.18) we have
ˉ
P =
epsilon1
r
η
0
ˉ
I,
ˉ
Q =
1
η
0
μ
r
ˉ
I,
ˉ
L =
ˉ
M = 0.
Generally a material will have either its electric or magnetic properties dominant. If
μ
r
= 1 and epsilon1
r
negationslash= 1 then the material is generally called a perfect dielectric or a perfect
insulator, and is said to be an electric material. If epsilon1
r
= 1 and μ
r
negationslash= 1, the material is
said to be a magnetic material.
A linear isotropic material may also have conduction properties. In a conducting
material, a constitutive relation is generally used to describe the mechanical interaction
of ?eld and charge by relating the electric ?eld to a secondary electric current. For
a nondispersive isotropic material, the current is aligned with, and proportional to, the
electric ?eld; there are no temporal operators in the constitutive relation, which is simply
J =σE. (2.28)
This is known as Ohm’s law. Here σ is the conductivity of the material.
If μ
r
≈ 1 and σ is very small, the material is generally called a good dielectric.If
σ is very large, the material is generally called a good conductor. The conditions by
which we say the conductivity is “small” or “large” are usually established using the
frequency response of the material. Materials that are good dielectrics over broad ranges
of frequency include various glasses and plastics such as fused quartz, polyethylene,
and te?on. Materials that are good conductors over broad ranges of frequency include
common metals such as gold, silver, and copper.
For dispersive linear isotropic materials, the constitutive parameters become nonsta-
tionary (time dependent), and the constitutive relations involve time operators. (Note
that the name dispersive describes the tendency for pulsed electromagnetic waves to
spread out, or disperse, in materials of this type.) If we assume that the relationships
given by (2.26), (2.27), and (2.28) retain their product form in the frequency domain,
then by the convolution theorem we have in the time domain the constitutive relations
D(r,t)=epsilon1
0
parenleftbigg
E(r,t)+
integraldisplay
t
?∞
χ
e
(r,t ? t
prime
)E(r,t
prime
)dt
prime
parenrightbigg
, (2.29)
B(r,t)=μ
0
parenleftbigg
H(r,t)+
integraldisplay
t
?∞
χ
m
(r,t ? t
prime
)H(r,t
prime
)dt
prime
parenrightbigg
, (2.30)
J(r,t)=
integraldisplay
t
?∞
σ(r,t ? t
prime
)E(r,t
prime
)dt
prime
. (2.31)
These expressions were ?rst introduced by Volterra in 1912[199]. We see that for a linear
dispersive material of this type the constitutive operators are time integrals, and that
the behavior of D(t) depends not only on the value of E at time t, but on its values at
all past times. Thus, in dispersive materials there is a “time lag” between the e?ect of
the applied ?eld and the polarization or magnetization that results. In the frequency
domain, temporal dispersion is associated with complex values of the constitutive pa-
rameters, which, to describe a causal relationship, cannot be constant with frequency.
The nonzero imaginary component is identi?ed with the dissipation of electromagnetic
energy as heat. Causality is implied by the upper limit being t in the convolution inte-
grals, which indicates that D(t)cannot depend on future values of E(t). This assumption
leads to a relationship between the real and imaginary parts of the frequency domain
constitutive parameters as described through the Kronig–Kramers equations.
Constitutive relations for ?elds in perfect conductors. In a perfect electric con-
ductor (PEC) or a perfect magnetic conductor (PMC) the ?elds are exactly speci?ed as
the null ?eld:
E = D = B = H = 0.
By Ampere’s and Faraday’s laws we must also have J = J
m
= 0; hence, by the continuity
equation, ρ =ρ
m
= 0.
In addition to the null ?eld, we have the condition that the tangential electric ?eld
on the surface of a PEC must be zero. Similarly, the tangential magnetic ?eld on the
surface of a PMC must be zero. This implies (§ 2.8.3) that an electric surface current
may exist on the surface of a PEC but not on the surface of a PMC, while a magnetic
surface current may exist on the surface of a PMC but not on the surface of a PEC.
A PEC may be regarded as the limit of a conducting material as σ →∞. In many
practical cases, good conductors such as gold and copper can be assumed to be perfect
electric conductors, which greatly simpli?es the application of boundary conditions. No
physical material is known to behave as a PMC, but the concept is mathematically
useful for applying symmetry conditions (in which a PMC is sometimes referred to as a
“magnetic wall”) and for use in developing equivalence theorems.
Constitutive relations in a linear anisotropic material. In a linear anisotropic
material there are relationships between B and H and between D and E, but the ?eld
vectors are not aligned as in the isotropic case. We can thus write
D = ˉepsilon1· E, B = ˉμ· H, J = ˉσ· E,
where ˉepsilon1 is called the permittivity dyadic, ˉμ is the permeability dyadic, and ˉσ is the
conductivity dyadic. In terms of the general constitutive relation (2.18) we have
ˉ
P = cˉepsilon1,
ˉ
Q =
ˉμ
?1
c
,
ˉ
L =
ˉ
M = 0.
Many di?erent types of materials demonstrate anisotropic behavior, including opti-
cal crystals, magnetized plasmas, and ferrites. Plasmas and ferrites are examples of
gyrotropic media. With the proper choice of coordinate system, the frequency-domain
permittivity or permeability can be written in matrix form as
[
?
ˉepsilon1] =
?
?
epsilon1
11
epsilon1
12
0
?epsilon1
12
epsilon1
11
0
00epsilon1
33
?
?
, [
?
ˉμ] =
?
?
μ
11
μ
12
0
?μ
12
μ
11
0
00μ
33
?
?
. (2.32)
Each of the matrix entries may be complex. For the special case of a lossless gyrotropic
material, the matrices become hermitian:
[
?
ˉepsilon1] =
?
?
epsilon1 ?jδ 0
jδepsilon10
00epsilon1
3
?
?
, [
?
ˉμ] =
?
?
μ ?jκ 0
jκμ0
00μ
3
?
?
, (2.33)
where epsilon1, epsilon1
3
, δ, μ, μ
3
, and κ are real numbers.
Crystals have received particular attention because of their birefringent properties. A
birefringent crystal can be characterized by a symmetric permittivity dyadic that has real
permittivity parameters in the frequency domain; equivalently, the constitutive relations
do not involve constitutive operators. A coordinate system called the principal system,
with axes called the principal axes, can always be found so that the permittivity dyadic
in that system is diagonal:
[
?
ˉepsilon1] =
?
?
epsilon1
x
00
0 epsilon1
y
0
00epsilon1
z
?
?
.
The geometrical structure of a crystal determines the relationship between epsilon1
x
, epsilon1
y
, and
epsilon1
z
.Ifepsilon1
x
= epsilon1
y
<epsilon1
z
, then the crystal is positive uniaxial (e.g., quartz). If epsilon1
x
= epsilon1
y
>epsilon1
z
,
the crystal is negative uniaxial (e.g., calcite). If epsilon1
x
negationslash=epsilon1
y
negationslash=epsilon1
z
, the crystal is biaxial (e.g.,
mica). In uniaxial crystals the z-axis is called the optical axis.
If the anisotropic material is dispersive, we can generalize the convolutional form of
the isotropic dispersive media to obtain the constitutive relations
D(r,t)=epsilon1
0
parenleftbigg
E(r,t)+
integraldisplay
t
?∞
ˉχ
e
(r,t ? t
prime
)· E(r,t
prime
)dt
prime
parenrightbigg
, (2.34)
B(r,t)=μ
0
parenleftbigg
H(r,t)+
integraldisplay
t
?∞
ˉχ
m
(r,t ? t
prime
)· H(r,t
prime
)dt
prime
parenrightbigg
, (2.35)
J(r,t)=
integraldisplay
t
?∞
ˉσ(r,t ? t
prime
)· E(r,t
prime
)dt
prime
. (2.36)
Constitutive relations for biisotropic materials. A biisotropic material is an
isotropic magnetoelectric material. Here we have D related to E and B, and H related to
E and B, but with no realignment of the ?elds as in anisotropic (or bianisotropic) mate-
rials. Perhaps the simplest example is the Tellegen medium devised by B.D.H. Tellegen
in 1948 [196], having
D =epsilon1E +ξH, (2.37)
B =ξE +μH. (2.38)
Tellegen proposed that his hypothetical material be composed of small (but macroscopic)
ferromagnetic particles suspended in a liquid. This is an example of a synthetic mate-
rial, constructed from ordinary materials to have an exotic electromagnetic behavior.
Other examples include arti?cial dielectrics made from metallic particles imbedded in
lightweight foams [66], and chiral materials made from small metallic helices suspended
in resins [112].
Chiral materials are also biisotropic, and have the constitutive relations
D =epsilon1E ?χ
?H
?t
, (2.39)
B =μH +χ
?E
?t
, (2.40)
where the constitutive parameter χ is called the chirality parameter. Note the presence
of temporal derivative operators. Alternatively,
D =epsilon1(E +β?×E), (2.41)
B =μ(H +β?×H), (2.42)
by Faraday’s and Ampere’s laws. Chirality is a natural state of symmetry; many natural
substances are chiral materials, including DNA and many sugars. The time derivatives
in (2.39)–(2.40) produce rotation of the polarization of time harmonic electromagnetic
waves propagating in chiral media.
Constitutive relations in nonlinear media. Nonlinear electromagnetic e?ects have
been studied by scientists and engineers since the beginning of the era of electrical tech-
nology. Familiar examples include saturation and hysteresis in ferromagnetic materials
and the behavior of p-n junctions in solid-state recti?ers. The invention of the laser
extended interest in nonlinear e?ects to the realm of optics, where phenomena such as
parametric ampli?cation and oscillation, harmonic generation, and magneto-optic inter-
actions have found applications in modern devices [174].
Provided that the external ?eld applied to a nonlinear electric material is small com-
pared to the internal molecular ?elds, the relationship between E and D can be expanded
in a Taylor series of the electric ?eld. For an anisotropic material exhibiting no hysteresis
e?ects, the constitutive relation is [131]
D
i
(r,t)=epsilon1
0
E
i
(r,t)+
3
summationdisplay
j=1
χ
(1)
ij
E
j
(r,t)+
3
summationdisplay
j,k=1
χ
(2)
ijk
E
j
(r,t)E
k
(r,t)+
+
3
summationdisplay
j,k,l=1
χ
(3)
ijkl
E
j
(r,t)E
k
(r,t)E
l
(r,t)+··· (2.43)
where the index i = 1,2,3 refers to the three components of the ?elds D and E. The
?rst sum in (2.43) is identical to the constitutive relation for linear anisotropic materi-
als. Thus, χ
(1)
ij
is identical to the susceptibility dyadic of a linear anisotropic medium
considered earlier. The quantity χ
(2)
ijk
is called the second-order susceptibility, and is a
three-dimensional matrix (or third rank tensor) describing the nonlinear electric e?ects
quadratic in E. Similarly χ
(3)
ijkl
is called the third-order susceptibility, and is a four-
dimensional matrix (or fourth rank tensor) describing the nonlinear electric e?ects cubic
in E. Numerical values of χ
(2)
ijk
and χ
(3)
ijkl
are given in Shen [174] for a variety of crystals.
When the material shows hysteresis e?ects, D at any point r and time t is due not only
to the value of E at that point and at that time, but to the values of E at all points and
times. That is, the material displays both temporal and spatial dispersion.
2.3 Maxwell’sequationsinmovingframes
The essence of special relativity is that the mathematical forms of Maxwell’s equa-
tions are identical in all inertial reference frames: frames moving with uniform velocities
relative to the laboratory frame of reference in which we perform our measurements.
This form invariance of Maxwell’s equations is a speci?c example of the general physical
principle of covariance. In the laboratory frame we write the di?erential equations of
Maxwell’s theory as
?×E(r,t)=?
?B(r,t)
?t
,
?×H(r,t)= J(r,t)+
?D(r,t)
?t
,
?·D(r,t)=ρ(r,t),
?·B(r,t)= 0,
?·J(r,t)=?
?ρ(r,t)
?t
.
Figure 2.1: Primed coordinate system moving with velocity v relative to laboratory
(unprimed) coordinate system.
Similarly, in an inertial frame having four-dimensional coordinates (r
prime
,t
prime
) we have
?
prime
× E
prime
(r
prime
,t
prime
)=?
?B
prime
(r
prime
,t
prime
)
?t
prime
,
?
prime
× H
prime
(r
prime
,t
prime
)= J
prime
(r
prime
,t
prime
)+
?D
prime
(r
prime
,t
prime
)
?t
prime
,
?
prime
· D
prime
(r
prime
,t
prime
)=ρ
prime
(r
prime
,t
prime
),
?
prime
· B
prime
(r
prime
,t
prime
)= 0,
?
prime
· J
prime
(r
prime
,t
prime
)=?
?ρ
prime
(r
prime
,t
prime
)
?t
prime
.
The primed ?elds measured in the moving system do not have the same numerical values
as the unprimed ?elds measured in the laboratory. To convert between E and E
prime
, B and
B
prime
, and so on, we must ?nd a way to convert between the coordinates (r,t) and (r
prime
,t
prime
).
2.3.1 Field conversions under Galilean transformation
We shall assume that the primed coordinate system moves with constant velocity v
relativetothelaboratoryframe(Figure2.1).Priortotheearlypartofthetwentieth
century, converting between the primed and unprimed coordinate variables was intuitive
and obvious: it was thought that time must be measured identically in each coordinate
system, and that the relationship between the space variables can be determined simply
by the displacement of the moving system at time t = t
prime
. Under these assumptions, and
under the further assumption that the two systems coincide at time t = 0, we can write
t
prime
= t, x
prime
= x ?v
x
t, y
prime
= y ?v
y
t, z
prime
= z ?v
z
t,
or simply
t
prime
= t, r
prime
= r ? vt.
This is called a Galilean transformation. We can use the chain rule to describe the
manner in which di?erential operations transform, i.e., to relate derivatives with respect
to the laboratory coordinates to derivatives with respect to the inertial coordinates. We
have, for instance,
?
?t
=
?t
prime
?t
?
?t
prime
+
?x
prime
?t
?
?x
prime
+
?y
prime
?t
?
?y
prime
+
?z
prime
?t
?
?z
prime
=
?
?t
prime
?v
x
?
?x
prime
?v
y
?
?y
prime
?v
z
?
?z
prime
=
?
?t
prime
?(v ·?
prime
). (2.44)
Similarly
?
?x
=
?
?x
prime
,
?
?y
=
?
?y
prime
,
?
?z
=
?
?z
prime
,
from which
?×A(r,t)=?
prime
× A(r,t), ?·A(r,t)=?
prime
· A(r,t), (2.45)
for each vector ?eld A.
Newton was aware that the laws of mechanics are invariant with respect to Galilean
transformations. DoMaxwell’sequationsalsobehaveinthisway? LetususetheGalilean
transformation to determine which relationship between the primed and unprimed ?elds
resultsinforminvarianceofMaxwell’sequations. We?rstexamine?
prime
×E, thespatialrate
of change of the laboratory ?eld with respect to the inertial frame spatial coordinates:
?
prime
× E =?×E =?
?B
?t
=?
?B
?t
prime
+(v ·?
prime
)B
by (2.45) and (2.44). Rewriting the last term by (B.45) we have
(v ·?
prime
)B =??
prime
×(v × B)
since v is constant and ?
prime
· B =?·B = 0, hence
?
prime
×(E + v × B)=?
?B
?t
prime
. (2.46)
Similarly
?
prime
× H =?×H = J +
?D
?t
= J +
?D
?t
prime
+?
prime
×(v × D)? v(?
prime
· D)
where ?
prime
· D =?·D =ρ so that
?
prime
×(H ? v × D)=
?D
?t
prime
?ρv + J. (2.47)
Also
?
prime
· J =?·J =?
?ρ
?t
=?
?ρ
?t
prime
+(v ·?
prime
)ρ
and we may use (B.42) to write
(v ·?
prime
)ρ = v ·(?
prime
ρ)=?
prime
·(ρv),
obtaining
?
prime
·(J ?ρv)=?
?ρ
?t
prime
. (2.48)
Equations (2.46), (2.47), and (2.48) show that the forms of Maxwell’s equations in the
inertial and laboratory frames are identical provided that
E
prime
= E + v × B, (2.49)
D
prime
= D, (2.50)
H
prime
= H ? v × D, (2.51)
B
prime
= B, (2.52)
J
prime
= J ?ρv, (2.53)
ρ
prime
=ρ. (2.54)
That is, (2.49)–(2.54) result in form invariance of Faraday’s law, Ampere’s law, and the
continuity equation under a Galilean transformation. These equations express the ?elds
measured by a moving observer in terms of those measured in the laboratory frame. To
convert the opposite way, we need only use the principle of relativity. Neither observer
can tell whether he or she is stationary — only that the other observer is moving relative
to him or her. To obtain the ?elds in the laboratory frame we simply change the sign on
v and swap primed with unprimed ?elds in (2.49)–(2.54):
E = E
prime
? v × B
prime
, (2.55)
D = D
prime
, (2.56)
H = H
prime
+ v × D
prime
, (2.57)
B = B
prime
, (2.58)
J = J
prime
+ρ
prime
v, (2.59)
ρ =ρ
prime
. (2.60)
According to (2.53), a moving observer interprets charge stationary in the laboratory
frame as an additional current moving opposite the direction of his or her motion. This
seems reasonable. However, while E depends on both E
prime
and B
prime
, the ?eld B is unchanged
under the transformation. Why should B have this special status? In fact, we may
uncover an inconsistency among the transformations by considering free space where
(2.22) and (2.23) hold: in this case (2.49) gives
D
prime
/epsilon1
0
= D/epsilon1
0
+ v ×μ
0
H
or
D
prime
= D + v × H/c
2
rather than (2.50). Similarly, from (2.51) we get
B
prime
= B ? v × E/c
2
instead of (2.52). Using these, the set of transformations becomes
E
prime
= E + v × B, (2.61)
D
prime
= D + v × H/c
2
, (2.62)
H
prime
= H ? v × D, (2.63)
B
prime
= B ? v × E/c
2
, (2.64)
J
prime
= J ?ρv, (2.65)
ρ
prime
=ρ. (2.66)
These can also be written using dyadic notation as
E
prime
=
ˉ
I · E +
ˉ
β·(cB), (2.67)
cB
prime
=?
ˉ
β· E +
ˉ
I ·(cB), (2.68)
and
cD
prime
=
ˉ
I ·(cD)+
ˉ
β· H, (2.69)
H
prime
=?
ˉ
β·(cD)+
ˉ
I · H, (2.70)
where
[
ˉ
β] =
?
?
0 ?β
z
β
y
β
z
0 ?β
x
?β
y
β
x
0
?
?
with β = v/c. This set of equations is self-consistent among Maxwell’s equations. How-
ever, the equations are not consistent with the assumption of a Galilean transformation
of the coordinates, and thus Maxwell’s equations are not covariant under a Galilean
transformation. Maxwell’s equations are only covariant under a Lorentz transforma-
tion as described in the next section. Expressions (2.61)–(2.64) turn out to be accurate
to order v/c, hence are the results of a ?rst-order Lorentz transformation. Only when
v is an appreciable fraction of c do the ?eld conversions resulting from the ?rst-order
Lorentz transformation di?er markedly from those resulting from a Galilean transforma-
tion; those resulting from the true Lorentz transformation require even higher velocities
to di?er markedly from the ?rst-order expressions. Engineering accuracy is often accom-
plished using the Galilean transformation. This pragmatic observation leads to quite a
bit of confusion when considering the large-scale forms of Maxwell’s equations, as we
shall soon see.
2.3.2 Field conversions under Lorentz transformation
To ?nd the proper transformation under which Maxwell’s equations are covariant,
we must discard our notion that time progresses the same in the primed and the un-
primed frames. The proper transformation of coordinates that guarantees covariance of
Maxwell’s equations is the Lorentz transformation
ct
prime
=γct ?γβ· r, (2.71)
r
prime
= ˉα· r ?γβct, (2.72)
where
γ =
1
radicalbig
1 ?β
2
, ˉα =
ˉ
I +(γ ? 1)
ββ
β
2
,β=|β|.
This is obviously more complicated than the Galilean transformation; only as β → 0 are
the Lorentz and Galilean transformations equivalent.
Not surprisingly, ?eld conversions between inertial reference frames are more com-
plicated with the Lorentz transformation than with the Galilean transformation. For
simplicity we assume that the velocity of the moving frame has only an x-component:
v = ?xv. Later we can generalize this to any direction. Equations (2.71) and (2.72)
become
x
prime
= x +(γ ? 1)x ?γvt, (2.73)
y
prime
= y, (2.74)
z
prime
= z, (2.75)
ct
prime
=γct ?γ
v
c
x, (2.76)
and the chain rule gives
?
?x
=γ
?
?x
prime
?γ
v
c
2
?
?t
prime
, (2.77)
?
?y
=
?
?y
prime
, (2.78)
?
?z
=
?
?z
prime
, (2.79)
?
?t
=?γv
?
?x
prime
+γ
?
?t
prime
. (2.80)
We begin by examining Faraday’s law in the laboratory frame. In component form we
have
?E
z
?y
?
?E
y
?z
=?
?B
x
?t
, (2.81)
?E
x
?z
?
?E
z
?x
=?
?B
y
?t
, (2.82)
?E
y
?x
?
?E
x
?y
=?
?B
z
?t
. (2.83)
These become
?E
z
?y
prime
?
?E
y
?z
prime
=γv
?B
x
?x
prime
?γ
?B
x
?t
prime
, (2.84)
?E
x
?z
prime
?γ
?E
z
?x
prime
+γ
v
c
2
?E
z
?t
prime
=γv
?B
y
?x
prime
?γ
?B
y
?t
prime
, (2.85)
γ
?E
y
?x
prime
?γ
v
c
2
?E
y
?t
prime
?
?E
x
?y
prime
=γv
?B
z
?x
prime
?γ
?B
z
?t
prime
, (2.86)
after we use (2.77)–(2.80) to convert the derivatives in the laboratory frame to derivatives
with respect to the moving frame coordinates. To simplify (2.84) we consider
?·B =
?B
x
?x
+
?B
y
?y
+
?B
z
?z
= 0.
Converting the laboratory frame coordinates to the moving frame coordinates, we have
γ
?B
x
?x
prime
?γ
v
c
2
?B
x
?t
prime
+
?B
y
?y
prime
+
?B
z
?z
prime
= 0
or
?γv
?B
x
?x
prime
=?γ
v
2
c
2
?B
x
?t
prime
+v
?B
y
?y
prime
+v
?B
z
?z
prime
.
Substituting this into (2.84) and rearranging (2.85) and (2.86), we obtain
?
?y
prime
γ(E
z
+vB
y
)?
?
?z
prime
γ(E
y
?vB
z
)=?
?B
x
?t
prime
,
?E
x
?z
prime
?
?
?x
prime
γ(E
z
+vB
y
)=?
?
?t
prime
γ
parenleftBig
B
y
+
v
c
2
E
z
parenrightBig
,
?
?x
prime
γ(E
y
?vB
z
)?
?E
x
?y
prime
=?
?
?t
prime
γ
parenleftBig
B
z
?
v
c
2
E
y
parenrightBig
.
Comparison with (2.81)–(2.83) shows that form invariance of Faraday’s law under the
Lorentz transformation requires
E
prime
x
= E
x
, E
prime
y
=γ(E
y
?vB
z
), E
prime
z
=γ(E
z
+vB
y
),
and
B
prime
x
= B
x
, B
prime
y
=γ
parenleftBig
B
y
+
v
c
2
E
z
parenrightBig
, B
prime
z
=γ
parenleftBig
B
z
?
v
c
2
E
y
parenrightBig
.
To generalize v to any direction, we simply note that the components of the ?elds parallel
to the velocity direction are identical in the moving and laboratory frames, while the
components perpendicular to the velocity direction convert according to a simple cross
product rule. After similar analyses with Ampere’s and Gauss’s laws (see Problem 2.2),
we ?nd that
E
prime
bardbl
= E
bardbl
, B
prime
bardbl
= B
bardbl
, D
prime
bardbl
= D
bardbl
, H
prime
bardbl
= H
bardbl
,
E
prime
⊥
=γ(E
⊥
+β× cB
⊥
), (2.87)
cB
prime
⊥
=γ(cB
⊥
?β× E
⊥
), (2.88)
cD
prime
⊥
=γ(cD
⊥
+β× H
⊥
), (2.89)
H
prime
⊥
=γ(H
⊥
?β× cD
⊥
), (2.90)
and
J
prime
bardbl
=γ(J
bardbl
?ρv), (2.91)
J
prime
⊥
= J
⊥
, (2.92)
cρ
prime
=γ(cρ?β· J), (2.93)
where the symbols bardbl and ⊥ designate the components of the ?eld parallel and perpen-
dicular to v, respectively.
These conversions are self-consistent, and the Lorentz transformation is the transfor-
mation under which Maxwell’s equations are covariant. If v
2
lessmuch c
2
, then γ ≈ 1 and to
?rst order (2.87)–(2.93) reduce to (2.61)–(2.66). If v/c lessmuch 1, then the ?rst-order ?elds
reduce to the Galilean ?elds (2.49)–(2.54).
To convert in the opposite direction, we can swap primed and unprimed ?elds and
change the sign on v:
E
⊥
=γ(E
prime
⊥
?β× cB
prime
⊥
), (2.94)
cB
⊥
=γ(cB
prime
⊥
+β× E
prime
⊥
), (2.95)
cD
⊥
=γ(cD
prime
⊥
?β× H
prime
⊥
), (2.96)
H
⊥
=γ(H
prime
⊥
+β× cD
prime
⊥
), (2.97)
and
J
bardbl
=γ(J
prime
bardbl
+ρ
prime
v), (2.98)
J
⊥
= J
prime
⊥
, (2.99)
cρ =γ(cρ
prime
+β· J
prime
). (2.100)
The conversion formulas can be written much more succinctly in dyadic notation:
E
prime
=γ ˉα
?1
· E +γ
ˉ
β·(cB), (2.101)
cB
prime
=?γ
ˉ
β· E +γ ˉα
?1
·(cB), (2.102)
cD
prime
=γ ˉα
?1
·(cD)+γ
ˉ
β· H, (2.103)
H
prime
=?γ
ˉ
β·(cD)+γ ˉα
?1
· H, (2.104)
and
cρ
prime
=γ(cρ?β· J), (2.105)
J
prime
= ˉα· J ?γβcρ, (2.106)
where ˉα
?1
· ˉα =
ˉ
I, and thus ˉα
?1
= ˉα?γββ.
Maxwell’s equations are covariant under a Lorentz transformation but not under a
Galilean transformation; the laws of mechanics are invariant under a Galilean transfor-
mationbutnotunderaLorentztransformation. Howthenshouldweanalyzeinteractions
between electromagnetic ?elds and particles or materials? Einstein realized that the laws
of mechanics needed revision to make them Lorentz covariant: in fact, under his theory of
special relativity all physical laws should demonstrate Lorentz covariance. Interestingly,
charge is then Lorentz invariant, whereas mass is not (recall that invariance refers to a
quantity, whereas covariance refers to the form of a natural law). We shall not attempt
to describe all the rami?cations of special relativity, but instead refer the reader to any
of the excellent and readable texts on the subject, including those by Bohm [14], Einstein
[62], and Born [18], and to the nice historical account by Miller [130]. However, we shall
examine the importance of Lorentz invariants in electromagnetic theory.
Lorentz invariants. Although the electromagnetic ?elds are not Lorentz invariant
(e.g., the numerical value of E measured by one observer di?ers from that measured by
another observer in uniform relative motion), several quantities do give identical values
regardless of the velocity of motion. Most fundamental are the speed of light and the
quantity of electric charge which, unlike mass, is the same in all frames of reference.
Other important Lorentz invariants include E · B, H · D, and the quantities
B · B ? E · E/c
2
,
H · H ? c
2
D · D,
B · H ? E · D,
cB · D + E · H/c.
(See Problem 2.3.) To see the importance of these quantities, consider the special case
of ?elds in empty space. If E·B = 0 in one reference frame, then it is zero in all reference
frames. Then if B · B ? E · E/c
2
= 0 in any reference frame, the ratio of E to B is
always c
2
regardless of the reference frame in which the ?elds are measured. This is the
characteristic of a plane wave in free space.
If E · B = 0 and c
2
B
2
> E
2
, then we can ?nd a reference frame using the conversion
formulas (2.101)–(2.106) (see Problem 2.5) in which the electric ?eld is zero but the
magnetic ?eld is nonzero. In this case we call the ?elds purely magnetic in any reference
frame, even if both E and B are nonzero. Similarly, if E · B = 0 and c
2
B
2
< E
2
then
we can ?nd a reference frame in which the magnetic ?eld is zero but the electric ?eld is
nonzero. We call ?elds of this type purely electric.
The Lorentz force is not Lorentz invariant. Consider a point charge at rest in the
laboratory frame. While we measure only an electric ?eld in the laboratory frame, an
inertial observer measures both electric and magnetic ?elds. A test charge Q in the
laboratory frame experiences the Lorentz force F = QE; in an inertial frame the same
charge experiences F
prime
= QE
prime
+ Qv×B
prime
(see Problem 2.6). The conversion formulas show
that F and F
prime
are not identical.
We see that both E and B are integral components of the electromagnetic ?eld: the
separation of the ?eld into electric and magnetic components depends on the motion
of the reference frame in which measurements are made. This has obvious implications
when considering static electric and magnetic ?elds.
DerivationofMaxwell’sequationsfromCoulomb’slaw. Considerapointcharge
at rest in the laboratory frame. If the magnetic component of force on this charge arises
naturally through motion of an inertial reference frame, and if this force can be expressed
intermsofCoulomb’slawinthelaboratoryframe, thenperhapsthemagnetic?eldcanbe
derived directly from Coulomb’s and the Lorentz transformation. Perhaps it is possible
to derive all of Maxwell’s theory with Coulomb’s law and Lorentz invariance as the only
postulates.
Several authors, notably Purcell [152] and Elliott [65], have used this approach. How-
ever, Jackson [91] has pointed out that many additional assumptions are required to
deduce Maxwell’s equations beginning with Coulomb’s law. Feynman [73] is critical of
the approach, pointing out that we must introduce a vector potential which adds to the
scalar potential from electrostatics in order to produce an entity that transforms accord-
ing to the laws of special relativity. In addition, the assumption of Lorentz invariance
seems to involve circular reasoning since the Lorentz transformation was originally in-
troduced to make Maxwell’s equations covariant. But Lucas and Hodgson [117] point
out that the Lorentz transformation can be deduced from other fundamental principles
(such as causality and the isotropy of space), and that the postulate of a vector potential
is reasonable. Schwartz [170] gives a detailed derivation of Maxwell’s equations from
Coulomb’s law, outlining the necessary assumptions.
Transformationofconstitutiverelations. Minkowski’sinterestinthecovarianceof
Maxwell’s equations was aimed not merely at the relationship between ?elds in di?erent
moving frames of reference, but at an understanding of the electrodynamics of moving
media. Hewishedtoascertainthee?ectofamovingmaterialbodyontheelectromagnetic
?elds in some region of space. By proposing the covariance of Maxwell’s equations in
materials as well as in free space, he extended Maxwell’s theory to moving material
bodies.
We have seen in (2.101)–(2.104) that (E,cB) and (cD,H) convert identically under a
Lorentz transformation. Since the most general form of the constitutive relations relate
cD and H to the ?eld pair (E,cB) (see § 2.2.2) as
bracketleftbigg
cD
H
bracketrightbigg
=
bracketleftbig
ˉ
C
bracketrightbig
bracketleftbigg
E
cB
bracketrightbigg
,
this form of the constitutive relations must be Lorentz covariant. That is, in the reference
frame of a moving material we have
bracketleftbigg
cD
prime
H
prime
bracketrightbigg
=
bracketleftbig
ˉ
C
prime
bracketrightbig
bracketleftbigg
E
prime
cB
prime
bracketrightbigg
,
and should be able to convert [
ˉ
C
prime
] to [
ˉ
C]. We should be able to ?nd the constitutive
matrix describing the relationships among the ?elds observed in the laboratory frame.
It is somewhat laborious to obtain the constitutive matrix [
ˉ
C] for an arbitrary moving
medium. Detailed expressions for isotropic, bianisotropic, gyrotropic, and uniaxial media
are given by Kong [101]. The rather complicated expressions can be written in a more
compact form if we consider the expressions for B and D in terms of the pair (E,H).
For a linear isotropic material such that D
prime
= epsilon1
prime
E
prime
and B
prime
= μ
prime
H
prime
in the moving frame,
the relationships in the laboratory frame are [101]
B =μ
prime
ˉ
A · H ??× E, (2.107)
D =epsilon1
prime
ˉ
A · E +?× H, (2.108)
where
ˉ
A =
1 ?β
2
1 ? n
2
β
2
bracketleftbigg
ˉ
I ?
n
2
? 1
1 ?β
2
ββ
bracketrightbigg
, (2.109)
? =
n
2
? 1
1 ? n
2
β
2
β
c
, (2.110)
and where n = c(μ
prime
epsilon1
prime
)
1/2
is the optical index of the medium. A moving material that
is isotropic in its own moving reference frame is bianisotropic in the laboratory frame.
If, for instance, we tried to measure the relationship between the ?elds of a moving
isotropic?uid, butusedinstrumentsthatwerestationaryinourlaboratory(e.g., attached
to our measurement bench) we would ?nd that D depends not only on E but also on
H, and that D aligns with neither E nor H. That a moving material isotropic in its
own frame of reference is bianisotropic in the laboratory frame was known long ago.
Roentgen showed experimentally in 1888 that a dielectric moving through an electric
?eld becomes magnetically polarized, while H.A. Wilson showed in 1905 that a dielectric
moving through a magnetic ?eld becomes electrically polarized [139].
If v
2
/c
2
lessmuch 1, we can consider the form of the constitutive equations for a ?rst-order
Lorentz transformation. Ignoring terms to order v
2
/c
2
in (2.109) and (2.110), we obtain
ˉ
A =
ˉ
I and ?= v(n
2
? 1)/c
2
. Then, by (2.107) and (2.108),
B =μ
prime
H ?(n
2
? 1)
v × E
c
2
, (2.111)
D =epsilon1
prime
E +(n
2
? 1)
v × H
c
2
. (2.112)
We can also derive these from the ?rst-order ?eld conversion equations (2.61)–(2.64).
From (2.61) and (2.62) we have
D
prime
= D + v × H/c
2
=epsilon1
prime
E
prime
=epsilon1
prime
(E + v × B).
Eliminating B via (2.64), we have
D + v × H/c
2
=epsilon1
prime
E +epsilon1
prime
v ×(v × E/c
2
)+epsilon1
prime
v × B
prime
=epsilon1
prime
E +epsilon1
prime
v × B
prime
where we have neglected terms of order v
2
/c
2
. Since B
prime
= μ
prime
H
prime
= μ
prime
(H ? v × D),we
have
D + v × H/c
2
=epsilon1
prime
E +epsilon1
prime
μ
prime
v × H ?epsilon1
prime
μ
prime
v × v × D.
Using n
2
= c
2
μ
prime
epsilon1
prime
and neglecting the last term since it is of order v
2
/c
2
, we obtain
D =epsilon1
prime
E +(n
2
? 1)
v × H
c
2
,
which is identical to the expression (2.112) obtained by approximating the exact result
to ?rst order. Similar steps produce (2.111). In a Galilean frame where v/c lessmuch 1, the
expressions reduce to D =epsilon1
prime
E and B =μ
prime
H, and the isotropy of the ?elds is preserved.
For a conducting medium having
J
prime
=σ
prime
E
prime
in a moving reference frame, Cullwick [48] shows that in the laboratory frame
J =σ
prime
γ[
ˉ
I ?ββ] · E +σ
prime
γcβ× B.
For v lessmuch c we can set γ ≈ 1 and see that
J =σ
prime
(E + v × B)
to ?rst order.
Constitutive relations in deforming or rotating media. The transformations
discussed in the previous paragraphs hold for media in uniform relative motion. When
a material body undergoes deformation or rotation, the concepts of special relativity are
not directly applicable. However, authors such as Pauli [144] and Sommerfeld [185] have
maintained that Minkowski’s theory is approximately valid for deforming or rotating
media if v is taken to be the instantaneous velocity at each point within the body.
The reasoning is that at any instant in time each point within the body has a velocity
v that may be associated with some inertial reference frame (generally di?erent for
each point). Thus the constitutive relations for the material at that point, within some
small time interval taken about the observation time, may be assumed to be those of
a stationary material, and the relations measured by an observer within the laboratory
frame may be computed using the inertial frame for that point. This instantaneous rest-
frame theory is most accurate at small accelerations dv/dt. Van Bladel [201] outlines
its shortcomings. See also Anderson [3] and Mo [132] for detailed discussions of the
electromagnetic properties of material media in accelerating frames of reference.
2.4 TheMaxwell–Bo?equations
In any version of Maxwell’s theory, the mediating ?eld is the electromagnetic ?eld
described by four ?eld vectors. In Minkowski’s form of Maxwell’s equations we use E,
D, B, and H. As an alternative consider the electromagnetic ?eld as represented by the
vector ?elds E, B, P, and M, and described by
?×E =?
?B
?t
, (2.113)
?×(B/μ
0
? M)= J +
?
?t
(epsilon1
0
E + P), (2.114)
?·(epsilon1
0
E + P)=ρ, (2.115)
?·B = 0. (2.116)
These Maxwell–Bo? equations are named after L. Bo?, who formalized them for moving
media [13]. The quantity P is the polarization vector, and M is the magnetization vector.
The use of P and M in place of D and H is sometimes called an application of the principle
of Ampere and Lorentz [199].
Let us examine the rami?cation of using (2.113)–(2.116) as the basis for a postulate
of electromagnetics. These equations are similar to the Maxwell–Minkowski equations
used earlier; must we rebuild all the underpinning of a new postulate, or can we use
our original arguments based on the Minkowski form? For instance, how do we invoke
uniqueness if we no longer have the ?eld H? What represents the ?ux of energy, formerly
found using E×H? And, importantly, are (2.113)–(2.114) form invariant under a Lorentz
transformation?
It turns out that the set of vector ?elds (E,B,P,M) is merely a linear mapping of
the set (E,D,B,H). As pointed out by Tai [193], any linear mapping of the four ?eld
vectors from Minkowski’s form onto any other set of four ?eld vectors will preserve the
covariance of Maxwell’s equations. Bo? chose to keep E and B intact and to introduce
only two new ?elds; he could have kept H and D instead, or used a mapping that
introduced four completely new ?elds (as did Chu). Many authors retain E and H.
This is somewhat more cumbersome since these vectors do not convert as a pair under
a Lorentz transformation. A discussion of the idea of ?eld vector “pairing” appears in
§ 2.6.
The usefulness of the Bo? form lies in the speci?c mapping chosen. Comparison of
(2.113)–(2.116) to (2.1)–(2.4) quickly reveals that
P = D ?epsilon1
0
E, (2.117)
M = B/μ
0
? H. (2.118)
We see that P is the di?erence between D in a material and D in free space, while M is
the di?erence between H in free space and H in a material. In free space, P = M = 0.
Equivalent polarization and magnetization sources. The Bo? formulation pro-
videsanewwaytoregard E and B. Maxwellgrouped(E,H)asapairof“forcevectors”to
be associated with line integrals (or curl operations in the point forms of his equations),
and (D,B) as a pair of “?ux vectors” associated with surface integrals (or divergence
operations). That is, E is interpreted as belonging to the computation of “emf” as a line
integral, while B is interpreted as a density of magnetic “?ux” passing through a surface.
Similarly, H yields the “mmf” about some closed path and D the electric ?ux through
a surface. The introduction of P and M allows us to also regard E as a ?ux vector and
B as a force vector — in essence, allowing the two ?elds E and B to take on the duties
that required four ?elds in Minkowski’s form. To see this, we rewrite the Maxwell–Bo?
equations as
?×E =?
?B
?t
,
?×
B
μ
0
=
parenleftbigg
J +?×M +
?P
?t
parenrightbigg
+
?epsilon1
0
E
?t
,
?·(epsilon1
0
E)=(ρ??·P),
?·B = 0,
and compare them to the Maxwell–Minkowski equations for sources in free space:
?×E =?
?B
?t
,
?×
B
μ
0
= J +
?epsilon1
0
E
?t
,
?·(epsilon1
0
E)=ρ,
?·B = 0.
The forms are preserved if we identify ?P/?t and ?×M as new types of current density,
and ?·P as a new type of charge density. We de?ne
J
P
=
?P
?t
(2.119)
as an equivalent polarization current density, and
J
M
=?×M
asanequivalent magnetization current density(sometimescalledtheequivalent Amperian
currents of magnetized matter [199]). We de?ne
ρ
P
=??·P
as an equivalent polarization charge density (sometimes called the Poisson–Kelvin equiv-
alent charge distribution [199]). Then the Maxwell–Bo? equations become simply
?×E =?
?B
?t
, (2.120)
?×
B
μ
0
=(J + J
M
+ J
P
)+
?epsilon1
0
E
?t
, (2.121)
?·(epsilon1
0
E)=(ρ+ρ
P
), (2.122)
?·B = 0. (2.123)
Here is the new view. A material can be viewed as composed of charged particles of
matter immersed in free space. When these charges are properly considered as “equiv-
alent” polarization and magnetization charges, all ?eld e?ects (describable through ?ux
and force vectors) can be handled by the two ?elds E and B. Whereas in Minkowski’s
form D diverges from ρ, in Bo?’s form E diverges from a total charge density consisting
of ρ and ρ
P
. Whereas in the Minkowski form H curls around J, in the Bo? form B curls
around the total current density consisting of J, J
M
, and J
P
.
This view was pioneered by Lorentz, who by 1892considered matter as consisting of
bulk molecules in a vacuum that would respond to an applied electromagnetic ?eld [130].
The resulting motion of the charged particles of matter then became another source
term for the “fundamental” ?elds E and B. Using this reasoning he was able to reduce
the fundamental Maxwell equations to two equations in two unknowns, demonstrating a
simplicity appealing to many (including Einstein). Of course, to apply this concept we
must be able to describe how the charged particles respond to an applied ?eld. Simple
microscopic models of the constituents of matter are generally used: some combination
of electric and magnetic dipoles, or of loops of electric and magnetic current.
The Bo? equations are mathematically appealing since they now specify both the curl
and divergence of the two ?eld quantities E and B. By the Helmholtz theorem we know
that a ?eld vector is uniquely speci?ed when both its curl and divergence are given. But
this assumes that the equivalent sources produced by P and M are true source ?elds in
the same sense as J. We have precluded this by insisting in Chapter 1 that the source
?eld must be independent of the mediating ?eld it sources. If we view P and M as
merely a mapping from the original vector ?elds of Minkowski’s form, we still have four
vector ?elds with which to contend. And with these must also be a mapping of the
constitutive relationships, which now link the ?elds E, B, P, and M. Rather than argue
the actual physical existence of the equivalent sources, we note that a real bene?t of
the new view is that under certain circumstances the equivalent source quantities can be
determined through physical reasoning, hence we can create physical models of P and M
and deduce their links to E and B. We may then ?nd it easier to understand and deduce
the constitutive relationships. However we do not in general consider E and B to be in
any way more “fundamental” than D and H.
CovarianceoftheBo?form. Becauseofthelinearrelationships(2.117)and(2.118),
covariance of the Maxwell–Minkowski equations carries over to the Maxwell–Bo? equa-
tions. However, the conversion between ?elds in di?erent moving reference frames will
now involve P and M. Since Faraday’s law is unchanged in the Bo? form, we still have
E
prime
bardbl
= E
bardbl
, (2.124)
B
prime
bardbl
= B
bardbl
, (2.125)
E
prime
⊥
=γ(E
⊥
+β× cB
⊥
), (2.126)
cB
prime
⊥
=γ(cB
⊥
?β× E
⊥
). (2.127)
To see how P and M convert, we note that in the laboratory frame D = epsilon1
0
E + P and
H = B/μ
0
? M, while in the moving frame D
prime
=epsilon1
0
E
prime
+ P
prime
and H
prime
= B
prime
/μ
0
? M
prime
.Thus
P
prime
bardbl
= D
prime
bardbl
?epsilon1
0
E
prime
bardbl
= D
bardbl
?epsilon1
0
E
bardbl
= P
bardbl
and
M
prime
bardbl
= B
prime
bardbl
/μ
0
? H
prime
bardbl
= B
bardbl
/μ
0
? H
bardbl
= M
bardbl
.
For the perpendicular components
D
prime
⊥
=γ(D
⊥
+β× H
⊥
/c)=epsilon1
0
E
prime
⊥
+ P
prime
⊥
=epsilon1
0
[γ(E
⊥
+β× cB
⊥
)] + P
prime
⊥
;
substitution of H
⊥
= B
⊥
/μ
0
? M
⊥
then gives
P
prime
⊥
=γ(D
⊥
?epsilon1
0
E
⊥
)?γepsilon1
0
β× cB
⊥
+γβ× B
⊥
/(cμ
0
)?γβ× M
⊥
/c
or
cP
prime
⊥
=γ(cP
⊥
?β× M
⊥
).
Similarly,
M
prime
⊥
=γ(M
⊥
+β× cP
⊥
).
Hence
E
prime
bardbl
= E
bardbl
, B
prime
bardbl
= B
bardbl
, P
prime
bardbl
= P
bardbl
, M
prime
bardbl
= M
bardbl
, J
prime
⊥
= J
⊥
, (2.128)
and
E
prime
⊥
=γ(E
⊥
+β× cB
⊥
), (2.129)
cB
prime
⊥
=γ(cB
⊥
?β× E
⊥
), (2.130)
cP
prime
⊥
=γ(cP
⊥
?β× M
⊥
), (2.131)
M
prime
⊥
=γ(M
⊥
+β× cP
⊥
), (2.132)
J
prime
bardbl
=γ(J
bardbl
?ρv). (2.133)
In the case of the ?rst-order Lorentz transformation we can set γ ≈ 1 to obtain
E
prime
= E + v × B, (2.134)
B
prime
= B ?
v × E
c
2
, (2.135)
P
prime
= P ?
v × M
c
2
, (2.136)
M
prime
= M + v × P, (2.137)
J
prime
= J ?ρv. (2.138)
To convert from the moving frame to the laboratory frame we simply swap primed with
unprimed ?elds and let v →?v.
As a simple example, consider a linear isotropic medium having
D
prime
=epsilon1
0
epsilon1
prime
r
E
prime
, B
prime
=μ
0
μ
prime
r
H
prime
,
in a moving reference frame. From (117) we have
P
prime
=epsilon1
0
epsilon1
prime
r
E
prime
?epsilon1
0
E
prime
=epsilon1
0
χ
prime
e
E
prime
where χ
prime
e
=epsilon1
prime
r
? 1 is the electric susceptibility of the moving material. Similarly (2.118)
yields
M
prime
=
B
prime
μ
0
?
B
prime
μ
0
μ
prime
r
=
B
prime
χ
prime
m
μ
0
μ
prime
r
where χ
prime
m
=μ
prime
r
?1 is the magnetic susceptibility of the moving material. How are P and
M related to E and B in the laboratory frame? For simplicity, we consider the ?rst-order
expressions. From (2.136) we have
P = P
prime
+
v × M
prime
c
2
=epsilon1
0
χ
prime
e
E
prime
+
v × B
prime
χ
prime
m
μ
0
μ
prime
r
c
2
.
Substituting for E
prime
and B
prime
from (2.134) and (2.135), and using μ
0
c
2
= 1/epsilon1
0
,wehave
P =epsilon1
0
χ
prime
e
(E + v × B)+epsilon1
0
χ
prime
m
μ
prime
r
v ×
parenleftbigg
B ?
v × E
c
2
parenrightbigg
.
Neglecting the last term since it varies as v
2
/c
2
,weget
P =epsilon1
0
χ
prime
e
E +epsilon1
0
parenleftbigg
χ
prime
e
+
χ
prime
m
μ
prime
r
parenrightbigg
v × B. (2.139)
Similarly,
M =
χ
prime
m
μ
0
μ
prime
r
B ?epsilon1
0
parenleftbigg
χ
prime
e
+
χ
prime
m
μ
prime
r
parenrightbigg
v × E. (2.140)
2.5 Large-scaleformofMaxwell’sequations
We can write Maxwell’s equations in a form that incorporates the spatial variation of
the ?eld in a certain region of space. To do this, we integrate the point form of Maxwell’s
Figure 2.2: Open surface having velocity v relative to laboratory (unprimed) coordinate
system. Surface is non-deforming.
equations over a region of space, then perform some succession of manipulations until
we arrive at a form that provides us some bene?t in our work with electromagnetic
?elds. The results are particularly useful for understanding the properties of electric and
magnetic circuits, and for predicting the behavior of electrical machinery.
We shall consider two important situations: a mathematical surface that moves with
constant velocity v and with constant shape, and a surface that moves and deforms
arbitrarily.
2.5.1 Surface moving with constant velocity
Consider an open surface S moving with constant velocity v relative to the laboratory
frame(Figure2.2).Assumeeverypointonthesurfaceisanordinarypoint.Atany
instant t we can express the relationship between the ?elds at points on S in either
frame. In the laboratory frame we have
?×E =?
?B
?t
, ?×H =
?D
?t
+ J,
while in the moving frame
?
prime
× E
prime
=?
?B
prime
?t
prime
, ?
prime
× H
prime
=
?D
prime
?t
prime
+ J
prime
.
If we integrate over S and use Stokes’s theorem, we get for the laboratory frame
contintegraldisplay
Gamma1
E · dl =?
integraldisplay
S
?B
?t
· dS, (2.141)
contintegraldisplay
Gamma1
H · dl =
integraldisplay
S
?D
?t
· dS +
integraldisplay
S
J · dS, (2.142)
and for the moving frame
contintegraldisplay
Gamma1
prime
E
prime
· dl
prime
=?
integraldisplay
S
prime
?B
prime
?t
prime
· dS
prime
, (2.143)
contintegraldisplay
Gamma1
prime
H
prime
· dl
prime
=
integraldisplay
S
prime
?D
prime
?t
prime
· dS
prime
+
integraldisplay
S
prime
J
prime
· dS
prime
. (2.144)
Here boundary contour Gamma1 has sense determined by the right-hand rule. We use the
notation Gamma1
prime
, S
prime
, etc., to indicate that all integrations for the moving frame are computed
using space and time variables in that frame. Equation (2.141) is the integral form of
Faraday’s law, while (2.142) is the integral form of Ampere’s law.
Faraday’s law states that the net circulation of E about a contour Gamma1 (sometimes called
the electromotive force or emf) is determined by the ?ux of the time-rate of change of the
?ux vector B passing through the surface bounded by Gamma1. Ampere’s law states that the
circulation of H (sometimes called the magnetomotive force or mmf) is determined by
the ?ux of the current J plus the ?ux of the time-rate of change of the ?ux vector D.Itis
the term containing ?D/?t that Maxwell recognized as necessary to make his equations
consistent; since it has units of current, it is often referred to as the displacement current
term.
Equations (2.141)–(2.142) are the large-scale or integral forms of Maxwell’s equations.
They are the integral-form equivalents of the point forms, and are form invariant under
Lorentz transformation. If we express the ?elds in terms of the moving reference frame,
we can write
contintegraldisplay
Gamma1
prime
E
prime
· dl
prime
=?
d
dt
integraldisplay
S
prime
B
prime
· dS
prime
, (2.145)
contintegraldisplay
Gamma1
prime
H
prime
· dl
prime
=
d
dt
integraldisplay
S
prime
D
prime
· dS
prime
+
integraldisplay
S
prime
J
prime
· dS
prime
. (2.146)
These hold for a stationary surface, since the surface would be stationary to an observer
who moves with it. We are therefore justi?ed in removing the partial derivative from the
integral. Although the surfaces and contours considered here are purely mathematical,
they often coincide with actual physical boundaries. The surface may surround a moving
material medium, for instance, or the contour may conform to a wire moving in an
electrical machine.
We can also convert the auxiliary equations to large-scale form. Consider a volume
region V surrounded by a surface S that moves with velocity v relative to the laboratory
frame(Figure2.3).IntegratingthepointformofGauss’slawover V wehave
integraldisplay
V
?·D dV =
integraldisplay
V
ρ dV.
Using the divergence theorem and recognizing that the integral of charge density is total
charge, we obtain
contintegraldisplay
S
D · dS =
integraldisplay
V
ρ dV = Q(t) (2.147)
where Q(t) is the total charge contained within V at time t. This large-scale form of
Gauss’s law states that the total ?ux of D passing through a closed surface is identical
to the electric charge Q contained within. Similarly,
contintegraldisplay
S
B · dS = 0 (2.148)
Figure 2.3: Non-deforming volume region having velocity v relative to laboratory (un-
primed) coordinate system.
is the large-scale magnetic ?eld Gauss’s law. It states that the total ?ux of B passing
through a closed surface is zero, since there are no magnetic charges contained within
(i.e., magnetic charge does not exist).
Since charge is an invariant quantity, the large-scale forms of the auxiliary equations
take the same form in a moving reference frame:
contintegraldisplay
S
prime
D
prime
· dS
prime
=
integraldisplay
V
prime
ρ
prime
dV
prime
= Q(t) (2.149)
and
contintegraldisplay
S
prime
B
prime
· dS
prime
= 0. (2.150)
The large-scale forms of the auxiliary equations may be derived from the large-scale
forms of Faraday’s and Ampere’s laws. To obtain Gauss’s law, we let the open surface
in Ampere’s law become a closed surface. Then
contintegraltext
H · dl vanishes, and application of
the large-scale form of the continuity equation (1.10) produces (2.147). The magnetic
Gauss’s law (2.148) is found from Faraday’s law (2.141) by a similar transition from an
open surface to a closed surface.
The values obtained from the expressions (2.141)–(2.142) will not match those ob-
tained from (2.143)–(2.144), and we can use the Lorentz transformation ?eld conversions
to study how they di?er. That is, we can write either side of the laboratory equations in
terms of the moving reference frame ?elds, or vice versa. For most engineering applica-
tions where v/c lessmuch 1 this is not done via the Lorentz transformation ?eld relations, but
rather via the Galilean approximations to these relations (see Tai [194] for details on us-
ing the Lorentz transformation ?eld relations). We consider the most common situation
in the next section.
Kinematicformofthelarge-scaleMaxwellequations. Confusioncanresultfrom
the fact that the large-scale forms of Maxwell’s equations can be written in a number of
Figure 2.4: Non-deforming closed contour moving with velocity v through a magnetic
?eld B given in the laboratory (unprimed) coordinate system.
ways. A popular formulation of Faraday’s law, the emf formulation, revolves around the
concept of electromotive force. Unfortunately, various authors o?er di?erent de?nitions
of emf in a moving circuit.
Consider a non-deforming contour in space, moving with constant velocity v relative
tothelaboratoryframe(Figure2.4).Intermsofthelaboratory?eldswehavethelarge-
scale form of Faraday’s law (2.141). The ?ux term on the right-hand side of this equation
can be written di?erently by employing the Helmholtz transport theorem (A.63). If a
non-deforming surface S moves with uniform velocity v relative to the laboratory frame,
and a vector ?eld A(r,t) is expressed in the stationary frame, then the time derivative
of the ?ux of A through S is
d
dt
integraldisplay
S
A · dS =
integraldisplay
S
bracketleftbigg
?A
?t
+ v(?·A)??×(v × A)
bracketrightbigg
· dS. (2.151)
Using this with (2.141) we have
contintegraldisplay
Gamma1
E · dl =?
d
dt
integraldisplay
S
B · dS +
integraldisplay
S
v(?·B)· dS ?
integraldisplay
S
?×(v × B)· dS.
Remembering that ?·B = 0 and using Stokes’s theorem on the last term, we obtain
contintegraldisplay
Gamma1
(E + v × B)· dl =?
d
dt
integraldisplay
S
B · dS =?
dPsi1(t)
dt
(2.152)
where the magnetic ?ux
integraldisplay
S
B · dS =Psi1(t)
represents the ?ux of B through S. Following Sommerfeld [185], we may set
E
?
= E + v × B
to obtain the kinematic form of Faraday’s law
contintegraldisplay
Gamma1
E
?
· dl =?
d
dt
integraldisplay
S
B · dS =?
dPsi1(t)
dt
. (2.153)
(The asterisk should not be confused with the notation for complex conjugate.)
Much confusion arises from the similarity between (2.153) and (2.145). In fact, these
expressions are di?erent and give di?erent results. This is because B
prime
in (2.145) is
measured in the frame of the moving circuit, while B in (2.153) is measured in the frame
of the laboratory. Further confusion arises from various de?nitions of emf. Many authors
(e.g., Hermann Weyl [213]) de?ne emf to be the circulation of E
?
. In that case the emf
is equal to the negative time rate of change of the ?ux of the laboratory frame magnetic
?eld B through S. Since the Lorentz force experienced by a charge q moving with the
contour is given by qE
?
= q(E + v × B), this emf is the circulation of Lorentz force
per unit charge along the contour. If the contour is aligned with a conducting circuit,
then in some cases this emf can be given physical interpretation as the work required
to move a charge around the entire circuit through the conductor against the Lorentz
force. Unfortunately the usefulness of this de?nition of emf is lost if the time or space
rate of change of the ?elds is so large that no true loop current can be established
(hence Kircho?’s law cannot be employed). Such a problem must be treated as an
electromagnetic “scattering” problem with consideration given to retardation e?ects.
Detailed discussions of the physical interpretation of E
?
in the de?nition of emf are given
by Scanlon [165] and Cullwick [48].
Other authors choose to de?ne emf as the circulation of the electric ?eld in the frame
of the moving contour. In this case the circulation of E
prime
in (2.145) is the emf, and is
related to the ?ux of the magnetic ?eld in the frame of the moving circuit. As pointed
out above, the result di?ers from that based on the Lorentz force. If we wish, we can
also write this emf in terms of the ?elds expressed in the laboratory frame. To do this we
must convert?B
prime
/?t
prime
to the laboratory ?elds using the rules for a Lorentz transformation.
The result, given by Tai [194], is quite complicated and involves both the magnetic and
electric laboratory-frame ?elds.
The moving-frame emf as computed from the Lorentz transformation is rarely used as
a working de?nition of emf, mostly because circuits moving at relativistic velocities are
seldom used by engineers. Unfortunately, more confusion arises for the case v lessmuch c, since
for a Galilean frame the Lorentz-force and moving-frame emfs become identical. This
is apparent if we use (2.52) to replace B
prime
with the laboratory frame ?eld B, and (2.49)
to replace E
prime
with the combination of laboratory frame ?elds E + v × B. Then (2.145)
becomes
contintegraldisplay
Gamma1
E
prime
· dl =
contintegraldisplay
Gamma1
(E + v × B)· dl =?
d
dt
integraldisplay
S
B · dS,
which is identical to (2.153). For circuits moving with low velocity then, the circulation
of E
prime
can be interpreted as work per unit charge. As an added bit of confusion, the term
contintegraldisplay
Gamma1
(v × B)· dl =
integraldisplay
S
?×(v × B)· dS
is sometimes called motional emf, since it is the component of the circulation of E
?
that
is directly attributable to the motion of the circuit.
Although less commonly done, we can also rewrite Ampere’s law (2.142) using (2.151).
This gives
contintegraldisplay
Gamma1
H · dl =
integraldisplay
S
J · dS +
d
dt
integraldisplay
S
D · dS ?
integraldisplay
S
(v?·D)· dS +
integraldisplay
S
?×(v × D)· dS.
Using ?·D =ρ and using Stokes’s theorem on the last term, we obtain
contintegraldisplay
Gamma1
(H ? v × D)· dl =
d
dt
integraldisplay
S
D · dS +
integraldisplay
S
(J ?ρv)· dS.
Finally, letting H
?
= H ? v × D and J
?
= J ?ρv we can write the kinematic form of
Ampere’s law:
contintegraldisplay
Gamma1
H
?
· dl =
d
dt
integraldisplay
S
D · dS +
integraldisplay
S
J
?
· dS. (2.154)
In a Galilean frame where we use (2.49)–(2.54), we see that (2.154) is identical to
contintegraldisplay
Gamma1
H
prime
· dl =
d
dt
integraldisplay
S
D
prime
· dS +
integraldisplay
S
J
prime
· dS (2.155)
where the primed ?elds are measured in the frame of the moving contour. This equiv-
alence does not hold when the Lorentz transformation is used to represent the primed
?elds.
Alternative form of the large-scale Maxwell equations. We can write Maxwell’s
equations in an alternative large-scale form involving only surface and volume integrals.
This will be useful later for establishing the ?eld jump conditions across a material or
source discontinuity. Again we begin with Maxwell’s equations in point form, but instead
of integrating them over an open surface we integrate over a volume region V moving
withvelocity v (Figure2.3).Inthelaboratoryframethisgives
integraldisplay
V
(?×E)dV =?
integraldisplay
V
?B
?t
dV,
integraldisplay
V
(?×H)dV =
integraldisplay
V
parenleftbigg
?D
?t
+ J
parenrightbigg
dV.
An application of curl theorem (B.24) then gives
contintegraldisplay
S
(?n × E)dS =?
integraldisplay
V
?B
?t
dV, (2.156)
contintegraldisplay
S
(?n × H)dS =
integraldisplay
V
parenleftbigg
?D
?t
+ J
parenrightbigg
dV. (2.157)
Similar results are obtained for the ?elds in the moving frame:
contintegraldisplay
S
prime
(?n
prime
× E
prime
)dS
prime
=?
integraldisplay
V
prime
?B
prime
?t
prime
dV
prime
,
contintegraldisplay
S
prime
(?n
prime
× H
prime
)dS
prime
=
integraldisplay
V
prime
parenleftbigg
?D
prime
?t
prime
+ J
prime
parenrightbigg
dV
prime
.
These large-scale forms are an alternative to (2.141)–(2.144). They are also form-
invariant under a Lorentz transformation.
An alternative to the kinematic formulation of (2.153) and (2.154) can be achieved
by applying a kinematic identity for a moving volume region. If V is surrounded by a
surface S that moves with velocity v relative to the laboratory frame, and if a vector ?eld
A is measured in the laboratory frame, then the vector form of the general transport
theorem (A.68) states that
d
dt
integraldisplay
V
A dV =
integraldisplay
V
?A
?t
dV +
contintegraldisplay
S
A(v · ?n)dS. (2.158)
Applying this to (2.156) and (2.157) we have
contintegraldisplay
S
[?n × E ?(v · ?n)B] dS =?
d
dt
integraldisplay
V
B dV, (2.159)
contintegraldisplay
S
[?n × H +(v · ?n)D] dS =
integraldisplay
V
J dV +
d
dt
integraldisplay
V
D dV. (2.160)
We can also apply (2.158) to the large-scale form of the continuity equation (2.10) and
obtain the expression for a volume region moving with velocity v:
contintegraldisplay
S
(J ?ρv)· dS =?
d
dt
integraldisplay
V
ρ dV.
2.5.2 Moving, deforming surfaces
Because (2.151) holds for arbitrarily moving surfaces, the kinematic versions (2.153)
and (2.154) hold when v is interpreted as an instantaneous velocity. However, if the
surface and contour lie within a material body that moves relative to the laboratory
frame, the constitutive equations relating E, D, B, H, and J in the laboratory frame
di?er from those relating the ?elds in the stationary frame of the body (if the body is
not accelerating), and thus the concepts of § 2.3.2 must be employed. This is important
when boundary conditions at a moving surface are needed. Particular care must be taken
when the body accelerates, since the constitutive relations are then only approximate.
The representation (2.145)–(2.146) is also generally valid, provided we de?ne the
primed ?elds as those converted from laboratory ?elds using the Lorentz transforma-
tion with instantaneous velocity v. Here we should use a di?erent inertial frame for each
point in the integration, and align the frame with the velocity vector v at the instant
t. We certainly may do this since we can choose to integrate any function we wish.
However, this representation may not ?nd wide application.
We thus choose the following expressions, valid for arbitrarily moving surfaces con-
taining only regular points, as our general forms of the large-scale Maxwell equations:
contintegraldisplay
Gamma1(t)
E
?
· dl =?
d
dt
integraldisplay
S(t)
B · dS =?
dPsi1(t)
dt
,
contintegraldisplay
Gamma1(t)
H
?
· dl =
d
dt
integraldisplay
S(t)
D · dS +
integraldisplay
S(t)
J
?
· dS,
where
E
?
= E + v × B,
H
?
= H ? v × D,
J
?
= J ?ρv,
and where all ?elds are taken to be measured in the laboratory frame with v the in-
stantaneous velocity of points on the surface and contour relative to that frame. The
constitutive parameters must be considered carefully if the contours and surfaces lie in
a moving material medium.
Kinematic identity (2.158) is also valid for arbitrarily moving surfaces. Thus we have
the following, valid for arbitrarily moving surfaces and volumes containing only regular
points:
contintegraldisplay
S(t)
[?n × E ?(v · ?n)B] dS =?
d
dt
integraldisplay
V(t)
B dV,
contintegraldisplay
S(t)
[?n × H +(v · ?n)D] dS =
integraldisplay
V(t)
J dV +
d
dt
integraldisplay
V(t)
D dV.
We also ?nd that the two Gauss’s law expressions,
contintegraldisplay
S(t)
D · dS =
integraldisplay
V(t)
ρ dV,
contintegraldisplay
S(t)
B · dS = 0,
remain valid.
2.5.3 Large-scale form of the Bo? equations
The Maxwell–Bo? equations can be written in large-scale form using the same ap-
proach as with the Maxwell–Minkowski equations. Integrating (2.120) and (2.121) over
an open surface S and applying Stokes’s theorem, we have
contintegraldisplay
Gamma1
E · dl =?
integraldisplay
S
?B
?t
· dS, (2.161)
contintegraldisplay
Gamma1
B · dl =μ
0
integraldisplay
S
parenleftbigg
J + J
M
+ J
P
+
?epsilon1
0
E
?t
parenrightbigg
· dS, (2.162)
for ?elds in the laboratory frame, and
contintegraldisplay
Gamma1
prime
E
prime
· dl
prime
=?
integraldisplay
S
prime
?B
prime
?t
prime
· dS
prime
,
contintegraldisplay
Gamma1
prime
B
prime
· dl
prime
=μ
0
integraldisplay
S
prime
parenleftbigg
J
prime
+ J
prime
M
+ J
prime
P
+
?epsilon1
0
E
prime
?t
prime
parenrightbigg
· dS
prime
,
for ?elds in a moving frame. We see that Faraday’s law is unmodi?ed by the introduction
of polarization and magnetization, hence our prior discussion of emf for moving contours
remains valid. However, Ampere’s law must be interpreted somewhat di?erently. The
?ux vector B also acts as a force vector, and its circulation is proportional to the out-
?ux of total current, consisting of J plus the equivalent magnetization and polarization
currents plus the displacement current in free space, through the surface bounded by the
circulation contour.
The large-scale forms of the auxiliary equations can be found by integrating (2.122)
and (2.123) over a volume region and applying the divergence theorem. This gives
contintegraldisplay
S
E · dS =
1
epsilon1
0
integraldisplay
V
(ρ+ρ
P
)dV,
contintegraldisplay
S
B · dS = 0,
for the laboratory frame ?elds, and
contintegraldisplay
S
prime
E
prime
· dS
prime
=
1
epsilon1
0
integraldisplay
V
prime
(ρ
prime
+ρ
prime
P
)dV
prime
,
contintegraldisplay
S
prime
B
prime
· dS
prime
= 0,
for the moving frame ?elds. Here we ?nd the force vector E also acting as a ?ux vector,
with the out?ux of E over a closed surface proportional to the sum of the electric and
polarization charges enclosed by the surface.
To provide the alternative representation, we integrate the point forms over V and use
the curl theorem to obtain
contintegraldisplay
S
(?n × E)dS =?
integraldisplay
V
?B
?t
dV, (2.163)
contintegraldisplay
S
(?n × B)dS =μ
0
integraldisplay
V
parenleftbigg
J + J
M
+ J
P
+
?epsilon1
0
E
?t
parenrightbigg
dV, (2.164)
for the laboratory frame ?elds, and
contintegraldisplay
S
prime
(?n
prime
× E
prime
)dS
prime
=?
integraldisplay
V
prime
?B
prime
?t
prime
dV
prime
,
contintegraldisplay
S
prime
(?n
prime
× B
prime
)dS
prime
=μ
0
integraldisplay
V
prime
parenleftbigg
J
prime
+ J
prime
M
+ J
prime
P
+
?epsilon1
0
E
prime
?t
prime
parenrightbigg
dV
prime
,
for the moving frame ?elds.
The large-scale forms of the Bo? equations can also be put into kinematic form using
either (2.151) or (2.158). Using (2.151) on (2.161) and (2.162) we have
contintegraldisplay
Gamma1(t)
E
?
· dl =?
d
dt
integraldisplay
S(t)
B · dS, (2.165)
contintegraldisplay
Gamma1(t)
B
?
· dl =
integraldisplay
S(t)
μ
0
J
?
· dS +
1
c
2
d
dt
integraldisplay
S(t)
E · dS, (2.166)
where
E
?
= E + v × B,
B
?
= B ?
1
c
2
v × E,
J
?
= J + J
M
+ J
P
?(ρ+ρ
P
)v.
Here B
?
is equivalent to the ?rst-order Lorentz transformation representation of the ?eld
in the moving frame (2.64). (The dagger ? should not be confused with the symbol for
the hermitian operation.) Using (2.158) on (2.163) and (2.164) we have
contintegraldisplay
S(t)
[?n × E ?(v · ?n)B] dS=?
d
dt
integraldisplay
V(t)
B dV, (2.167)
and
contintegraldisplay
S(t)
bracketleftbigg
?n × B +
1
c
2
(v · ?n)E
bracketrightbigg
dS =μ
0
integraldisplay
V(t)
(J + J
M
+ J
P
)dV +
1
c
2
d
dt
integraldisplay
V(t)
E dV.
(2.168)
In each case the ?elds are measured in the laboratory frame, and v is measured with
respect to the laboratory frame and may vary arbitrarily over the surface or contour.
2.6 Thenatureofthefour?eldquantities
Since the very inception of Maxwell’s theory, its students have been distressed by the
factthatwhiletherearefourelectromagnetic?elds(E,D,B,H), thereareonlytwofunda-
mental equations (the curl equations) to describe their interrelationship. The relegation
of additional required information to constitutive equations that vary widely between
classes of materials seems to lessen the elegance of the theory. While some may ?nd
elegant the separation of equations into a set expressing the basic wave nature of electro-
magnetism and a set describing how the ?elds interact with materials, the history of the
discipline is one of categorizing and pairing ?elds as “fundamental” and “supplemental”
in hopes of reducing the model to two equations in two unknowns.
Lorentz led the way in this area. With his electrical theory of matter, all material ef-
fects could be interpreted in terms of atomic charge and current immersed in free space.
We have seen how the Maxwell–Bo? equations seem to eliminate the need for D and H,
and indeed for simple media where there is a linear relation between the remaining “fun-
damental” ?elds and the induced polarization and magnetization, it appears that only
E and B are required. However, for more complicated materials that display nonlinear
and bianisotropic e?ects we are only able to supplant D and H with two other ?elds P
and M, along with (possibly complicated) constitutive relations relating them to E and
B.
Even those authors who do not wish to eliminate two of the ?elds tend to categorize
the ?elds into pairs based on physical arguments, implying that one or the other pair
is in some way “more fundamental.” Maxwell himself separated the ?elds into the pair
(E,H) that appears within line integrals to give work and the pair (B,D) that appears
within surface integrals to give ?ux. In what other ways might we pair the four vectors?
Most prevalent is the splitting of the ?elds into electric and magnetic pairs: (E,D)and
(B,H). In Poynting’s theorem E · D describes one component of stored energy (called
“electric energy”) and B · H describes another component (called “magnetic energy”).
These pairs also occur in Maxwell’s stress tensor. In statics, the ?elds decouple into
electric and magnetic sets. But biisotropic and bianisotropic materials demonstrate how
separation into electric and magnetic e?ects can become problematic.
In the study of electromagnetic waves, the ratio of E to H appears to be an important
quantity, called the “intrinsic impedance.” The pair (E,H) also determines the Poynting
?ux of power, and is required to establish the uniqueness of the electromagnetic ?eld.
In addition, constitutive relations for simple materials usually express (D,B) in terms
of (E,H). Models for these materials are often conceived by viewing the ?elds (E,H)
as interacting with the atomic structure in such a way as to produce secondary e?ects
describable by (D,B). These considerations, along with Maxwell’s categorization into
a pair of work vectors and a pair of ?ux vectors, lead many authors to formulate elec-
tromagnetics with E and H as the “fundamental” quantities. But the pair (B,D) gives
rise to electromagnetic momentum and is also perpendicular to the direction of wave
propagation in an anisotropic material; in these senses, we might argue that these ?elds
must be equally “fundamental.”
Perhaps the best motivation for grouping ?elds comes from relativistic considerations.
We have found that (E,B) transform together under a Lorentz transformation, as do
(D,H). In each of these pairs we have one polar vector (E or D) and one axial vector (B
or H). A polar vector retains its meaning under a change in handedness of the coordinate
system, while an axial vector does not. The Lorentz force involves one polar vector (E)
and one axial vector (B) that we also call “electric” and “magnetic.” If we follow the
lead of some authors and choose to de?ne E and B through measurements of the Lorentz
force, then we recognize that B must be axial since it is not measured directly, but as
part of the cross product v × B that changes its meaning if we switch from a right-hand
to a left-hand coordinate system. The other polar vector (D) and axial vector (H) arise
through the “secondary” constitutive relations. Following this reasoning we might claim
that E and B are “fundamental.”
Sommerfeld also associates E with B and D with H. The vectors E and B are
called entities of intensity, describing “how strong,” while D and H are called entities
of quantity, describing “how much.” This is in direct analogy with stress (intensity) and
strain (quantity) in materials. We might also say that the entities of intensity describe
a “cause” while the entities of quantity describe an “e?ect.” In this view E “induces”
(causes) a polarization P, and the ?eld D = epsilon1
0
E + P is the result. Similarly B creates
M, and H = B/μ
0
? M is the result. Interestingly, each of the terms describing energy
and momentum in the electromagnetic ?eld (D · E, B · H, E × H, D × B) involves the
interaction of an entity of intensity with an entity of quantity.
Although there is a natural tendency to group things together based on conceptual
similarity, there appears to be little reason to believe that any of the four ?eld vectors are
more“fundamental”thantherest. PerhapswearefortunatethatwecanapplyMaxwell’s
theory without worrying too much about such questions of underlying philosophy.
2.7 Maxwell’sequationswithmagneticsources
Researchers have yet to discover the “magnetic monopole”: a magnetic source from
which magnetic ?eld would diverge. This has not stopped speculation on the form that
Maxwell’s equations might take if such a discovery were made. Arguments based on
fundamental principles of physics (such as symmetry and conservation laws) indicate
that in the presence of magnetic sources Maxwell’s equations would assume the forms
?×E =?J
m
?
?B
?t
, (2.169)
?×H = J +
?D
?t
, (2.170)
?·B =ρ
m
, (2.171)
?·D =ρ, (2.172)
where J
m
is a volume magnetic current density describing the ?ow of magnetic charge in
exactly the same manner as J describes the ?ow of electric charge. The density of this
magnetic charge is given by ρ
m
and should, by analogy with electric charge density, obey
a conservation law
?·J
m
+
?ρ
m
?t
= 0.
This is the magnetic source continuity equation.
It is interesting to inquire as to the units of J
m
and ρ
m
. From (2.169) we see that if B
has units of Wb/m
2
, then J
m
has units of (Wb/s)/m
2
. Similarly, (2.171) shows that ρ
m
must have units of Wb/m
3
. Hence magnetic charge is measured in Wb, magnetic current
in Wb/s. This gives a nice symmetry with electric sources where charge is measured in
C and current in C/s.
3
The physical symmetry is equally appealing: magnetic ?ux lines
diverge from magnetic charge, and the total ?ux passing through a surface is given by the
total magnetic charge contained within the surface. This is best seen by considering the
large-scale forms of Maxwell’s equations for stationary surfaces. We need only modify
(2.145) to include the magnetic current term; this gives
contintegraldisplay
Gamma1
E · dl =?
integraldisplay
S
J
m
· dS ?
d
dt
integraldisplay
S
B · dS, (2.173)
contintegraldisplay
Gamma1
H · dl =
integraldisplay
S
J · dS +
d
dt
integraldisplay
S
D · dS. (2.174)
If we modify (2.148) to include magnetic charge, we get the auxiliary equations
contintegraldisplay
S
D · dS =
integraldisplay
V
ρ dV,
contintegraldisplay
S
B · dS =
integraldisplay
V
ρ
m
dV.
Anyofthelarge-scaleformsofMaxwell’sequationscanbesimilarlymodi?edtoinclude
magnetic current and charge. For arbitrarily moving surfaces we have
contintegraldisplay
Gamma1(t)
E
?
· dl =?
d
dt
integraldisplay
S(t)
B · dS ?
integraldisplay
S(t)
J
?
m
· dS,
contintegraldisplay
Gamma1(t)
H
?
· dl =
d
dt
integraldisplay
S(t)
D · dS +
integraldisplay
S(t)
J
?
· dS,
where
E
?
= E + v × B,
H
?
= H ? v × D,
J
?
= J ?ρv,
J
?
m
= J
m
?ρ
m
v,
and all ?elds are taken to be measured in the laboratory frame with v the instantaneous
velocity of points on the surface and contour relative to the laboratory frame. We also
have the alternative forms
contintegraldisplay
S
(?n × E)dS =
integraldisplay
V
parenleftbigg
?
?B
?t
? J
m
parenrightbigg
dV, (2.175)
contintegraldisplay
S
(?n × H)dS =
integraldisplay
V
parenleftbigg
?D
?t
+ J
parenrightbigg
dV, (2.176)
and
contintegraldisplay
S(t)
[?n × E ?(v · ?n)B] dS =?
integraldisplay
V(t)
J
m
dV ?
d
dt
integraldisplay
V(t)
B dV, (2.177)
contintegraldisplay
S(t)
[?n × H +(v · ?n)D] dS =
integraldisplay
V(t)
J dV +
d
dt
integraldisplay
V(t)
D dV, (2.178)
3
We note that if the modern unit of T is used to describe B, then ρ
m
is described using the more
cumbersome units of T/m, while J
m
is given in terms of T/s. Thus, magnetic charge is measured in Tm
2
and magnetic current in (Tm
2
)/s.
and the two Gauss’s law expressions
contintegraldisplay
S(t)
D · ?n dS =
integraldisplay
V(t)
ρ dV,
contintegraldisplay
S(t)
B · ?n dS =
integraldisplay
V(t)
ρ
m
dV.
Magnetic sources also allow us to develop equivalence theorems in which di?cult prob-
lems involving boundaries are replaced by simpler problems involving magnetic sources.
Although these sources may not physically exist, the mathematical solutions are com-
pletely valid.
2.8 Boundary(jump)conditions
If we restrict ourselves to regions of space without spatial (jump) discontinuities in
either the sources or the constitutive relations, we can ?nd meaningful solutions to the
Maxwell di?erential equations. We also know that for given sources, if the ?elds are
speci?ed on a closed boundary and at an initial time the solutions are unique. The
standard approach to treating regions that do contain spatial discontinuities is to isolate
thediscontinuitiesonsurfaces. Thatis, weintroducesurfacesthatservetoseparatespace
into regions in which the di?erential equations are solvable and the ?elds are well de?ned.
To make the solutions in adjoining regions unique, we must specify the tangential ?elds
on each side of the adjoining surface. If we can relate the ?elds across the boundary, we
can propagate the solution from one region to the next; in this way, information about
the source in one region is e?ectively passed on to the solution in an adjacent region. For
uniqueness, only relations between the tangential components need be speci?ed.
We shall determine the appropriate boundary conditions (BC’s) via two distinct ap-
proaches. We ?rst model a thin source layer and consider a discontinuous surface source
layerasalimitingcaseofthecontinuousthinlayer. Withnotruediscontinuity, Maxwell’s
di?erential equations hold everywhere. We then consider a true spatial discontinuity be-
tween material surfaces (with possible surface sources lying along the discontinuity). We
mustthenisolatetheregioncontainingthediscontinuityand postulate a?eldrelationship
that is both physically meaningful and experimentally veri?able.
We shall also consider both stationary and moving boundary surfaces, and surfaces
containing magnetic as well as electric sources.
2.8.1 Boundaryconditions across a stationary, thin source layer
In § 1.3.3 we discussed how in the macroscopic sense a surface source is actually a
volume distribution concentrated near a surface S. We write the charge and current in
terms of the point r on the surface and the normal distance x from the surface at r as
ρ(r,x,t)=ρ
s
(r,t)f(x,Delta1), (2.179)
J(r,x,t)= J
s
(r,t)f(x,Delta1), (2.180)
where f(x,Delta1)is the source density function obeying
integraldisplay
∞
?∞
f(x,Delta1)dx = 1. (2.181)
Figure 2.5: Derivation of the electromagnetic boundary conditions across a thin contin-
uous source layer.
The parameter Delta1 describes the “width” of the source layer normal to the reference
surface.
We use (2.156)–(2.157) to study ?eld behavior across the source layer. Consider a
volumeregion V thatintersectsthesourcelayerasshowninFigure2.5.Letthetopand
bottom surfaces be parallel to the reference surface, and label the ?elds on the top and
bottom surfaces with subscripts 1 and 2, respectively. Since points on and within V are
all regular, (2.157) yields
integraldisplay
S
1
?n
1
× H
1
dS+
integraldisplay
S
2
?n
2
× H
2
dS+
integraldisplay
S
3
?n
3
× H dS=
integraldisplay
V
parenleftbigg
J +
?D
?t
parenrightbigg
dV.
We now choose δ = kDelta1 (k > 1) so that most of the source lies within V.AsDelta1 → 0
the thin source layer recedes to a surface layer, and the volume integral of displacement
current and the integral of tangential H over S
3
both approach zero by continuity of
the ?elds. By symmetry S
1
= S
2
and ?n
1
=??n
2
= ?n
12
, where ?n
12
is the surface normal
directed into region 1 from region 2. Thus
integraldisplay
S
1
?n
12
×(H
1
? H
2
)dS=
integraldisplay
V
J dV. (2.182)
Note that
integraldisplay
V
J dV =
integraldisplay
S
1
integraldisplay
δ/2
?δ/2
J dSdx =
integraldisplay
δ/2
?δ/2
f(x,Delta1)dx
integraldisplay
S
1
J
s
(r,t)dS.
Since we assume that the majority of the source current lies within V, the integral can
be evaluated using (2.181) to give
integraldisplay
S
1
[?n
12
×(H
1
? H
2
)? J
s
] dS= 0,
hence
?n
12
×(H
1
? H
2
)= J
s
.
The tangential magnetic ?eld across a thin source distribution is discontinuous by an
amount equal to the surface current density.
Similar steps with Faraday’s law give
?n
12
×(E
1
? E
2
)= 0.
The tangential electric ?eld is continuous across a thin source.
We can also derive conditions on the normal components of the ?elds, although these
arenotrequiredforuniqueness.Gauss’slaw(2.147)appliedtothevolume V inFigure
2.5gives
integraldisplay
S
1
D
1
· ?n
1
dS+
integraldisplay
S
2
D
2
· ?n
2
dS+
integraldisplay
S
3
D · ?n
3
dS=
integraldisplay
V
ρ dV.
AsDelta1→ 0, the thin source layer recedes to a surface layer. The integral of normal D over
S
3
tends to zero by continuity of the ?elds. By symmetry S
1
= S
2
and ?n
1
=??n
2
= ?n
12
.
Thus
integraldisplay
S
1
(D
1
? D
2
)· ?n
12
dS=
integraldisplay
V
ρ dV. (2.183)
The volume integral is
integraldisplay
V
ρ dV =
integraldisplay
S
1
integraldisplay
δ/2
?δ/2
ρ dSdx =
integraldisplay
δ/2
?δ/2
f(x,Delta1)dx
integraldisplay
S
1
ρ
s
(r,t)dS.
Since δ = kDelta1 has been chosen so that most of the source charge lies within V, (2.181)
gives
integraldisplay
S
1
[(D
1
? D
2
)· ?n
12
?ρ
s
] dS= 0,
hence
(D
1
? D
2
)· ?n
12
=ρ
s
.
The normal component of D is discontinuous across a thin source distribution by an
amount equal to the surface charge density. Similar steps with the magnetic Gauss’s law
yield
(B
1
? B
2
)· ?n
12
= 0.
The normal component of B is continuous across a thin source layer.
We can follow similar steps when a thin magnetic source layer is present. When
evaluating Faraday’s law we must include magnetic surface current and when evaluating
the magnetic Gauss’s law we must include magnetic charge. However, since such sources
arenotphysicalwepostponetheirconsiderationuntilthenextsection, whereappropriate
boundary conditions are postulated rather than derived.
2.8.2 Boundaryconditions across a stationarylayer of ?eld disconti-
nuity
Providedthatwemodelasurfacesourceasalimitingcaseofaverythinbutcontinuous
volume source, we can derive boundary conditions across a surface layer. We might ask
whetherwecanextendthisideatosurfacesofmaterialswheretheconstitutiveparameters
change from one region to another. Indeed, if we take Lorentz’ viewpoint and visualize a
material as a conglomerate of atomic charge, we should be able to apply this same idea.
After all, a material should demonstrate a continuous transition (in the macroscopic
Figure 2.6: Derivation of the electromagnetic boundary conditions across a discontinuous
source layer.
sense) across its boundary, and we can employ the Maxwell–Bo? equations to describe
the relationship between the “equivalent” sources and the electromagnetic ?elds.
We should note, however, that the limiting concept is not without its critics. Stokes
suggested as early as 1848 that jump conditions should never be derived from smooth
solutions [199]. Let us therefore pursue the boundary conditions for a surface of true
?eld discontinuity. This will also allow us to treat a material modeled as having a true
discontinuity in its material parameters (which we can always take as a mathematical
model of a more gradual transition) before we have studied in a deeper sense the physical
propertiesofmaterials. Thisapproach, takenbymanytextbooks, mustbedonecarefully.
There is a logical di?culty with this approach, lying in the application of the large-
scale forms of Maxwell’s equations. Many authors postulate Maxwell’s equations in point
form, integrate to obtain the large-scale forms, then apply the large-scale forms to regions
of discontinuity. Unfortunately, the large-scale forms thus obtained are only valid in the
same regions where their point form antecedents were valid — discontinuities must be
excluded. Schelkuno? [167] has criticized this approach, calling it a “swindle” rather
than a proof, and has suggested that the proper way to handle true discontinuities
is to postulate the large-scale forms of Maxwell’s equations, and to include as part
of the postulate the assumption that the large-scale forms are valid at points of ?eld
discontinuity. Does this mean we must reject our postulate of the point form Maxwell
equations and reformulate everything in terms of the large-scale forms? Fortunately, no.
Tai [192] has pointed out that it is still possible to postulate the point forms, as long
as we also postulate appropriate boundary conditions that make the large-scale forms,
as derived from the point forms, valid at surfaces of discontinuity. In essence, both
approaches require an additional postulate for surfaces of discontinuity: the large scale
forms require a postulate of applicability to discontinuous surfaces, and from there the
boundary conditions can be derived; the point forms require a postulate of the boundary
conditions that result in the large-scale forms being valid on surfaces of discontinuity.
Let us examine how the latter approach works.
Consider a surface across which the constitutive relations are discontinuous, containing
electric and magnetic surface currents and charges J
s
,ρ
s
, J
ms
,andρ
ms
(Figure2.6).
We locate a volume region V
1
abovethesurfaceofdiscontinuity;thisvolumeisbounded
byasurface S
1
andanother surface S
10
which is parallel to, and a small distance δ/2
above, the surface of discontinuity. A second volume region V
2
is similarly situated below
the surface of discontinuity. Because these regions exclude the surface of discontinuity
we can use (2.176) to get
integraldisplay
S
1
?n × H dS+
integraldisplay
S
10
?n × H dS =
integraldisplay
V
1
parenleftbigg
J +
?D
?t
parenrightbigg
dV,
integraldisplay
S
2
?n × H dS+
integraldisplay
S
20
?n × H dS =
integraldisplay
V
2
parenleftbigg
J +
?D
?t
parenrightbigg
dV.
Adding these we obtain
integraldisplay
S
1
+S
2
?n × H dS?
integraldisplay
V
1
+V
2
parenleftbigg
J +
?D
?t
parenrightbigg
dV ?
?
integraldisplay
S
10
?n
10
× H
1
dS?
integraldisplay
S
20
?n
20
× H
2
dS= 0, (2.184)
where we have used subscripts to delineate the ?elds on each side of the discontinuity
surface.
If δ is very small (but nonzero), then ?n
10
=??n
20
= ?n
12
and S
10
= S
20
. Letting
S
1
+ S
2
= S and V
1
+ V
2
= V, we can write (184) as
integraldisplay
S
(?n × H)dS?
integraldisplay
V
parenleftbigg
J +
?D
?t
parenrightbigg
dV =
integraldisplay
S
10
?n
12
×(H
1
? H
2
)dS. (2.185)
Now suppose we use the same volume region V, but let it intersect the surface of
discontinuity(Figure2.6),andsupposethatthelarge-scaleformofAmpere’slawholds
even if V contains points of ?eld discontinuity. We must include the surface current in
the computation. Since
integraltext
V
J dV becomes
integraltext
S
J
s
dS on the surface, we have
integraldisplay
S
(?n × H)dS?
integraldisplay
V
parenleftbigg
J +
?D
?t
parenrightbigg
dV =
integraldisplay
S
10
J
s
dS. (2.186)
We wish to have this give the same value for the integrals over V and S as (2.185), which
included in its derivation no points of discontinuity. This is true provided that
?n
12
×(H
1
? H
2
)= J
s
. (2.187)
Thus, under the condition (2.187) we may interpret the large-scale form of Ampere’s law
(as derived from the point form) as being valid for regions containing discontinuities.
Note that this condition is not “derived,” but must be regarded as a postulate that
results in the large-scale form holding for surfaces of discontinuous ?eld.
Similar reasoning can be used to determine the appropriate boundary condition on
tangential E from Faraday’s law. Corresponding to (2.185) we obtain
integraldisplay
S
(?n × E)dS?
integraldisplay
V
parenleftbigg
?J
m
?
?B
?t
parenrightbigg
dV =
integraldisplay
S
10
?n
12
×(E
1
? E
2
)dS. (2.188)
Employing (2.175) over the region containing the ?eld discontinuity surface we get
integraldisplay
S
(?n × E)dS?
integraldisplay
V
parenleftbigg
?J
m
?
?B
?t
parenrightbigg
dV =?
integraldisplay
S
10
J
ms
dS. (2.189)
To have (2.188) and (2.189) produce identical results, we postulate
?n
12
×(E
1
? E
2
)=?J
ms
(2.190)
as the boundary condition appropriate to a surface of ?eld discontinuity containing a
magnetic surface current.
We can also postulate boundary conditions on the normal ?elds to make Gauss’s laws
valid for surfaces of discontinuous ?elds. Integrating (2.147) over the regions V
1
and V
2
and adding, we obtain
integraldisplay
S
1
+S
2
D · ?n dS?
integraldisplay
S
10
D
1
· ?n
10
dS?
integraldisplay
S
20
D
2
· ?n
20
dS=
integraldisplay
V
1
+V
2
ρ dV.
As δ → 0 this becomes
integraldisplay
S
D · ?n dS?
integraldisplay
V
ρ dV =
integraldisplay
S
10
(D
1
? D
2
)· ?n
12
dS. (2.191)
IfweintegrateGauss’slawovertheentireregion V, includingthesurfaceofdiscontinuity,
we get
contintegraldisplay
S
D · ?n dS=
integraldisplay
V
ρ dV +
integraldisplay
S
10
ρ
s
dS. (2.192)
In order to get identical answers from (2.191) and (2.192), we must have
(D
1
? D
2
)· ?n
12
=ρ
s
as the boundary condition appropriate to a surface of ?eld discontinuity containing an
electric surface charge. Similarly, we must postulate
(B
1
? B
2
)· ?n
12
=ρ
ms
as the condition appropriate to a surface of ?eld discontinuity containing a magnetic
surface charge.
We can determine an appropriate boundary condition on current by using the large-
scale form of the continuity equation. Applying (2.10) over each of the volume regions
ofFigure2.6andaddingtheresults,wehave
integraldisplay
S
1
+S
2
J · ?n dS?
integraldisplay
S
10
J
1
· ?n
10
dS?
integraldisplay
S
20
J
2
· ?n
20
dS=?
integraldisplay
V
1
+V
2
?ρ
?t
dV.
As δ → 0 we have
integraldisplay
S
J · ?n dS?
integraldisplay
S
10
(J
1
? J
2
)· ?n
12
dS=?
integraldisplay
V
?ρ
?t
dV. (2.193)
Applying the continuity equation over the entire region V and allowing it to intersect
the discontinuity surface, we get
integraldisplay
S
J · ?n dS+
integraldisplay
Gamma1
J
s
· ?m dl =?
integraldisplay
V
?ρ
?t
dV ?
integraldisplay
S
10
?ρ
s
?t
dS.
By the two-dimensional divergence theorem (B.20) we can write this as
integraldisplay
S
J · ?n dS+
integraldisplay
S
10
?
s
· J
s
dS=?
integraldisplay
V
?ρ
?t
dV ?
integraldisplay
S
10
?ρ
s
?t
dS.
In order for this expression to produce the same values of the integrals over S and V as
in (2.193) we require
?
s
· J
s
=??n
12
·(J
1
? J
2
)?
?ρ
s
?t
,
which we take as our postulate of the boundary condition on current across a surface
containing discontinuities. A similar set of steps carried out using the continuity equation
for magnetic sources yields
?
s
· J
ms
=??n
12
·(J
m1
? J
m2
)?
?ρ
ms
?t
.
In summary, we have the following boundary conditions for ?elds across a surface
containing discontinuities:
?n
12
×(H
1
? H
2
)= J
s
, (2.194)
?n
12
×(E
1
? E
2
)=?J
ms
, (2.195)
?n
12
·(D
1
? D
2
)=ρ
s
, (2.196)
?n
12
·(B
1
? B
2
)=ρ
ms
, (2.197)
and
?n
12
·(J
1
? J
2
)=??
s
· J
s
?
?ρ
s
?t
, (2.198)
?n
12
·(J
m1
? J
m2
)=??
s
· J
ms
?
?ρ
ms
?t
, (2.199)
where ?n
12
points into region 1 from region 2.
2.8.3 Boundaryconditions at the surface of a perfect conductor
We can easily specialize the results of the previous section to the case of perfect electric
or magnetic conductors. In § 2.2.2 we saw that the constitutive relations for perfect
conductors requires the null ?eld within the material. In addition, a PEC requires zero
tangential electric ?eld, while a PMC requires zero tangential magnetic ?eld. Using
(2.194)–(2.199), we ?nd that the boundary conditions for a perfect electric conductor
are
?n × H = J
s
, (2.200)
?n × E = 0, (2.201)
?n · D =ρ
s
, (2.202)
?n · B = 0, (2.203)
and
?n · J =??
s
· J
s
?
?ρ
s
?t
, ?n · J
m
= 0. (2.204)
For a PMC the conditions are
?n × H = 0, (2.205)
?n × E =?J
ms
, (2.206)
?n · D = 0, (2.207)
?n · B =ρ
ms
, (2.208)
and
?n · J
m
=??
s
· J
ms
?
?ρ
ms
?t
, ?n · J = 0. (2.209)
We note that the normal vector ?n points out of the conductor and into the adjacent
region of nonzero ?elds.
2.8.4 Boundaryconditions across a stationarylayer of ?eld disconti-
nuityusing equivalent sources
So far we have avoided using the physical interpretation of the equivalent sources in the
Maxwell–Bo? equations so that we might investigate the behavior of ?elds across true
discontinuities. Now that we have the appropriate boundary conditions, it is interesting
to interpret them in terms of the equivalent sources.
If we put H = B/μ
0
? M into (2.194) and rearrange, we get
?n
12
×(B
1
? B
2
)=μ
0
(J
s
+ ?n
12
× M
1
? ?n
12
× M
2
). (2.210)
The terms on the right involving ?n
12
×M have the units of surface current and are called
equivalent magnetization surface currents. De?ning
J
Ms
=??n × M (2.211)
where ?n is directed normally outward from the material region of interest, we can rewrite
(2.210) as
?n
12
×(B
1
? B
2
)=μ
0
(J
s
+ J
Ms1
+ J
Ms2
). (2.212)
We note that J
Ms
replaces atomic charge moving along the surface of a material with an
equivalent surface current in free space.
If we substitute D =epsilon1
0
E + P into (2.196) and rearrange, we get
?n
12
·(E
1
? E
2
)=
1
epsilon1
0
(ρ
s
? ?n
12
· P
1
+ ?n
12
· P
2
). (2.213)
The terms on the right involving ?n
12
· P have the units of surface charge and are called
equivalent polarization surface charges. De?ning
ρ
Ps
= ?n · P, (2.214)
we can rewrite (2.213) as
?n
12
·(E
1
? E
2
)=
1
epsilon1
0
(ρ
s
+ρ
Ps1
+ρ
Ps2
). (2.215)
We note that ρ
Ps
replaces atomic charge adjacent to a surface of a material with an
equivalent surface charge in free space.
In summary, the boundary conditions at a stationary surface of discontinuity written
in terms of equivalent sources are
?n
12
×(B
1
? B
2
)=μ
0
(J
s
+ J
Ms1
+ J
Ms2
), (2.216)
?n
12
×(E
1
? E
2
)=?J
ms
, (2.217)
?n
12
·(E
1
? E
2
)=
1
epsilon1
0
(ρ
s
+ρ
Ps1
+ρ
Ps2
), (2.218)
?n
12
·(B
1
? B
2
)=ρ
ms
. (2.219)
2.8.5 Boundaryconditionsacrossamovinglayerof?elddiscontinuity
With a moving material body it is often necessary to apply boundary conditions de-
scribing the behavior of the ?elds across the surface of the body. If a surface of discon-
tinuity moves with constant velocity v, the boundary conditions (2.194)–(2.199) hold as
long as all ?elds are expressed in the frame of the moving surface. We can also derive
boundary conditions for a deforming surface moving with arbitrary velocity by using
equations (2.177)–(2.178). In this case all ?elds are expressed in the laboratory frame.
Proceeding through the same set of steps that gave us (2.194)–(2.197), we ?nd
?n
12
×(H
1
? H
2
)+(?n
12
· v)(D
1
? D
2
)= J
s
, (2.220)
?n
12
×(E
1
? E
2
)?(?n
12
· v)(B
1
? B
2
)=?J
ms
, (2.221)
?n
12
·(D
1
? D
2
)=ρ
s
, (2.222)
?n
12
·(B
1
? B
2
)=ρ
ms
. (2.223)
Note that when ?n
12
· v = 0 these boundary conditions reduce to those for a stationary
surface. This occurs not only when v = 0 but also when the velocity is parallel to the
surface.
The reader must be wary when employing (2.220)–(2.223). Since the ?elds are mea-
sured in the laboratory frame, if the constitutive relations are substituted into the bound-
ary conditions they must also be represented in the laboratory frame. It is probable that
the material parameters would be known in the rest frame of the material, in which case
a conversion to the laboratory frame would be necessary.
2.9 Fundamentaltheorems
In this section we shall consider some of the important theorems of electromagnetics
that pertain directly to Maxwell’s equations. They may be derived without reference to
the solutions of Maxwell’s equations, and are not connected with any specialization of
the equations or any speci?c application or geometrical con?guration. In this sense these
theorems are fundamental to the study of electromagnetics.
2.9.1 Linearity
Recall that a mathematical operator L is linear if
L(α
1
f
1
+α
2
f
2
)=α
1
L(f
1
)+α
2
L(f
2
)
holds for any two functions f
1,2
in the domain of L and any two scalar constants α
1,2
.A
standard observation regarding the equation
L(f)= s, (2.224)
where L is a linear operator and s is a given forcing function, is that if f
1
and f
2
are
solutions to
L(f
1
)= s
1
, L(f
2
)= s
2
, (2.225)
respectively, and
s = s
1
+ s
2
, (2.226)
then
f = f
1
+ f
2
(2.227)
is a solution to (2.224). This is the principle of superposition; if convenient, we can
decompose s in equation (2.224) as a sum (2.226) and solve the two resulting equations
(2.225) independently. The solution to (2.224) is then (2.227), “by superposition.” Of
course, we are free to split the right side of (2.224) into more than two terms — the
method extends directly to any ?nite number of terms.
Because the operators ?·, ?×, and ?/?t are all linear, Maxwell’s equations can be
treated by this method. If, for instance,
?×E
1
=?
?B
1
?t
, ?×E
2
=?
?B
2
?t
,
then
?×E =?
?B
?t
where E = E
1
+ E
2
and B = B
1
+ B
2
. The motivation for decomposing terms in a
particular way is often based on physical considerations; we give one example here and
defer others to later sections of the book. We saw earlier that Maxwell’s equations can
be written in terms of both electric and (?ctitious) magnetic sources as in equations
(2.169)–(2.172). Let E = E
e
+ E
m
where E
e
is produced by electric-type sources and E
m
is produced by magnetic-type sources, and decompose the other ?elds similarly. Then
?×E
e
=?
?B
e
?t
, ?×H
e
= J +
?D
e
?t
, ?·D
e
=ρ, ?·B
e
= 0,
with a similar equation set for the magnetic sources. We may, if desired, solve these
two equation sets independently for E
e
, D
e
, B
e
, H
e
and E
m
, D
m
, E
m
, H
m
, and then use
superposition to obtain the total ?elds E, D, B, H.
2.9.2 Duality
The intriguing symmetry of Maxwell’s equations leads us to an observation that can
reduce the e?ort required to compute solutions. Consider a closed surface S enclosing a
region of space that includes an electric source current J and a magnetic source current
J
m
. The ?elds (E
1
,D
1
,B
1
,H
1
) within the region (which may also contain arbitrary
media) are described by
?×E
1
=?J
m
?
?B
1
?t
, (2.228)
?×H
1
= J +
?D
1
?t
, (2.229)
?·D
1
=ρ, (2.230)
?·B
1
=ρ
m
. (2.231)
Suppose we have been given a mathematical description of the sources (J,J
m
) and have
solved for the ?eld vectors (E
1
,D
1
,B
1
,H
1
). Of course, we must also have been supplied
with a set of boundary values and constitutive relations in order to make the solution
unique. We note that if we replace the formula for J with the formula for J
m
in (2.229)
(and ρ with ρ
m
in (2.230)) and also replace J
m
with ?J in (2.228) (and ρ
m
with ?ρ
in (2.231)) we get a new problem to solve, with a di?erent solution. However, the
symmetry of the equations allows us to specify the solution immediately. The new set of
curl equations requires
?×E
2
= J ?
?B
2
?t
, (2.232)
?×H
2
= J
m
+
?D
2
?t
. (2.233)
As long as we can resolve the question of how the constitutive parameters must be altered
to re?ect these replacements, we can conclude by comparing (2.232) with (2.229) and
(2.233) with (2.228) that the solution to these equations is merely
E
2
= H
1
,
B
2
=?D
1
,
D
2
= B
1
,
H
2
=?E
1
.
That is, if we have solved the original problem, we can use those solutions to ?nd the
new ones. This is an application of the general principle of duality.
Unfortunately, this approach is a little awkward since the units of the sources and
?elds in the two problems are di?erent. We can make the procedure more convenient by
multiplying Ampere’s law by η
0
=(μ
0
/epsilon1
0
)
1/2
. Then we have
?×E =?J
m
?
?B
?t
, (2.234)
?×(η
0
H)=(η
0
J)+
?(η
0
D)
?t
. (2.235)
Thus if the original problem has solution (E
1
,η
0
D
1
,B
1
,η
0
H
1
), then the dual problem
with J replaced by J
m
/η
0
and J
m
replaced by ?η
0
J has solution
E
2
=η
0
H
1
, (2.236)
B
2
=?η
0
D
1
, (2.237)
η
0
D
2
= B
1
, (2.238)
η
0
H
2
=?E
1
. (2.239)
The units on the quantities in the two problems are now identical.
Of course, the constitutive parameters for the dual problem must be altered from
those of the original problem to re?ect the change in ?eld quantities. From (2.19) and
(2.20) we know that the most general forms of the constitutive relations (those for linear,
bianisotropic media) are
D
1
=
ˉ
ξ
1
· H
1
+ ˉepsilon1
1
· E
1
, (2.240)
B
1
= ˉμ
1
· H
1
+
ˉ
ζ
1
· E
1
, (2.241)
for the original problem, and
D
2
=
ˉ
ξ
2
· H
2
+ ˉepsilon1
2
· E
2
, (2.242)
B
2
= ˉμ
2
· H
2
+
ˉ
ζ
2
· E
2
, (2.243)
for the dual problem. Substitution of (2.236)–(2.239) into (2.240) and (2.241) gives
D
2
=(?
ˉ
ζ
1
)· H
2
+
parenleftbigg
ˉμ
1
η
2
0
parenrightbigg
· E
2
, (2.244)
B
2
=
parenleftbig
η
2
0
ˉepsilon1
1
parenrightbig
· H
2
+(?
ˉ
ξ
1
)· E
2
. (2.245)
Comparing (2.244) with (2.242) and (2.245) with (2.243), we conclude that
ˉ
ζ
2
=?
ˉ
ξ
1
,
ˉ
ξ
2
=?
ˉ
ζ
1
, ˉμ
2
=η
2
0
ˉepsilon1
1
, ˉepsilon1
2
=
ˉμ
1
η
2
0
.
As an important special case, we see that for a linear, isotropic medium speci?ed by a
permittivity epsilon1 and permeability μ, the dual problem is obtained by replacing epsilon1
r
with μ
r
and μ
r
with epsilon1
r
. The solution to the dual problem is then given by
E
2
=η
0
H
1
,η
0
H
2
=?E
1
,
as before. We thus see that the medium in the dual problem must have electric properties
numerically equal to the magnetic properties of the medium in the original problem, and
magnetic properties numerically equal to the electric properties of the medium in the
original problem. This is rather inconvenient for most applications. Alternatively, we
may divide Ampere’s law by η = (μ/epsilon1)
1/2
instead of η
0
. Then the dual problem has
J replaced by J
m
/η, and J
m
replaced by ?ηJ, and the solution to the dual problem is
given by
E
2
=ηH
1
,ηH
2
=?E
1
.
In this case there is no need to swapepsilon1
r
andμ
r
, since information about these parameters
is incorporated into the replacement sources.
We must also remember that to obtain a unique solution we need to specify the bound-
ary values of the ?elds. In a true dual problem, the boundary values of the ?elds used
in the original problem are used on the swapped ?elds in the dual problem. A typical
example of this is when the condition of zero tangential electric ?eld on a perfect electric
conductor is replaced by the condition of zero tangential magnetic ?eld on the surface of
a perfect magnetic conductor. However, duality can also be used to obtain the mathe-
matical form of the ?eld expressions, often in a homogeneous (source-free) situation, and
boundary values can be applied later to specify the solution appropriate to the problem
geometry. This approach is often used to compute waveguide modal ?elds and the elec-
tromagnetic ?elds scattered from objects. In these cases a TE/TM ?eld decomposition
is employed, and duality is used to ?nd one part of the decomposition once the other is
known.
Dualityof electric and magnetic point source ?elds. By duality, we can some-
times use the known solution to one problem to solve a related problem by merely sub-
stituting di?erent variables into the known mathematical expression. An example of this
is the case in which we have solved for the ?elds produced by a certain distribution of
electric sources and wish to determine the ?elds when the same distribution is used to
describe magnetic sources.
Let us consider the case when the source distribution is that of a point current, or
Hertzian dipole, immersed in free space. As we shall see in Chapter 5, the ?elds for a
general source may be found by using the ?elds produced by these point sources. We
begin by ?nding the ?elds produced by an electric dipole source at the origin aligned
along the z-axis,
J = ?zI
0
δ(r),
then use duality to ?nd the ?elds produced by a magnetic current source J
m
= ?zI
m0
δ(r).
The ?elds produced by the electric source must obey
?×E
e
=?
?
?t
μ
0
H
e
, (2.246)
?×H
e
= ?zI
0
δ(r)+
?
?t
epsilon1
0
E
e
, (2.247)
?·epsilon1
0
E
e
=ρ, (2.248)
?·H
e
= 0, (2.249)
while those produced by the magnetic source must obey
?×E
m
=??zI
m0
δ(r)?
?
?t
μ
0
H
m
, (2.250)
?×H
m
=
?
?t
epsilon1
0
E
m
, (2.251)
?·E
m
= 0, (2.252)
?·μ
0
H
m
=ρ
m
. (2.253)
We see immediately that the second set of equations is the dual of the ?rst, as long
as we scale the sources appropriately. Multiplying (2.250) by ?I
0
/I
m0
and (2.251) by
I
0
η
2
0
/I
m0
, we have the curl equations
?×
parenleftbigg
?
I
0
I
m0
E
m
parenrightbigg
= ?zI
0
δ(r)+
?
?t
parenleftbigg
μ
0
I
0
I
m0
H
m
parenrightbigg
, (2.254)
?×
parenleftbigg
I
0
η
2
0
I
m0
H
m
parenrightbigg
=?
?
?t
parenleftbigg
?epsilon1
0
I
0
η
2
0
I
m0
E
m
parenrightbigg
. (2.255)
Comparing (2.255) with (2.246) and (2.254) with (2.247) we see that
E
m
=?
I
m0
I
0
H
e
, H
m
=
I
m0
I
0
E
e
η
2
0
.
We note that it is impossible to have a point current source without accompanying
point charge sources terminating each end of the dipole current. The point charges are
required to satisfy the continuity equation, and vary in time as the moving charge that
establishes the current accumulates at the ends of the dipole. From (2.247) we see that
the magnetic ?eld curls around the combination of the electric ?eld and electric current
source, while from (2.246) the electric ?eld curls around the magnetic ?eld, and from
(2.248) diverges from the charges located at the ends of the dipole. From (2.250) we
see that the electric ?eld must curl around the combination of the magnetic ?eld and
magnetic current source, while (2.251) and (2.253) show that the magnetic ?eld curls
around the electric ?eld and diverges from the magnetic charge.
Dualityin a source-free region. Consider a closed surface S enclosing a source-free
region of space. For simplicity, assume that the medium within S is linear, isotropic, and
homogeneous. The ?elds within S are described by Maxwell’s equations
?×E
1
=?
?
?t
μH
1
, (2.256)
?×ηH
1
=
?
?t
epsilon1ηE
1
, (2.257)
?·epsilon1E
1
= 0, (2.258)
?·μH
1
= 0. (2.259)
Under these conditions the concept of duality takes on a di?erent face. The symmetry
of the equations is such that the mathematical form of the solution for E is the same as
that for ηH. That is, the ?elds
E
2
=ηH
1
, (2.260)
H
2
=?E
1
/η, (2.261)
are also a solution to Maxwell’s equations, and thus the dual problem merely involves
replacing E by ηH and H by ?E/η. However, the ?nal forms of E and H will not be
identical after appropriate boundary values are imposed.
This form of duality is very important for the solution of ?elds within waveguides or
the ?elds scattered by objects where the sources are located outside the region where the
?elds are evaluated.
2.9.3 Reciprocity
The reciprocity theorem, also called the Lorentz reciprocity theorem, describes a spe-
ci?c and often useful relationship between sources and the electromagnetic ?elds they
produce. Under certain special circumstances we ?nd that an interaction between inde-
pendent source and mediating ?elds called “reaction” is a spatially symmetric quantity.
The reciprocity theorem is used in the study of guided waves to establish the orthogonal-
ity of guided wave modes, in microwave network theory to obtain relationships between
terminal characteristics, and in antenna theory to demonstrate the equivalence of trans-
mission and reception patterns.
Consider a closed surface S enclosing a volume V. Assume that the ?elds within and
on S are produced by two independent source ?elds. The source (J
a
,J
ma
) produces the
?eld (E
a
,D
a
,B
a
,H
a
) as described by Maxwell’s equations
?×E
a
=?J
ma
?
?B
a
?t
, (2.262)
?×H
a
= J
a
+
?D
a
?t
, (2.263)
while the source ?eld (J
b
,J
mb
) produces the ?eld (E
b
,D
b
,B
b
,H
b
) as described by
?×E
b
=?J
mb
?
?B
b
?t
, (2.264)
?×H
b
= J
b
+
?D
b
?t
. (2.265)
The sources may be distributed in any way relative to S: they may lie completely inside,
completely outside, or partially inside and partially outside. Material media may lie
within S, and their properties may depend on position.
Let us examine the quantity
R ≡?·(E
a
× H
b
? E
b
× H
a
).
By (B.44) we have
R = H
b
·?×E
a
? E
a
·?×H
b
? H
a
·?×E
b
+ E
b
·?×H
a
so that by Maxwell’s curl equations
R =
bracketleftbigg
H
a
·
?B
b
?t
? H
b
·
?B
a
?t
bracketrightbigg
?
bracketleftbigg
E
a
·
?D
b
?t
? E
b
·
?D
a
?t
bracketrightbigg
+
+ [J
a
· E
b
? J
b
· E
a
? J
ma
· H
b
+ J
mb
· H
a
].
The useful relationships we seek occur when the ?rst two bracketed quantities on the
right-hand side of the above expression are zero. Whether this is true depends not only
onthebehaviorofthe?elds, butonthepropertiesofthemediumatthepointinquestion.
Though we have assumed that the sources of the ?eld sets are independent, it is apparent
that they must share a similar time dependence in order for the terms within each of the
bracketed quantities to cancel. Of special interest is the case where the two sources are
both sinusoidal in time with identical frequencies, but with di?ering spatial distributions.
We shall consider this case in detail in § 4.10.2after we have discussed the properties of
the time harmonic ?eld. Importantly, we will ?nd that only certain characteristics of the
constitutive parameters allow cancellation of the bracketed terms; materials with these
characteristics are called reciprocal, and the ?elds they support are said to display the
property of reciprocity. To see what this property entails, we set the bracketed terms to
zero and integrate over a volume V to obtain
contintegraldisplay
S
(E
a
× H
b
? E
b
× H
a
)· dS =
integraldisplay
V
(J
a
· E
b
? J
b
· E
a
? J
ma
· H
b
+ J
mb
· H
a
)dV,
which is the time-domain version of the Lorentz reciprocity theorem.
Two special cases of this theorem are important to us. If all sources lie outside S,we
have Lorentz’s lemma
contintegraldisplay
S
(E
a
× H
b
? E
b
× H
a
)· dS = 0.
This remarkable expression shows that a relationship exists between the ?elds produced
by completely independent sources, and is useful for establishing waveguide mode or-
thogonality for time harmonic ?elds. If sources reside within S but the surface integral
is equal to zero, we have
integraldisplay
V
(J
a
· E
b
? J
b
· E
a
? J
ma
· H
b
+ J
mb
· H
a
)dV = 0.
This occurs when the surface is bounded by a special material (such as an impedance
sheet or a perfect conductor), or when the surface recedes to in?nity; the expression is
useful for establishing the reciprocity conditions for networks and antennas. We shall
interpret it for time harmonic ?elds in § 4.10.2.
2.9.4 Similitude
A common approach in physical science involves the introduction of normalized vari-
ables to provide for scaling of problems along with a chance to identify certain physically
signi?cant parameters. Similarity as a general principle can be traced back to the earliest
attempts to describe physical e?ects with mathematical equations, with serious study un-
dertaken by Galileo. Helmholtz introduced the ?rst systematic investigation in 1873, and
the concept was rigorized by Reynolds ten years later [216]. Similitude is now considered
a fundamental guiding principle in the modeling of materials [199].
Theprocessoftenbeginswithaconsiderationofthefundamentaldi?erentialequations.
In electromagnetics we may introduce a set of dimensionless ?eld and source variables
E, D, B, H, J,ρ, (2.266)
by setting
E = Ek
E
, B = Bk
B
, D = Dk
D
,
H = Hk
H
, J = Jk
J
,ρ=ρk
ρ
. (2.267)
Here we regard the quantities k
E
,k
B
,... as base units for the discussion, while the
dimensionless quantities (2.266) serve to express the actual ?elds E,B,...in terms of
these base units. Of course, the time and space variables can also be scaled: we can write
t = tk
t
, l = lk
l
, (2.268)
if l is any length of interest. Again, the quantities t and l are dimensionless measure
numbers used to express the actual quantities t and l relative to the chosen base amounts
k
t
and k
l
. With (2.267) and (2.268), Maxwell’s curl equations become
? × E =?
k
B
k
E
k
l
k
t
?B
?t
, ? × H =
k
J
k
l
k
H
J +
k
D
k
H
k
l
k
t
?D
?t
(2.269)
while the continuity equation becomes
? · J =?
k
ρ
k
J
k
l
k
t
?ρ
?t
, (2.270)
where ? has been normalized by k
l
. These are examples of ?eld equations cast into
dimensionless form — it is easily veri?ed that the similarity parameters
k
B
k
E
k
l
k
t
,
k
J
k
l
k
H
,
k
D
k
H
k
l
k
t
,
k
ρ
k
J
k
l
k
t
, (2.271)
are dimensionless. The idea behind electromagnetic similitude is that a given set of
normalized values E,B,...can satisfy equations (2.269) and (2.270) for many di?erent
physical situations, provided that the numerical values of the coe?cients (2.271) are all
?xed across those situations. Indeed, the di?erential equations would be identical.
To make this discussion a bit more concrete, let us assume a conducting linear medium
where
D =epsilon1E, B =μH, J =σE,
and use
epsilon1 =epsilon1k
epsilon1
,μ=μk
μ
,σ=σk
σ
,
to express the material parameters in terms of dimensionless values epsilon1, μ, and σ. Then
D =
k
epsilon1
k
E
k
D
epsilon1E, B =
k
μ
k
H
k
B
μH, J =
k
σ
k
E
k
J
σE,
and equations (2.269) become
? × E =?
parenleftbigg
k
μ
k
l
k
t
k
H
k
E
parenrightbigg
μ
?H
?t
,
? × H =
parenleftbigg
k
σ
k
l
k
E
k
H
parenrightbigg
σE +
parenleftbigg
k
epsilon1
k
l
k
t
k
E
k
H
parenrightbigg
epsilon1
?E
?t
.
De?ning
α =
k
μ
k
l
k
t
k
H
k
E
,γ= k
σ
k
l
k
E
k
H
,β=
k
epsilon1
k
l
k
t
k
E
k
H
,
we see that under the current assumptions similarity holds between two electromagnetics
problems only ifαμ,γσ, andβepsilon1are numerically the same in both problems. A necessary
condition for similitude, then, is that the products
(αμ)(βepsilon1)= k
μ
k
epsilon1
parenleftbigg
k
l
k
t
parenrightbigg
2
μepsilon1,(αμ)(γσ)= k
μ
k
σ
k
2
l
k
t
μσ,
(which do not involve k
E
or k
H
) stay constant between problems. We see, for example,
that we may compensate for a halving of the length scale k
l
by (a) a quadrupling of the
permeability μ, or (b) a simultaneous halving of the time scale k
t
and doubling of the
conductivityσ. A much less subtle special case is that for whichσ = 0, k
epsilon1
=epsilon1
0
, k
μ
=μ
0
,
and epsilon1 =μ= 1; we then have free space and must simply maintain
k
l
/k
t
= constant
so that the time and length scales stay proportional. In the sinusoidal steady state, for
instance, the frequency would be made to vary inversely with the length scale.
2.9.5 Conservation theorems
ThemisconceptionthatPoynting’stheoremcanbe“derived”fromMaxwell’sequations
iswidespreadandingrained. Wemust, infact, postulate theideathattheelectromagnetic
?eld can be associated with an energy ?ux propagating at the speed of light. Since
the form of the postulate is patterned after the well-understood laws of mechanics, we
begin by developing the basic equations of momentum and energy balance in mechanical
systems. Then we shall see whether it is sensible to ascribe these principles to the
electromagnetic ?eld.
Maxwell’s theory allows us to describe, using Maxwell’s equations, the behavior of
the electromagnetic ?elds within a (possibly) ?nite region V of space. The presence of
any sources or material objects outside V are made known through the speci?cation of
tangential ?elds over the boundary of V, as required for uniqueness. Thus, the in?uence
of external e?ects can always be viewed as being transported across the boundary. This
is true of mechanical as well as electromagnetic e?ects. A charged material body can
be acted on by physical contact with another body, by gravitational forces, and by the
Lorentz force, each e?ect resulting in momentum exchange across the boundary of the
object. These e?ects must all be taken into consideration if we are to invoke momentum
conservation, resulting in a very complicated situation. This suggests that we try to
decompose the problem into simpler “systems” based on physical e?ects.
The system concept in the physical sciences. The idea of decomposing a com-
plicated system into simpler, self-contained systems is quite common in the physical
sciences. Pen?eld and Haus [145] invoke this concept by introducing an electromagnetic
system where the e?ects of the Lorentz force equation are considered to accompany a
mechanical system where e?ects of pressure, stress, and strain are considered, and a
thermodynamic system where the e?ects of heat exchange are considered. These systems
can all be interrelated in a variety of ways. For instance, as a material heats up it can
expand, and the resulting mechanical forces can alter the electrical properties of the
material. We will follow Pen?eld and Haus by considering separate electromagnetic and
mechanical subsystems; other systems may be added analogously.
If we separate the various systems by physical e?ect, we will need to know how to
“reassemble the information.” Two conservation theorems are very helpful in this re-
gard: conservation of energy, and conservation of momentum. Engineers often employ
these theorems to make tacit use of the system idea. For instance, when studying elec-
tromagnetic waves propagating in a waveguide, it is common practice to compute wave
attenuation by calculating the Poynting ?ux of power into the walls of the guide. The
power lost from the wave is said to “heat up the waveguide walls,” which indeed it does.
This is an admission that the electromagnetic system is not “closed”: it requires the
inclusion of a thermodynamic system in order that energy be conserved. Of course, the
detailed workings of the thermodynamic system are often ignored, indicating that any
thermodynamic “feedback” mechanism is weak. In the waveguide example, for instance,
the heating of the metallic walls does not alter their electromagnetic properties enough
to couple back into an e?ect on the ?elds in the walls or in the guide. If such e?ects were
important, they would have to be included in the conservation theorem via the bound-
ary ?elds; it is therefore reasonable to associate with these ?elds a “?ow” of energy or
momentum into V. Thus, we wish to develop conservation laws that include not only the
Lorentz force e?ects within V, but a ?ow of external e?ects into V through its boundary
surface.
To understand how external in?uences may e?ect the electromagnetic subsystem, we
look to the behavior of the mechanical subsystem as an analogue. In the electromagnetic
system, e?ectsarefeltbothinternallytoaregion(becauseoftheLorentzforcee?ect)and
through the system boundary (by the dependence of the internal ?elds on the boundary
?elds). In the mechanical and thermodynamic systems, a region of mass is a?ected both
internally (through transfer of heat and gravitational forces) and through interactions
occurring across its surface (through transfers of energy and momentum, by pressure
and stress). One beauty of electromagnetic theory is that we can ?nd a mathematical
symmetry between electromagnetic and mechanical e?ects which parallels the above con-
ceptual symmetry. This makes applying conservation of energy and momentum to the
total system (electromagnetic, thermodynamic, and mechanical) very convenient.
Conservation of momentum and energyin mechanical systems. We begin by
reviewing the interactions of material bodies in a mechanical system. For simplicity we
concentrate on ?uids (analogous to charge in space); the extension of these concepts to
solid bodies is straightforward.
Consider a ?uid with mass density ρ
m
. The momentum of a small subvolume of the
?uid is given by ρ
m
v dV, where v is the velocity of the subvolume. So the momentum
density isρ
m
v. Newton’s second law states that a force acting throughout the subvolume
results in a change in its momentum given by
D
Dt
(ρ
m
v dV)= f dV, (2.272)
where f is the volume force density and the D/Dt notation shows that we are interested
in the rate of change of the momentum as observed by the moving ?uid element (see
§ A.2). Here f could be the weight force, for instance. Addition of the results for all
elements of the ?uid body gives
D
Dt
integraldisplay
V
ρ
m
v dV =
integraldisplay
V
f dV (2.273)
as the change in momentum for the entire body. If on the other hand the force exerted
on the body is through contact with its surface, the change in momentum is
D
Dt
integraldisplay
V
ρ
m
v dV =
contintegraldisplay
S
t dS (2.274)
where t is the “surface traction.”
We can write the time-rate of change of momentum in a more useful form by applying
the Reynolds transport theorem (A.66):
D
Dt
integraldisplay
V
ρ
m
v dV =
integraldisplay
V
?
?t
(ρ
m
v)dV +
contintegraldisplay
S
(ρ
m
v)v · dS. (2.275)
Superposing (2.273) and (2.274) and substituting into (2.275) we have
integraldisplay
V
?
?t
(ρ
m
v)dV +
contintegraldisplay
S
(ρ
m
v)v · dS =
integraldisplay
V
f dV +
contintegraldisplay
S
t dS. (2.276)
If we de?ne the dyadic quantity
ˉ
T
k
=ρ
m
vv
then (2.276) can be written as
integraldisplay
V
?
?t
(ρ
m
v)dV +
contintegraldisplay
S
?n ·
ˉ
T
k
dS=
integraldisplay
V
f dV +
contintegraldisplay
S
t dS. (2.277)
This principle of linear momentum [214] can be interpreted as a large-scale form of
conservation of kinetic linear momentum. Here ?n ·
ˉ
T
k
represents the ?ow of kinetic mo-
mentum across S, and the sum of this momentum transfer and the change of momentum
within V stands equal to the forces acting internal to V and upon S.
The surface traction may be related to the surface normal ?n through a dyadic quantity
ˉ
T
m
called the mechanical stress tensor:
t = ?n ·
ˉ
T
m
.
With this we may write (2.277) as
integraldisplay
V
?
?t
(ρ
m
v)dV +
contintegraldisplay
S
?n ·
ˉ
T
k
dS=
integraldisplay
V
f dV +
contintegraldisplay
S
?n ·
ˉ
T
m
dS
and apply the dyadic form of the divergence theorem (B.19) to get
integraldisplay
V
?
?t
(ρ
m
v)dV +
integraldisplay
V
?·(ρ
m
vv)dV =
integraldisplay
V
f dV +
integraldisplay
V
?·
ˉ
T
m
dV. (2.278)
Combining the volume integrals and setting the integrand to zero we have
?
?t
(ρ
m
v)+?·(ρ
m
vv)= f +?·
ˉ
T
m
,
which is the point-form equivalent of (2.277). Note that the second term on the right-
hand side is nonzero only for points residing on the surface of the body. Finally, letting
g denote momentum density we obtain the simple expression
?·
ˉ
T
k
+
?g
k
?t
= f
k
, (2.279)
where
g
k
=ρ
m
v
is the density of kinetic momentum and
f
k
= f +?·
ˉ
T
m
(2.280)
is the total force density.
Equation (2.279) is somewhat analogous to the electric charge continuity equation
(1.11). For each point of the body, the total out?ux of kinetic momentum plus the time
rate of change of kinetic momentum equals the total force. The resemblance to (1.11)
is strong, except for the nonzero term on the right-hand side. The charge continuity
equation represents a closed system: charge cannot spontaneously appear and add an
extra term to the right-hand side of (1.11). On the other hand, the change in total
momentum at a point can exceed that given by the momentum ?owing out of the point
if there is another “source” (e.g., gravity for an internal point, or pressure on a boundary
point).
To obtain a momentum conservation expression that resembles the continuity equa-
tion, we must consider a “subsystem” with terms that exactly counterbalance the extra
expressions on the right-hand side of (2.279). For a ?uid acted on only by external
pressure the sole e?ect enters through the traction term, and [145]
?·
ˉ
T
m
=??p (2.281)
where p is the pressure exerted on the ?uid body. Now, using (B.63), we can write
??p =??·
ˉ
T
p
(2.282)
where
ˉ
T
p
= p
ˉ
I
and
ˉ
I is the unit dyad. Finally, using (2.282), (2.281), and (2.280) in (2.279), we obtain
?·(
ˉ
T
k
+
ˉ
T
p
)+
?
?t
g
k
= 0
and we have an expression for a closed system including all possible e?ects. Now, note
that we can form the above expression as
parenleftbigg
?·
ˉ
T
k
+
?
?t
g
k
parenrightbigg
+
parenleftbigg
?·
ˉ
T
p
+
?
?t
g
p
parenrightbigg
= 0 (2.283)
where g
p
= 0 since there are no volume e?ects associated with pressure. This can be
viewed as the sum of two closed subsystems
?·
ˉ
T
k
+
?
?t
g
k
= 0, (2.284)
?·
ˉ
T
p
+
?
?t
g
p
= 0.
We now have the desired viewpoint. The conservation formula for the complete closed
system can be viewed as a sum of formulas for open subsystems, each having the form
of a conservation law for a closed system. In case we must include the e?ects of gravity,
for instance, we need only determine
ˉ
T
g
and g
g
such that
?·
ˉ
T
g
+
?
?t
g
g
= 0
and add this new conservation equation to (2.283). If we can ?nd a conservation ex-
pression of form similar to (2.284) for an “electromagnetic subsystem,” we can include
its e?ects along with the mechanical e?ects by merely adding together the conservation
laws. We shall ?nd just such an expression later in this section.
We stated in § 1.3 that there are four fundamental conservation principles. We have
now discussed linear momentum; the principle of angular momentum follows similarly.
Our next goal is to ?nd an expression similar to (2.283) for conservation of energy. We
may expect the conservation of energy expression to obey a similar law of superposition.
We begin with the fundamental de?nition of work: for a particle moving with velocity v
under the in?uence of a force f
k
the work is given by f
k
·v. Dot multiplying (2.272) by v
and replacing f by f
k
(to represent both volume and surface forces), we get
v ·
D
Dt
(ρ
m
v)dV = v · f
k
dV
or equivalently
D
Dt
parenleftbigg
1
2
ρ
m
v · v
parenrightbigg
dV = v · f
k
dV.
Integration over a volume and application of the Reynolds transport theorem (A.66) then
gives
integraldisplay
V
?
?t
parenleftbigg
1
2
ρ
m
v
2
parenrightbigg
dV +
contintegraldisplay
S
?n ·
parenleftbigg
v
1
2
ρ
m
v
2
parenrightbigg
dS=
integraldisplay
V
f
k
· v dV.
Hence the sum of the time rate of change in energy internal to the body and the ?ow
of kinetic energy across the boundary must equal the work done by internal and surface
forces acting on the body. In point form,
?·S
k
+
?
?t
W
k
= f
k
· v (2.285)
where
S
k
= v
1
2
ρ
m
v
2
is the density of the ?ow of kinetic energy and
W
k
=
1
2
ρ
m
v
2
is the kinetic energy density. Again, the system is not closed (the right-hand side of
(2.285) is not zero) because the balancing forces are not included. As was done with the
momentum equation, the e?ect of the work done by the pressure forces can be described
in a closed-system-type equation
?·S
p
+
?
?t
W
p
= 0. (2.286)
Combining (2.285) and (2.286) we have
?·(S
k
+ S
p
)+
?
?t
(W
k
+ W
p
)= 0,
the energy conservation equation for the closed system.
Conservation in the electromagnetic subsystem. We would now like to achieve
closed-system conservation theorems for the electromagnetic subsystem so that we can
add in the e?ects of electromagnetism. For the momentum equation, we can proceed
exactly as we did with the mechanical system. We begin with
f
em
=ρE + J × B.
This force term should appear on one side of the point form of the momentum conserva-
tion equation. The term on the other side must involve the electromagnetic ?elds, since
they are the mechanism for exerting force on the charge distribution. Substituting for J
from (2.2) and for ρ from (2.3) we have
f
em
= E(?·D)? B ×(?×H)+ B ×
?D
?t
.
Using
B ×
?D
?t
=?
?
?t
(D × B)+ D ×
?B
?t
and substituting from Faraday’s law for ?B/?t we have
? [E(?·D)? D ×(?×E)+ H(?·B)? B ×(?×H)] +
?
?t
(D × B)=?f
em
. (2.287)
Here we have also added the null term H(?·B).
The forms of (2.287) and (2.279) would be identical if the bracketed term could be
written as the divergence of a dyadic function
ˉ
T
em
. This is indeed possible for linear,
homogeneous, bianisotropic media, provided that the constitutive matrix [
ˉ
C
EH
] in (2.21)
is symmetric [101]. In that case
ˉ
T
em
=
1
2
(D · E + B · H)
ˉ
I ? DE ? BH, (2.288)
which is called the Maxwell stress tensor. Let us demonstrate this equivalence for a
linear, isotropic, homogeneous material. Putting D = epsilon1E and H = B/μ into (2.287) we
obtain
?·T
em
=?epsilon1E(?·E)+
1
μ
B ×(?×B)+epsilon1E ×(?×E)?
1
μ
B(?·B). (2.289)
Now (B.46) gives
?(A · A)= 2A ×(?×A)+ 2(A ·?)A
so that
E(?·E)? E ×(?×E)= E(?·E)+(E ·?)E ?
1
2
?(E
2
).
Finally, (B.55) and (B.63) give
E(?·E)? E ×(?×E)=?·
parenleftbigg
EE ?
1
2
ˉ
IE · E
parenrightbigg
.
Substituting this expression and a similar one for B into (2.289) we have
?·
ˉ
T
em
=?·
bracketleftbigg
1
2
(D · E + B · H)
ˉ
I ? DE ? BH
bracketrightbigg
,
which matches (2.288).
Replacing the term in brackets in (2.287) by ?·
ˉ
T
em
, we get
?·
ˉ
T
em
+
?g
em
?t
=?f
em
(2.290)
where
g
em
= D × B.
Equation (2.290) is the point form of the electromagnetic conservation of momentum
theorem. It is mathematically identical in form to the mechanical theorem (2.279).
Integration over a volume gives the large-scale form
contintegraldisplay
S
ˉ
T
em
· dS +
integraldisplay
V
?g
em
?t
dV =?
integraldisplay
V
f
em
dV. (2.291)
If we interpret this as we interpreted the conservation theorems from mechanics, the ?rst
term on the left-hand side represents the ?ow of electromagnetic momentum across the
boundaryof V, whilethesecondtermrepresentsthechangeinmomentumwithin V. The
sum of these two quantities is exactly compensated by the total Lorentz force acting on
the charges within V. Thus we identify g
em
as the transport density of electromagnetic
momentum.
Because(2.290)isnotzeroontheright-handside, itdoesnotrepresentaclosedsystem.
If the Lorentz force is the only force acting on the charges within V, then the mechanical
reaction to the Lorentz force should be described by Newton’s third law. Thus we have
the kinematic momentum conservation formula
?·
ˉ
T
k
+
?g
k
?t
= f
k
=?f
em
.
Subtracting this expression from (2.290) we obtain
?·(
ˉ
T
em
?
ˉ
T
k
)+
?
?t
(g
em
? g
k
)= 0, (2.292)
which describes momentum conservation for the closed system.
It is also possible to derive a conservation theorem for electromagnetic energy that
resembles the corresponding theorem for mechanical energy. Earlier we noted that v · f
represents the volume density of work produced by moving an object at velocity v under
the action of a force f. For the electromagnetic subsystem the work is produced by
charges moving against the Lorentz force. So the volume density of work delivered to
the currents is
w
em
= v · f
em
= v ·(ρE + J × B)=(ρv)· E +ρv ·(v × B). (2.293)
Using (B.6) on the second term in (2.293) we get
w
em
=(ρv)· E +ρB ·(v × v).
The second term vanishes by de?nition of the cross product. This is the familiar property
that the magnetic ?eld does no work on moving charge. Hence
w
em
= J · E. (2.294)
This important relation says that charge moving in an electric ?eld experiences a force
which results in energy transfer to (or from) the charge. We wish to write this energy
transfer in terms of an energy ?ux vector, as we did with the mechanical subsystem.
As with our derivation of the conservation of electromagnetic momentum, we wish to
relate the energy transfer to the electromagnetic ?elds. Substitution of J from (2.2) into
(2.294) gives
w
em
=(?×H)· E ?
?D
?t
· E,
hence
w
em
=??·(E × H)+ H ·(?×E)?
?D
?t
· E
by (B.44). Substituting for ?×E from (2.1) we have
w
em
=??·(E × H)?
bracketleftbigg
E ·
?D
?t
+ H ·
?B
?t
bracketrightbigg
.
This is not quite of the form (2.285) since a single term representing the time rate of
change of energy density is not present. However, for a linear isotropic medium in which
epsilon1 and μ do not depend on time (i.e., a nondispersive medium) we have
E ·
?D
?t
=epsilon1E ·
?E
?t
=
1
2
epsilon1
?
?t
(E · E)=
1
2
?
?t
(D · E), (2.295)
H ·
?B
?t
=μH ·
?H
?t
=
1
2
μ
?
?t
(H · H)=
1
2
?
?t
(H · B). (2.296)
Using this we obtain
?·S
em
+
?
?t
W
em
=?f
em
· v =?J · E (2.297)
where
W
em
=
1
2
(D · E + B · H)
and
S
em
= E × H. (2.298)
Equation (2.297) is the point form of the energy conservation theorem, also called Poynt-
ing’s theorem after J.H. Poynting who ?rst proposed it. The quantity S
em
given in
(2.298) is known as the Poynting vector. Integrating (2.297) over a volume and using the
divergence theorem, we obtain the large-scale form
?
integraldisplay
V
J · E dV =
integraldisplay
V
1
2
?
?t
(D · E + B · H)dV +
contintegraldisplay
S
(E × H)· dS. (2.299)
This also holds for a nondispersive, linear, bianisotropic medium with a symmetric con-
stitutive matrix [101, 185].
We see that the electromagnetic energy conservation theorem (2.297) is identical in
form to the mechanical energy conservation theorem (2.285). Thus, if the system is com-
posed of just the kinetic and electromagnetic subsystems, the mechanical force exactly
balances the Lorentz force, and (2.297) and (2.285) add to give
?·(S
em
+ S
k
)+
?
?t
(W
em
+ W
k
)= 0, (2.300)
showing that energy is conserved for the entire system.
As in the mechanical system, we identify W
em
as the volume electromagnetic energy
density in V, and S
em
as the density of electromagnetic energy ?owing across the bound-
ary of V. This interpretation is somewhat controversial, as discussed below.
Interpretation of the energyand momentum conservation theorems. There
has been some controversy regarding Poynting’s theorem (and, equally, the momentum
conservation theorem). While there is no question that Poynting’s theorem is mathe-
matically correct, we may wonder whether we are justi?ed in associating W
em
with W
k
and S
em
with S
k
merely because of the similarities in their mathematical expressions.
Certainly there is some justi?cation for associating W
k
, the kinetic energy of particles,
with W
em
, since we shall show that for static ?elds the term
1
2
(D · E + B · H) represents
the energy required to assemble the charges and currents into a certain con?guration.
However, the term S
em
is more problematic. In a mechanical system, S
k
represents the
?ow of kinetic energy associated with moving particles — does that imply that S
em
rep-
resents the ?ow of electromagnetic energy? That is the position generally taken, and it is
widely supported by experimental evidence. However, the interpretation is not clear-cut.
If we associate S
em
with the ?ow of electromagnetic energy at a point in space, then
we must de?ne what a ?ow of electromagnetic energy is. We naturally associate the
?ow of kinetic energy with moving particles; with what do we associate the ?ow of
electromagnetic energy? Maxwell felt that electromagnetic energy must ?ow through
space as a result of the mechanical stresses and strains associated with an unobserved
substance called the “aether.” A more modern interpretation is that the electromagnetic
?elds propagate as a wave through space at ?nite velocity; when those ?elds encounter a
chargedparticleaforceisexerted, workisdone, andenergyis“transferred”fromthe?eld
to the particle. Hence the energy ?ow is associated with the “?ow” of the electromagnetic
wave.
Unfortunately, it is uncertain whether E × H is the appropriate quantity to associate
with this ?ow, since only its divergence appears in Poynting’s theorem. We could add
any other term S
prime
that satis?es ?·S
prime
= 0 to S
em
in (2.297), and the conservation theorem
would be unchanged. (Equivalently, we could add to (2.299) any term that integrates to
zero over S.) There is no such ambiguity in the mechanical case because kinetic energy
is rigorously de?ned. We are left, then, to postulate that E × H represents the density
of energy ?ow associated with an electromagnetic wave (based on the symmetry with
mechanics), and to look to experimental evidence as justi?cation. In fact, experimental
evidence does point to the correctness of this hypothesis, and the quantity E×H is widely
and accurately used to compute the energy radiated by antennas, carried by waveguides,
etc.
Confusion also arises regarding the interpretation of W
em
. Since this term is so con-
veniently paired with the mechanical volume kinetic energy density in (2.300) it would
seem that we should interpret it as an electromagnetic energy density. As such, we can
think of this energy as “localized” in certain regions of space. This viewpoint has been
criticized [187, 145, 69] since the large-scale form of energy conservation for a space re-
gion only requires that the total energy in the region be speci?ed, and the integrand
(energy density) giving this energy is not unique. It is also felt that energy should be
associated with a “con?guration” of objects (such as charged particles) and not with an
arbitrary point in space. However, we retain the concept of localized energy because it
is convenient and produces results consistent with experiment.
The validity of extending the static ?eld interpretation of
1
2
(D · E + B · H)
as the energy “stored” by a charge and a current arrangement to the time-varying case
has also been questioned. If we do extend this view to the time-varying case, Poynting’s
theorem suggests that every point in space somehow has an energy density associated
with it, and the ?ow of energy from that point (via S
em
) must be accompanied by a
change in the stored energy at that point. This again gives a very useful and intuitively
satisfying point of view. Since we can associate the ?ow of energy with the propagation
of the electromagnetic ?elds, we can view the ?elds in any region of space as having the
potential to do work on charged particles in that region. If there are charged particles in
that region then work is done, accompanied by a transfer of energy to the particles and
a reduction in the amplitudes of the ?elds.
We must also remember that the association of stored electromagnetic energy density
W
em
with the mechanical energy density W
k
is only possible if the medium is nondisper-
sive. If we cannot make the assumptions that justify (2.295) and (2.296), then Poynting’s
theorem must take the form
?
integraldisplay
V
J · E dV =
integraldisplay
V
bracketleftbigg
E ·
?D
?t
+ H ·
?B
?t
bracketrightbigg
dV +
contintegraldisplay
S
(E × H)· dS. (2.301)
Fordispersivemedia,thevolumetermontheright-handsidedescribesnotonlythestored
electromagnetic energy, but also the energy dissipated within the material produced by
a time lag between the ?eld applied to the medium and the resulting polarization or
magnetization of the atoms. This is clearly seen in (2.29), which shows that D(t)depends
on the value of E at time t and at all past times. The stored energy and dissipative terms
are hard to separate, but we can see that there must always be a stored energy term by
substituting D =epsilon1
0
E + P and H = B/μ
0
? M into (2.301) to obtain
?
integraldisplay
V
[(J + J
P
)· E + J
H
· H] dV =
1
2
?
?t
integraldisplay
V
(epsilon1
0
E · E +μ
0
H · H)dV +
contintegraldisplay
S
(E × H)· dS. (2.302)
Here J
P
is the equivalent polarization current (2.119) and J
H
is an analogous magnetic
polarization current given by
J
H
=μ
0
?M
?t
.
In this form we easily identify the quantity
1
2
(epsilon1
0
E · E +μ
0
H · H)
as the electromagnetic energy density for the ?elds E and H in free space. Any dissipa-
tion produced by polarization and magnetization lag is now handled by the interaction
between the ?elds and equivalent current, just as J · E describes the interaction of the
electric current (source and secondary) with the electric ?eld. Unfortunately, the equiv-
alent current interaction terms also include the additional stored energy that results
from polarizing and magnetizing the material atoms, and again the e?ects are hard to
separate.
Finally, let us consider the case of static ?elds. Setting the time derivative to zero in
(2.299), we have
?
integraldisplay
V
J · E dV =
contintegraldisplay
S
(E × H)· dS.
This shows that energy ?ux is required to maintain steady current ?ow. For instance,
we need both an electromagnetic and a thermodynamic subsystem to account for energy
conservation in the case of steady current ?ow through a resistor. The Poynting ?ux
describes the electromagnetic energy entering the resistor and the thermodynamic ?ux
describes the heat dissipation. For the sum of the two subsystems conservation of energy
requires
?·(S
em
+ S
th
)=?J · E + P
th
= 0.
To compute the heat dissipation we can use
P
th
= J · E =??·S
em
and thus either use the boundary ?elds or the ?elds and current internal to the resistor
to ?nd the dissipated heat.
Boundaryconditions on the Poynting vector. The large-scale form of Poynting’s
theorem may be used to determine the behavior of the Poynting vector on either side
of a boundary surface. We proceed exactly as in § 2.8.2. Consider a surface S across
which the electromagnetic sources and constitutive parameters are discontinuous (Figure
2.6).Asbefore,let ?n
12
be the unit normal directed into region 1. We now simplify the
notation and write S instead of S
em
. If we apply Poynting’s theorem
integraldisplay
V
parenleftbigg
J · E + E ·
?D
?t
+ H ·
?B
?t
parenrightbigg
dV +
contintegraldisplay
S
S · n dS= 0
tothetwoseparatesurfacesshowninFigure2.6,weobtain
integraldisplay
V
parenleftbigg
J · E + E ·
?D
?t
+ H ·
?B
?t
parenrightbigg
dV +
integraldisplay
S
S · n dS=
integraldisplay
S
10
?n
12
·(S
1
? S
2
)dS. (2.303)
If on the other hand we apply Poynting’s theorem to the entire volume region including
the surface of discontinuity and include the contribution produced by surface current, we
get
integraldisplay
V
parenleftbigg
J · E + E ·
?D
?t
+ H ·
?B
?t
parenrightbigg
dV +
integraldisplay
S
S · n dS=?
integraldisplay
S
10
J
s
· E dS. (2.304)
Since we are uncertain whether to use E
1
or E
2
in the surface term on the right-hand side,
if we wish to have the integrals over V and S in (2.303) and (2.304) produce identical
results we must postulate the two conditions
?n
12
×(E
1
? E
2
)= 0
and
?n
12
·(S
1
? S
2
)=?J
s
· E. (2.305)
The ?rst condition is merely the continuity of tangential electric ?eld as originally postu-
lated in § 2.8.2; it allows us to be nonspeci?c as to which value of E we use in the second
condition, which is the desired boundary condition on S.
It is interesting to note that (2.305) may also be derived directly from the two pos-
tulated boundary conditions on tangential E and H. Here we write with the help of
(B.6)
?n
12
·(S
1
? S
2
)= ?n
12
·(E
1
× H
1
? E
2
× H
2
)= H
1
·(?n
12
× E
1
)? H
2
·(?n
12
× E
2
).
Since ?n
12
× E
1
= ?n
12
× E
2
= ?n
12
× E,wehave
?n
12
·(S
1
? S
2
)=(H
1
? H
2
)·(?n
12
× E)= [??n
12
×(H
1
? H
2
)] · E.
Finally, using ?n
12
×(H
1
? H
2
)= J
s
we arrive at (2.305).
The arguments above suggest an interesting way to look at the boundary conditions.
Onceweidentify S withthe?owofelectromagneticenergy, wemayconsiderthecondition
on normal S as a fundamental statement of the conservation of energy. This statement
implies continuity of tangential E in order to have an unambiguous interpretation for the
meaning of the term J
s
· E. Then, with continuity of tangential E established, we can
derive the condition on tangential H directly.
An alternative formulation of the conservation theorems. As we saw in the
paragraphs above, our derivation of the conservation theorems lacks strong motivation.
We manipulated Maxwell’s equations until we found expressions that resembled those
for mechanical momentum and energy, but in the process found that the validity of the
expressions is somewhat limiting. For instance, we needed to assume a linear, homoge-
neous, bianisotropic medium in order to identify the Maxwell stress tensor (2.288) and
the energy densities in Poynting’s theorem (2.299). In the end, we were reduced to pos-
tulating the meaning of the individual terms in the conservation theorems in order for
the whole to have meaning.
An alternative approach is popular in physics. It involves postulating a single La-
grangian density function for the electromagnetic ?eld, and then applying the stationary
property of the action integral. The results are precisely the same conservation expres-
sions for linear momentum and energy as obtained from manipulating Maxwell’s equa-
tions (plus the equation for conservation of angular momentum), obtained with fewer
restrictions regarding the constitutive relations. This process also separates the stored
energy, Maxwell stress tensor, momentum density, and Poynting vector as natural com-
ponents of a tensor equation, allowing a better motivated interpretation of the meaning
of these components. Since this approach is also a powerful tool in mechanics, its ap-
plication is more strongly motivated than merely manipulating Maxwell’s equations. Of
course, some knowledge of the structure of the electromagnetic ?eld is required to provide
an appropriate postulate of the Lagrangian density. Interested readers should consult
Kong [101], Jackson [91], Doughty [57], or Tolstoy [198].
2.10 Thewave natureoftheelectromagnetic?eld
Throughout this chapter our goal has been a fundamental understanding of Maxwell’s
theory of electromagnetics. We have concentrated on developing and understanding the
equations relating the ?eld quantities, but have done little to understand the nature of
the ?eld itself. We would now like to investigate, in a very general way, the behavior
of the ?eld. We shall not attempt to solve a vast array of esoteric problems, but shall
instead concentrate on a few illuminating examples.
The electromagnetic ?eld can take on a wide variety of characteristics. Static ?elds
di?er qualitatively from those which undergo rapid time variations. Time-varying ?elds
exhibit wave behavior and carry energy away from their sources. In the case of slow
time variation this wave nature may often be neglected in favor of the nearby coupling
of sources we know as the inductance e?ect, hence circuit theory may su?ce to describe
the ?eld-source interaction. In the case of extremely rapid oscillations, particle concepts
may be needed to describe the ?eld.
Thedynamiccouplingbetweenthevarious?eldvectorsinMaxwell’sequationsprovides
a means of characterizing the ?eld. Static ?elds are characterized by decoupling of the
electric and magnetic ?elds. Quasistatic ?elds exhibit some coupling, but the wave
characteristic of the ?eld is ignored. Tightly coupled ?elds are dominated by the wave
e?ect, but may still show a static-like spatial distribution near the source. Any such
“near-zone” e?ects are generally ignored for ?elds at light-wave frequencies, and the
particle nature of light must often be considered.
2.10.1 Electromagnetic waves
AnearlyresultofMaxwell’stheorywasthepredictionandlaterveri?cationbyHeinrich
Hertz of the existence of electromagnetic waves. We now know that nearly any time-
varying source produces waves, and that these waves have certain important properties.
An electromagnetic wave is a propagating electromagnetic ?eld that travels with ?nite
velocity as a disturbance through a medium. The ?eld itself is the disturbance, rather
thanmerelyrepresentingaphysicaldisplacementorothere?ectonthemedium. Thisfact
is fundamental for understanding how electromagnetic waves can travel through a true
vacuum. Many speci?c characteristics of the wave, such as velocity and polarization,
depend on the properties of the medium through which it propagates. The evolution
of the disturbance also depends on these properties: we say that a material exhibits
“dispersion” if the disturbance undergoes a change in its temporal behavior as the wave
progresses. As waves travel they carry energy and momentum away from their source.
This energy may be later returned to the source or delivered to some distant location.
Wavesarealsocapableoftransferringenergyto, orwithdrawingenergyfrom, themedium
through which they propagate. When energy is carried outward from the source never
to return, we refer to the process as “electromagnetic radiation.” The e?ects of radiated
?elds can be far-reaching; indeed, radio astronomers observe waves that originated at the
very edges of the universe.
Light is an electromagnetic phenomenon, and many of the familiar characteristics of
light that we recognize from our everyday experience may be applied to all electromag-
netic waves. For instance, radio waves bend (or “refract”) in the ionosphere much as
light waves bend while passing through a prism. Microwaves re?ect from conducting sur-
faces in the same way that light waves re?ect from a mirror; detecting these re?ections
forms the basis of radar. Electromagnetic waves may also be “con?ned” by re?ecting
boundaries to form waves standing in one or more directions. With this concept we can
use waveguides or transmission lines to guide electromagnetic energy from spot to spot,
or to concentrate it in the cavity of a microwave oven.
The manifestations of electromagnetic waves are so diverse that no one book can
possibly describe the entire range of phenomena or application. In this section we shall
merely introduce the reader to some of the most fundamental concepts of electromagnetic
wave behavior. In the process we shall also introduce the three most often studied types
of traveling electromagnetic waves: plane waves, spherical waves, and cylindrical waves.
In later sections we shall study some of the complicated interactions of these waves with
objects and boundaries, in the form of guided waves and scattering problems.
Mathematically, electromagneticwavesariseasasubsetofsolutionstoMaxwell’sequa-
tions. These solutions obey the electromagnetic “wave equation,” which may be derived
from Maxwell’s equations under certain circumstances. Not all electromagnetic ?elds
satisfy the wave equation. Obviously, time-invariant ?elds cannot represent evolving
wave disturbances, and must obey the static ?eld equations. Time-varying ?elds in cer-
tain metals may obey the di?usion equation rather than the wave equation, and must
thereby exhibit di?erent behavior. In the study of quasistatic ?elds we often ignore the
displacement current term in Maxwell’s equations, producing solutions that are most
important near the sources of the ?elds and having little associated radiation. When the
displacement term is signi?cant we produce solutions with the properties of waves.
2.10.2 Wave equation for bianisotropic materials
In deriving electromagnetic wave equations we transform the ?rst-order coupled par-
tial di?erential equations we know as Maxwell’s equations into uncoupled second-order
equations. That is, we perform a set of operations (and make appropriate assumptions)
to reduce the set of four di?erential equations in the four unknown ?elds E, D, B, and
H, into a set of di?erential equations each involving a single unknown (usually E or
H). It is possible to derive wave equations for E and H even for the most general cases
of inhomogeneous, bianisotropic media, as long as the constitutive parameters ˉμ and
ˉ
ξ are constant with time. Substituting the constitutive relations (2.19)–(2.20) into the
Maxwell–Minkowski curl equations (2.169)–(2.170) we get
?×E =?
?
?t
(
ˉ
ζ · E + ˉμ· H)? J
m
, (2.306)
?×H =
?
?t
(ˉepsilon1· E +
ˉ
ξ· H)+ J. (2.307)
Separate equations for E and H are facilitated by introducing a new dyadic operator
ˉ
?,
which when dotted with a vector ?eld V gives the curl:
ˉ
?·V =?×V. (2.308)
It is easy to verify that in rectangular coordinates
ˉ
? is
[
ˉ
?] =
?
?
0 ??/?z ?/?y
?/?z 0 ??/?x
??/?y ?/?x 0
?
?
.
With this notation, Maxwell’s curl equations (2.306)–(2.307) become simply
parenleftbigg
ˉ
?+
?
?t
ˉ
ζ
parenrightbigg
· E =?
?
?t
ˉμ· H ? J
m
, (2.309)
parenleftbigg
ˉ
??
?
?t
ˉ
ξ
parenrightbigg
· H =
?
?t
ˉepsilon1· E + J. (2.310)
Obtaining separate equations for E and H is straightforward. De?ning the inverse
dyadic ˉμ
?1
through
ˉμ· ˉμ
?1
= ˉμ
?1
· ˉμ =
ˉ
I,
we can write (2.309) as
?
?t
H =?ˉμ
?1
·
parenleftbigg
ˉ
?+
?
?t
ˉ
ζ
parenrightbigg
· E ? ˉμ
?1
· J
m
(2.311)
where we have assumed that ˉμ is independent of time. Assuming that
ˉ
ξ is also indepen-
dent of time, we can di?erentiate (2.310) with respect to time to obtain
parenleftbigg
ˉ
??
?
?t
ˉ
ξ
parenrightbigg
·
?H
?t
=
?
2
?t
2
(ˉepsilon1· E)+
?J
?t
.
Substituting ?H/?t from (2.311) and rearranging, we get
bracketleftbiggparenleftbigg
ˉ
??
?
?t
ˉ
ξ
parenrightbigg
· ˉμ
?1
·
parenleftbigg
ˉ
?+
?
?t
ˉ
ζ
parenrightbigg
+
?
2
?t
2
ˉepsilon1
bracketrightbigg
· E =?
parenleftbigg
ˉ
??
?
?t
ˉ
ξ
parenrightbigg
· ˉμ
?1
· J
m
?
?J
?t
.
(2.312)
This is the general wave equation for E. Using an analogous set of steps, and assuming
ˉepsilon1 and
ˉ
ζ are independent of time, we can ?nd
bracketleftbiggparenleftbigg
ˉ
?+
?
?t
ˉ
ζ
parenrightbigg
· ˉepsilon1
?1
·
parenleftbigg
ˉ
??
?
?t
ˉ
ξ
parenrightbigg
+
?
2
?t
2
ˉμ
bracketrightbigg
· H =
parenleftbigg
ˉ
?+
?
?t
ˉ
ζ
parenrightbigg
· ˉepsilon1
?1
· J ?
?J
m
?t
.
(2.313)
This is the wave equation for H. The case in which the constitutive parameters are
time-dependent will be handled using frequency domain techniques in later chapters.
Wave equations for anisotropic, isotropic, and homogeneous media are easily obtained
from (2.312) and (2.313) as special cases. For example, the wave equations for a homo-
geneous, isotropic medium can be found by setting
ˉ
ζ =
ˉ
ξ = 0, ˉμ =μ
ˉ
I, and ˉepsilon1 =epsilon1
ˉ
I:
1
μ
ˉ
?·(
ˉ
?·E)+epsilon1
?
2
E
?t
2
=?
1
μ
ˉ
?·J
m
?
?J
?t
,
1
epsilon1
ˉ
?·(
ˉ
?·H)+μ
?
2
H
?t
2
=
1
epsilon1
ˉ
?·J ?
?J
m
?t
.
Returning to standard curl notation we ?nd that these become
?×(?×E)+μepsilon1
?
2
E
?t
2
=??×J
m
?μ
?J
?t
, (2.314)
?×(?×H)+μepsilon1
?
2
H
?t
2
=?×J ?epsilon1
?J
m
?t
. (2.315)
In each of the wave equations it appears that operations on the electromagnetic ?elds
have been separated from operations on the source terms. However, we have not yet
invoked any coupling between the ?elds and sources associated with secondary interac-
tions. That is, we need to separate the impressed sources, which are independent of
the ?elds they source, with secondary sources resulting from interactions between the
sourced ?elds and the medium in which the ?elds exist. The simple case of an isotropic
conducting medium will be discussed below.
Wave equation using equivalent sources. An alternative approach for studying
wave behavior in general media is to use the Maxwell–Bo? form of the ?eld equations
?×E =?
?B
?t
, (2.316)
?×
B
μ
0
=(J + J
M
+ J
P
)+
?epsilon1
0
E
?t
, (2.317)
?·(epsilon1
0
E)=(ρ+ρ
P
), (2.318)
?·B = 0. (2.319)
Taking the curl of (2.316) we have
?×(?×E)=?
?
?t
?×B.
Substituting for ?×B from (2.317) we then obtain
?×(?×E)+μ
0
epsilon1
0
?
2
E
?t
2
=?μ
0
?
?t
(J + J
M
+ J
P
), (2.320)
which is the wave equation for E. Taking the curl of (2.317) and substituting from (2.316)
we obtain the wave equation
?×(?×B)+μ
0
epsilon1
0
?
2
B
?t
2
=μ
0
?×(J + J
M
+ J
P
) (2.321)
for B. Solutionofthewaveequationsisoftenfacilitatedbywritingthecurl-curloperation
in terms of the vector Laplacian. Using (B.47), and substituting for the divergence from
(2.318) and (2.319), we can write the wave equations as
?
2
E ?μ
0
epsilon1
0
?
2
E
?t
2
=
1
epsilon1
0
?(ρ+ρ
P
)+μ
0
?
?t
(J + J
M
+ J
P
), (2.322)
?
2
B ?μ
0
epsilon1
0
?
2
B
?t
2
=?μ
0
?×(J + J
M
+ J
P
). (2.323)
The simplicity of these equations relative to (2.312) and (2.313) is misleading. We have
not considered the constitutive equations relating the polarization P and magnetization
M to the ?elds, nor have we considered interactions leading to secondary sources.
2.10.3 Wave equation in a conducting medium
As an example of the type of wave equation that arises when secondary sources are
included, considerahomogeneousisotropicconductingmediumdescribedbypermittivity
epsilon1, permeability μ, and conductivity σ. In a conducting medium we must separate the
source ?eld into a causative impressed term J
i
that is independent of the ?elds it sources,
and a secondary term J
s
that is an e?ect of the sourced ?elds. In an isotropic conducting
medium the e?ect is described by Ohm’s law J
s
= σE. Writing the total current as
J = J
i
+ J
s
, and assuming that J
m
= 0, we write the wave equation (2.314) as
?×(?×E)+μepsilon1
?
2
E
?t
2
=?μ
?(J
i
+σE)
?t
. (2.324)
Using (B.47) and substituting ?·E =ρ/epsilon1, we can write the wave equation for E as
?
2
E ?μσ
?E
?t
?μepsilon1
?
2
E
?t
2
=μ
?J
i
?t
+
1
epsilon1
?ρ. (2.325)
Substituting J = J
i
+σE into (2.315) and using (B.47), we obtain
?(?·H)??
2
H +μepsilon1
?
2
H
?t
2
=?×J
i
+σ?×E.
Since ?×E =??B/?t and ?·H =?·B/μ= 0,wehave
?
2
H ?μσ
?H
?t
?μepsilon1
?
2
H
?t
2
=??×J
i
. (2.326)
This is the wave equation for H.
2.10.4 Scalar wave equation for a conducting medium
In many applications, particularly those involving planar boundary surfaces, it is
convenient to decompose the vector wave equation into cartesian components. Using
?
2
V = ?x?
2
V
x
+ ?y?
2
V
y
+ ?z?
2
V
z
in (2.325) and in (2.326), we ?nd that the rectangular
components of E and H must obey the scalar wave equation
?
2
ψ(r,t)?μσ
?ψ(r,t)
?t
?μepsilon1
?
2
ψ(r,t)
?t
2
= s(r,t). (2.327)
For the electric ?eld wave equation we have
ψ = E
α
, s =μ
?J
i
α
?t
+
1
epsilon1
?α·?ρ,
where α = x, y,z. For the magnetic ?eld wave equations we have
ψ = H
α
, s = ?α·(?? × J
i
).
2.10.5 Fields determined byMaxwell’s equations vs. ?elds deter-
mined bythe wave equation
Although we derive the wave equations directly from Maxwell’s equations, we may
wonder whether the solutions to second-order di?erential equations such as (2.314)–
(2.315) are necessarily the same as the solutions to the ?rst-order Maxwell equations.
Hansen and Yaghjian [81] show that if all information about the ?elds is supplied by the
sources J(r,t) and ρ(r,t), rather than by speci?cation of ?eld values on boundaries, the
solutions to Maxwell’s equations and the wave equations are equivalent as long as the
second derivatives of the quantities
?·E(r,t)?ρ(r,t)/epsilon1, ?·H(r,t),
are continuous functions of r and t. If boundary values are supplied in an attempt to
guarantee uniqueness, then solutions to the wave equation and to Maxwell’s equations
may di?er. This is particularly important when comparing numerical solutions obtained
directly from Maxwell’s equations (using the FDTD method, say) to solutions obtained
from the wave equation. “Spurious” solutions having no physical signi?cance are a con-
tinual plague for engineers who employ numerical techniques. The interested reader
should see Jiang [94].
We note that these conclusions do not hold for static ?elds. The conditions for equiv-
alence of the ?rst-order and second-order static ?eld equations are considered in § 3.2.4.
2.10.6 Transient uniform plane waves in a conducting medium
We can learn a great deal about the wave nature of the electromagnetic ?eld by solving
the wave equation (2.325) under simple circumstances. In Chapter 5 we shall solve for
the ?eld produced by an arbitrary distribution of impressed sources, but here we seek a
simple solution to the homogeneous form of the equation. This allows us to study the
phenomenology of wave propagation without worrying about the consequences of speci?c
source functions. We shall also assume a high degree of symmetry so that we are not
bogged down in details about the vector directions of the ?eld components.
We seek a solution of the wave equation in which the ?elds are invariant over a chosen
planar surface. The resulting ?elds are said to comprise a uniform plane wave. Although
we can envision a uniform plane wave as being created by a uniform surface source of
doubly-in?nite extent, plane waves are also useful as models for spherical waves over
localized regions of the wavefront.
We choose the plane of ?eld invariance to be the xy-plane and later generalize the
resulting solution to any planar surface by a simple rotation of the coordinate axes. Since
the ?elds vary with z only we choose to write the wave equation (2.325) in rectangular
coordinates, giving for a source-free region of space
4
?x
?
2
E
x
(z,t)
?z
2
+ ?y
?
2
E
y
(z,t)
?z
2
+ ?z
?
2
E
z
(z,t)
?z
2
?μσ
?E(z,t)
?t
?μepsilon1
?
2
E(z,t)
?t
2
= 0. (2.328)
If we return to Maxwell’s equations, we soon ?nd that not all components of E are
present in the plane-wave solution. Faraday’s law states that
?×E(z,t)=??x
?E
y
(z,t)
?z
+ ?y
?E
x
(z,t)
?z
= ?z ×
?E(z,t)
?z
=?μ
?H(z,t)
?t
. (2.329)
We see that ?H
z
/?t = 0, hence H
z
must be constant with respect to time. Because
a nonzero constant ?eld component would not exhibit wave-like behavior, we can only
have H
z
= 0 in our wave solution. Similarly, Ampere’s law in a homogeneous conducting
region free from impressed sources states that
?×H(z,t)= J +
?D(z,t)
?t
=σE(z,t)+epsilon1
?E(z,t)
?t
or
? ?x
?H
y
(z,t)
?z
+ ?y
?H
x
(z,t)
?z
= ?z ×
?H(z,t)
?z
=σE(z,t)+epsilon1
?E(z,t)
?t
. (2.330)
This implies that
σE
z
(z,t)+epsilon1
?E
z
(z,t)
?t
= 0,
which is a di?erential equation for E
z
with solution
E
z
(z,t)= E
0
(z)e
?
σ
epsilon1
t
.
Since we are interested only in wave-type solutions, we choose E
z
= 0.
Hence E
z
= H
z
= 0, and thus both E and H are perpendicular to the z-direction.
Using (2.329) and (2.330), we also see that
?
?t
(E · H)= E ·
?H
?t
+ H ·
?E
?t
=?
1
μ
E ·
parenleftbigg
?z ×
?E
?z
parenrightbigg
? H ·
parenleftBig
σ
epsilon1
E
parenrightBig
+
1
epsilon1
H ·
parenleftbigg
?z ×
?H
?z
parenrightbigg
or
parenleftbigg
?
?t
+
σ
epsilon1
parenrightbigg
(E · H)=
1
μ
?z ·
parenleftbigg
E ×
?E
?z
parenrightbigg
?
1
epsilon1
?z ·
parenleftbigg
H ×
?H
?z
parenrightbigg
.
We seek solutions of the type E(z,t)= ?pE(z,t)and H(z,t)= ?qH(z,t), where ?p and ?q are
constant unit vectors. Under this condition we have E×?E/?z = 0 and H×?H/?z = 0,
giving
parenleftbigg
?
?t
+
σ
epsilon1
parenrightbigg
(E · H)= 0.
4
The term “source free” applied to a conducting region implies that the region is devoid of impressed
sources and, because of the relaxation e?ect, has no free charge. See the discussion in Jones [97].
Thus we also have E · H = 0, and ?nd that E must be perpendicular to H.SoE, H,
and ?z comprise a mutually orthogonal triplet of vectors. A wave having this property is
said to be TEM to the z-direction or simply TEM
z
. Here “TEM” stands for transverse
electromagnetic, indicating the orthogonal relationship between the ?eld vectors and the
z-direction. Note that
?p × ?q =±?z.
The constant direction described by ?p is called the polarization of the plane wave.
We are now ready to solve the source-free wave equation (2.328). If we dot both sides
of the homogeneous expression by ?p we obtain
?p · ?x
?
2
E
x
?z
2
+ ?p · ?y
?
2
E
y
?z
2
?μσ
?(?p · E)
?t
?μepsilon1
?
2
(?p · E)
?t
2
= 0.
Noting that
?p · ?x
?
2
E
x
?z
2
+ ?p · ?y
?
2
E
y
?z
2
=
?
2
?z
2
(?p · ?xE
x
+ ?p · ?yE
y
)=
?
2
?z
2
(?p · E),
we have the wave equation
?
2
E(z,t)
?z
2
?μσ
?E(z,t)
?t
?μepsilon1
?
2
E(z,t)
?t
2
= 0. (2.331)
Similarly, dotting both sides of (2.326) with ?q and setting J
i
= 0 we obtain
?
2
H(z,t)
?z
2
?μσ
?H(z,t)
?t
?μepsilon1
?
2
H(z,t)
?t
2
= 0. (2.332)
In a source-free homogeneous conducting region E and H satisfy identical wave equations.
Solutions are considered in § A.1. There we solve for the total ?eld for all z,t given
the value of the ?eld and its derivative over the z = 0 plane. This solution can be
directly applied to ?nd the total ?eld of a plane wave re?ected by a perfect conductor.
Let us begin by considering the lossless case where σ = 0, and assuming the region z < 0
contains a perfect electric conductor. The conditions on the ?eld in the z = 0 plane are
determined by the required boundary condition on a perfect conductor: the tangential
electric ?eld must vanish. From (2.330) we see that since E ⊥ ?z, requiring
?H(z,t)
?z
vextendsingle
vextendsingle
vextendsingle
vextendsingle
z=0
= 0 (2.333)
gives E(0,t)= 0 and thus satis?es the boundary condition. Writing
H(0,t)= H
0
f(t),
?H(z,t)
?z
vextendsingle
vextendsingle
vextendsingle
vextendsingle
z=0
= H
0
g(t)= 0, (2.334)
and setting Omega1= 0 in (A.41) we obtain the solution to (2.332):
H(z,t)=
H
0
2
f
parenleftBig
t ?
z
v
parenrightBig
+
H
0
2
f
parenleftBig
t +
z
v
parenrightBig
, (2.335)
where v = 1/(μepsilon1)
1/2
. Since we designate the vector direction of H as ?q, the vector ?eld
is
H(z,t)= ?q
H
0
2
f
parenleftBig
t ?
z
v
parenrightBig
+ ?q
H
0
2
f
parenleftBig
t +
z
v
parenrightBig
. (2.336)
Figure 2.7: Propagation of a transient plane wave in a lossless medium.
From (2.329) we also have the solution for E(z,t):
E(z,t)= ?p
vμH
0
2
f
parenleftBig
t ?
z
v
parenrightBig
? ?p
vμH
0
2
f
parenleftBig
t +
z
v
parenrightBig
, (2.337)
where
?p × ?q = ?z.
The boundary conditions E(0,t)= 0 and H(0,t)= H
0
f(t) are easily veri?ed by substi-
tution.
This solution displays the quintessential behavior of electromagnetic waves. We may
interpret the term f(t +z/v)as a wave ?eld disturbance, propagating at velocityv in the
?z-direction, incident from z > 0 upon the conductor. The term f(t ? z/v) represents
a wave ?eld disturbance propagating in the +z-direction with velocity v, re?ected from
the conductor. By “propagating” we mean that if we increment time, the disturbance
will occupy a spatial position determined by incrementing z by vt. For free space where
v = 1/(μ
0
epsilon1
0
)
1/2
, the velocity of propagation is the speed of light c.
A speci?c example should serve to clarify our interpretation of the wave solution.
Taking μ=μ
0
and epsilon1 = 81epsilon1
0
, representing typical constitutive values for fresh water, we
can plot (2.335) as a function of position for ?xed values of time. The result is shown in
Figure2.7,wherewehavechosen
f(t)= rect(t/τ) (2.338)
with τ = 1 μs. We see that the disturbance is spatially distributed as a rectangular
pulse of extent L = 2vτ = 66.6 m, where v = 3.33 × 10
7
m/s is the wave velocity,
and where 2τ is the temporal duration of the pulse. At t =?8 μs the leading edge of
the pulse is at z = 233 m, while at ?4 μs the pulse has traveled a distance z = vt =
(3.33 × 10
7
)×(4 × 10
?6
) = 133 m in the ?z-direction, and the leading edge is thus at
100 m. At t =?1 μs the leading edge strikes the conductor and begins to induce a
current in the conductor surface. This current sets up the re?ected wave, which begins
to travel in the opposite (+z) direction. At t =?0.5 μs a portion of the wave has begun
to travel in the +z-direction while the trailing portion of the disturbance continues to
travel in the ?z-direction. At t = 1 μs the wave has been completely re?ected from
the surface, and thus consists only of the component traveling in the +z-direction. Note
that if we plot the total ?eld in the z = 0 plane, the sum of the forward and backward
traveling waves produces the pulse waveform (2.338) as expected.
Using the expressions for E and H we can determine many interesting characteristics
of the wave. We see that the terms f(t ± z/v) represent the components of the waves
traveling in the ?z-directions, respectively. If we were to isolate these waves from each
other (by, for instance, measuring them as functions of time at a position where they do
not overlap) we would ?nd from (2.336) and (2.337) that the ratio of E to H forawave
traveling in either direction is
vextendsingle
vextendsingle
vextendsingle
vextendsingle
E(z,t)
H(z,t)
vextendsingle
vextendsingle
vextendsingle
vextendsingle
=vμ=(μ/epsilon1)
1/2
,
independent of the time and position of the measurement. This ratio, denoted by η and
carrying units of ohms, is called the intrinsic impedance of the medium through which
the wave propagates. Thus, if we let E
0
=ηH
0
we can write
E(z,t)= ?p
E
0
2
f
parenleftBig
t ?
z
v
parenrightBig
? ?p
E
0
2
f
parenleftBig
t +
z
v
parenrightBig
. (2.339)
Wecaneasilydeterminethecurrentinducedintheconductorbyapplyingtheboundary
condition (2.200):
J
s
= ?n × H|
z=0
= ?z × [H
0
?q f(t)] =??pH
0
f(t). (2.340)
We can also determine the pressure exerted on the conductor due to the Lorentz force
interaction between the ?elds and the induced current. The total force on the conductor
can be computed by integrating the Maxwell stress tensor (2.288) over the xy-plane
5
:
F
em
=?
integraldisplay
S
ˉ
T
em
· dS.
The surface traction is
t =
ˉ
T
em
· ?n =
bracketleftbigg
1
2
(D · E + B · H)
ˉ
I ? DE ? BH
bracketrightbigg
· ?z.
Since E and H are both normal to ?z, the last two terms in this expression are zero. Also,
the boundary condition on E implies that it vanishes in the xy-plane. Thus
t =
1
2
(B · H)?z = ?z
μ
2
H
2
(t).
5
We may neglect the momentum term in (2.291), which is small compared to the stress tensor term. See
Problem 2.20.
With H
0
= E
0
/η we have
t = ?z
E
2
0
2η
2
μf
2
(t). (2.341)
As a numerical example, consider a high-altitude nuclear electromagnetic pulse (HEMP)
generated by the explosion of a large nuclear weapon in the upper atmosphere. Such
an explosion could generate a transient electromagnetic wave of short (sub-microsecond)
duration with an electric ?eld amplitude of 50,000 V/m in air [200]. Using (2.341),
we ?nd that the wave would exert a peak pressure of P =|t|=.011 Pa = 1.6 × 10
?6
lb/in
2
if re?ected from a perfect conductor at normal incidence. Obviously, even for this
extreme ?eld level the pressure produced by a transient electromagnetic wave is quite
small. However, from (2.340) we ?nd that the current induced in the conductor would
have a peak value of 133 A/m. Even a small portion of this current could destroy a
sensitive electronic circuit if it were to leak through an opening in the conductor. This is
an important concern for engineers designing circuitry to be used in high-?eld environ-
ments, and demonstrates why the concepts of current and voltage can often supersede
the concept of force in terms of importance.
Finally, let us see how the terms in the Poynting power balance theorem relate. Con-
sider a cubic region V bounded by the planes z = z
1
and z = z
2
, z
2
> z
1
. We choose
the ?eld waveform f(t) and locate the planes so that we can isolate either the forward
or backward traveling wave. Since there is no current in V, Poynting’s theorem (2.299)
becomes
1
2
?
?t
integraldisplay
V
(epsilon1E · E +μH · H)dV =?
contintegraldisplay
S
(E × H)· dS.
Consider the wave traveling in the ?z-direction. Substitution from (2.336) and (2.337)
gives the time-rate of change of stored energy as
S
cube
(t)=
1
2
?
?t
integraldisplay
V
bracketleftbig
epsilon1E
2
(z,t)+μH
2
(z,t)
bracketrightbig
dV
=
1
2
?
?t
integraldisplay
x
integraldisplay
y
dx dy
integraldisplay
z
2
z
1
bracketleftbigg
epsilon1
(vμ)
2
H
2
0
4
f
2
parenleftBig
t +
z
v
parenrightBig
+μ
H
2
0
4
f
2
parenleftBig
t +
z
v
parenrightBig
bracketrightbigg
dz
=
1
2
?
?t
μ
H
2
0
2
integraldisplay
x
integraldisplay
y
dx dy
integraldisplay
z
2
z
1
f
2
parenleftBig
t +
z
v
parenrightBig
dz.
Integration over x and y gives the area A of the cube face. Putting u = t + z/v we see
that
S = Aμ
H
2
0
4
?
?t
integraldisplay
t+z
2
/v
t+z
1
/v
f
2
(u)vdu.
Leibnitz’ rule for di?erentiation (A.30) then gives
S
cube
(t)= A
μvH
2
0
4
bracketleftBig
f
2
parenleftBig
t +
z
2
v
parenrightBig
? f
2
parenleftBig
t +
z
1
v
parenrightBigbracketrightBig
. (2.342)
Again substituting for E(t + z/v) and H(t + z/v) we can write
S
cube
(t)=?
contintegraldisplay
S
(E × H)· dS
=?
integraldisplay
x
integraldisplay
y
vμH
2
0
4
f
2
parenleftBig
t +
z
1
v
parenrightBig
(??p × ?q)·(??z)dx dy?
?
integraldisplay
x
integraldisplay
y
vμH
2
0
4
f
2
parenleftBig
t +
z
2
v
parenrightBig
(??p × ?q)·(?z)dx dy.
Figure 2.8: Propagation of a transient plane wave in a dissipative medium.
The second term represents the energy change in V produced by the backward traveling
wave entering the cube by passing through the plane at z = z
2
, while the ?rst term
represents the energy change in V produced by the wave exiting the cube by passing
through the plane z = z
1
. Contributions from the sides, top, and bottom are zero since
E × H is perpendicular to ?n over those surfaces. Since ?p × ?q = ?z,weget
S
cube
(t)= A
μvH
2
0
4
bracketleftBig
f
2
parenleftBig
t +
z
2
v
parenrightBig
? f
2
parenleftBig
t +
z
1
v
parenrightBigbracketrightBig
,
which matches (2.342) and thus veri?es Poynting’s theorem. We may interpret this result
as follows. The propagating electromagnetic disturbance carries energy through space.
The energy within any region is associated with the ?eld in that region, and can change
with time as the propagating wave carries a ?ux of energy across the boundary of the
region. The energy continues to propagate even if the source is changed or is extinguished
altogether. That is, the behavior of the leading edge of the disturbance is determined
by causality — it is a?ected by obstacles it encounters, but not by changes in the source
that occur after the leading edge has been established.
When propagating through a dissipative region a plane wave takes on a somewhat
di?erent character. Again applying the conditions (2.333) and (2.334), we obtain from
(2.991) the solution to the wave equation (2.332):
H(z,t)=
H
0
2
e
?
Omega1
v
z
f
parenleftBig
t ?
z
v
parenrightBig
+
H
0
2
e
Omega1
v
z
f
parenleftBig
t +
z
v
parenrightBig
?
?
zOmega1
2
H
0
2v
e
?Omega1t
integraldisplay
t+
z
v
t?
z
v
f(u)e
Omega1u
J
1
parenleftBig
Omega1
v
radicalbig
z
2
?(t ? u)
2
v
2
parenrightBig
Omega1
v
radicalbig
z
2
?(t ? u)
2
v
2
du (2.343)
where Omega1 = σ/2epsilon1. The ?rst two terms resemble those for the lossless case, modi?ed
by an exponential damping factor. This accounts for the loss in amplitude that must
accompanythetransferofenergyfromthepropagatingwavetojouleloss(heat)within
theconductingmedium. Theremainingtermappearsonlywhenthemediumislossy, and
results in an extension of the disturbance through the medium because of the currents
inducedbythepassingwavefront. This“wake”followstheleadingedgeofthedisturbance
asisshownclearlyinFigure2.8.HerewehaverepeatedthecalculationofFigure2.7,
but with σ = 2 × 10
?4
, approximating the conductivity of fresh water. As the wave
travels to the left it attenuates and leaves a trailing remnant behind. Upon reaching
the conductor it re?ects much as in the lossless case, resulting in a time dependence at
z = 0 given by the ?nite-duration rectangular pulse (2.338). In order for the pulse to
be of ?nite duration, the wake left by the re?ected pulse must exactly cancel the wake
associated with the incident pulse that continues to arrive after the re?ection. As the
re?ected pulse sweeps forward, the wake is obliterated everywhere behind.
If we were to verify the Poynting theorem for a dissipative medium (which we shall
not attempt because of the complexity of the computation), we would need to include
the E·J term. Here J is the induced conduction current and the integral of E·J accounts
for the joule loss within a region V balanced by the di?erence in Poynting energy ?ux
carried into and out of V.
Once we have the ?elds for a wave propagating along the z-direction, it is a simple
matter to generalize these results to any propagation direction. Assume that ?u is normal
to the surface of a plane over which the ?elds are invariant. Then u = ?u·r describes the
distance from the origin along the direction ?u. We need only replace z by ?u · r in any
of the expressions obtained above to determine the ?elds of a plane wave propagating in
the u-direction. We must also replace the orthogonality condition ?p × ?q = ?z with
?p × ?q = ?u.
For instance, the ?elds associated with a wave propagating through a lossless medium in
the positive u-direction are, from (2.336)–(2.337),
H(r,t)= ?q
H
0
2
f
parenleftbigg
t ?
?u · r
v
parenrightbigg
, E(r,t)= ?p
vμH
0
2
f
parenleftbigg
t ?
?u · r
v
parenrightbigg
.
2.10.7 Propagation of cylindrical waves in a lossless medium
Much as we envisioned a uniform plane wave arising from a uniform planar source, we
can imagine a uniform cylindrical wave arising from a uniform line source. Although this
line source must be in?nite in extent, uniform cylindrical waves (unlike plane waves) dis-
play the physical behavior of diverging from their source while carrying energy outwards
to in?nity.
A uniform cylindrical wave has ?elds that are invariant over a cylindrical surface:
E(r,t)= E(ρ,t), H(r,t)= H(ρ,t). For simplicity, we shall assume that waves propagate
in a homogeneous, isotropic, linear, and lossless medium described by permittivity epsilon1
and permeability μ. From Maxwell’s equations we ?nd that requiring the ?elds to be
independent of φ and z puts restrictions on the remaining vector components. Faraday’s
law states
?×E(ρ,t)=?
?
φ
?E
z
(ρ,t)
?ρ
+ ?z
1
ρ
?
?ρ
[ρE
φ
(ρ,t)] =?μ
?H(ρ,t)
?t
. (2.344)
Equating components we see that ?H
ρ
/?t = 0, and because our interest lies in wave
solutions we take H
ρ
= 0. Ampere’s law in a homogeneous lossless region free from
impressed sources states in a similar manner
?×H(ρ,t)=?
?
φ
?H
z
(ρ,t)
?ρ
+ ?z
1
ρ
?
?ρ
[ρH
φ
(ρ,t)] =epsilon1
?E(ρ,t)
?t
. (2.345)
Equating components we ?nd that E
ρ
= 0. Since E
ρ
= H
ρ
= 0, both E and H are
perpendicular to the ρ-direction. Note that if there is only a z-component of E then
there is only a φ-component of H. This case, termed electric polarization, results in
?E
z
(ρ,t)
?ρ
=μ
?H
φ
(ρ,t)
?t
.
Similarly, if there is only a z-component of H then there is only a φ-component of E.
This case, termed magnetic polarization, results in
?
?H
z
(ρ,t)
?ρ
=epsilon1
?E
φ
(ρ,t)
?t
.
Since E =
?
φE
φ
+ ?zE
z
and H =
?
φH
φ
+ ?zH
z
, we can always decompose a cylindrical
electromagnetic wave into cases of electric and magnetic polarization. In each case the
resulting ?eld is TEM
ρ
since the vectors E,H, ?ρ are mutually orthogonal.
Wave equations for E
z
in the electric polarization case and for H
z
in the magnetic
polarization case can be found in the usual manner. Taking the curl of (2.344) and
substituting from (2.345) we ?nd
?×(?×E)=??z
1
ρ
?
?ρ
parenleftbigg
ρ
?E
z
?ρ
parenrightbigg
?
?
φ
?
?ρ
parenleftbigg
1
ρ
?
?ρ
[ρE
φ
]
parenrightbigg
=?
1
v
2
?
2
E
?t
2
=?
1
v
2
bracketleftbigg
?z
?
2
E
z
?t
2
+
?
φ
?
2
E
φ
?t
2
bracketrightbigg
where v = 1/(μepsilon1)
1/2
. Noting that E
φ
= 0 for the electric polarization case we obtain the
wave equation for E
z
. A similar set of steps beginning with the curl of (2.345) gives an
identical equation for H
z
.Thus
1
ρ
?
?ρ
parenleftbigg
ρ
?
?ρ
bracketleftbigg
E
z
H
z
bracketrightbiggparenrightbigg
?
1
v
2
?
2
?t
2
bracketleftbigg
E
z
H
z
bracketrightbigg
= 0. (2.346)
We can obtain a solution for (2.346) in much the same way as we do for the wave
equations in § A.1. We begin by substituting for E
z
(ρ,t)in terms of its temporal Fourier
representation
E
z
(ρ,t)=
1
2π
integraldisplay
∞
?∞
?
E
z
(ρ,ω)e
jωt
dω
to obtain
1
2π
integraldisplay
∞
?∞
bracketleftbigg
1
ρ
?
?ρ
parenleftbigg
ρ
?
?ρ
?
E
z
(ρ,ω)
parenrightbigg
+
ω
2
v
2
?
E
z
(ρ,ω)
bracketrightbigg
e
jωt
dω= 0.
The Fourier integral theorem implies that the integrand is zero. Then, expanding out
the ρ derivatives, we ?nd that
?
E
z
(ρ,ω) obeys the ordinary di?erential equation
d
2
?
E
z
dρ
2
+
1
ρ
d
?
E
z
dρ
+ k
2
?
E
z
= 0
where k =ω/v. This is merely Bessel’s di?erential equation (A.124). It is a second-order
equation with two independent solutions chosen from the list
J
0
(kρ), Y
0
(kρ), H
(1)
0
(kρ), H
(2)
0
(kρ).
We ?nd that J
0
(kρ)and Y
0
(kρ)are useful for describing standing waves between bound-
aries while H
(1)
0
(kρ) and H
(2)
0
(kρ) are useful for describing waves propagating in the
ρ-direction. Of these, H
(1)
0
(kρ) represents waves traveling inward while H
(2)
0
(kρ) repre-
sents waves traveling outward. Concentrating on the outward traveling wave we ?nd
that
?
E
z
(ρ,ω)=
?
A(ω)
bracketleftBig
?j
π
2
H
(2)
0
(kρ)
bracketrightBig
=
?
A(ω)?g(ρ,ω).
Here A(t) ?
?
A(ω) is the disturbance waveform, assumed to be a real, causal function.
To make E
z
(ρ,t) real we require that the inverse transform of ?g(ρ,ω) be real. This
requires the inclusion of the ?jπ/2 factor in ?g(ρ,ω). Inverting we have
E
z
(ρ,t)= A(t)? g(ρ,t) (2.347)
where g(ρ,t) ? (?jπ/2)H
(2)
0
(kρ).
The inverse transform needed to obtain g(ρ,t) may be found in Campbell [26]:
g(ρ,t)=
F
?1
braceleftBig
?j
π
2
H
(2)
0
parenleftBig
ω
ρ
v
parenrightBigbracerightBig
=
U
parenleftbig
t ?
ρ
v
parenrightbig
radicalBig
t
2
?
ρ
2
v
2
,
where U(t) is the unit step function de?ned in (A.5). Substituting this into (2.347) and
writing the convolution in integral form we have
E
z
(ρ,t)=
integraldisplay
∞
?∞
A(t ? t
prime
)
U(t
prime
?ρ/v)
radicalbig
t
prime2
?ρ
2
/v
2
dt
prime
.
The change of variable x = t
prime
?ρ/v then gives
E
z
(ρ,t)=
integraldisplay
∞
0
A(t ? x ?ρ/v)
radicalbig
x
2
+ 2xρ/v
dx. (2.348)
Those interested in the details of the inverse transform should see Chew [33].
As an example, consider a lossless medium with μ
r
= 1, epsilon1
r
= 81, and a waveform
A(t)= E
0
[U(t)? U(t ?τ)]
where τ = 2 μs. This situation is the same as that in the plane wave example above,
except that the pulse waveform begins at t = 0. Substituting for A(t) into (2.348) and
using the integral
integraldisplay
dx
√
x
√
x + a
= 2ln
bracketleftbig√
x +
√
x + a
bracketrightbig
Figure 2.9: Propagation of a transient cylindrical wave in a lossless medium.
we can write the electric ?eld in closed form as
E
z
(ρ,t)= 2E
0
ln
bracketleftbigg√
x
2
+
√
x
2
+ 2ρ/v
√
x
1
+
√
x
1
+ 2ρ/v
bracketrightbigg
, (2.349)
where x
2
= max[0,t ?ρ/v] and x
1
= max[0,t ?ρ/v?τ]. The ?eld is plotted in Figure
2.9forvariousvaluesoftime.Notethattheleadingedgeofthedisturbancepropagates
outward at a velocityv and a wake trails behind the disturbance. This wake is similar to
that for a plane wave in a dissipative medium, but it exists in this case even though the
medium is lossless. We can think of the wave as being created by a line source of in?nite
extent, pulsed by the disturbance waveform. Although current changes simultaneously
everywhere along the line, it takes the disturbance longer to propagate to an observation
point in the z = 0 plane from source points z negationslash= 0 than from the source point at z = 0.
Thus, the ?eld at an arbitrary observation pointρ arrives from di?erent source points at
di?erenttimes.IfwelookatFigure2.9wenotethatthereisalwaysanonzero?eldnear
ρ = 0 (or any value ofρ<vt) regardless of the time, since at any given t the disturbance
is arriving from some point along the line source.
WealsoseeinFigure2.9thatasρbecomeslargethepeakvalueofthepropagating
disturbanceapproachesacertainvalue. Thisvalueoccursat t
m
=ρ/v+τ or, equivalently,
ρ
m
=v(t ?τ). If we substitute this value into (2.349) we ?nd that
E
z
(ρ,t
m
)= 2E
0
ln
bracketleftbiggradicalbigg
τ
2ρ/v
+
radicalbigg
1 +
τ
2ρ/v
bracketrightbigg
.
For large values of ρ/v,
E
z
(ρ,t
m
)≈ 2E
0
ln
bracketleftbigg
1 +
radicalbigg
τ
2ρ/v
bracketrightbigg
.
Using ln(1 + x)≈ x when x lessmuch 1, we ?nd that
E
z
(ρ,t
m
)≈ E
0
radicalBigg
2τv
ρ
.
Thus, asρ →∞we have E×H ~ 1/ρ andthe?uxofenergypassingthroughacylindrical
surface of areaρ dφdz is independent ofρ. This result is similar to that seen for spherical
waves where E × H ~ 1/r
2
.
2.10.8 Propagation of spherical waves in a lossless medium
In the previous section we found solutions that describe uniform cylindrical waves
dependent only on the radial variable ρ. It turns out that similar solutions are not
possible in spherical coordinates; ?elds that only depend on r cannot satisfy Maxwell’s
equations since, as shown in § 2.10.9, a source having the appropriate symmetry for the
productionofuniformsphericalwaves in factproducesno?eldatallexternaltotheregion
it occupies. As we shall see in Chapter 5, the ?elds produced by localized sources are in
general quite complex. However, certain solutions that are only slightly nonuniform may
be found, and these allow us to investigate the most important properties of spherical
waves. We shall ?nd that spherical waves diverge from a localized point source and
expand outward with ?nite velocity, carrying energy away from the source.
Consider a homogeneous, lossless, source-free region of space characterized by permit-
tivity epsilon1 and permeability μ. We seek solutions to the wave equation that are TEM
r
in
spherical coordinates (H
r
= E
r
= 0), and independent of the azimuthal angle φ.Thus
we may write
E(r,t)=
?
θE
θ
(r,θ,t)+
?
φE
φ
(r,θ,t),
H(r,t)=
?
θH
θ
(r,θ,t)+
?
φH
φ
(r,θ,t).
Maxwell’s equations show that not all of these vector components are required. Faraday’s
law states that
?×E(r,θ,t)= ?r
1
r sinθ
?
?θ
[sinθE
φ
(r,θ,t)] ?
?
θ
1
r
?
?r
[rE
φ
(r,θ,t)] +
?
φ
1
r
?
?r
[rE
θ
(r,θ,t)]
=?μ
?H(r,θ,t)
?t
. (2.350)
Since we require H
r
= 0 we must have
?
?θ
[sinθE
φ
(r,θ,t)] = 0.
This implies that either E
φ
~ 1/sinθ or E
φ
= 0. We shall choose E
φ
= 0 and investigate
whether the resulting ?elds satisfy the remaining Maxwell equations.
In a source-free region of space we have ?·D =epsilon1?·E = 0. Since we now have only a
θ-component of the electric ?eld, this requires
1
r
?
?θ
E
θ
(r,θ,t)+
cotθ
r
E
θ
(r,θ,t)= 0.
From this we see that when E
φ
= 0 the component E
θ
must obey
E
θ
(r,θ,t)=
f
E
(r,t)
sinθ
.
By (2.350) there is only a φ-component of magnetic ?eld, and it must obey H
φ
(r,θ,t)=
f
H
(r,t)/sinθ where
?μ
?
?t
f
H
(r,t)=
1
r
?
?r
[rf
E
(r,t)]. (2.351)
Thus the spherical wave has the property E ⊥ H ⊥ r, and is TEM to the r-direction.
We can obtain a wave equation for E
θ
by taking the curl of (2.350) and substituting
from Ampere’s law:
?×(?×E)=?
?
θ
1
r
?
2
?r
2
[rE
θ
] =?×
bracketleftbigg
?μ
?
?t
H
bracketrightbigg
=?μ
?
?t
bracketleftbigg
σE +epsilon1
?
?t
E
bracketrightbigg
.
This gives
?
2
?r
2
[rf
E
(r,t)] ?μσ
?
?t
[rf
E
(r,t)] ?μepsilon1
?
2
?t
2
[rf
E
(r,t)] = 0, (2.352)
which is the desired wave equation for E. Proceeding similarly we ?nd that H
φ
obeys
?
2
?r
2
[rf
H
(r,t)] ?μσ
?
?t
[rf
H
(r,t)] ?μepsilon1
?
2
?t
2
[rf
H
(r,t)] = 0. (2.353)
We see that the wave equation for rf
E
is identical to that for the plane wave ?eld E
z
(2.331). Thus, we can use the solution obtained in § A.1, as we did with the plane wave,
with a few subtle di?erences. First, we cannot have r < 0. Second, we do not anticipate
a solution representing a wave traveling in the ?r-direction — i.e., a wave converging
toward the origin. (In other situations we might need such a solution in order to form a
standing wave between two spherical boundary surfaces, but here we are only interested
in the basic propagating behavior of spherical waves.) Thus, we choose as our solution
the term (A.45) and ?nd for a lossless medium where Omega1= 0
E
θ
(r,θ,t)=
1
r sinθ
A
parenleftBig
t ?
r
v
parenrightBig
. (2.354)
From (2.351) we see that
H
φ
=
1
μv
1
r sinθ
A
parenleftBig
t ?
r
v
parenrightBig
. (2.355)
Since μv =(μ/epsilon1)
1/2
=η, we can also write this as
H =
?r × E
η
.
We note that our solution is not appropriate for unbounded space since the ?elds have
a singularity at θ = 0. Thus we must exclude the z-axis. This can be accomplished
by using PEC cones of angles θ
1
and θ
2
, θ
2
>θ
1
. Because the electric ?eld E =
?
θE
θ
is
normal to these cones, the boundary condition that tangential E vanishes is satis?ed.
It is informative to see how the terms in the Poynting power balance theorem relate for
a spherical wave. Consider the region between the spherical surfaces r = r
1
and r = r
2
,
r
2
> r
1
. Since there is no current within the volume region, Poynting’s theorem (2.299)
becomes
1
2
?
?t
integraldisplay
V
(epsilon1E · E +μH · H)dV =?
contintegraldisplay
S
(E × H)· dS. (2.356)
From (2.354) and (2.355), the time-rate of change of stored energy is
P
sphere
(t)=
1
2
?
?t
integraldisplay
V
[epsilon1E
2
(r,θ,t)+μH
2
(r,θ,t)] dV
=
1
2
?
?t
integraldisplay
2π
0
dφ
integraldisplay
θ
2
θ
1
dθ
sinθ
integraldisplay
r
2
r
1
bracketleftbigg
epsilon1
1
r
2
A
2
parenleftBig
t ?
r
v
parenrightBig
+μ
1
r
2
1
(vμ)
2
A
2
parenleftBig
t ?
r
v
parenrightBig
bracketrightbigg
r
2
dr
= 2πepsilon1F
?
?t
integraldisplay
r
2
r
1
A
2
parenleftBig
t ?
r
v
parenrightBig
dr
where
F = ln
bracketleftbigg
tan(θ
2
/2)
tan(θ
1
/2)
bracketrightbigg
.
Putting u = t ?r/v we see that
P
sphere
(t)=?2πepsilon1F
?
?t
integraldisplay
t?r
2
/v
t?r
1
/v
A
2
(u)vdu.
An application of Leibnitz’ rule for di?erentiation (A.30) gives
P
sphere
(t)=?
2π
η
F
bracketleftBig
A
2
parenleftBig
t ?
r
2
v
parenrightBig
? A
2
parenleftBig
t ?
r
1
v
parenrightBigbracketrightBig
. (2.357)
Next we ?nd the Poynting ?ux term:
P
sphere
(t)=?
contintegraldisplay
S
(E × H)· dS
=?
integraldisplay
2π
0
dφ
integraldisplay
θ
2
θ
1
bracketleftbigg
1
r
1
A
parenleftBig
t ?
r
1
v
parenrightBig
?
θ
bracketrightbigg
×
bracketleftbigg
1
r
1
1
μv
A
parenleftBig
t ?
r
1
v
parenrightBig
?
φ
bracketrightbigg
·(??r)r
2
1
dθ
sinθ
?
?
integraldisplay
2π
0
dφ
integraldisplay
θ
2
θ
1
bracketleftbigg
1
r
2
A
parenleftBig
t ?
r
2
v
parenrightBig
?
θ
bracketrightbigg
×
bracketleftbigg
1
r
2
1
μv
A
parenleftBig
t ?
r
2
v
parenrightBig
?
φ
bracketrightbigg
· ?rr
2
2
dθ
sinθ
.
The ?rst term represents the power carried by the traveling wave into the volume region
by passing through the spherical surface at r = r
1
, while the second term represents
the power carried by the wave out of the region by passing through the surface r = r
2
.
Integration gives
P
sphere
(t)=?
2π
η
F
bracketleftBig
A
2
parenleftBig
t ?
r
2
v
parenrightBig
? A
2
parenleftBig
t ?
r
1
v
parenrightBigbracketrightBig
, (2.358)
which matches (2.357), thus verifying Poynting’s theorem.
It is also interesting to compute the total energy passing through a surface of radius
r
0
. From (2.358) we see that the ?ux of energy (power density) passing outward through
the surface r = r
0
is
P
sphere
(t)=
2π
η
FA
2
parenleftBig
t ?
r
0
v
parenrightBig
.
The total energy associated with this ?ux can be computed by integrating over all time:
we have
E =
2π
η
F
integraldisplay
∞
?∞
A
2
parenleftBig
t ?
r
0
v
parenrightBig
dt =
2π
η
F
integraldisplay
∞
?∞
A
2
(u)du
after making the substitution u = t ?r
0
/v. The total energy passing through a spherical
surface is independent of the radius of the sphere. This is an important property of
spherical waves. The 1/r dependence of the electric and magnetic ?elds produces a
power density that decays with distance in precisely the right proportion to compensate
for the r
2
-type increase in the surface area through which the power ?ux passes.
2.10.9 Nonradiating sources
Not all time-dependent sources produce electromagnetic waves. In fact, certain local-
ized source distributions produce no ?elds external to the region containing the sources.
Such distributions are said to be nonradiating, and the ?elds they produce (within their
source regions) lack wave characteristics.
Let us consider a speci?c example involving two concentric spheres. The inner sphere,
carrying a uniformly distributed total charge ?Q, is rigid and has a ?xed radius a; the
outer sphere, carrying uniform charge +Q, is a ?exible balloon that can be stretched to
any radius b = b(t). The two surfaces are initially stationary, some external force being
required to hold them in place. Now suppose we apply a time-varying force that results
in b(t) changing from b(t
1
) = b
1
to b(t
2
) = b
2
> b
1
. This creates a radially directed
time-varying current ?rJ
r
(r,t). By symmetry J
r
depends only on r and produces a ?eld
E that depends only on r and is directed radially. An application of Gauss’s law over a
sphere of radius r
0
> b
2
, which contains zero total charge, gives
4πr
2
0
E
r
(r
0
,t)= 0,
hence E(r,t)= 0 for r > r
0
and all time t.SoE = 0 external to the current distribution
and no outward traveling wave is produced. Gauss’s law also shows that E = 0 inside
the rigid sphere, while between the spheres
E(r,t)=??r
Q
4πepsilon1
0
r
2
.
Now work is certainly required to stretch the balloon and overcome the Lorentz force
between the two charged surfaces. But an application of Poynting’s theorem over a
surfaceenclosingbothspheresshowsthatnoenergyiscarriedawaybyanelectromagnetic
wave. Where does the expended energy go? The presence of only two nonzero terms in
Poynting’s theorem clearly indicates that the power term
integraltext
V
E · J dV corresponding to
the external work must be balanced exactly by a change in stored energy. As the radius
of the balloon increases, so does the region of nonzero ?eld as well as the stored energy.
In free space any current source expressible in the form
J(r,t)=?
parenleftbigg
?ψ(r,t)
?t
parenrightbigg
(2.359)
and localized to a volume region V, such as the current in the example above, is nonra-
diating. Indeed, Ampere’s law states that
?×H =epsilon1
0
?E
?t
+?
parenleftbigg
?ψ(r,t)
?t
parenrightbigg
(2.360)
for r ∈ V; taking the curl we have
?×(?×H)=epsilon1
0
??×E
?t
+?×?
parenleftbigg
?ψ(r,t)
?t
parenrightbigg
.
But the second term on the right is zero, so
?×(?×H)=epsilon1
0
??×E
?t
and this equation holds for all r. By Faraday’s law we can rewrite it as
parenleftbigg
(?×?×) +
1
c
2
?
2
?t
2
parenrightbigg
H(r,t)= 0.
So H obeysthehomogeneouswave equationeverywhere,and H = 0 followsfromcausality.
The laws of Ampere and Faraday may also be combined with (2.359) to show that
parenleftbigg
(?×?×) +
1
c
2
?
2
?t
2
parenrightbiggbracketleftbigg
E(r,t)+
1
epsilon1
0
?ψ(r,t)
bracketrightbigg
= 0
for all r. By causality
E(r,t)=?
1
epsilon1
0
?ψ(r,t) (2.361)
everywhere. But since ψ(r,t) = 0 external to V, we must also have E = 0 there.
Note that E =??ψ/epsilon1
0
is consistent with Ampere’s law (2.360) provided that H = 0
everywhere.
We see that sources having spherical symmetry such that
J(r,t)= ?rJ
r
(r,t)=?
parenleftbigg
?ψ(r,t)
?t
parenrightbigg
= ?r
?
2
ψ(r,t)
?r?t
obey(2.359)andarethereforenonradiating. Hencethe?eldsassociatedwithanyoutward
traveling spherical wave must possess some angular variation. This holds, for example,
for the ?elds far removed from a time-varying source of ?nite extent.
As pointed out by Lindell [113], nonradiating sources are not merely hypothetical.
The out?owing currents produced by a highly symmetric nuclear explosion in outer
space or in a homogeneous atmosphere would produce no electromagnetic ?eld outside
the source region. The large electromagnetic-pulse e?ects discussed in § 2.10.6 are due
to inhomogeneities in the earth’s atmosphere. We also note that the ?elds produced
by a radiating source J
r
(r,t) do not change external to the source if we superpose a
nonradiating component J
nr
(r,t) to create a new source J = J
nr
+ J
r
. We say that the
two sources J and J
r
are equivalent for the region V external to the sources. This presents
di?culties in remote sensing where investigators are often interested in reconstructing an
unknown source by probing the ?elds external to (and usually far away from) the source
region. Unique reconstruction is possible only if the ?elds within the source region are
also measured.
For the time harmonic case, Devaney and Wolf [54] provide the most general possible
form for a nonradiating source. See § 4.11.9 for details.
2.11 Problems
2.1 Consider the constitutive equations (2.16)–(2.17) relating E, D, B, and H in a
bianisotropic medium. Using the de?nition for P and M, show that the constitutive
equations relating E, B, P, and M are
P =
parenleftbigg
1
c
ˉ
P ?epsilon1
0
ˉ
I
parenrightbigg
· E +
ˉ
L · B,
M =?
ˉ
M · E ?
parenleftbigg
c
ˉ
Q ?
1
μ
0
ˉ
I
parenrightbigg
· B.
Also ?nd the constitutive equations relating E, H, P, and M.
2.2 Consider Ampere’s law and Gauss’s law written in terms of rectangular compo-
nents in the laboratory frame of reference. Assume that an inertial frame moves with
velocity v = ?xv with respect to the laboratory frame. Using the Lorentz transformation
given by (2.73)–(2.76), show that
cD
prime
⊥
=γ(cD
⊥
+β× H
⊥
),
H
prime
⊥
=γ(H
⊥
?β× cD
⊥
),
J
prime
bardbl
=γ(J
bardbl
?ρv),
J
prime
⊥
= J
⊥
,
cρ
prime
=γ(cρ?β· J),
where “⊥” means perpendicular to the direction of the velocity and “bardbl” means parallel
to the direction of the velocity.
2.3 Show that the following quantities are invariant under Lorentz transformation:
(a) E · B,
(b) H · D,
(c) B · B ? E · E/c
2
,
(d) H · H ? c
2
D · D,
(e) B · H ? E · D,
(f) cB · D + E · H/c.
2.4 Show that if c
2
B
2
> E
2
holds in one reference frame, then it holds in all other
reference frames. Repeat for the inequality c
2
B
2
< E
2
.
2.5 Show that if E·B = 0 and c
2
B
2
> E
2
holds in one reference frame, then a reference
frame may be found such that E = 0. Show that if E·B = 0 and c
2
B
2
< E
2
holds in one
reference frame, then a reference frame may be found such that B = 0.
2.6 A test charge Q at rest in the laboratory frame experiences a force F = QE as
measured by an observer in the laboratory frame. An observer in an inertial frame
measures a force on the charge given by F
prime
= QE
prime
+ Qv×B
prime
. Show that F negationslash= F
prime
and ?nd
the formula for converting between F and F
prime
.
2.7 Consider a material moving with velocity v with respect to the laboratory frame of
reference. When the ?elds are measured in the moving frame, the material is found to be
isotropic with D
prime
=epsilon1
prime
E
prime
and B
prime
=μ
prime
H
prime
. Show that the ?elds measured in the laboratory
frame are given by (2.107) and (2.108), indicating that the material is bianisotropic when
measured in the laboratory frame.
2.8 Show that by assuming v
2
/c
2
lessmuch 1 in (2.61)–(2.64) we may obtain (2.111).
2.9 Derive the following expressions that allow us to convert the value of the magneti-
zation measured in the laboratory frame of reference to the value measured in a moving
frame:
M
prime
⊥
=γ(M
⊥
+β× cP
⊥
), M
prime
bardbl
= M
bardbl
.
2.10 Beginning with the expressions (2.61)–(2.64) for the ?eld conversions under a
?rst-order Lorentz transformation, show that
P
prime
= P ?
v × M
c
2
, M
prime
= M + v × P.
2.11 Considerasimpleisotropicmaterialmovingthroughspacewithvelocity v relative
to the laboratory frame. The relative permittivity and permeability of the material
measured in the moving frame are epsilon1
prime
r
and μ
prime
r
, respectively. Show that the magnetization
as measured in the laboratory frame is related to the laboratory frame electric ?eld and
magnetic ?ux density as
M =
χ
prime
m
μ
0
μ
prime
r
B ?epsilon1
0
parenleftbigg
χ
prime
e
+
χ
prime
m
μ
prime
r
parenrightbigg
v × E
when a ?rst-order Lorentz transformation is used. Here χ
prime
e
=epsilon1
prime
r
? 1 and χ
prime
m
=μ
prime
r
? 1.
2.12 Considerasimpleisotropicmaterialmovingthroughspacewithvelocity v relative
to the laboratory frame. The relative permittivity and permeability of the material
measured in the moving frame are epsilon1
prime
r
and μ
prime
r
, respectively. Derive the formulas for the
magnetization and polarization in the laboratory frame in terms of E and B measured in
the laboratory frame by using the Lorentz transformations (2.128) and (2.129)–(2.132).
Show that these expressions reduce to (2.139) and (2.140) under the assumption of a
?rst-order Lorentz transformation (v
2
/c
2
lessmuch 1).
2.13 Derive the kinematic form of the large-scale Maxwell–Bo? equations (2.165) and
(2.166). Derive the alternative form of the large-scale Maxwell–Bo? equations (2.167)
and (2.168).
2.14 Modify the kinematic form of the Maxwell–Bo? equations (2.165)–(2.166) to
account for the presence of magnetic sources. Repeat for the alternative forms (2.167)–
(2.168).
2.15 Consider a thin magnetic source distribution concentrated near a surface S. The
magnetic charge and current densities are given by
ρ
m
(r,x,t)=ρ
ms
(r,t)f(x,Delta1), J
m
(r,x,t)= J
ms
(r,t)f(x,Delta1),
where f(x,Delta1)satis?es
integraldisplay
∞
?∞
f(x,Delta1)dx = 1.
Let Delta1→ 0 and derive the boundary conditions on (E,D,B,H) across S.
2.16 Beginning with the kinematic forms of Maxwell’s equations (2.177)–(2.178), de-
rive the boundary conditions for a moving surface
?n
12
×(H
1
? H
2
)+(?n
12
· v)(D
1
? D
2
)= J
s
,
?n
12
×(E
1
? E
2
)?(?n
12
· v)(B
1
? B
2
)=?J
ms
.
2.17 Beginning with Maxwell’s equations and the constitutive relationships for a bian-
isotropic medium (2.19)–(2.20), derive the wave equation for H (2.313). Specialize the
result for the case of an anisotropic medium.
2.18 Consider an isotropic but inhomogeneous material, so that
D(r,t)=epsilon1(r)E(r,t), B(r,t)=μ(r)H(r,t).
Show that the wave equations for the ?elds within this material may be written as
?
2
E ?μepsilon1
?
2
E
?t
2
+?
bracketleftbigg
E ·
parenleftbigg
?epsilon1
epsilon1
parenrightbiggbracketrightbigg
?(?×E)×
parenleftbigg
?μ
μ
parenrightbigg
=μ
?J
?t
+?
parenleftBig
ρ
epsilon1
parenrightBig
,
?
2
H ?μepsilon1
?
2
H
?t
2
+?
bracketleftbigg
H ·
parenleftbigg
?μ
μ
parenrightbiggbracketrightbigg
?(?×H)×
parenleftbigg
?epsilon1
epsilon1
parenrightbigg
=??×J ? J ×
parenleftbigg
?epsilon1
epsilon1
parenrightbigg
.
2.19 Consider a homogeneous, isotropic material in which D =epsilon1E and B =μH. Using
the de?nitions of the equivalent sources, show that the wave equations (2.322)–(2.323)
are equivalent to (2.314)–(2.315).
2.20 When we calculate the force on a conductor produced by an incident plane wave,
we often neglect the momentum term
?
?t
(D × B).
Compute this term for the plane wave ?eld (2.336) in free space at the surface of the
conductor and compare to the term obtained from the Maxwell stress tensor (2.341).
What is the relative di?erence in amplitude?
2.21 When a material is only slightly conducting, and thus Omega1 is very small, we often
neglect the third term in the plane wave solution (2.343). Reproduce the plot of Figure
2.8withthistermomittedandcompare.Discusshowtheomittedterma?ectstheshape
of the propagating waveform.
2.22 A total charge Q is evenly distributed over a spherical surface. The surface
expands outward at constant velocity so that the radius of the surface is b =vt at time
t. (a) Use Gauss’s law to ?nd E everywhere as a function of time. (b) Show that E may
be found from a potential function
ψ(r,t)=
Q
4πr
(r ?vt)U(r ?vt)
according to (2.361). Here U(t) is the unit step function. (c) Write down the form of
J for the expanding sphere and show that since it may be found from (2.359) it is a
nonradiating source.