Supersymmetry: Herbi K. Dreiner Howard E. Haber Stephen P. Martin September 22, 2004

Supersymmetry
Herbi K. Dreiner Howard E. Haber Stephen P. Martin

September 22, 2004
Contents
Part 1: Fermions in Quantum Field Theory and the
Standard Model 7
1 Two-component formalism for Spin-1/2 Fermions 9
1.1 The Lorentz group and its Lie algebra 9
1.2 The Poincare group and its Lie algebra 12
1.3 Spin-1/2 representation of the Lorentz group 14
1.4 Bilinear covariants of two-component spinors 17
1.5 Lagrangians for free spin-1/2 fermions 23
1.6 The fermion mass-matrix and its diagonalization 26
1.7 Discrete spacetime and internal symmetries 29
1.8 Parity transformation of two-component spinors 32
1.9 Time-reversal transformation of two-component spinors 36
1.10 Charge conjugation of two-component spinors 40
1.11 CP and CPT conjugation of two-component spinors 41
References 48
2 Feynman Rules for Fermions 49
2.1 Fermion creation and annihilation operators 49
2.2 Properties of the two-component spinor wave functions 50
2.3 Charged two-component fermion elds 54
2.4 Feynman rules for external two-component fermion lines 56
2.5 Feynman rules for two-component fermion propagators 57
2.6 Feynman rules for two-component fermion interactions 61
2.7 General structure and rules for Feynman graphs 65
2.8 Simple examples of Feynman diagrams and amplitudes 67
2.8.1 Tree-level decays 67
2.8.2 Tree-level scattering processes 69
2
Contents 3
2.9 Conventions for fermion and anti-fermion names and elds 76
3 From Two-Component to Four-Component Spinors 83
3.1 Four-component spinors 84
3.2 Lagrangians for free four-component fermions 91
3.3 Properties of the four-component spinor wave functions 92
3.4 Feynman rules for four-component Majorana fermions 94
3.5 Simple examples of Feynman diagrams revisited 99
References 106
4 Gauge Theories and the Standard Model 107
4.1 Abelian Gauge eld theory 107
4.2 Non-abelian gauge groups and their Lie algebras 109
4.3 Non-abelian gauge eld theory 112
4.4 Feynman rules for Gauge theories 116
4.5 Spontaneously broken gauge theories 119
4.5.1 Goldstones theorem 119
4.5.2 Massive gauge bosons 120
4.5.3 The unitary and R
gauges 123
4.5.4 The physical Higgs bosons 125
4.6 Complex representations of scalar elds 127
4.7 The Standard Model of particle physics 127
4.8 Parameter count of the Standard Model 127
4.9 Grand Unication and Running Couplings 128
Part 2: Constructing Supersymmetric Theories 129
5 Introduction to Supersymmetry 131
5.1 Motivation: the Hierarchy Problem 131
5.2 Enter supersymmetry 134
5.3 Historical analogies 145
6 Supersymmetric Lagrangians 146
6.1 A free chiral supermultiplet 146
6.2 Interactions of chiral supermultiplets 152
6.3 Supersymmetric Gauge Theories 155
6.4 Gauge interactions for chiral supermultiplets 158
6.5 Summary: How to build a supersymmetric model 160
6.6 Soft supersymmetry-breaking interactions 163
7 Superelds 166
4 Contents
Part 3: Realistic Supersymmetric Models 167
8 The Minimal Supersymmetric Standard Model 169
8.1 The superpotential and supersymmetric interactions 169
8.2 R-parity (also known as matter parity) and its consequences 174
8.3 Soft supersymmetry breaking in the MSSM 178
8.4 Hints of an Organizing Principle 179
8.5 Electroweak symmetry breaking and the Higgs bosons 184
8.6 Neutralinos and charginos 191
8.7 The gluino 195
8.8 Squarks and sleptons 196
8.9 Summary: the MSSM sparticle spectrum 202
9 Origins of supersymmetry breaking 206
9.1 General considerations for supersymmetry breaking 206
9.2 The goldstino and the gravitino 212
9.3 Gravity-mediated supersymmetry breaking models 217
9.4 Gauge-mediated supersymmetry breaking models 219
10 From model assumptions to the low-energy MSSM 227
10.1 Renormalization Group Equations 227
10.2 The Eective Potential 233
10.3 Radiative Corrections to Particle Masses 233
11 R-parity violation 234
Part 4: Advanced Topics 235
12 Origin of the term 237
12.1 The next-to-minimal supersymmetric standard model 237
12.2 Non-renormalizable operators and the term 238
Part 5: The Appendices 239
Appendix A: Notation and Conventions 241
A.1 Matrix notation and the summation convention 241
A.2 Complex conjugation and the avor index 241
A.3 Spacetime notation 242
A.4 Two-component spinor notation 243
A.5 Four-component spinors and the Dirac matrices 245
Contents 5
Appendix B: Compendium of Useful Relations for Two-
Component Notation 250
B.1 Sigma matrices and associated identities 250
B.2 Behavior of 2-component fermion bilinears under C, P, T 253
References 258
Index 269
6 Contents
Part 1
Fermions in Quantum Field Theory and
the Standard Model
1
Two-component formalism for Spin-1/2
Fermions
In this chapter, we examine the incorporation of spin-1/2 fermions into
quantum eld theory. Underlying the relativistic theory of quantized
elds is special relativity and the invariance of the Lagrangian under
the Poincare group, which consists of the Lorentz group plus space-time
translations. Representations of the Poincare group correspond to particle
states of denite mass and spin.
1.1 The Lorentz group and its Lie algebra
Under Lorentz transformations,
, the spacetime coordinates, x
,
transform as x
.
1
The condition that g
is invariant
under Lorentz transformations implies that
= g
. (1.1)
We can also dene
which governs the transformation of a covariant

four-vector: x
. One then easily derives:
= g
.
Using eq. (1.1), it follows that
= g
. The set of all possible

forms a Lie group O(3,1).
Eq. (1.1) implies that possesses the following two properties:
(i) det = 1 and (ii) [
0
0
[ 1. Thus, Lorentz transformations
fall into four disconnected classes, which can be denoted by a pair
of signs:
_
sgn(det ) , sgn(
0
0
)
_
. The proper orthochronous Lorentz
transformations correspond to (+, +) and are continuously connected to
the identity. The group of such transformations forms a subgroup of
O(3,1), which we shall denote by SO
+
(3,1).
2
If SO
+
(3, 1), we can
1
Our conventions for spacetime notation can be found in Appendix A.
2
In this notation, the S (which stands for special) corresponds to the condition
det = 1 and the subscript + corresponds to
0
0 +1.
9
10 1 Two-component formalism for Spin-1/2 Fermions
generate all elements of the other classes of Lorentz transformations by
introducing the space-inversion matrix,
P
= diag(1, 1, 1, 1), the
time-inversion matrix
T
= diag(1, 1, 1, 1) and the spacetime inversion
matrix
P
T
= I
4
. Then, ,
P
,
T
,
P
T
spans the full
Lorentz group.
Innitesimal Lorentz transformations must be proper and or-
thochronous since these are continuously connected to the identity. These
transformations are best studied by exploring the properties of the
SO(3,1) Lie algebra. Later (see section 1.7), we shall examine the
implications of the improper Lorentz transformationsspace-inversion
and time-inversion.
The most general proper orthochronous Lorentz transformation,
corresponding to a rotation by an angle about an axis n [
n]
and a boost vector

v tanh
1
[where v v/[v[ and [v[/c],
3
is a
4 4 matrix given by:
= exp
_
i
s i
k
_
= exp
_
i
2
_
, (1.2)
where
i
1
2
ijk
jk
,
i

i0
=
0i
, s
i
1
2
ijk
s
jk
, k
i
s
0i
= s
i0
and
(s
= i(g
) . (1.3)
Here, the indices i, j, k = 1, 2, 3 and
123
= +1.
Note that s
is an antisymmetric 4 4 matrix, i.e., s
= s
, and
satises the commutation relations of the SO(3,1) SL(2,C) Lie algebra,
[s
, s
] = i(g
+g
). (1.4)
The elds of spin-zero and spin-one transform under a general Lorentz
transformation as
(x
) = (x) , spin 0 , (1.5)

A
(x
) =
(x) , spin 1 . (1.6)

For a eld of spin-s, the general transformation law reads
(x
) = exp
_
i
2
(x) , (1.7)
where the S
are (nite-dimensional) irreducible matrix representations

of the Lie algebra of the Lorentz group, and and label the components
of the matrix representation space. The dimension of this space is related
to the spin of the particle. In particular, S
is an antisymmetric tensor,
S
= S
, that satises the commutation relations of the SL(2,C) Lie

3
Henceforth, we work in units where c = 1.
1.1 The Lorentz group and its Lie algebra 11
algebra [eq. (1.4)]. Dierent irreducible nite dimensional representations
of SL(2,C) correspond to particles of dierent spin. It is convenient to
identify the following pieces of S
S
i
1
2
ijk
S
jk
, K
i
S
0i
, (1.8)
where i, j, k = 1, 2, 3. One can show that the S
i
generate three-
dimensional rotations in space and the K
i
generate the Lorentz-boosts [2].
Using eq. (1.4), it follows that S
i
and N
i
satisfy the commutation relations
[S
i
, S
j
] = i
ijk
S
k
, (1.9)
[S
i
, K
j
] = i
ijk
K
k
, (1.10)
[K
i
, K
j
] = i
ijk
S
k
. (1.11)
It is convenient to dene the following linear combinations of the
generators
S
+

1
2
(
S +i
K) , (1.12)
1
2
(
S i
K) . (1.13)
Then, the commutation relations [eqs. (1.9)(1.11)] decouple into two
independent SU(2) algebras
[S
i
+
, S
j
+
] = i
ijk
S
k
+
, (1.14)
[S
i
, S
j
] = i
ijk
S
k
, (1.15)
[S
i
+
, S
j
] = 0 . (1.16)
The nite-dimensional irreducible representations of SU(2) are well
known: these are the (2s + 1) (2s + 1) representation matrices
corresponding to spin-s, where s = 0,
1
2
, 1,
3
2
. . . (whose matrix elements
appear in most textbooks of quantum mechanics). Hence, the irreducible
representations of the Lorentz group can be characterized by two numbers
(s
+
, s
), where s
is non-negative and either an integer or half-integer.

The eigenvalues of

S
2
are given by s
(s
+ 1). The dimension of the

representation corresponding to (s
+
, s
) is (2s
+
+ 1)(2s
+ 1).
Using eqs. (1.7) and (1.8), one can write an innitesimal Lorentz
transformation as
M exp
_
i
2
_
I i

S i

K, (1.17)
where
i
and
i
are dened below eq. (1.2) and I is the identity. The
simplest (trivial) representation is the one-dimensional (0, 0) representa-
tion which corresponds to a spin-zero scalar eld. In this representation,
S =

K = 0 and we recover from eq. (1.7) the transformation law given
in eq. (1.5). The spin-one transformation of eq. (1.6) corresponds to
the four-dimensional (
1
2
,
1
2
) representation. However, in a quantum eld
theory of spin-one elds, only three of the four degrees of freedom are
physical. Moreover, in gauge theories of massless spin-one elds, gauge
invariance introduces an additional constraint and only two degrees of
freedom are physical. This is described in detail in ref. [1] to which we
refer the reader.
1.2 The Poincare group and its Lie algebra
The Poincare group consists of Lorentz transformations and spacetime
translations. That is, under a Poincare transformation, the spacetime
co-ordinates transform as x

=
+a
, where is given by eq. (1.2)

and a
is a constant four-vector. For a eld of spin s, the transformation

law given by eq. (1.7) still holds under Poincare transformations. It is
convenient to rewrite this transformation law as follows:
(x) = exp
_
i
2
1
(x a)
_
, (1.18)
where we have used x =
1
(x
a) and redened the dummy variable x
by removing the prime. The Poincare algebra is obtained by considering

an innitesimal Poincare transformation. Expanding in a Taylor series
about = I
4
and a = 0, we may rewrite eq. (1.18) as:
4
(x)
_
I +ia
i
2
(L
+S
(x) , (1.19)
where P
and L
are the dierential operators

P
, L
i(x
) . (1.20)
We further dene J
+ S
. Thus, the Poincare algebra consists

of the ten generators P
and J
which obey the following commutation

relations:
5
[P
, P
] = 0 , (1.21)
_
J
, P
_
= i(g
) , (1.22)
_
J
, J
_
= i(g
+g
) . (1.23)
4
The operators I, P
and L
include an implicit factor of
, whereas the spin

operator S
depends non-trivially on and (except for the case of spin zero,

when S = 0).
5
It is easy to show that eq. (1.22) follows from the transformation law of the four-vector
P
under Lorentz transformations.

1.2 The Poincare group and its Lie algebra 13
The Poincare algebra possesses two independent Casimir operators
(these are polynomial functions of the generators that commute with the
generators P
and J
). The rst one is P

2
P
. To construct the
second one, we introduce the Pauli-Lubanski vector w
:
w

1
2
, (1.24)
where
0123
= 1. Explicitly,
w
= (
J

P ; P
0
J +

N

P) , (1.25)
where J
i
1
2
ijk
J
jk
and N
i
J
0i
. From its denition, it follows
that w
= 0. The second Casimir operator is w

2
w
. The
representations of the Poincare algebra can be labelled by the eigenvalues
of P
2
and w
2
when acting on the physical states.
6
The eigenvalue of P
2
is
m
2
, where m is the mass. To see the physical interpretation of w
2
, let us
evaluate the eigenvalue of w
2
, consider rst the case of m ,= 0. In this case,
we are free to evaluate w
2
in the particle rest frame (since w
2
is a Lorentz
scalar). In this frame, w
= (0 ; m
S), where S
i
is dened in eq. (1.8).
Hence, w
2
= m
2
S
2
, with eigenvalues m
2
s(s + 1), s = 0,
1
2
, 1, . . .. We
conclude that massive (positive energy) states can be labelled by (m, s),
where m is the mass and s is the spin of the state.
If m = 0, the previous analysis is not valid, since we cannot evaluate
w
2
in the rest frame. Nevertheless, if we take the m 0 limit, it follows
from the results above that either w
2
= 0 or the corresponding states have
innite spin. We reject the second possibility (which does not appear to
be realized in nature) and assume that w
2
= 0. Thus, we must solve the
equations w
2
= P
2
= w
= 0. It is simplest to choose a frame in which

P = P
0
(1; 0, 0, 1) where P
0
> 0. In this frame, it is easy to show that
w = w
0
(1; 0, 0, 1). That is, in any Lorentz frame,
w
= hP
, (1.26)
where h is called the helicity operator. From eq. (1.26), we derive:
7
h =
w
0
P
0
=

J

P
P
0
=

S

P
P
0
. (1.27)
Eigenvalues of h are called the helicity (and are denoted by ). One can
check that h commutes with all the generators of the Poincare algebra
(as long as P
0
= [
P[ ,= 0). Thus, can be used to label states. In

6
Here, we restrict our consideration to states of non-negative energy P
0
. This is
possible since the sign of P
0
is also invariant under proper Poincare transformations.
7
We dene the dierential operator L
i
1
2
ijk
L
jk
. Then, noting that

L = x

P, it
follows that

J

P =

S

P.
particular, noting that for massless states, the eigenvalue of

S

P/P
0
is equal to that of

S

P (where

P

P/[
P[) and corresponds to the

projection of the the spin along its direction of motion. The spectrum
of

S

P consists of = 0,
1
2
, 1, . . .. Under a CPT transformation,
. Thus, massless (positive energy) particles are labeled by [[
(colloquially called spin-[[), where both [[ helicity states appear in
the theory.
1.3 Spin-1/2 representation of the Lorentz group
In this chapter, we focus on the simplest non-trivial irreducible
representations of the Lorentz algebra. These are the two-dimensional
representations: (
1
2
, 0) and (0,
1
2
). Explicitly, the corresponding two-
dimensional representations of the Lorentz generators are given by
(
1
2
, 0) :

S
+
=
1
2
(
S +i
K) =
1
2
,

S
=
1
2
(
S i
K) = 0 , (1.28)
which corresponds to

S = /2 and

K = i/2, and
(0,
1
2
) :

S
+
=
1
2
(
S +i
K) = 0 ,

S
=
1
2
(
S i
K) =
1
2
,(1.29)
which corresponds to

S = /2 and

K = i/2. Here, = (
1
,
2
,
3
) are
the usual Pauli spin matrices
1
=
_
0 1
1 0
_
,
2
=
_
0 i
i 0
_
,
3
=
_
1 0
0 1
_
. (1.30)
We shall dene a fourth matrix,
0
I
2
, which is the 2 2 identity
matrix.
Consider the innitesimal Lorentz transformation in the (
1
2
, 0) repre-
sentation. Inserting the (
1
2
, 0) generators [eq. (1.28)] into eq. (1.17) yields:
M I
2
i
2

1
2
. (1.31)
A two-component (
1
2
, 0) spinor eld is denoted by
(x), and transforms

as
(x)
(x
) = M
(x), , = 1, 2. (1.32)
If M is a matrix representation of SL(n,C), then, M
, (M
1
)
T
, and
(M
1
)
are also matrix representations of the same dimension. For n > 2,

all four representations are inequivalent. For SL(2,C), there are at most
two distinct matrix representations corresponding to a given dimension:
1.3 Spin-1/2 representation of the Lorentz group 15
(j
1
, j
2
) and (j
2
, j
1
). Using eq. (1.31), and the following property of Pauli
matrices:
2
(
2
)
T
=
T
, (1.33)
where the transpose of the -matrices are:
T
= (
1
,
2
,
3
). It is a
simple matter to check that M and (M
1
)
T
are related by
(M
1
)
T
= i
2
M(i
2
)
T
. (1.34)
Since (i
2
)
T
= (i
2
)
1
, the matrices M and (M
1
)
T
are related by
a similarity transformation corresponding to a unitary change in basis.
Hence, M and (M
1
)
T
are equivalent representations.
8
It is convenient
to introduce the two-component spinor eld
(x), ( = 1, 2), which

transforms under the contragradient representation (M
1
)
T
(x)
(x
) = (M
1
)
T
(x) ,
= [i
2
M(i
2
)
T
]
(x) . (1.35)
This motivates the denitions
i
2
=
_
0 1
1 0
_
,
(i
2
)
1
=
_
0 1
1 0
_
, (1.36)
i.e.
12
=
21
=
21
=
12
= 1. The -tensor satises:
, (1.37)
from which it follows that:
9
. (1.38)
Then, eqs. (1.32) and (1.35) imply that
. (1.39)
Since M and (M
1
)
T
are equivalent representations, either
or
are
equally good candidates to describe the (
1
2
, 0) representation.
Consider next the innitesimal Lorentz transformation in the (0,
1
2
)
representation [eqs. (1.17) and (1.29)],
(M
1
)
I
2
i
2
+
1
2
, (1.40)
8
This corresponds to the well known result that the 2 and 2* representations of SU(2)
are equivalent.
9
Of course,
, so the distinction is somewhat pedantic. Nevertheless, it is

useful to keep track of this distinction when reinterpreting such equations in terms
of matrix multiplication.
where we have used eq. (1.31) to identify the right-hand side above as
(M
1
)
.
A two-component (0,
1
2
) spinor eld is denoted by

(x), and transforms
as

(x)

(x
) = (M
1
)

(x) , ,

=

1,

2. (1.41)
The so-called dotted indices have been introduced to distinguish the
(0,
1
2
) representation from the (
1
2
, 0) representation. The bar over the
spinor is a convention that serves as an extra (admittedly redundant)
reminder that this is the (0,
1
2
)-representation. The equivalent description
of this representation is obtained via the conjugate representation M
.
That is, M
is related by a similarity transformation to (M

1
)
. Under
Lorentz transformations

(x)

(x
) = (M
(x), (1.42)
where

=

,

=

, (1.43)
and
= i
2
=
_
0 1
1 0
_
,

= (i
2
)
1
=
_
0 1
1 0
_
. (1.44)
Hence,

and
are numerically equal, but the dotted and undotted

indices transform dierently under Lorentz transformations [eqs. (1.32)
and (1.41)] and must always be kept distinct.
The dotted epsilon tensor satises:

, (1.45)
from which it follows that
10

=

,

=

. (1.46)
Note that

and (
have the same transformation law, as do

and
(
. Hence, we may equate them:

11

= (
,

= (
. (1.47)
10
Of course,

=

See footnote 9.
11
One could also consider the complex-conjugated two-component spinor elds.
However, in quantum eld theory, the two-component spinor elds are also operators,
and the use of the dagger () indicates hermitian conjugation with respect to the
quantum Hilbert space. The hermitian conjugated elds rather than the complex-
conjugated elds are the ones that appear in the Lagrangian of the theory.
The Lorentz transformation property of

can then be written in the form
(
(M
, where (M
= (M
simply corresponds to
the well known denition of the hermitian adjoint matrix as the complex
conjugate transpose of the matrix.
So far, we have employed a notation in which unbarred two-component
fermion elds correspond to the (
1
2
, 0) representation of the Lorentz group,
while barred elds correspond to the (0,
1
2
) representation. Of course, one
could always consider a (0,
1
2
) as a fundamental eld and assign it a symbol
without a bar. However, it is useful to establish a convention in which all
unbarred elds correspond to the (
1
2
, 0) representation. Henceforth, we
shall always work in this convention. Since (
= , this convention can

be adopted with no loss of generality.
We will encounter two types of spinor quantities in this book. First, we
shall examine spinor elds that are used to construct Lagrangians for eld
theoretic models. These spinors are anti-commuting objects. We shall
then consider free-eld plane wave expansions of these spinor elds. That
is, we expand the spinor elds in terms of sums of plane waves multiplied
by spinor wave-functions times the appropriate creation or annihilation
operator. These wave-functions are commuting objects that satisfy the
free-eld Dirac equation, whereas the creation and annihilation operators
satisfy the usual canonical anti-commutation relations.
1.4 Bilinear covariants of two-component spinors
It is well known that one can construct Lorentz scalar, vector and tensor
quantities that are bilinear in the fermion elds. Subsequently, these
vector and tensor quantities can be further combined to make new Lorentz
scalars. Such scalars can be utilized in the construction of Lagrangians for
theories of fermion elds. The Lagrangian of a relativistic quantum eld
theory is Lorentz invariant, i.e., it must transform as a Lorentz scalar.
The simplest quantum eld theory of fermions consists of a single (
1
2
, 0)
fermion eld. More generally, one can consider a theory of a multiplet
of fermions elds. Since the corresponding Lagrangian is bilinear in the
fermion elds, it is sucient to consider two (
1
2
, 0) fermion elds and their
Lorentz transformation properties:
. (1.48)
In order to construct a Lorentz invariant (i.e., scalar) bilinear of the two-
component fermion elds, we rst note a basic property of the Lorentz
transformation matrix M. From the denition of the determinant,
det M =
, (1.49)
where we have used det M = 1 at the last step. It then follows that the
bilinear combination

(1.50)
is invariant under Lorentz transformations. Similarly, given two (0,
1
2
)
fermion elds, the bilinear combination

(1.51)
is also Lorentz invariant. Note carefully the placement of the indices. We
have adopted a convention that the heights of indices must be consistent
in the sense that lowered indices must always be contracted with raised
indices. Moreover, whenever possible undotted and dotted indices are
contracted with the placement of indices as follows:
and

. (1.52)
In this case, the spinor indices can be omitted without ambiguity as
indicated in eqs. (1.50) and (1.51).
We consider next the behavior of eqs. (1.50) and (1.51) under hermitian
conjugation. Since hermitian conjugation of a spinor product reverses the
order of the spinors, it follows that
()
= (

=

. (1.53)
Finally, we note that for anti-commuting two-component fermion elds,
=
= , (1.54)
where we have used
and relabeled the dummy indices.

Likewise,

=

(1.55)
()
. (1.56)
In order to construct fermion bilinears that transform as Lorentz vectors
and tensors, we must introduce the sigma-matrices,
and

, dened
by
(I
2
; ) ,
(I
2
; ) , (1.57)
where = 0, 1, 2, 3. These quantities possess a specic spinor index
structure. To determine it, rst note that
p
= p
0
I
2
p =
_
p
0
p
3
p
1
+ip
2
p
1
ip
2
p
0
+p
3
_
(1.58)
is an hermitian 2 2 matrix. Moreover, any hermitian 2 2 matrix can
be written in the form of eq. (1.58). Since Mp
is also hermitian,
there exists a p
such that
p
= Mp
. (1.59)
Using det(p
) = p
2
0
[ p [
2
and det M = 1, it follows that
p
2
0
[ p
[
2
= p
2
0
[ p[
2
, (1.60)
and hence p
is a Lorentz transformation. Moreover, one can check

that for M I
2
i
2

1
2
, one obtains p
where is
the 4 4 Lorentz transformation matrix [eq. (1.2)] parameterized by

and

. In order for eq. (1.59) to be Lorentz-covariant, both sides of the
equation must possess the same index structure. Using the known index
structure of the Lorentz transformation matrices M [eq. (1.32)] and M
[eq. (1.42)], it follows that the spinor index structure of eq. (1.59) is given
by
p

= M
(M
, (1.61)
where we have used (M

= (M
. Likewise, for the same four-

vectors p and p
as above,
p

= [(M
)
1
]

(M
1
)
. (1.62)
We thus obtain the index structure:

and

. (1.63)
Both
and
are hermitian matrices, which is expressed by the

following relations:
(

, (
)

=
. (1.64)
Some useful relations between
and
are
,

=
, (1.65)

. (1.66)
Finally, we note the following identities:
[
= 2g
, (1.67)
[
= 2g
. (1.68)
We may now construct the bilinear covariant that transforms as a
Lorentz vector:
. (1.69)
Note that the convention of eq. (1.52) is respected and allows us to
suppress the spinor indices. Moreover, the structure of the spinor and
Lorentz indices guarantees that
transforms as a Lorentz vector.

Nevertheless, an explicit proof is instructive. If
and A
are Lorentz
vectors, then
is a Lorentz scalar. Under a Lorentz transformation,

A
and

transform according to eqs. (1.6), (1.35) and (1.41),

respectively. Then,
is a Lorentz scalar if the following condition

is satised:
(M
1
)
(M
1
)

. (1.70)
To verify eq. (1.70), we need an explicit expression for M. This will be
given after the introduction of the second-rank tensor bilinear covariants
below, so we postpone the rest of the proof until then.
Likewise, we may dene:
. (1.71)
One may rewrite this Lorentz covariant bilinear in terms of
by using
the identity:
, (1.72)
which is valid for anti-commuting two-component spinors. Under hermi-
tian conjugation,
(
, (1.73)
a result that holds true for both commuting and anti-commuting spinors.
Combining eq. (1.72) and eq. (1.73) yields
(
(1.74)
Using the fact that the quantity

is a Lorentz scalar, we obtain

a second consistency conditions following similar arguments given above:
M

. (1.75)
Next we proceed to construct bilinear covariants that transform as a
Lorentz second-rank tensor. Using the matrices
and
we rst dene
(
i
4
_

_
, (1.76)
(

i
4
_

_
. (1.77)
These matrices satisfy self-duality relations:
=
1
2
i
=
1
2
i
. (1.78)
The components of
and
are easily evaluated:
ij
=
ij
=
1
2
ijk
k
, (1.79)
i0
=
0i
=
i0
=
0i
=
i
2
i
. (1.80)
The hermiticity of implies that
(

. (1.81)
Previously [see eqs. (1.17), (1.28) and (1.29)], we noted that the
Lorentz transformation matrices for the (
1
2
, 0) and (0,
1
2
) representations,
respectively are given by
exp
_
i
2
_
I
2
i
2

1
2
, for (
1
2
, 0) ,
I
2
i
2
+
1
2
, for (0,
1
2
) .
(1.82)
From this we deduce that
S
, for the (
1
2
, 0) representation , (1.83)
S
, for the (0,

1
2
) representation . (1.84)
Indeed, S
i
=
1
2
ijk
S
jk
=
1
2
i
for both (
1
2
, 0) and (0,
1
2
) representations,
and thus w
2
= m
2
S
2
with eigenvalues
3
4
m
2
as expected for a spin-1/2
fermion.
We may now construct two bilinear covariants that transform as second-
rank Lorentz tensors:
, (1.85)

. (1.86)
Again, we note that the convention of eq. (1.52) is respected and allows
us to suppress the spinor indices. For anti-commuting two-component
spinors, one easily veries that
. (1.87)
Under hermitian conjugation, eq. (1.81) yields
(
, (1.88)
a result that holds true for both commuting and anti-commuting spinors.
We now return to complete the proof of eqs. (1.70) and (1.75). It is
sucient to use the innitesimal forms
12
for M, M
and
M I
2
i
2
, (1.89)
M
I
2
+
i
2
, (1.90)
+
1
2
_
_
. (1.91)
Inserting these results in eqs. (1.70) and (1.75), it is straightforward to
verify that these results are equivalent to the identities:
= i(g
) , (1.92)
= i(g
) , (1.93)
which are easily veried. This completes the proof of eqs.(1.70) and (1.75).
Many more useful identities involving
and
can be found
in the Appendix.
The complete list of independent bilinear covariants is given in
Table 1.1. To show that the list is complete, we note that an arbitrary
2 2 complex matrix can be written as a complex linear combination of
four linearly independent 2 2 matrices, which may be chosen, e.g., to
be
( = 0, 1, 2, 3). However, in constructing the bilinear covariants,

both undotted and dotted two-component spinors are employed. Thus,
the relevant space is the sixteen-dimensional product space of two
independent spaces of 2 2 matrices, spanned by the sixteen matrices
13
,

,
. (1.94)
Here, it is important to note that
and

each possess
only three independent components due to the self-duality relations of
eq. (1.78). Hence, the total number of independent components of the
bilinear covariants is indeed equal to 16, as indicated in Table 1.1. One
consequence of these considerations is that products of three or more
or matrices can always be expressed as a linear combination of the
matrices given in eq. (1.94) . Examples can be found in the Appendix.
In a theory of a single two-component spinor, there are only six
independent components of the non-vanishing bilinear covariants. To
obtain this result, let us set = in the bilinear covariants listed in
Table 1.1, and note that

and
= 0. The
12
The inverses of these quantities are obtained (to rst order in ) by replacing .
13
For example, inside the sixteen dimensional product space there exist four-
dimensional subspaces spanned by {
} and {

} respectively.
Table 1.1. Independent bilinear covariants involving a pair of two-component
spinor elds.
bilinear covariants number of components
1

3
latter result is derived as follows. Using eqs. (1.38), (1.54) and (1.85),
=
1
2
=
1
2
Tr
= 0 . (1.95)
The derivation of
= 0 is similar. One consequence of this result is

that the magnetic moment of a real Majorana fermion must vanish.
1.5 Lagrangians for free spin-1/2 fermions
We now have the ingredients to construct a Lorentz invariant hermitian
Lagrangian for free fermions. First, consider a theory of a single two-
component fermion, . From the previous section, it is clear that the
following quantity is a Lorentz scalar:
i

. (1.96)
The factor of i is inserted since the hermitian quantity
i
2

i
2
[
) (
] , (1.97)
diers from i
by a total divergence. Hence, eq. (1.96) is a

candidate for a kinetic energy term in the Lagrangian. A second hermitian
Lorentz scalar,
m +m
, (1.98)
is a candidate for a mass term in the Lagrangian for a theory of fermions.
Thus, the Lagrangian for a free two-component fermion is given by
L = i

1
2
(m +m
) . (1.99)
Without loss of generality, we can absorb the phase of the parameter m
into the denition of the eld . Henceforth, we shall always work in a
convention in which m is real and non-negative. The corresponding eld
equations are:
i
m = 0 , (1.100)
i
m = 0 , (1.101)
where eq. (1.101) is the conjugate of eq. (1.100). This is a theory of a
(neutral) Majorana spin-1/2 fermion.
Next, consider a theory of two free two-component fermion elds,
1
and
2
. The corresponding Lagrangian can be written in the following
form:
14
L = i
1
1
+i
2
1
2
m
1
(
1
1
+
1
1
)
1
2
m
2
(
2
2
+
2
2
) ,
(1.102)
where m
1
and m
2
are real and non-negative mass parameters. A special
case arises when m
1
= m
2
= m. In this case one can perform complex
eld redenitions
=
1
2
(
1
+i
2
) , (1.103)
=
1
2
(
1
i
2
) , (1.104)
to obtain
L = i
+i
m( + ) . (1.105)
The corresponding eld equations for and are:
i
m = 0 , i
m = 0 , (1.106)
i
m = 0 , i
m = 0 , (1.107)
The theory of two free two-component fermion elds of equal mass
[eq. (1.102) with m
1
= m
2
] possesses a global internal O(2) symmetry,
15
i
O
ij
j
, where O
T
O = I
2
. Corresponding to this symmetry is an
hermitian conserved Noether current:
J
= i(
1
1
) , (1.108)
14
The theory specied by eq. (1.99) is in a form in which the mass matrix is diagonal.
In Section 1.6, we will generalize to the case of n two-component elds (n 2) and
discuss the general procedure of fermion mass matrix diagonalization.
15
The kinetic energy term alone is invariant under a larger global U(2) symmetry.
However, when m = 0, the global symmetry of the free fermion eld theory is O(2).
with a corresponding conserved charge, Q =
_
J
0
d
3
x. In the
i
-basis,
the Noether current is o-diagonal. However, after converting to the
basis via eqs. (1.103) and (1.104), the current is diagonal:
J
, (1.109)
which provides the motivation for this basis choice. In particular, we can
now identify and as elds of denite and opposite charge Q. Together,
and constitute a single Dirac spin-1/2 fermion.
Finally, we introduce the concept of o-shell degrees of freedom and
contrast this with on-shell or physical degrees of freedom. Here, o-shell
[on-shell] is shorthand for o- [on-] mass shell and distinguishes between
virtual and real particle propagation, respectively. Given a particle
with four-momentum p, real particle propagation must satisfy p
2
= m
2
(the mass-shell condition), whereas for virtual particles (e.g., particle
exchange inside Feynman diagrams), the four-momentum p and the mass
m are independent. The eld operator corresponding to real particle
propagation satises the eld equations, whereas the eld operator of a
virtual particle is not constrained by the eld equations.
For scalar (spin-0) elds, there is no distinction between on-shell and
o-shell degrees of freedom. The on-shell scalar eld satises the Klein-
Gordon equation, but this does not alter the fact that a real (complex)
scalar eld consists of one (two) degree(s) of freedom. For a spin-s
eld (s > 0), the number of on-shell degrees of freedom is less than
the number of o-shell degrees of freedom. Although all on-shell spin-s
elds satisfy the Klein-Gordon equation, the spin-s eld equations provide
additional constraints that reduce the number of degrees of freedom
originally present. We illustrate this in the case of free fermion eld
theory.
A theory of a neutral self-conjugate fermion is described in terms of
a single complex two-component fermion eld
(x). This corresponds

initially to four o-shell degrees of freedom, since
and

(
are independent degrees of freedom. But, when the eld equations are
imposed [see eq. (1.100)], (x) is determined in terms of (x). Thus,
the number of physical (on-shell) degrees of freedom for a neutral self-
conjugate fermion is equal to two.
A theory of a charged fermion is described in terms of a pair of mass-
degenerate two-component fermions. We shall work in a basis, where the
charged fermion is represented by a pair of two-component fermion elds
and . The number of o-shell degrees of freedom for a charged fermion
is eight. Applying the eld equations determines in terms of and
in terms of [see eqs. (1.106) and (1.107)]. Thus, the number of physical
on-shell degrees of freedom for a charged fermion is equal to four.
Although we have illustrated the counting of degrees of freedom for free
fermion eld theory, the same counting applies in an interacting theory.
1.6 The fermion mass-matrix and its diagonalization
We now generalize the discussion of Section 1.5 and consider a collection
of n free anti-commuting two-component spin-1/2 elds,
i
(x), which
transform as (
1
2
, 0) elds under the Lorentz group. Here, is the spinor
index, and i is a avor index (i = 1, 2, . . . , n) that labels the distinct elds
of the collection. The free-eld Lagrangian is given by
L = i
1
2
M
ij

i

j
1
2
M
ij

j
, (1.110)
where we are following the avor index conventions of Appendix A.
In particular, note that M
ij
(M
ij
)
. Moreover, M
ij
is a complex
symmetric matrix, since the product of anticommuting two-component
elds satises
i

j
=
j

i
[with the spinor contraction rule according to
eq. (1.52)].
In general, the matrix M is not diagonal, so we cannot directly identify

i
(x) as a eld of denite mass. If the Lagrangian were also to include
interaction terms, we would call
i
(x) an interaction-eigenstate eld.
16
One can diagonalize the mass matrix and rewrite the Lagrangian in
terms of mass eigenstates
i
that exhibit real non-negative masses m
i
,
respectively. To do this, we introduce the mass-eigenstates:

i
=
i
j
j
,

i
=
i
j

j
, (1.111)
where is a unitary nn matrix (to be determined by eq. (1.113) below),
and
i
j
(
i
j
)
. Under this transformation, the kinetic energy term is

unchanged:
i
i
=

i

i
, but the mass term is transformed:
M
ij

i

j
= M
ij
i
k
k
m
k
k
, (1.112)
where the m
k
are real and non-negative, and M
ij
i
k
= m
k
k
, (no
sum over k). Equivalently, in matrix notation with suppressed indices,
T
M = M
D
diag(m
1
, m
2
, . . .). (1.113)
One can prove that a unitary matrix satisfying eq. (1.113) always exists.
This result corresponds to the Takagi factorization of a general complex
symmetric matrix [3, 4]. That is, for any complex symmetric matrix
16
In general, we shall use hatted elds to represent interaction eigenstate elds and
un-hatted elds to describe states of denite mass
1.6 The fermion mass-matrix and its diagonalization 27
M there exists a unitary matrix such that
T
M is diagonal with
non-negative entries given by the positive square roots of the eigenvalues
of MM
(or M
M). To compute the values of the (non-negative real)

diagonal elements of M
D
, note that eq. (1.113) implies that
T
MM
= M
D
M
D
= M
2
D
. (1.114)
However, we know that the hermitian matrix MM
can be diagonalized
by a unitry matrix.
17
Hence, the diagonal elements of M
2
D
are the
eigenvlaues of MM
, which implies that the diagonal elements of M

D
are
the non-negative square roots of the corresponding eigenvalues of MM
,
as asserted above. If the eigenvalues of MM
are non-degenerate, then

is unique up to permuations of the rows (or columns) of and a possible
overall multiplication of any row (or column) of by 1. If there are
degenerate eigenvalues, then if satises eq. (1.113) and so does K,
where K is real and orthogonal within the degenerate subspace.
Thus, in terms of the mass eigenstates, the general form for the free-eld
Lagrangian of n two-component spinors is given by:
L = i
i
1
2
i
m
i
(
i
i
+
i

i
) . (1.115)
If all the masses are non-degenerate, then this Lagrangian does not exhibit
any nontrivial global (internal) symmetry. However, if some of the masses
are degenerate, global internal symmetries will exist (some of which may
still be respected once interactions are included). For example, if all
masses are degenerate, then the model exhibits an O(n) global symmetry
corresponding to
i
O
ij
j
with O
T
O = I
n
.
In the case where some of the spin-1/2 fermion elds are massive Dirac
fermions carrying a conserved charge, it is often convenient to modify the
above procedure. If
is a charged massive eld, then there must be

an associated independent two-component spinor eld
of equal mass
with the opposite charge. Although the preceding mass diagonalization
procedure will always work, it is often more convenient to employ mass
eigenstates that are also eigenstates of the charge operator. In the case
of two elds of opposite charge, this means writing the corresponding
Lagrangian in the form given by eq. (1.105) [whereas the form given by
eq. (1.102) results from the diagonalization procedure described above].
A general theory of charged fermions deserves special attention.
Consider a collection of such free anti-commuting charged massive
17
Note however that one cannot in general use eq. (1.114) to determine the matrix
. For example, it is possible to have a non-diagonal matrix M such that MM
is proportional to the identity matrix. In this case, drops out completely from
eq. (1.114), and must be determined directly from eq. (1.113).
spin-1/2 elds, which can be represented by pairs of two-component
(interaction-eiegenstate) elds

i
(x),
i
(x), where
i
(x) transforms in
a (possibly reducible) representation of the unbroken symmetry group
that is the complex conjugate of the representation of

i
(x). The most
general free-eld Lagrangian is given by
L = i
i
+i

i
M
ij
i

j
M
ij

j
, (1.116)
where M
ij
is an arbitrary complex matrix, and M
ij
(M
ij
)
as before.
We diagonalize the mass matrix by introducing the mass-eigenstate elds
i
and
i
and unitary matrices L and R (to be determined below),

i
= L
i
k
k
,
i
= R
i
k
k
, (1.117)
and demand that M
ij
L
i
k
R
j
= m
k
k
(no sum over k). In matrix form,
this is written as:
L
T
MR = M
D
= diag(m
1
, m
2
, . . .), (1.118)
with the m
i
real and non-negative. One can prove that the unitary
matrices L and R satisfying eq. (1.118) always exist. This result
corresponds to the singular value decomposition of a general complex
matrix [5]. That is, for any complex matrix M there exist unitary matrices
L and R such that L
T
MR is diagonal with non-negative entries given
by the positive square roots of the eigenvalues of MM
(or M
M). In
particular, from eq. (1.118) it follows that
L
T
(MM
)L
= M
D
M
D
= M
2
D
, (1.119)
R
(M
M)R = M
D
M
D
= M
2
D
. (1.120)
This illustrates the fact that since MM
and M
M are both hermitian

(these two matrices are not equal in general, although they possess the
same real non-negative eigenvalues), they can be diagonalized by unitary
matrices. The diagonal elements of M
D
are therefore the non-negative
square roots of the corresponding eigenvalues of MM
(or M
M), as
asserted above.
Thus, in terms of the mass eigenstates,
L = i
i
i
+i
i
i
m
i
(
i
i
+
i

i
) . (1.121)
The mass matrix now consists of 2 2 blocks
_
0 m
i
m
i
0
_
along the
diagonal. More importantly, the mass matrix is diagonal with doubly-
degenerate entries m
2
i
and describes a collection of Dirac fermions.
18
In the most general theory of spin-1/2 elds, the Lagrangian can be
written in terms of two-component (
1
2
, 0) fermion elds

i
(x), which in
general may consist of neutral fermions
i
, and charged fermion pairs

i
and
i
. The mass eigenstate basis is achieved by a unitary rotation U
i
j
on the avor indices. In matrix form:
= U
0 0
0 L 0
0 0 R
, (1.122)
where , L, and R are constructed as described above. The result of
the mass diagonalization procedure in a general theory therefore always
consists of a collection of Majorana fermions as in eq. (1.115), plus a
collection of Dirac fermions as in eq. (1.121).
1.7 Discrete spacetime and internal symmetries
The Lorentz bilinear covariant quantities given in Table 1.1 transform
as a scalar, vector or second-rank tensor under the proper Lorentz
transformations (which are continuously connected to the identity). But,
it also proves useful to study the behavior of the bilinear covariants under
improper Lorentz transformations. To accomplish this, it is sucient
to study the behavior of the bilinear covariants under time reversal (T)
and under parity (P). In addition, for bilinear covariants that depend on a
pair of two-component spinor elds, a discrete transformation that can be
interpreted as charge conjugation (C) is also relevant. These symmetries
act on the Hilbert space of states via the anti-unitarity operator T
and the unitary operators T and (, respectively. The second-quantized
fermion elds, which are operators that act on on the Hilbert space,
transform under T, P and C via similarity transformations: T T
1
,
TT
1
and ((
1
.
The transformation laws for the two-component spin-1/2 fermions
under P and T cannot be determined a priori from the transformation
laws under proper Lorentz transformations. This can be understood
18
Of course, one could always choose instead to treat the Dirac fermions in a basis
with a fully diagonalized mass matrix, as in equation (1.115), by dening 2i1 =
(i +i)/
2 and 2i = i(i i)/
2. These fermion elds do not carry well-dened

charges, and are analogous to writing a charged scalar eld and its oppositely-
charged conjugate
in terms of their real and imaginary parts. However, it is rarely,

if ever, convenient to do so; practical calculations only require that the mass matrix
is diagonal, and it is of course more pleasant to use elds that carry well-dened
charges.
by noting that the two-component spinors transform under SL(2,C)
which is the universal covering group of SO
+
(3,1), the group of proper
orthochronous Lorentz transformations.
19
In particular, SL(2,C) does
not contain space-inversion or time-inversion. As a result, there is
some freedom in dening the action of P and T on the two-component
fermion elds. We shall x this freedom by demanding that the free-eld
fermionic Lagrangians respect the P and T discrete symmetries. That
is, L
P
(x, t) = L(x, t) and L
T
(x, t) = L(x, t), which ensures that the
corresponding free-eld action is invariant under P and T, respectively.
We expect that under a parity transformation a (
1
2
, 0) fermion is
transformed into a (0,
1
2
) fermion and vice versa. Thus, it is convenient
to dene an hermitian matrix P such that
T
(x)T
1
= i
P
P
(x
P
) , T

(x)T
1
= i
P
P

(x
P
) , (1.123)
where x
P
(t ; x) and
P
is initially an arbitrary complex phase.
Note that the hermiticity of P implies that (P
= P

. By imposing
L
P
(x, t) = L(x, t), where L is the free fermion Lagrangian [eq. (1.99)],
the matrix P is determined and the possible values of
P
are restricted.
Here, we follow the standard convention where the mass term is of the
form L
m
=
1
2
(m+m
) with m real and non-negative.

20
Invariance
of the kinetic energy term [eq. (1.96)] determines P
=
0
(up to an
overall sign that can be absorbed into the denition of
P
). To derive
this result, we note that any hermitian 2 2 matrix can be written
as P = a
. Invariance of the kinetic energy term then implies that

a
0
= 1 and a
i
= 0. It is convenient to separate out an explicit factor
of i in the denition of
P
, as we have done in eq. (1.123). Then, the
invariance of the mass term requires
P
=
P
or
P
= 1, as shown in
Section 1.8.
Time-reversal transforms a (
1
2
, 0) fermion into itself [and likewise for
the (0,
1
2
) fermion]. Thus, it is convenient to dene an hermitian matrix
T such that
T
(x)T
1
=
T
T

(x
T
) , T

(x)T
1
=
T
T
(x
T
) , (1.124)
where x
T
(t ; x) and
T
is initially an arbitrary complex phase. Here,
it should be noted that T
(x, t)T
1
transforms as

and T

(x, t)T
1
19
As Lie groups, SO+(3,1)

= SL(2,C)/Z2, corresponding to a double covering of
SO+(3,1) by SL(2,C). For example, the SL(2,C) matrices I2 and I2 correspond
to the identity element I4 SO+(3,1). For fermions, this is signicant, since the
fermion wave function changes by an overall minus sign under a 360
rotation.
20
Note that if m is initially complex, one is always free to absorb its phase in a eld
redenition of .
transforms as
, which explains the spinor index structure of eq. (1.124).

This behavior is a consequence of the fact that T is an anti-unitary
operator that satises T zT
1
= z
for any complex number z. By

imposing L
T
(x, t) = L(x, t), where L is the free fermion Lagrangian
[eq. (1.99)], the matrix T is determined and the possible values of
T
are
restricted. Invariance of the kinetic energy term [eq. (1.96)] determines
T
=
0
(up to an overall sign that can be absorbed into the denition

of
T
). Invariance of the mass term requires
T
=
T
or
T
= 1, as
shown in Section 1.9.
Finally, we examine charge conjugation. In a free-fermion theory,
charge conjugation simply interchanges particles and antiparticles, so
(
2
= 1. In a theory of a single neutral two-component Majorana fermion
(which is its own antiparticle), (
(
1
=
C
. Then, (
2
= 1 implies
that
2
C
= 1, and we conclude that
C
= 1. Clearly, in the free fermion
theory charge conjugation is trivial (and we are free to choose
C
= 1).
However, when interactions are included, in some cases charge conjugation
symmetry is maintained only if
C
= 1. Charge conjugation is less
trivial in a theory of charged fermions. Consider a theory of a pair of two-
component fermion elds of equal (non-zero) mass. The corresponding
free-fermion Lagrangian is given by eq. (1.102) with m
1
= m
2
. As before,
we assume that the phases of the elds have been chosen such that m
1
and m
2
are real and positive. In this case, the Lagrangian exhibits a
global O(2) symmetry,
i
C
ij
j
, where C O(2). Corresponding to
this symmetry is an hermitian conserved Noether current [eq. (1.108)]
with a corresponding conserved charge Q. Under the action of charge
conjugation, the eigenvalues of Q change sign (i.e., (Q(
1
= Q). Thus,
the charge conjugation operator satises:
(J
(
1
= J
. (1.125)
We can use this result to x the form of the charge conjugation matrix C.
To accomplish this, consider the transformation law of the elds under
charge conjugation:
(
i
(x)(
1
= C
ij
j
(x) , (
i
(x)(
1
= C
ij
j
(x) , (1.126)
where C is a real orthogonal 2 2 matrix. Using eq. (1.125) and the
explicit form of J
[eq. (1.108)], it follows that det C = 1. The most

general transformation of this type is given by
_
C
1
C
2
_
=
_
1 0
0 1
__
cos sin
sin cos
__
2
_
. (1.127)
However, one can always redene the fermion elds to absorb the angle
(while leaving the free-eld Lagrangian unchanged). Thus, without loss
of generality one can choose C =
C
3
, where
C
= 1. Explicitly,
(
1
(x)(
1
=
C
1
(x) , (
2
(x)(
1
=
C
2
(x) . (1.128)
It is often more convenient to employ two-component fermion elds of
denite (and opposite) charge, denoted by and . More precisely, for a
fermion of charge +1 (in arbitrary units), the elds and correspond to
charge +1 elds and the elds and correspond to charge 1 elds. The
elds and can be expressed in terms of
1
and
2
using eqs. (1.103)
and (1.104). Acting on and , the charge conjugation transformation
[eq. (1.128)] is given by
((x)(
1
=
C
(x) , ((x)(
1
=
C
(x) . (1.129)
Free eld theories are always separately invariant under P, C and T.
When interactions are included, this may no longer be the case. To
test whether an interacting eld theory is invariant under one or more
of the discrete symmetries, one must exhibit some choice of the phases
(
P
,
C
and
T
) of the elds for which the Lagrangian of the interacting
theory is invariant. However, even if none of the discrete symmetries
are separately conserved, a deep theorem of quantum eld theory asserts
that the combined discrete symmetry of CPT must be conserved by all
relativistic local (free or interacting) quantum eld theories. The CPT-
invariance of quantum eld theory imposes a condition on the overall
phase
CPT
=
C
T
. For all spin-s bosonic elds,
CPT
= (1)
s
.
For fermion elds,
CPT
must also be independent of particle species
(although its sign is not determined since any term in a Lagrangian must
contain an even number of fermion elds). By convention, we shall assume
that
CPT
= +1 for all species of spin-1/2 fermion elds. Then, for a
charged fermion, represented by a pair of two-component fermion elds
of denite and opposite charge, and ,
(TT (x)((TT )
1
= i(x) , (TT (x)((TT )
1
= i (x) . (1.130)
We now examine P, T and C transformations in more detail.
1.8 Parity transformation of two-component spinors
Under the parity (or more precisely, the space-inversion) transformation,
x
P
= (
P
)
= (t ; x), where
(
P
)
1 0 0 0
0 1 0 0
0 0 1 0
0 0 0 1
. (1.131)
Consider a theory of a single two-component fermion eld (x). Under
parity the two-component spinor transforms as
T
(x)T
1

P
(x) =
P
i
0
(x
P
) , (1.132)
T
(x)T
1

P
(x) =
P
i
0

(x
P
) , (1.133)
T

(x)T
1

P

(x) =
P
i
0

(x
P
) , (1.134)
T

(x)T
1

P
(x) =
P
i
0
(x
P
) . (1.135)
where [
P
[ = 1. Eq. (1.133) is obtained from the rst by raising the indices
[see eq. (1.39)] and using eq. (1.66). Eqs. (1.134) and (1.135) are obtained
from eqs. (1.132) and (1.133) by hermitian conjugation, respectively. Note
that when the parity transformation is applied twice, we obtain:
(
P
)
P
= , (
P
)
P
= . (1.136)
This is consistent with the more general result that for any neutral self-
conjugate spin-s eld, P
2
= (1)
2s
.
If one redenes e
i
[and e
i
], then
P
e
2i
P
. If
the mass of the fermion is nonzero,
21
we may restrict the choice of the
phase
P
by establishing a convention in which the mass parameter m
in the Lagrangian of eq. (1.99) is taken to be real and positive. Then,
P
= 1
22
as a consequence of the parity invariance of + , as we
now demonstrate.
To show that the Lagrangian of eq. (1.99) is invariant under parity, we
investigate the transformation properties of the scalar bilinear covariants.
For
1
2
and
1
2
we obtain
P
1
(x)
P
2
(x) =
P1
P2
1
(x
P
)
2
(x
P
) , (1.137)
P
1
(x)
P
2
(x) = (
P1
P2
)
1
(x
P
)
2
(x
P
) , (1.138)
where we have used
0

0
=
. As promised, + , transforms
as a scalar for
P
= 1. In this case, the hermitian linear combination
i( ) transforms as a pseudoscalar.
The Lorentz vector bilinear covariants transform as
P
1
(x)
P
2
(x) =
P1
P2
(
P
)
1
(x
P
)
2
(x
P
) , (1.139)
P
1
(x)

P
2
(x) =
P1
P2
(
P
)
1
(x
P
)
2
(x
P
) , (1.140)
21
For massless fermions, there are no additional restriction in the choice of phases
that enter the discrete symmetry transformation laws. However, in order to have
a continuous m 0 limit, we will choose these phase factors according to the
restrictions (if any) of the massive theory.
22
The standard phase convention in the literature (and in textbooks that treat such
things carefully) absorbs the factor of i into P in eqs. (1.132)(1.135). In this
convention, P = i. In this chapter, we have chosen the simpler procedure of
making the factor of i explicit so that in our convention P = 1.
where
P
is given in eq. (1.131).
23
In deriving these results, we have
made use of:
24

0
= (
P
)
, (1.141)
= (
P
)
. (1.142)
The kinetic energy term of the action [see eq. (1.96)] is therefore a scalar
under parity
25
(independently of the choice of
P
). This is easily deduced
from eq. (1.139) by noting that
= (
P
)
(where
P
/x
P
is
the parity-transformed derivative).
For the Lorentz tensor bilinear covariants we have
P
1
(x)
P
2
(x) =
P1
P2
(
P
)
(
P
)
1
(x
P
)
2
(x
P
) ,
(1.143)
P
1
(x)
P
2
(x) =
P1
P2
(
P
)
(
P
)
1
(x
P
)
2
(x
P
) .
(1.144)
In deriving these results, we have made use of:
0

= (
P
)
(
P
)

, (1.145)

0
= (
P
)
(
P
)
. (1.146)
In theories with multiple two-component fermion elds with no global
symmetries, the parity properties of the kth fermion is given by
eqs. (1.132)(1.135), with (
P
)
k
= 1 (in the convention where all mass
parameters are real). When interactions are included, if there is some
choice of the (
P
)
k
such that the action is invariant under parity, then
the theory is parity conserving. If there is an internal global symmetry
(
i
U
ij
j
) under which the Lagrangian is invariant, then there are
relations among the parity transformations of the fermion elds. Here,
we examine the simplest case of two mass-degenerate fermion elds.
As shown in section 1.7, a theory with a pair of mass-degenerate two-
component fermion elds possesses a conserved charge Q [see eq. (1.108)].
This implies that one can choose linear combinations of the fermions that
are eigenstates of Q. In particular, and [with the properties noted in
23
Numerically, (P )
= g, and many books therefore employ g (no sum over )

in the parity transformation of the bilinear covariants. We prefer the more accurate
notation above.
24
Roughly speaking, the factor of (P )
converts
into
, while the spinor index

structure is preserved by the multiplication of the appropriate factors of the identity
matrix (either
0
or
0
).
25
Under parity, the Lagrangian satises PL(x)P
1
= L(xP ). Integrating this result to
get the action S
R
Ld
4
x yields PSP
1
= S as desired.
eq. (1.129)] are states of denite (and opposite) charge. The elds and
transform under parity as:
T
(x)T
1

P
(x) =
P
i
0
(x) , (1.147)
T
(x)T
1

P
(x) =
P
i
0
(x) . (1.148)
That is, the phases that appear in the parity transformation laws of two-
component fermion elds of opposite charge are complex conjugates of
each other. This ensures that the mass term of the free Lagrangian,
m(+ + with mreal, is parity invariant. Notice that if we return to the
case of a single uncharged two-component fermion by taking = in the
transformation laws above [eqs. (1.147) and (1.148)], then consistency of
the and transformation laws implies that
P
=
P
as previously noted.
In the case of the charged fermion pair, the phase
P
is not constrained.
However, we can always choose to work in another basis. For example, if
we work in terms of the elds
1
and
2
[eqs. (1.103) and (1.104)], then
it follows that:
T
1
(x)T
1
= (Re
P
)i
0
1
(x
P
) (Im
P
)i
0
2
(x
P
) , (1.149)
T
2
(x)T
1
= (Im
P
)i
0
1
(x
P
) + (Re
P
)i
0
2
(x
P
) . (1.150)
If we choose
P
=
P
, then
1
and
2
have simple transformation
properties under parity:
T
i
(x)T
1
=
P
i
0
i
(x
P
) , (i = 1, 2) . (1.151)
That is, the elds
1
and
2
obey the standard parity transformation
laws of a single two-component fermion eld [eqs. (1.132)(1.135)], with
the same phase factor
P
in each case. Had one chosen
P
,=
P
in
eqs. (1.147) and (1.148), then the parity transformation laws of
1
and
2
would have been more complicated [eqs. (1.149) and (1.150)]. But, in
this case, one is free to make a further SO(2) rotation to transform
1
and
2
into new elds that do exhibit the simpler parity transformation
laws [eq. (1.151)].
We noted previously that P
2
= 1 for a neutral self-conjugate spin-1/2
fermion. However, for a charged fermion, eqs. (1.147) and (1.148) imply
that:
T
2
(x)(T
2
)
1
=
2
P
(x) , (1.152)
T
2

(x)(T
2
)
1
=
2
P

(x) . (1.153)
That is, P
2
=
2
P
when acting on and (these are elds with the
same value of the conserved charge). Note that unless
P
= 1 or i,
P
2
is not an element of the Lorentz group! Some authors insist that
P
2
= 1 when applied to charged fermions, although there is nothing
inconsistent with a more general phase factor. In fact, for any choice of
P
, one can always redene P (by multiplication by an appropriate gauge
transformation) to have any desired behavior. Nevertheless, there is some
motivation for choosing
P
= 1 so that P
2
= 1 as in the case of the
neutral self-conjugate spin-1/2 fermion.
One can now work out the parity properties of bilinear covariants
constructed out of the elds of a charged fermion pair. The results are
listed in Table B.3.
1.9 Time-reversal transformation of two-component spinors
Under the time-reversal (or more precisely, the time-inversion) transfor-
mation, x
T
= (
T
)
= (t ; x), where
(
T
)
1 0 0 0
0 1 0 0
0 0 1 0
0 0 0 1
. (1.154)
Since the time-reversal transformation is anti-unitary it requires a careful
consideration. Time-reversal is treated dierently in the rst-quantized
and the second-quantized theories.
26
Here, we shall only consider the
second-quantized version of time-reversal, which is governed by an anti-
unitary operator T that acts on the Hilbert space. Consider a theory
of a single two-component fermion eld (x). Under time-reversal, the
two-component spinor transforms as
T
(x)T
1

t

(x) =
T
(x
T
) , (1.155)
T
(x)T
1

t
(x) =
T
(x
T
) , (1.156)
T

(x)T
1

t
(x) =
(x
T
) , (1.157)
T

(x)T
1

t
(x) =
(x
T
) , (1.158)
where [
T
[ = 1. The notation requires some explanation. Due to the
anti-unitary nature of the operator T , the two-component spinor quantity
T
(x)T
1
transforms as the complex conjugate of a (
1
2
, 0) spinor with
a lower spinor index. Thus, we shall denote this quantity by
t

(x)
(where the superscript t is used in place of T in order to avoid confusion
with the symbol for the matrix transpose). Eq. (1.156) is obtained from
26
For parity and charge conjugation the classical and eld theoretic descriptions are
identical. See ref. [6] for a more detailed discussion.
eq. (1.155) by rst raising the indices [eq. (1.43)] and then using eq. (1.66).
Eqs. (1.157) and (1.158) are obtained from eqs. (1.155) and (1.156) by
hermitian conjugation, respectively. Note that when time-reversal is
applied twice in succession, we use T
T
T
1
=
T
and [
T
[ = 1 to obtain:
(
t
)
t
= , (
t
)
t
= . (1.159)
This is consistent with the more general result that:
T
2
= (1)
2s
(1.160)
for any (charged or neutral) spin-s quantum eld.
If one redenes e
i
[and e
i
], then
T
e
2i
T
.
As before, we may restrict the choice of the phase
T
by choosing the
convention in which the mass parameter m in the Lagrangian of eq. (1.99)
is taken to be real and non-negative. Then, as shown below,
T
= 1 as
a consequence of the time-reversal invariance of +.
To show that the Lagrangian of eq. (1.99) is invariant under time
reversal, we investigate the transformation properties of the scalar bilinear
covariants. For
1
2
and
1
2
we obtain
T
1
(x)
2
(x)T
1
= T
1
T
1
T
2
T
1
=
t
1
(x)
t
2
(x)
=
T1
T2
1
(x
T
)
2
(x
T
) , (1.161)
and
T
1
(x)
2
(x)T
1
= (
T1
T2
)
1
(x
T
)
2
(x
T
) . (1.162)
Thus, + is even under time-reversal if
T
= 1. In this case, the
hermitian linear combination i() is odd under time-reversal (after
noting that T iT
1
= i).
The Lorentz vector bilinear covariants transform as
T
1
2
T
1
= (T
1
T
1
)(T

T
1
)(T
2
T
1
) , (1.163)
and an analogous equation involving
. Since T

(x)T
1

t
(x) and
T
(x)T
1

t

(x), in order to have the spinor indices properly matched
in eq. (1.163) [and in the analogous equation involving
], one must make

use of the following results:
T
T
1
=

, T

T
1
=
, (1.164)
which follow from T T
1
=
=
T
. We then obtain:
T
1
(x)
2
(x)T
1
=
T1
T2

1
(x
T
)
0

(
)
0
2
(x
T
)
=
T1
T2
2
(x
T
)
0
)
0

1
(x
T
)
=
T1
T2
(
T
)
2
(x
T
)
1
(x
T
)
=
T1
T2
(
T
)
1
(x
T
)
2
(x
T
) . (1.165)
The minus sign in the second step arises due to anti-commuting spinors.
In the third step, we have made use of:

0
= (
T
)
, (1.166)
= (
T
)
, (1.167)
which follow immediately from eqs. (1.141) and (1.142) after noting that
(
T
)
= (
P
)
. Finally, applying eq. (1.72) yields the nal result

given in eq. (1.165). Similarly,
T
1
(x)
2
(x)T
1
=
T1
T2
(
T
)
1
(x
T
)
2
(x
T
) . (1.168)
The kinetic energy term of the action [see eq. (1.96)] is therefore a
scalar under time reversal (independently of the choice of
P
). This is
easily deduced from eq. (1.165) by noting that
= (
T
)
(where
= /x
T
is the time reversal-transformed derivative) and remembering
to complex conjugate the factor of i.
For the Lorentz tensor bilinear covariants, we examine:
T
1
(
2
T
1
= (T
1
T
1
)(T (
T
1
)(T
2
T
1
) , (1.169)
and an analogous equation involving
. We then employ results similar

to those of eq. (1.164):
T (
T
1
= (

, T (
T
1
= (
. (1.170)
The rest of the derivation is straightforward, and the end result is:
T
1
(x)
2
(x)T
1
=
T1
T2
(
T
)
(
T
)
1
(x
T
)
2
(x
T
) , (1.171)
T
1
(x)
2
(x)T
1
=
T1
T2
(
T
)
(
T
)
1
(x
T
)
2
(x
T
) , (1.172)
where we have used eqs. (1.87) and the following results:
0

= (
T
)
(
T
)

, (1.173)

0
= (
T
)
(
T
)
. (1.174)
which follow immediately from eqs. (1.145) and (1.146).
symmetries, the time reversal properties of each fermion is given by
eqs. (1.155)(1.158), with
T
= 1 (in the convention where all mass
parameters are real). When interactions are included, if there is some
choice of the
T
such that the action is invariant under time reversal,
then the theory is time reversal invariant. If there is an internal global
symmetry (
i
U
ij
j
) under which the Lagrangian is invariant, then
there are relations among the time reversal transformations of the fermion
elds. Here, we again examine the simplest case of two mass-degenerate
fermion elds,
1
and
2
. In terms of the linear combinations and
[eqs. (1.103) and (1.104)] corresponding to the elds of denite charge,
we dene the transformations under time reversal as follows. The elds
and transform under time reversal as:
T
(x)T
1

t

(x) =
T
(x
T
) , (1.175)
T
(x)T
1

t

(x) =
(x
T
) , (1.176)
That is, the phases that appear in the time-reversal transformation laws of
two-component fermion elds of opposite charge are complex conjugates
of each other. If we apply the time-reversal operator twice,
T
2
(x)(T
2
)
1
=
(x) , (1.177)
T
2
(x)(T
2
)
1
=
(x) , (1.178)
and so T
2
= 1 for both neutral and charged spin-1/2 fermions [as noted
in eq. (1.160)]. In contrast to the case of P
2
considered in section 1.8, the
phase
T
drops out in the computation of T
2
due to the anti-linearity of
the time-reversal operator.
Notice that if we return to the case of a single uncharged two-component
fermion by taking = in the transformation laws above [eqs. (1.175)
and (1.176)], then consistency of the and transformation laws implies
that
T
=
T
as previously noted. In the case of the charged fermion
pair, the phase
T
is not constrained. However, we can always choose to
work in another basis. For example, if we work in terms of the elds
1
and
2
[eqs. (1.103) and (1.104)], then it follows that:
T
1
(x)T
1
= (Re
T
)
0
1
(x
T
) (Im
T
)
0

2
(x
T
) , (1.179)
T
2
(x)T
1
= (Im
P
)
0
1
(x
T
) (Re
P
)
0

2
(x
T
) . (1.180)
If we choose
T
=
T
, then
1
and
2
properties under time reversal:
T
1
(x)T
1
=
T
1
(x
T
) , (1.181)
T
2
(x)T
1
=
T
2
(x
T
) . (1.182)
That is, the elds
1
and
2
obey the standard time reversal transforma-
tion laws of a single two-component fermion eld [eqs. (1.155)(1.158)],
but with opposite sign phase factors
T
in each case. Had one chosen
T
,=
T
in eqs. (1.175) and (1.176), then the time reversal transformation
laws of
1
and
2
would have been more complicated [eqs. (1.179) and
(1.180)]. But as before, one is free to make a further SO(2) rotation
to transform
1
and
2
into new elds that do exhibit the simpler time
reversal transformation laws [eq. (1.151)].
One can now work out the time reversal properties of bilinear covariants
constructed out of the elds of a charged fermion pair. The results are
listed in Table B.3.
1.10 Charge conjugation of two-component spinors
Charge conjugation was introduced in section 1.7. The charge conjugation
operator is a discrete operator that interchanges particles and their C-
conjugates. Here, conjugation refers to some conserved charge operator
Q, where (Q(
1
= Q. The conjugate of the conjugate eld is the
original eld, so that C
2
= 1. For a single two-component fermion eld
(
(x)(
1

C
(x) =
C
(x) , (1.183)
(

(x)(
1

C

(x) =

(x) , (1.184)
where [
C
[ = 1. In this case, no conserved charge exists, so that charge
conjugation is trivial. In particular, as noted in section 1.7, the invariance
of the mass term + implies that
C
=
C
, and hence
C
= 1.
27
The behavior of the bilinear covariants under C is simple:
C
1
O
C
2
=
C1
C2
1
O
2
, (1.185)
for any O = I
2
,
and
(where bars should appear over the

appropriate
i
depending on the choice of O).
symmetries, the charge conjugation properties of each fermion is given
by eqs. (1.183) and (1.184), with
C
= 1. When interactions are
included, if there is some choice of the
C
such that the action is invariant
under charge conjugation, then the theory is charge conjugation invariant.
If there is an internal global symmetry (
i
U
ij
j
) under which
the Lagrangian is invariant, then there are relations among the charge
conjugation transformations of the fermion elds. Here, we consider
27
In a theory with just one fermion eld, no physical quantity can depend on the sign of
C since the fermion eld must appear quadratically in the Lagrangian. So, without
loss of generality, we can take C = 1.
the simplest case of two mass-degenerate fermion elds,
1
and
2
, and
identify the conserved charge as Q =
_
J
0
d
3
x, where the conserved
current J
is given in eq. (1.108). In terms of the linear combinations

and [eqs. (1.103) and (1.104)] corresponding to the elds of denite
charge, we dene the charge conjugation transformations as follows.
(
(x)(
1

C
(x) =
C
(x) , (

(x)(
1

C

(x) =
C

(x) , (1.186)
(
(x)(
1

C
(x) =
(x) , (

(x)(
1

C

(x) =
C

(x) . (1.187)
Notice that if we return to the case of a single uncharged two-component
fermion by taking = in the transformation laws above [eqs. (1.186)
and (1.187)], then consistency of the and transformation laws implies
that
C
=
C
as previously noted. In the case of the charged fermion pair,
the phase
C
is not constrained (i.e., C
2
= 1 independent of the value of
the phase
C
). However, we can always choose to work in another basis.
For example, if we work in terms of the elds
1
and
2
[eqs. (1.103) and
(1.104)], then it follows that:
(
1
(x)(
1
= (Re
C
)
1
(x) + (Im
C
)
2
(x) , (1.188)
(
2
(x)(
1
= (Im
C
)
1
(x) (Re
C
)
2
(x) . (1.189)
If we choose
C
=
C
, then
1
and
2
properties under charge conjugation:
(
1
(x)(
1
=
C
1
(x) , (
2
(x)(
1
=
C
2
(x) . (1.190)
That is, the elds
1
and
2
obey the standard charge conjugation
transformation laws of a single two-component fermion eld [eqs. (1.183)
and (1.184)], but with opposite sign phase factors
C
in each case. Had one
chosen
C
,=
C
in eqs. (1.186) and (1.187), then the charge conjugation
transformation laws of
1
and
2
would have been more complicated
[eqs. (1.188) and (1.189)]. But once again, one is free to make a further
SO(2) rotation to transform
1
and
2
into new elds that do exhibit the
simpler charge conjugation transformation laws [eq. (1.190)].
One can now work out the charge conjugation properties of bilinear
covariants constructed out of the elds of a charged fermion pair. The
results are listed in Table B.3.
1.11 CP and CPT conjugation of two-component spinors
Given the results of the previous three sections, it is a simple matter
to work out the eects of CP and CPT transformations on the two-
component spinors and the bilinear covariants. Here, we focus on
the case of two mass-degenerate fermion elds,
1
and
2
, with the
associated conserved charge J
given in eq. (1.108). In terms of the linear

combinations and [eqs. (1.103) and (1.104)] corresponding to the elds
of denite charge, the CP transformations of the two-component spinors
are given by:
(T
(x)((T)
1
=
CP
i
0
(x
P
) , (1.191)
(T
(x)((T)
1
=
CP
i
0
(x
P
) , (1.192)
where
CP

C
P
. Again, observe that if we return to the case of a single
uncharged two-component fermion by taking = in the transformation
laws above [eqs. (1.191) and (1.192)], then consistency of the and
transformation laws implies that
CP
=
CP
as expected. In the case of
the charged fermion pair, the phase
CP
is not constrained. However, if
we work in terms of the elds
1
and
2
[eqs. (1.103) and (1.104)], and
choose
CP
= 1 (as required for uncharged two-component elds) then
as before it follows that the
i
have simple transformation properties:
(T
1
(x)((T)
1
=
CP
i
0
1
(x
P
) , (1.193)
(T
2
(x)((T)
1
=
CP
i
0
2
(x
P
) . (1.194)
That is, the elds
1
and
2
obey the standard CP transformation laws of
a single two-component fermion eld [eqs. (1.191) and (1.192)], but with
opposite sign phase factors
CP
in each case. This is the origin of the oft-
quoted statement in the literature that a theory of two mass-degenerate
Majorana fermions of opposite CP quantum numbers can be combined
into a single Dirac fermion.
In general, when acting on charged fermion elds (T ,= T(. A simple
computation yields: PC = (
P
)
2
CP when acting on the and elds.
28
However, for self-conjugate spin-1/2 fermion elds, PC = CP since
P
=
1. This provides another motivation for the choice of
CP
=
CP
above
in the case of charged fermion elds. One can also examine the behavior
of the fermion elds under the other possible products of C, P and T.
Again, the order of the operators can be signicant. For example, TC =
(
T
)
2
CT and TP = (
P
)
2
PT when acting on the and elds,
due to the anti-linearity of the time-reversal operator. For self-conjugate
spin-1/2 fermion elds, TC = CT and TP = PT since
C
,
P
and
T
are real (and equal to either 1). One is also free to employ these phase
conventions in the case of charged fermion elds.
28
When acting on the

and elds (whose conserved charge is opposite to that of
and ), one must employ the complex conjugate of the phases above.
Finally, we consider the eect of a CPT transformation.
29
The
corresponding transformation laws under CPT conjugation take on even
simpler forms:
(TT
(x)((TT )
1
= i
CPT

(x) , (1.195)
(TT
(x)((TT )
1
= i
CPT

(x) , (1.196)
where
CPT

C
T
and x (t ; x). Again, we observe that
when = , consistency of the CPT transformation laws implies that for
self-conjugate neutral fermion elds,
CPT
= 1. Note that when (TT
is applied twice,
30
((TT )
2
(x)[((TT )
2
]
1
= (
CPT
)
2
(x) , (1.197)
((TT )
2
(x)[((TT )
2
]
1
= (
CPT
)
2
(x) . (1.198)
Thus, for self-conjugate neutral spin-1/2 fermion elds, it follows that
(CPT)
2
= 1. As before, if we also choose
CPT
= 1 in the case
of charged elds and , then one obtains simple CPT-transformation
laws when the charged fermion elds are rewritten in terms of
1
and
2
[eqs. (1.103) and (1.104)]:
(TT
i
(x)((TT )
1
= i
CPT
i
(x) , i = 1, 2 . (1.199)
However, the choice of the phase
CPT
is more than just a convenience.
In general, in order to guarantee that (TT L(x)((TT )
1
= L(x), we
demand that
CPT
is independent of the particle species. For integral
spin s particles, one must choose
CPT
= (1)
s
. For half-integral spin
particles, all fermions must possess the same value of
CPT
, which must
therefore be equal to one of the two allowed values
CPT
= 1 for neutral
self-conjugate fermions. However, in the latter case, the overall sign choice
for
CPT
is not physically meaningful, since an even number of fermion
elds always appears in each term of L. Nevertheless, one is free to choose
a phase convention such that for any half-integer spin particle,
CPT

C
T
= +1 . (1.200)
Consider the bilinear covariants of the form
i
O
j
where O =
I
2
,
or
(and bars should appear over the appropriate
i
depending on the choice of O). These can be used to construct
29
In principle, there are six possible orderings of C, P and T; the eect of the
corresponding operators dier by at most a phase which is easily determined from
the results previously obtained above.
30
Using the results quoted above, when applied to the spin-1/2 fermions elds and
[see footnote 28], (CPT)
2
= (
CPT
)
2
C
2
(PT)
2
= (
CPT
)
2
, where we have noted
that C
2
= 1 and (PT)
2
= 1 independently of the phase choices.
hermitian bilinear quantities that are either scalar, vectors or second-rank
antisymmetric tensors with respect to proper Lorentz transformations.
Denote these by B, B
and B
, respectively. Then, starting from

eq. (1.199) and using eqs. (1.164) and (1.170) and the anti-commutativity
of the fermion elds, it follows that:
(TT B(x) ((TT )
1
= B(x) , (1.201)
(TT B
(x) ((TT )
1
= B
(x) , (1.202)
(TT B
(x) ((TT )
1
= B
(x) , (1.203)
provided that the phase
CPT
is chosen to be the same for every two-
component fermion eld (as noted above). Thus, we see that all Lorentz
tensors of the same (integer) rank behave the same way under a CPT
transformation. This ensures that the Lagrangian for any local quantum
eld theory (which is a hermitian Lorentz scalar) is CPT-invariant.
Problems
1. Consider a proper orthochronous Lorentz transformation that is a pure
boost. We follow the notation of eqs. (1.2) and (1.3).
(a) Prove the following two relations:
= exp[i
k] = I
4
i v
ksinh + ( v
k)
2
[1 cosh ] , (1.204)
exp
_
1
2

_
= cosh(/2) + v sinh(/2) , (1.205)
where

= v tanh
1
and [
[.
(b) Prove the following two identities:
p
E +m p
_
2(E +m)
, (1.206)
_
p
E +m+ p
_
2(E +m)
, (1.207)
where p
(E ; p). The matrix square root of p [or p] as dened

here is the unique hermitian matrix with non-negative eigenvalues whose
square is equal to p [or p].
(c) Using the results of parts (a) and (b), and noting that cosh = E/m,
show that for a pure boost (where
ij
= 0 and
i
=
i0
=
0i
),
the Lorentz transformations for the (
1
2
, 0) and (0,
1
2
) representations,
respectively, are given by:
exp
_
i
2
_
=
exp
_
1
2

_
=
_
p
m
, for (
1
2
, 0) ,
exp
_
1
2

_
=
_
p
m
, for (0,
1
2
) .
(1.208)
2. Starting from the free-eld Dirac equations in two-component notation
given in eq. (1.106), show that these equations are form invariant under
a proper orthochronous Lorentz transformation, x
. That is,
by writing
(x
) = M
(x) and

(x
) = (M
1
)

(x), and noting

that
, show that
and

satisfy the Dirac equation in the
transformed reference frame if eq. (1.75) is satised.
3. (a) Show that there is no solution for M in eq. (1.75) for =
P
,
where
P
is given by eq. (1.131).
(b) Starting from the free-eld Dirac equations in two-component
notation given in eqs. (1.106) and (1.107), show that these equations
are form invariant under a parity transformation, x
= (
P
)
.
That is, by writing
(x
) = iP
(x) and

= iP

(x), where
P

(P
, show that
and
satisfy the Dirac equation in the

space-reected reference frame if:
P
1
P = (
P
)
. (1.209)
Determine P (up to an overall phase factor).
4. (a) Show that there is no solution for M in eq. (1.75) for =
T
,
where
T
is given by eq. (1.154).
(b) Starting from the free-eld Dirac equations in two-component
notation given in eqs. (1.106) and (1.107), show that these equations
are form invariant under a time-reversal transformation, x
= (
T
)
.
That is, by writing
(x
) = T
(x) and

= T

(x), where
T

(T
, show that
and
satisfy the Dirac equation in

the time-reversed reference frame if:
T
1
T = (
T
)
. (1.210)
Determine T (up to an overall phase factor).
5. Consider the following Lagrangian of two free two-component fermion
elds,
1
and
2
:
L = i
1
1
+i
2
2
m(
1
2
+
1
2
)
1
2
M(
2
2
+
2
2
) .
(1.211)
(a) Determine the mass-eigenstates and nd the corresponding masses
of the two fermions.
(b) In the seesaw mechanism for light neutrino masses, one assumes
that m M. From the results of part (a), nd the mass of the lightest
fermion (keeping terms of order m
2
/M). Determine the numerical value
of M, assuming m = m
= 1.777 GeV and m
5 10
2
eV.
6. Consider the following Lagrangian of n free two-component fermion
elds,
i
:
L = iK
i
j

1
2
M
ij
1
2
M
ij
j
, (1.212)
(a) What are the constraints on K
i
j
, assuming that the Lagrangian is
hermitian?
(b) Dene a new set of elds
i
in terms of the

i
, such that the kinetic
energy term is canonical (i.e., K
i
j
=
i
j
). Re-express the Lagrangian in
terms of the new elds
i
and show that the resulting expression takes
the form given in eq. (1.110).
(c) How does the presence of K
i
j
aect the mass diagonalization
procedure of Section 1.6?
References
[1] S. Weinberg, The Quantum Theory of Fields, Volume I: Foundations
(Cambridge University Press, Cambridge, UK, 1995).
[2] References to the Lorentz algebra and the interpretations of boosts.
[3] R.A. Horn and C.R. Johnson, Matrix Analysis (Cambridge University Press,
Cambridge, England, 1990);
[4] R.N. Mohapatra and P.B. Pal, Massive Neutrinos in Physics and Astro-
physics, 2nd edition (World Scientic, Singapore, 1998).
[5] The singular value decomposition of a complex matrix is discussed in many
linear algebra textbooks. See, e.g., ref. [3].
[6] D. Bailin, Weak Interactions, second edition (Adam Hilger Ltd., Bristol,
England, 1982).
48
2
Feynman Rules for Fermions
In this chapter, we devise a set of Feynman rules to describe matrix
elements of processes involving fermions. The rules are developed for two-
component fermions and compared to the usual rules for four-component
fermions that can be found in any textbook on quantum eld theory.
2.1 Fermion creation and annihilation operators
We begin by describing the properties of a free neutral massive anti-
commuting spin-1/2 eld, denoted
(x), which transforms as (

1
2
, 0)
under the Lorentz group. The Lagrangian density is given by eq. (1.99).
The corresponding eld equations (i.e., the two-component version of the
Dirac equation) are given by eqs. (1.100) and (1.101). By virtue of the
eld equations, the eld
(x) can be expanded in a Fourier series:
(x) =
_
d
3
p
(2)
3/2
(2E
p
)
1/2
_
x
( p, )a( p, )e
ip x
+y
( p, )a
( p, )e
ip x
_
, (2.1)
where E
p
([ p[
2
+ m
2
)
1/2
, and the creation and annihilation operators
a
and a satisfy anticommutation relations:

a( p, ), a
( p
) =
3
( p p
, (2.2)
and all other anticommutators vanish. Applying eq. (1.100) to eq. (2.1),
we nd that the x
and y
satisfy momentum space Dirac equations.

We shall write these equations out explicitly and study the propereties of
these two-component spinor wave functions in Section 2.2.
49
50 2 Feynman Rules for Fermions
Similarly,

(x) (
_
d
3
p
(2)
3/2
(2E
p
)
1/2
_
x

( p, )a
( p, )e
ip x
+ y

( p, )a( p, )e
ip x
_
. (2.3)
We employ covariant normalization of the one particle states, i.e., we act
with one creation operator on the vacuum with the following convention
[ p, ) (2)
3/2
(2E
p
)
1/2
a
( p, ) [0) , (2.4)
so that

p, [ p
_
= (2)
3
(2E
p
)
3
( p p
. Therefore,
0[
(x) [ p, ) = x
( p, )e
ip x
, 0[

(x) [ p, ) = y

( p, )e
ip x
,(2.5)
p, [
(x) [0) = y
( p, )e
ip x
, p, [

(x) [0) = x

( p, )e
ip x
. (2.6)
It should be emphasized that
(x) is an anticommuting spinor eld,

whereas x
and y
are commuting two-component spinor wave functions.

The anticommuting properties of the elds are carried by the creation
and annihilation operators a
( p, ) and a( p, ).
2.2 Properties of the two-component spinor wave functions
In this section, we explore in some detail the properties of the two-
component spinor wave functions x
( p, ) and y
( p, ). Applying
eq. (1.100) to eq. (2.1), x
and y
satisfy momentum space Dirac

equations:
(p)

x
= m y

, (p)
= mx
, (2.7)
(p)
= my
, (p)

y
= m x

, (2.8)
x
(p)
= m y
, y

(p)

= mx
, (2.9)
x

(p)

= my
, y
(p)
= m x
, (2.10)
where x
( p, ), y
( p, ), etc. These equations imply that both

x
and y
must satisfy the mass-shell condition, p

2
= m
2
(or equivalently,
p
0
= E
p
).
The quantum number labels the spin or helicity of the spin-1/2
fermion. In order to construct the spin-1/2 helicity states, consider a
basis of two-component spinors
that are eigenstates of

1
2
p, i.e.,
1
2
p
, =
1
2
. (2.11)
If p is a unit vector with polar angle and azimuthal angle with respect
to a xed z-axis, then the two-component spinors are
1/2
( p) =
_
cos

2
e
i
sin

2
_
,
1/2
( p) =
_
e
i
sin

2
cos

2
_
. (2.12)
The two-component helicity spinors satisfy:
( p) = 2
( p) , (2.13)
( p) = 2e
2i
( p) , (2.14)
where is the 22 matrix whose matrix elements are
, and p is a unit
vector with polar angle and azimuthal angle + with respect to the
xed z-axis. Alternatively, we could construct spin states where the spin
is quantized in the particles rest frame along a xed axis, pointing along
the unit three-vector s. If we denote
s
to be an eigenstate of
1
2
s, then
we may use the above formulas, where the angles and are now the
polar and azimuthal angles of s. In relativistic scattering processes, it is
usually more convenient to employ helicity states. Note that for massless
particles, there is no rest frame and one must use helicity states.
For massive fermions, it is possible to dene the spin four-vector s
,
which satises sp = 0 and ss = 1. In the rest frame of the particle,
s
() = 2(0; s), and = 1/2 corresponds to spin-up and spin-down

with respect to the spin quantization axis that points in the direction of
the unit three-vector s. For helicity states, the spin four-vector is dened
as
1
s
() =
2
m
([ p[ ; E p) , (2.15)
where 2 = 1 is twice the spin-1/2 particle helicity. Note that in the rest
frame, s
= 2(0 ; p), whereas in the high energy limit (where E m),

s
= 2p
/m + O(m/E). For a massless fermion, the spin four-vector

does not exist (there is no rest frame). Nevertheless, one can obtain
consistent results by working with massive helicity states and taking the
m 0 limit at the end of the computation. In this case, we can simply
use s
= 2p
/m + O(m/E); in practical computations the nal result

will be well-dened in the zero mass limit.
The two-component spinors x and y can now be given explicitly in
terms of the
dened in eq. (2.12):

x
( p, ) =
, x
( p, ) = 2
_
p , (2.16)
y
( p, ) = 2
, y
( p, ) =
_
p , (2.17)
1
The overall sign of s
depends on the choice of . Thus, we write s
() to emphasize
this dependence. Later on in the text, we will often write s
for s
() when there is
no specic need to highlight the dependence on .
or equivalently
x

( p, ) = 2
_
p
, x

( p, ) =
p , (2.18)
y

( p, ) =
_
p
, y

( p, ) = 2
p . (2.19)
In the above equations, p
0
= E
p
is satised, and the (hermitian) matrices
p and

p are given in eqs. (1.206) and (1.207).
The phase choices employed in eqs. (2.16)(2.19) are conventional
and consistent with the phase choices for four-component spinor wave
functions [see Section 3.3]. We again emphasize that in eqs. (2.16)(2.19),
one may either choose
to be an eigenstate of p (which yields the

helicity spinor wave functions), or choose
to be an eigenstate of s,
where the spin is measured in the rest frame along the quantization axis s.
The following equations can now be veried by explicit computation:
(s)

x
= y

, (s)
= x
, (2.20)
(s)
= y
, (s)

y
= x

, (2.21)
x
(s)
= y
, y

(s)

= x
, (2.22)
x

(s)

= y
, y
(s)
= x
, (2.23)
where s
(), x
( p, ), y
( p, ), etc. Eqs. (2.20)(2.23)

are the analogues of the momentum space Dirac equations [eqs. (2.7)
(2.10)]. From these equations, one can check that both x
and y
must
satisfy s s = 1 and p s = 0.
It is useful to combine the results of eqs. (2.7)(2.10) and eqs. (2.20)
(2.23) as follows:
(p
ms
= 0 , (p
ms
= 0 , (2.24)
(p
+ms
= 0 , (p
+ms
= 0 , (2.25)
x
(p
ms
) = 0 , x

(p
ms
) = 0 , (2.26)
y
(p
+ms
) = 0 , y

(p
+ms
) = 0 . (2.27)
The above results are applicable only for massive fermions (where the spin
four-vector s
exists). However, in the case of a massless fermion we can

obtain valid results for helicity spinors simply by setting s
= 2p
/m
and then taking the m 0 limit. In particular, replacing ms
= 2p
in
eqs. (2.24)(2.27), and using the results of eqs. (2.7)(2.10) [before taking
the m 0 limit] yields
(1 + 2)x( p, ) = 0 , (1 2)y( p, ) = 0 , (2.28)
where is the helicity. Finally, we take the m 0 limit. The meaning of
the end result is clear; for massless fermions, only one helicity component
of x and y is non-zero. Applying this result to neutrinos, we nd that
massless neutrinos are left-handed ( = 1/2), while anti-neutrinos are
right-handed ( = +1/2).
Having dened explicit forms for the two-component spinor wave
functions, we can now write down the spin projection operators. These
operators can be easily obtained by making use of the following result:
1
2
_
1 +
1
m
p
_
s()
p
_
. (2.29)
We then have for example, with both spinor indices assumed to be in the
lowered position,
x( p, ) x( p, ) =
p
=
1
2
p
_
1 +
1
m
ps
p
_
p
=
1
2
_
p +
1
m
psp
_
=
1
2
[p ms] , (2.30)
where s s(). In deriving eq. (2.30), we made use of eq. (2.29) and the
completeness of the
. The product of three dot-products was simplied

by noting that ps = 0 implies s p = p s. The other spin
projection formulas can be similarly derived. For massive fermions, they
are:
x
( p, ) x
( p, ) =
1
2
(p
ms
, (2.31)
y

( p, )y
( p, ) =
1
2
(p
+ms
, (2.32)
x
( p, )y
( p, ) =
1
2
_
m
[s p]
_
, (2.33)
y

( p, ) x
( p, ) =
1
2
_
m

+ [s p]

_
, (2.34)
or equivalently,
x

( p, )x
( p, ) =
1
2
(p
ms
, (2.35)
y
( p, ) y
( p, ) =
1
2
(p
+ms
, (2.36)
y
( p, )x
( p, ) =
1
2
_
m
+ [s p]
_
, (2.37)
x

( p, ) y
( p, ) =
1
2
_
m

[s p]

_
. (2.38)
For the case of massless spin-1/2 fermions, we must use helicity spinor
wave functions. Setting s = 2p/m in the above formulas and letting
m 0 yields
x
( p, ) x
( p, ) = (
1
2
)p
, x
( p, )y
( p, ) = 0 , (2.39)
y

( p, )y
( p, ) = (
1
2
+)p

, y

( p, ) x
( p, ) = 0 , (2.40)
x

( p, )x
( p, ) = (
1
2
)p

, y
( p, )x
( p, ) = 0 , (2.41)
y
( p, ) y
( p, ) = (
1
2
+)p
, x

( p, ) y
( p, ) = 0 . (2.42)
Having listed the projection operators for denite spin projection or
helicity, we may now sum over spins to derive the spin-sum identities.
These arise when computing squared matrix elements for unpolarized
scattering and decay. There are only four basic identities, but for
convenience we list each of them with the two index height permutations
that can occur in squared amplitudes by following the rules given in this
paper. The results can be derived by inspection of the spin projection
operators, since summing over = 1/2 simply removes all terms linear
in the spin four-vector (which is proportional to ).
( p, ) x
( p, ) = p
x

( p, )x
( p, ) = p

, (2.43)
y

( p, )y
( p, ) = p

,
( p, ) y
( p, ) = p
, (2.44)
( p, )y
( p, ) = m
( p, )x
( p, ) = m
, (2.45)
y

( p, ) x
( p, ) = m

x

( p, ) y
( p, ) = m

. (2.46)
These results hold for both massive and massless spin-1/2 fermions.
2.3 Charged two-component fermion elds
Consider a collection of free anti-commuting two-component spin-1/2
elds,
i
(x), which transform as (
1
2
, 0) elds under the Lorentz group.
As shown in Section 1.6, one can diagonalize the fermion mass matrix.
As a result, the free-eld Lagrangian in terms of mass-eigenstate elds is
given by eq. (1.115). Each
i
can now be expanded in a Fourier series,
2.3 Charged two-component fermion elds 55
as in Section 2.2:
i
(x) =
_
d
3
p
(2)
3/2
(2E
ip
)
1/2
_
x
( p, )a
i
( p, )e
ip x
+y
( p, )a
i
( p, )e
ip x
_
, (2.47)
where E
ip
([ p[
2
+m
2
i
)
1/2
, and the creation and annihilation operators,
a
i
and a
i
satisfy anticommutation relations:
a
i
( p, ), a
j
( p
) =
3
( p p

ij
, (2.48)
and all other anticommutators vanish. We employ covariant normaliza-
tion of the one particle states (separately for each avor i) as in eq. (2.4).
In the case where some of the elds are massive Dirac fermions carrying
a conserved charge, it is more convenient to work in terms of mass-
eigenstate elds of denite charge. If
is a charged massive eld, then

there must be an associated independent two-component spinor eld
of equal mass with the opposite charge. The corresponding Lagrangian is

given by eq. (1.105). Together, and constitute a single Dirac spin-1/2
fermion. We can then write:
(x) =
_
d
3
p
(2)
3/2
(2E
p
)
1/2
_
x
( p, )a( p, )e
ip x
+y
( p, )b
( p, )e
ip x
_
, (2.49)
(x) =
_
d
3
p
(2)
3/2
(2E
p
)
1/2
_
x
( p, )b( p, )e
ip x
+y
( p, )a
( p, )e
ip x
_
, (2.50)
where E
p
([ p[
2
+ m
2
)
1/2
, and the creation and annihilation operators,
a
, b
, a and b satisfy anticommutation relations:

a( p, ), a
( p
) = b( p, ), b
( p
) =
3
( p p
, (2.51)
and all other anticommutators vanish. We now must distinguish between
two types of one particle states, which we can call fermion (F) and anti-
fermion (F):
[ p, ; F) (2)
3/2
(2E
p
)
1/2
b
( p, ) [0) , (2.52)
p, ; F
_
(2)
3/2
(2E
p
)
1/2
a
( p, ) [0) . (2.53)
Note that both (x) and (x) can create [ p, ; F) from the vacuum, while
(x) and (x) can create

p, ; F
_
. The one-particle wave functions are
given by:
0[
(x) [ p, ; F) = x
( p, )e
ip x
, 0[

(x) [ p, ; F) = y

( p, )e
ip x
,
(2.54)
F; p, [
(x) [0) = y
( p, )e
ip x
, F; p, [

(x) [0) = x

( p, )e
ip x
,
(2.55)
0[
(x)
p, ; F
_
= x
( p, )e
ip x
, 0[

(x)
p, ; F
_
= y

( p, )e
ip x
,
(2.56)
F; p,
(x) [0) = y
( p, )e
ip x
,
F; p,

(x) [0) = x

( p, )e
ip x
,
(2.57)
and the eight other single-particle matrix elements vanish.
Note that the two-component wave functions, x
( p) and y
( p) that
appear in the Fourier expansions of the two-component fermion elds
[eqs. (2.1), (2.47), (2.49) and (2.50)] do not depend on the avor indices
(or whether the eld is neutral or charged). The properties of these
functions have been given in Section 2.2.
2.4 Feynman rules for external two-component fermion lines
Let us consider a general Feynman diagram in which fermions of denite
mass (i.e., the so-called fermion mass-eigenstates) can appear as initial
and nal states. The rules for assigning two-component external state
spinors are then as follows.
For an initial-state left-handed (
1
2
, 0) fermion of momentum p and
helicity : x( p, ).
For an initial-state right-handed (0,
1
2
) fermion of momentum p and
helicity : y( p, ).
For a nal-state left-handed (
1
2
, 0) fermion of momentum p and
helicity : x( p, ).
For a nal-state right-handed (0,
1
2
) fermion of momentum p and
helicity : y( p, ).
Note that the two-component external state fermion wave functions are
distinguished by their Lorentz group transformation properties, rather
than by their particle or antiparticle status as in four-component Feynman
rules. This helps to explain why two-component notation is especially
convenient for either theories with Majorana particles, in which there
is no fundamental distinction between particles and antiparticles, or
chiral theories where the left and right-handed fermions transform under
dierent representations of the gauge group.
These rules are summarized in the following mnemonic diagram:
x
x
y y
L (
1
2
, 0) fermion
R (0,
1
2
) fermion
Initial State Final State
Fig. 2.1. The external wave-function spinors should be assigned as indicated
here, for initial-state and nal-state left-handed (
1
2
, 0) and right-handed (0,
1
2
)
fermions.
In contrast to four-component Feynman rules, the direction of the
arrows do not correspond to the ow of charge or fermion number.
Nevertheless, the above choice is convenientthe arrows of (
1
2
, 0) fermions
always point in the direction of their momenta while the arrows of (0,
1
2
)
fermions always point opposite to their momenta. These rules simply
correspond to the formulas for the one-particle wave functions given in
eqs. (2.5) and (2.6) [with the convention that [ p, ) is an initial-state
fermion and p, [ is a nal-state fermion].
The rules above apply to any mass eigenstate two-component fermion
external wave functions. In particular, the same rules apply for the
two-component fermions governed by the Lagrangians of eq. (1.115)
[Majorana] and eq. (1.121) [Dirac].
2.5 Feynman rules for two-component fermion propagators
Next we examine the fermion propagators for two-component fermions.
These are the Fourier transforms of the free-eld vacuum expectation
values of time-ordered products of two fermion elds. They are easily
obtained by inserting the free-eld expansion of the two-component
fermion eld and evaluating the spin sums using the formulas given in
eqs. (2.43)(2.46). For the case of a single neutral two-component fermion
eld of mass m
0[ T
(x)
(y) [0)
FT
=
i
p
2
m
2
+i
p
, (2.58)
0[ T

(x)
(y) [0)
FT
=
i
p
2
m
2
+i
p

, (2.59)
0[ T
(x)
(y) [0)
FT
=
i
p
2
m
2
+i
m
, (2.60)
0[ T

(x)
(y) [0)
FT
=
i
p
2
m
2
+i
m

, (2.61)
where the subscript FT indicates the Fourier transform from position
to momentum space. These results have an obvious diagrammatic
representation as shown in Fig. 2.2.
(a) (b)
p

p

ip
p
2
m
2
+i
ip

p
2
m
2
+i
(c) (d)

im
p
2
m
2
+i

im
p
2
m
2
+i

Fig. 2.2. Feynman rules for propagator lines of a neutral two-component spin-
1/2 fermion.
Note that the direction of the momentum ow p
here is determined
by the creation operator that appears in the evaluation of the free-eld
propagator. Arrows on fermion lines always run away from dotted indices
at a vertex and toward undotted indices at a vertex.
There are two types of fermion propagators. The rst type preserves
the direction of arrows, so it has one dotted and one undotted index. For
this type of propagator, it is convenient to establish a convention where
p
in the diagram is dened to be the momentum owing in the direction

of the arrow on the fermion propagator. With this convention, the two
rules above for propagators of the rst type can be summarized by one
rule, as shown in Figure 2.3. Here the choice of the or the version of
the rule is uniquely determined by the height of the indices on the vertex
p
ip
p
2
m
2
+i
or
ip

p
2
m
2
+i
Fig. 2.3. This one rule summarizes the results of Fig. 2.2(a) and (b).

im
im

Fig. 2.4. Fermion mass insertions can be treated as a type of interaction vertex,
using the Feynman rules shown here.
to which the propagator is connected. These heights should always be
chosen so that they are contracted as in eq. (1.52). The second type of
propagator shown above does not preserve the direction of arrows, and
corresponds to an odd number of mass insertions. The indices on
and
are staggered as shown to indicate that or are to be contracted

with an expression to the left, while or

are to be contracted with an
expression to the right, in accord with eq. (1.52).
Mass insertions on fermion lines can instead be handled as interaction
vertices, as shown in Figure 2.4. By summing up an innite chain of
such mass insertions between massless fermion propagators, one can easily
reproduce the massive fermion propagators of both types.
It is convenient to treat separately the case of charged massive fermions.
Consider a charged Dirac fermion of mass m, which is described by two
two-component elds and , whose free-eld Lagrangian is given by
eq. (1.104). Using the free eld expansions given by eqs. (2.49) and (2.50),
and the appropriate spin-sums [eqs. (2.43)(2.46)], the two-component
free-eld propagators are easily obtained:
0[ T
(x)
(y) [0)
FT
= 0[ T
(x)
(y) [0)
FT
=
i
p
2
m
2
p
, (2.62)
0[ T

(x)
(y) [0)
FT
= 0[ T

(x)
(y) [0)
FT
=
i
p
2
m
2
p

, (2.63)
0[ T
(x)
(y) [0)
FT
= 0[ T
(x)
(y) [0)
FT
=
i
p
2
m
2
m
, (2.64)
0[ T

(x)
(y) [0)
FT
= 0[ T

(x)
(y) [0)
FT
=
i
p
2
m
2
m

. (2.65)
For all other combinations of fermion bilinears, the corresponding two-
point functions vanish. These results again have a simple diagrammatic
representation, as shown in Figure 2.5.
2
(a)
p

ip
p
2
m
2
+i
or
ip
p
2
m
2
+i
(b)
p
p
ip
p
2
m
2
+i
or
ip
p
2
m
2
+i
(c) (d)

im
p
2
m
2
+i
im
p
2
m
2
+i
Fig. 2.5. Feynman rules for propagator lines of a charged two-component spin-
1/2 fermion.
Note that for Dirac fermions, the propagators with opposing arrows
(proportional to a mass) necessarily change the identity ( or ) of the
two-component fermion, while the single-arrow propagators never do. In
processes involving such a charged fermion, one must of course carefully
distinguish between the and elds.
2
In Fig. 2.5, the diagrams are drawn with the momentum along the arrow direction,
as in Fig. 2.3.
2.6 Feynman rules for two-component fermion interactions
We next examine the possible interaction vertices. Renormalizable
Lorentz invariant interactions involving fermions must consist of bilinears
in the fermion elds, which transform as a Lorentz scalar or vector,
coupled to the appropriate bosonic Lorentz scalar or vector eld to make
an overall Lorentz scalar quantity. Here, we shall rst consider the case
of fermion pairs coupled to a real scalar eld and a real vector eld
A
. We assume that the two-component fermion mass matrix has been

diagonalized as discussed in Section 1.6, so that the fermions consist
of a collection of two-component (
1
2
, 0) fermion mass-eigenstate elds.
These may include both neutral two-component elds and/or pairs of
oppositely-charged elds and . The interaction Lagrangian is given by
L
int
=
1
2
(
ij
j
+
ij

i

j
) (
ij
j
+
ij
i

j
)
(G
)
i
j

i
j
A
[(G
)
i
j
j
+ (G
)
i
j

i
j
]A
, (2.66)
where is a complex symmetric matrix, is an arbitrary complex matrix
and G
, G
and G
are hermitian matrices. We have suppressed the spinor

indices in eq. (2.66); the product of two component spinors is always
performed according to the index convention indicated in eq. (1.52).
We also have employed the convention concerning the avor labels
i and j described in Section A.2. That is, ipping the heights of all
avor indices of an object corresponds to complex conjugation. In this
convention, raised indices can only be contracted with lowered indices
and vice versa. The Feynman rules for the vertices that arise from this
interaction Lagrangian [eq. (2.66)] are shown in Fig. 2.6.
One clarication in the labeling is helpful. Consider a line in Fig. 2.6
labeled
i
. This means that the corresponding state is given by [
i
) [as
in eqs. (2.4), (2.52) or (2.53)], independent of the direction of the arrow.
The fact that there are two separate rules corresponding to the same pair
of outgoing states (dierentiated by the arrow directions) is a consequence
of the two terms proportional to
i
j
and

j
in eq. (2.66).
In Fig. 2.6, two versions are given for each of the boson-fermion-fermion
Feynman rules. The correct version to use depends in a unique way on
the heights of indices used to connect each fermion line to the rest of
the diagram. For example, the way of writing the vector-fermion-fermion
interaction rule depends on whether we used
j
i
, or its equivalent
form

j
, from eq. (2.66). Note the dierent heights of the spinor
indices and on
and
. The choice of which rule to use is thus

dictated by the height of the indices on the lines that connect to the vertex.
These heights should always be chosen so that they are contracted as in
eq. (1.52). Similarly, for the scalar-fermion-fermion vertices, one should
i
ij
or i
ij

i
ij
or i
ij
i
ij
or i
ij

i
ij
or i
ij
j
A

i(G
)
i
j
or i(G
)
i
j
j
Fig. 2.6. The form of the Feynman rules for two-component fermion interactions
with a neutral boson in a general renormalizable eld theory. In the diagram
with the external vector boson, one may take = , or , respectively.
choose the rule which correctly matches the indices with the rest of the
diagram. (However, when all indices are suppressed, the scalar-fermion-
fermion rules will have an identical appearance for both cases anyway,
since they are just proportional to the identity matrix on the 22 spinor
i
i(
2
)
ij
i
i(
1
)
ij
W
i
i(G
1
)
i
j
or i(G
1
)
i
j
i
i(G
2
)
i
j
or i(G
2
)
i
j
Fig. 2.7. The form of the Feynman rules for two-component fermion interactions
with a charged boson in a general renormalizable eld theory. For each diagram
shown above, there is a charge-conjugated diagram in which all arrows are
reversed. The corresponding rules are obtained simply by raising all lowered
avor indices and lowering all raised avor indices [c.f. Section A.2]. Spinor
indices are suppressed [c.f. Fig. 2.6]. The arrows above on the charged scalar
lines and above the vector boson lines indicate the ow of charge. On both
charged and neutral fermion lines, the arrows indicate the ow of chirality.
space.) These comments will be claried by examples below.
We can also treat the interactions of fermions to complex scalar and
vector elds. As in eq. (2.66), we assume that the fermion mass matrices
have been diagonalized (see Section 1.6). We denote a set of neutral
fermion mass-eigenstates elds by
i
and a set of charged fermion mass-
eigenstate elds by pairs of oppositely charged elds
j
and
j
. The
charged scalar and vector bosons are complex elds denoted by and
W, respectively. Here, we shall only consider the simplest case where
I
k
j
iU
m
j
y
Imn
U
n
k
I
k
j
iU
m
j
y
Imn
U
n
k
, a
j
i
igU
i
k
(T
a
)
k
m
U
m
j
igU
i
k
(T
a
)
k
m
U
m
j
or
Fig. 2.8. Feynman rules for Yukawa [gauge] couplings of scalars [vector bosons]
to two-component fermions. Spinor indices are suppressed [c.f. Fig. 2.6]. The
y
Imn
are the Yukawa coupling matrices in the interaction-eigenstate basis, and
the matrices U achieve the rotation to the mass eigenstate basis. The avor
index conenvtions of Section A.2 imply that y
Ijk
(y
Ijk
)
and U
i
j
(U
i
j
)
.
the charges of , W and are assumed to be equal. In this case, the
interaction Lagrangian is given by:
L
int
=
1
2
[
ij
1

i
j
+ (
2
)
ij

i

j
]
1
2
[
ij
2

i
j
+ (
1
)
ij
i

j
]
1
2
W
[(G
1
)
i
j
j
+ (G
2
)
i
j

i
j
]
1
2
W
[(G
1
)
i
j

j
i
+ (G
2
)
i
j

j
i
] , (2.67)
where
1
and
2
are complex symmetric matrices and G
1
and G
2
are
hermitian matrices. The corresponding Feynman rules are given in
Fig. 2.7.
If the interaction Lagrangian is given in terms of interaction-eigenstate
elds, then the unitary matrices used to rotate to the mass-eigenstate
basis will appear in the Feynman rules. Suppoe that the two-component
fermion elds that appear in the Lagrangian are written originally in
terms of (
1
2
, 0)-fermion interaction eigenstates,

i
, which are related to
2.7 General structure and rules for Feynman graphs 65
mass eigenstate elds by

= U [see eq. (1.122)]. In any theory, the
most general set of fermion interaction vertices with a (possibly complex)
scalar
I
and gauge eld A
a
can be written as follows:

L
int
=
1
2
y
Ijk
1
2
y
Ijk
k
gA
a
(T
a
)
i
j
j
, (2.68)
where the complex Yukawa couplings y
Ijk
are symmetric under inter-
change of j and k, g is a real gauge coupling and the T
a
are the hermitian
representation matrices
3
corresponding to the two-component fermions
elds. As before, we have employed the avor index conventions of
Section A.2. The mass eigenstate Feynman rules then take the form
shown in Fig. 2.8.
2.7 General structure and rules for Feynman graphs
When computing an amplitude for a given process, all possible diagrams
should be drawn that conform with the rules given above for external
wave functions, propagators, and interactions. Starting from any external
wave function spinor, or from any vertex on a fermion loop, factors
corresponding to each propagator and vertex should be written down
from left to right, following the line until it ends at another external state
wave function or at the original point on the fermion loop. If one starts
a fermion line at an x or y external state spinor, it should have a raised
undotted index in accord with eq. (1.52). Or, if one starts with an x or
y, it should have a lowered dotted spinor index. Then, all spinor indices
should always be contracted as in eq. (1.52). If one ends with an x or
y external state spinor, it will have a lowered undotted index, while if
one ends with an x or y spinor, it will have a raised dotted index. For
arrow-preserving fermion propagators and gauge vertices, the preceding
determines whether the or rule should be used. With only a little
experience, one can write down amplitudes immediately with all spinor
indices suppressed.
Symmetry factors for identical particles are implemented in the usual
way. Fermi-Dirac statistics are implemented by the following rules:
Each closed fermion loop gets a factor of 1.
A relative minus sign is imposed between terms contributing to
a given amplitude whenever the ordering of external state spinors
(written left-to-right) diers by an odd permutation.
3
For a U(1) gauge group, the T
a
are replaced by real numbers corresponding to the
U(1) charges of the left-handed (
1
2
, 0) fermion.
Amplitudes generated according to these rules will contain objects of the
form:
/ = z
1
z
2
(2.69)
where z
1
and z
2
are each commuting external spinor wave functions x, x,
y, or y, and is a sequence of alternating and matrices. The complex
conjugate of this quantity is given by
/
= z
2
z
1
(2.70)
where

is obtained from by reversing the order of all the and
matrices, and using the same rule for suppressed spinor indices. (Notice
that this rule for taking complex conjugates has the same form as for
anticommuting spinors.) We emphasize that in principle, it does not
matter in what direction a diagram is traversed while applying the rules.
However, one must associate a sign with each diagram that depends on
the ordering of the external fermions. This sign can be xed by rst
choosing some canonical ordering of the external fermions. Then for any
graph that contributes to the process of interest, the corresponding sign
is positive (negative) if the ordering of external fermions is an even (odd)
permutation with respect to the canonical ordering. If one chooses a
dierent canonical ordering, then the resulting amplitude changes by an
overall sign (is unchanged) if this ordering is an odd (even) permutation
of the original canonical ordering.
4
This is consistent with the fact that
the amplitude is only dened up to an overall sign, which is not physically
observable.
Note that dierent graphs contributing to the same process will often
have dierent external state wave function spinors, with dierent arrow
directions, for the same external fermion. In particular, there are no
arbitrary choices to be made for arrow directions, since one must add
together all Feynman graphs that obey the rules.
4
For a process with exactly two external fermions, it is convenient to apply the
Feynman rules by starting from the same fermion external state in all diagrams.
That way, all terms in the amplitude have the same canonical ordering of fermions
and there are no additional minus signs between diagrams. Four a process with four
or more external fermions, it may happen that there is no way to choose the same
ordering of external state spinors for all graphs when the amplitude is written down.
Then the relative signs between dierent graphs must be chosen according to the
relative sign of the permutation of the corresponding external fermion spinors. This
guarantees that the total amplitude is antisymmetric under the interchange of any
pair of external fermions.
2.8 Simple examples of Feynman diagrams and amplitudes
Some simple examples based on the interaction vertices of Section 2.6 will
help clarify the rules of Section 2.7.
2.8.1 Tree-level decays
Let us rst consider a theory with a single, uncharged, massive (
1
2
, 0)
fermion , and a real scalar , with interaction
L
int
=
1
2
( +
) . (2.71)
Consider the decay ( p
1
, s
1
)( p
2
, s
2
), where by we mean the one
particle state given by eq. (2.4). Two diagrams contribute to this process,
as shown in Figure 2.9.
(p
2
, s
2
)
(p
1
, s
1
)
(p
2
, s
2
)
(p
1
, s
1
)
Fig. 2.9. The two tree-level Feynman diagrams contributing to the decay of a
scalar into a Majorana fermion pair.
The matrix element is then given by
i/= y( p
1
, s
1
)
(i
)y( p
2
, s
2
)
+ x( p
1
, s
1
)

(i
) x( p
2
, s
2
)
= iy( p
1
, s
1
)y( p
2
, s
2
) i
x( p
1
, s
1
) x( p
2
, s
2
). (2.72)
The second line above could be written down directly by recalling that
the sum over suppressed spinor indices is taken according to eq. (1.52).
Note that if we reverse the ordering for the external fermions, the overall
sign of the amplitude changes sign.
5
Of course, this overall sign is not
signicant and depends on the order used in constructing the two particle
state. One could even make the unconventional (but correct) choice of
starting the rst diagram from fermion 1, and the second diagram from
fermion 2:
i/= iy( p
1
, s
1
)y( p
2
, s
2
) (1)i
x( p
2
, s
2
) x( p
1
, s
1
) . (2.73)
5
This is easily checked, since for the commuting spinor wave functions ( x and y), the
spinor products in eq. (2.72) change sign when the order is reversed [see eqs. (B.42)
and (B.43)].
A
(p
1
, s
1
)
(p
2
, s
2
)
A
(p
1
, s
1
)
(p
2
, s
2
)
massive vector boson A
into a pair of Majorana fermions .

Here the rst term establishes the canonical ordering of fermions (1,2),
and the contribution from the second diagram therefore includes the
relative minus sign in parentheses. It is easily seen that equations
(2.72) and (2.73) are indeed equal. As usual, when a total decay rate
is computed, one must multiply the integral over the total phase space by
1/2 to account for the identical particles.
Consider next the decay of a massive neutral vector A
into a Majorana
fermion pair A
( p
1
, s
1
)( p
2
, s
2
), following from the interaction
L
int
= GA
, (2.74)
where G is a real coupling parameter. The two diagrams shown in Figure
2.10 contribute.
Following the rules of Fig. 2.6, we start from the fermion with
momentum p
1
and spin vector s
1
, and end at the fermion with momentum
p
2
and spin vector s
2
. The resulting amplitude for the decay is
i/=
[iG x( p
1
, s
1
)
y( p
2
, s
2
) +iGy( p
1
, s
1
)
x( p
2
, s
2
)] (2.75)
where
is the vector boson polarization vector. As illustrated in Fig. 2.6,

we have used the -version of the vector-fermion-fermion rule for the
rst diagram of Fig. 2.10 and the -version for the second diagram of
Fig. 2.10, as dictated by the implicit spinor indices, which we have
suppressed. However, we could have chosen to evaluate the second
diagram of Fig. 2.10 using the -version of the vector-fermion-fermion
rule by starting from the fermion with momentum p
2
. In that case, the
term +iGy( p
1
, s
1
)
x( p
2
, s
2
) in eq. (2.75) is replaced by
(1)[iG x( p
2
, s
2
)
y( p
1
, s
1
)] . (2.76)
In eq. (2.76), the factor of iG arises from the use of the -version of the
vector-fermion-fermion rule, whereas the overall factor of 1 is due to the
fact that the order of the fermion wave functions has been reversed; i.e
(21) is an odd permutation of (12). This is in accord with the ordering
rule stated at the end of Section 2.7. Thus, the resulting amplitude for
the decay of the vector boson into the pair of Majorana fermions now
takes the form:
i/=
[iG x( p
1
, s
1
)
y( p
2
, s
2
) +iG x( p
2
, s
2
)
y( p
1
, s
1
)] . (2.77)
By using y
x = x
y, one trivially shows that eqs. (2.75) and (2.77) are

identical. The form given in eq. (2.77) is especially convenient because it
explicitly exhibits the fact that the amplitude is antisymmetric under the
interchange of the two external identical fermions. Again, the absolute
sign of the total amplitude is not signicant and depends on the choice
of ordering of the outgoing states. When computing the total decay rate,
one must again multiply the total integral over phase space by 1/2 to
account for identical particles in the nal state.
Next, we consider the decay of a neutral vector boson into a charged
fermion-antifermion pair. We denote and as (
1
2
, 0) elds with charges
Q = 1 and Q = 1, respectively, with couplings to the neutral vector
boson as follows:
L
int
= A
[G
+G
]. (2.78)
There are two contributing graphs to vector boson decay as shown in
Figure 2.11. To evaluate the amplitude, we start from the charge Q = +1
fermion (with momentum p
1
and spin vector s
1
), and end at the charge
Q = 1 fermion (with momentum p
2
and spin vector s
2
). The charge
ow follows the direction of the arrow on the fermion line. Note that for
the nal state fermion lines, the outgoing with arrow pointing outward
from the vertex and the outgoing with arrow pointing inward to the
vertex both correspond to outgoing Q = +1 states. The amplitude for
the decay is
i/=
[iG
x( p
1
, s
1
)
y( p
2
, s
2
) +iG
y( p
1
, s
1
)
x( p
2
, s
2
)]
=
[iG
x( p
1
, s
1
)
y( p
2
, s
2
) +iG
x( p
2
, s
2
)
y( p
1
, s
1
)] . (2.79)
As in the case of the decay to a pair of Majorana fermions, we have
exhibited two forms for the amplitude in eq. (2.79) that depend on
whether the -version or the -version of the Feynman rule has been
employed. Of course, the resulting amplitude is the same in each method
(up to an overall sign of the total amplitude which is not determined).
2.8.2 Tree-level scattering processes
The next level of complexity consists of diagrams that involve fermion
propagators. For our rst example of this type, consider the tree-level
A
(p
1
, s
1
)
(p
2
, s
2
)
A
(p
1
, s
1
)
(p
2
, s
2
)
massive neutral vector boson A
into a Dirac fermion-antifermion pair.

k k
Fig. 2.12. Tree-level Feynman diagrams contributing to the elastic scattering
of a neutral scalar and a neutral two-component fermion. There are four more
diagrams, obtained from these by crossing the initial and nal scalar lines.
matrix element for the scattering of a neutral scalar and a two-component
neutral massive fermion ( ), with the interaction Lagrangian given
by eq. (2.71). Using the corresponding Feynman rules, there are eight
contributing diagrams. Four are depicted in Fig. 2.12; there are another
four diagrams (not shown) where the initial and nal state scalars are
crossed (so that the initial [nal] state scalar is now attached to the same
vertex as the nal [initial] state fermion).
We shall write down the amplitudes for these diagrams starting with
the nal state fermion line and moving toward the initial state. Then,
i/=
i
s m
2
_
[[
2
[ x( p
2
, s
2
) k x( p
1
, s
1
) +y( p
2
, s
2
) k y( p
1
, s
1
)]
+m
2
y( p
2
, s
2
)x( p
1
, s
1
) + (
)
2
x( p
2
, s
2
) y( p
1
, s
1
)
_
+(crossed) , (2.80)
where k
is the sum of the two incoming (or outgoing) four-momenta,

s = k
2
, (p
1
, s
1
) are the momentum and spin four-vectors of the incoming
fermion, and (p
2
, s
2
) are those of the outgoing fermion. Crossed
indicates the same contribution but with the initial and nal scalars
interchanged. Note that we could have evaluated the diagrams above
by starting with the initial vertex and moving toward the nal vertex.
It is easy to check that the resulting amplitude is the negative
6
of the
one obtained in eq. (2.80); the overall sign change simply corresponds to
swapping the order of the two fermions and has no physical consequence.
Next, we compute the tree-level matrix element for the scattering of a
vector boson and a neutral massive two-component fermion with the
interaction Lagrangian of eq. (2.74). Again there are eight diagrams: the
four diagrams depicted in Fig. 2.13 plus another four (not shown) where
the initial and nal state vector bosons are crossed. Starting with the
nal state fermion line and moving toward the initial state, we obtain
i/=
iG
2
s m
2
_
x( p
2
, s
2
)
2
k
1
x( p
1
, s
1
) +y( p
2
, s
2
)
2
k
1
y ( p
1
, s
1
)
m
_
y( p
2
, s
2
)
1
x( p
1
, s
1
) + x( p
2
, s
2
)
1
y( p
1
, s
1
)
_
+(crossed) , (2.81)
where
1
and
2
are the initial and nal vector boson polarization four-
vectors, respectively. As before, k
is the sum of the two incoming (or

outgoing) four-momenta and s = k
2
, and (p
1
, s
1
) are the momentum and
spin four-vectors of the incoming fermion, and (p
2
, s
2
) are those of the
outgoing fermion. Crossed indicates the same contribution but with
the initial and nal vector bosons swapped. If one evaluates the diagrams
above by starting with the initial vertex and moving toward the nal
6
The opposite sign is a consequence of eqs. (B.42)(B.44) and the minus sign dierence
between the two ways of evaluating the propagator that preserves the arrow direction.
k k
of a neutral vector boson and a neutral two-component fermion. There are four
more diagrams, obtained from these by crossing the initial and nal scalar lines.
k
Fig. 2.14. Tree-level Feynman diagrams contributing to the elastic scattering of

a neutral scalar and a charged fermion. There are four more diagrams, obtained
from these by crossing the initial and nal scalar lines.
vertex, the resulting amplitude is the negative of the one obtained in
eq. (2.81), as expected.
We now consider the scattering of a charged Dirac fermion with a
k

k

Fig. 2.15. Tree-level Feynman diagrams contributing to the scattering of an
initial charged scalar and a charged fermion into its charge-conjugated nal state.
The unlabeled intermediate state is a neutral fermion. There are four more
diagrams, obtained from these by crossing the initial and nal scalar lines.
neutral scalar. The (
1
2
, 0) elds and have opposite charges Q = +1
and 1 respectively, and interact with the scalar according to
L
int
= [ +
] , (2.82)
where is a coupling parameter. Then, for the elastic scattering of a
Q = +1 fermion and a scalar, the diagrams of Fig. 2.14 contribute at
tree-level plus another four diagrams (not shown) where the initial and
nal state scalars are crossed. Since these diagrams match precisely those
of Fig. 2.12, one obtains the same matrix element, eq. (2.80), previously
obtained for the scattering of a neutral scalar and neutral two-component
fermion, with the replacement of with and m
with the mass of the

charged fermion, m.
Consider next the scattering of a charged Dirac fermion and a charged
scalar, where both the scalar and fermion have the same absolute value of
the charge. As usual, we denote the charged Q = +1 fermion by the pair
of two-component fermions and and the (intermediate state) neutral
two-component fermion by . The charged Q = 1 scalar is represented
by the scalar eld and its complex conjugate, and the corresponding
interaction Lagrangian takes the form:
L
int
=
[
1
+
2
] [
2
+
] . (2.83)
k

of a neutral vector boson and a charged Dirac fermion. There are four more
diagrams, obtained from these by crossing the initial and nal vector lines.
One scattering process that deserves special attention is the scattering
of an initial boson-fermion state into its charge-conjugated nal state via
the exchange of a neutral fermion. The relevant diagrams are shown in
Fig. 2.15 plus the corresponding diagrams with the intial and nal scalars
crossed. The derivation is similar to the ones given previously, and we
end up with
i/=
i
s m
2
2
[ x( p
2
, s
2
) k x( p
1
, s
1
) +y( p
2
, s
2
) k y( p
1
, s
1
)]
+m
_
2
1
y( p
2
, s
2
)x( p
1
, s
1
) + (
2
)
2
x( p
2
, s
2
) y( p
1
, s
1
)
_
+(crossed) , (2.84)
where m is the mass of the charged fermion and the four-momentum k is
dened as shown in Fig. 2.15.
The scattering of a charged fermion and a neutral spin-1 vector boson
can be similarly treated. For example, consider the amplitude for the
elastic scattering of a charged fermion and a neutral vector boson. Using
the interaction Lagrangian given in eq. (2.78), the relevant diagrams are
those shown in Fig. 2.16, plus four diagrams (not shown) obtained from
these by crossing the initial and nal state vectors. Applying the Feynman
1
2
3
4
1
2
3
4
1
2
3
4
1
2
3
4
of identical neutral Majorana fermions via scalar exchange in the t-channel.
Additionally, there are four u-channel diagrams obtained from these by crossing
either the initial or nal fermion lines. Finally, one must also evaluate four s-
channel diagrams in which the two-component fermions 1 and 2 annihilate into
an intermediate scalar which subsequently decays into two-component fermions
3 and 4.
rules following from eq. (2.74) as before, one obtains
i/=
i
s m
2
_
G
2
x( p
2
, s
2
)
2
p
1
x( p
1
, s
1
)
+G
2
y( p
2
, s
2
)
2
p
1
y ( p
1
, s
1
)
mG
_
y( p
2
, s
2
)
1
x( p
1
, s
1
) + x( p
2
, s
2
)
1
y( p
1
, s
1
)
_
+(crossed) (2.85)
and the masses and the assignments of momenta and spins are as before.
The computation of the amplitude for the scattering of a charged fermon
and a charged vector boson is straightforward and will not be given
explicitly here.
Finally, let us work out an example with four external-state fermions.
Consider the case of elastic scattering of two identical Majorana fermions
due to scalar exchange, governed by the interaction of eq. (2.71). The
t-channel diagrams for scattering initial fermions labeled 1, 2 into nal
state fermions labeled 3, 4 are shown in Fig. 2.17. There are also four
u-channel and four s-channel annihilation diagrams (not shown). The
resulting matrix element is:
i/=
i
s m
2
2
(x
1
x
2
)(y
3
y
4
) + (
)
2
( y
1
y
2
)( x
3
x
4
)
+[[
2
[(x
1
x
2
)( x
3
x
4
) + ( y
1
y
2
)(y
3
y
4
)]
_
+(1)
i
t m
2
2
(y
3
x
1
)(y
4
x
2
) + (
)
2
( x
3
y
1
)( x
4
y
2
)
+[[
2
[( x
3
y
1
)(y
4
x
2
) + (y
3
x
1
)( x
4
y
2
)]
_
+
i
u m
2
2
(y
4
x
1
)(y
3
x
2
) + (
)
2
( x
4
y
1
)( x
3
y
2
)
+[[
2
[( x
4
y
1
)(y
3
x
2
) + (y
4
x
1
)( x
3
y
2
)]
_
, (2.86)
where x
i
x( p
i
, s
i
), y
i
y( p
i
, s
i
), m
is the mass of the exchanged

scalar, s = (p
1
+ p
2
)
2
, t = (p
1
p
3
)
2
and u = (p
1
p
4
)
2
. The relative
minus sign (in parentheses) between the t-channel diagram and the s
and u-channel diagrams is obtained by observing that 3142 is an odd
permutation and 4132 is an even permutation of 1234.
7
2.9 Conventions for fermion and anti-fermion names and elds
Dirac fermions present a problem for labeling Feynman diagrams. In the
two-component language, a Dirac fermion is described by two distinct
(
1
2
, 0) elds and . This eld represents both the particle and anti-
particle which are not identical. This is best illustrated by considering
the electron which is a Dirac fermion. Let us denote the corresponding
two-component elds by e and e
c
.
8
We shall also denote the
one-particle electron and positron states by e
and e
+
respectively. In a
given Feynman diagram, one has the option of labelling graphs by particle
names or eld names; each choice has advantages and disadvantages. It is
convenient to display a translation between the two labeling conventions:
Note that the two-component elds e and e
c
both represent the negatively
charged electron, conventionally denoted by e
, whereas both e
c
and e
7
Note that we would have obtained the same sign for the u-channel diagram had we
crossed the initial state fermion lines instead of the nal state fermion lines.
8
Equivalently, if one considers the four-component Dirac eld e, then e = PLe and
e
c
= PL
c
e
can be thought of as two left-handed elds that are combined to form the
Dirac electron eld.
(a)
e
e
(b)
e
e
c
(c)
e
+
e
c
(d)
e
+
e
Fig. 2.18. The translation between the particle name and the two-component
eld name conventions for external lines in a Feynman diagram. The diagrams
represent an electron or positron whose momentum points from left to right (as
specied by the arrow above each line). The corresponding two-component eld
label is indicated to the right of each line.
represent the positively charged positron, conventionally denoted by e
+
.
One also needs to establish a convention for labeling the two-component
fermion elds that appear in the Feynman rules. As an example, consider
the two-component Feynman rules for the QED coupling of electrons and
positrons to the photon, which are exhibited in Fig. 2.19.
In general, two-component fermion lines with dotted indices always
correspond to arrows going away from the vertex, and two-component
fermion lines with undotted indices always correspond to arrows going
toward the vertex. When employing the Feynman rules of Fig. 2.19,
the choice of the two-component eld label depends on the direction of
momentum ow of the corresponding fermion, following the prescription
of Fig. 2.18. For example, if the direction of the momentum ow
in Fig. 2.19 follows the direction of the arrows of the two-component
fermion elds, then one should label the two-component fermion lines
with unbarred elds. That is, the Feynman rules are labeled with two-
component (unbarred) elds; they can be applied to processes involving
either fermions or anti-fermions (following the prescription of Fig. 2.18).
A simple example should make this clear. Consider the s-channel
tree-level Feynman diagrams that contribute to Bhabha scattering
(e
e
+
e
e
+
). If we label the external fermion lines accoridng to
the corresponding particle names, the result is shown in Figure 2.20.
However, when using the particle name convention, one must discern
ie

or ie

e or e
e or e
(a)

ie

or ie

e
c
or e
c
e
c
or e
c
(b)
Fig. 2.19. The two-component Feynman rules for the QED vertex. The choice of
the two-component eld label depends on the direction of momentum ow (which
is independent of the direction of the arrow on the fermion lines), following the
prescription of Fig. 2.18.
e
e
+
e
e
+
e
e
+
e
e
+
e
e
+
e
e
+
e
e
+
e
e
+
Fig. 2.20. Tree-level s-channel Feynman diagrams for e
e
+
e
e
+
, with the
external lines labeled according to the particle names. The momentum ow of
the external particles is indicated by the arrows above the corresponding fermion
lines in the upper left diagram.
the identity of the external two-component fermion elds by carefully
observing the direction of the arrow of each fermion line. If the arrow
coincides with the direction of propagation, then we identify the electron
[positron] line with e [e
c
]. If the arrow is opposite to the direction of
e
e
e
c
e
c
e
c
e
c
e
c
e
c
e
e
e
e
e
c
e
c
e
e
Fig. 2.21. Tree-level s-channel Feynman diagrams for e
+
e
e
+
e
, with the
external lines labeled according to the two-component fermion elds. Note that
all momenta ow from left to right.
propagation, then we identify the electron [positron] line with e
c
[e], in
accord with the prescription of Fig. 2.18. Thus, in order to employ the
two-component QED Feynman rules given in Fig. 2.19, we relabel the
graphs of eq. (2.20) by employing the two-component fermion eld labels,
as shown in Fig. 2.21. One can now employ the Feynman rules of Fig. 2.19
directly to compute the invariant amplitude. Note that the choice of rule
involving either the elds e and e in Fig. 2.19(a) or the elds e
c
and e
c
in
Fig. 2.19(b) is unambiguous.
Problems
1. In eq. (2.14), we introduced the eigeinstates of the helicity operator.
Derive the following explicit formula for
( p):
( p) = exp (i n/2)
( z) , (2.87)
where n=(sin , cos , 0 ) .
HINT: Obtain
( p) from
( z) by employing the spin-1/2 rotation

operator corresponding to a rotation from the z-direction to the direction
of p (characterized by polar angle and azimuthal angle ).
2. Suppose that x
( p, ) and y
( p, ) satisfy eqs. (2.7)(2.10) and

eqs. (2.20)(2.23).
(a) Using the identity (p)(p) = (p)(p) = p
2
, show that both
x
and y
must satisfy the mass-shell condition, p

2
= m
2
(or equivalently,
p
0
= E
p
).
(b) Show that eqs. (2.20)(2.23) imply that
(s)

(s)

=
, (s)

(s)
. (2.88)
and conclude that ss = 1.
(c) Using the identity: p s + s p = p s + s p = 2ps,
show that both x
and y
must satisfy the condition ps = 0.

3. (a) Verify that

p
p = m.
(b) The two-component helicity spinor
is dened by eq. (2.11). Using

the explicit forms for
and s
(), prove that:
p
_
s()
= m
. (2.89)
Show that the same result can be obtained from the rst equation in
eq. (2.20) by using the explicit forms for x
and y

.
(c) Noting that s() is proportional to , use the result of part (b) to
prove eq. (2.29).
4. Consider a massive fermion of mass m with four momentum (E ; p).
Dene a set of three four-vectors s
a
(a = 1, 2, 3) such that the s

a
and p/m
form an orthonormal set of four-vectors. That is,
p s
a
= 0 , (2.90)
s
a
s
b
=
ab
, (2.91)
s
a
s
a
= g
+
p
m
2
, (2.92)
where the sum over the repeated index a is implicitly assumed. A
convenient choice for the s
a
is
s
1
= (0 ; cos cos , cos sin, sin ) ,
s
2
= (0 ; sin, cos , 0) ,
s
3
=
_
[ p[
m
;
E
m
p
_
, (2.93)
in a coordinate system where p = (sin cos , sin sin , cos ). Note that
s
3
is identical to the positive helicity spin vector; that is, s 2s
3
[see
eq. (2.15)].
(a) Derive the following properties:
s
1
s
2
s
3
= m, (2.94)
s
a
s
b
s
a
s
b
=

abc
(s
c
)
m
, (2.95)
[( s
a
) ( s
b
)]
=
ab
i
abc
m
[( p) ( s
c
)]
, (2.96)
[( s
a
) ( s
b
)]

=
ab
+
i
abc
m
[( p) ( s
c
)]

. (2.97)
(b) Prove that the helicity two-component spinors, x( p, ) and y( p, )
satisfy the following equations:
(s
a
)

x
( p,
) =
a
y

( p, ) , (s)
( p,
) =
a
( p, ) ,
(s)
( p,
) =
a
( p, ) , (s)

y
( p,
) =
a
x

( p, ) ,
x
( p,
)(s)
=
a
( p, ) , y

( p,
)(s)

=
a
( p, ) ,
x

( p,
)(s)

=
a
( p, ) , y
( p,
)(s)
=
a
( p, ) ,
where the
a
are the Pauli matrices,
9
and there is an implicit sum over
the repeated label =
1
2
. These equations generalize the results of
eqs. (2.20)(2.23).
9
The rst (second) row and column of the -matrices correspond to = 1/2 (1/2).
Thus, for example,
3
= 2
(no sum over ).

(c) Using the results of part (b), derive the following generalizations of
eqs. (2.31)(2.34):
x
( p,
) x
( p, ) =
1
2
(p
ms
a
, (2.98)
y

( p,
)y
( p, ) =
1
2
(p
+ms
a
)

, (2.99)
x
( p,
)y
( p, ) =
1
2
_
m
[(s
a
) (p)]
_
, (2.100)
y

( p,
) x
( p, ) =
1
2
_
m
+ [(s
a
) (p)]

_
. (2.101)
These equations are the two-component versions of the Bouchiat-Michel
formulas. (See Chapter 3, problem 6 for the four-component spinor
version of these formulas.)
(d) Take the m 0 limit and derive the relevant Bouchiat-Michel
formulas for a massless helicity two-component spinor. [HINT: Recall
that s
3
= p/m+O(m/E) as m 0.]
3
From Two-Component to
Four-Component Spinors
To motivate the introduction of four component spinors for spin-1/2
particles, it is instructive to compare real and complex scalar elds that
describe spin-0 particles. Consider a free scalar eld theory with two
real, mass-degenerate scalar elds
1
(x) and
2
(x). The corresponding
Lagrangian is given by:
L =
1
2
(
1
)(
1
) +
1
2
(
2
)(
2
)
1
2
m
2
(
2
1
+
2
2
) , (3.1)
where m
2
is assumed to be positive. In this case, we can rewrite the
theory in terms of a single complex scalar eld:
=
1
2
(
1
+i
2
) ,
=
1
2
(
1
i
2
) . (3.2)
In terms of the complex eld, eq. (3.1) takes the following form:
L = (
) m
2
. (3.3)
Note that eq. (3.1) exhibits a global O(2) symmetry,
i
C
ij
j
.
Corresponding to this symmetry is a conserved Noether current
J
=
1
1
, (3.4)
with a corresponding conserved charge, Q =
_
J
0
d
3
x. The complex eld
(x) creates and annihilates particles of denite charge Q. Comparing
these results to eq. (1.102) with m
1
= m
2
and eqs. (1.103)(1.105), we
see that in a theory of mass degenerate spin-1/2 fermions, the role of
the real elds
i
are taken by the
i
. The fermion elds of denite (and
opposite) charge are and . Thus, we combine and into one complex
four-component spinor eld , which plays the same role as the complex
scalar eld .
83
84 3 From Two-Component to Four-Component Spinors
3.1 Four-component spinors
We now spell out in more detail the four-component Dirac fermion
notation. A four component Dirac spinor eld, (x), is made up of two
mass-degenerate two-component spinor elds, (x) and (x) as follows:
(x)
_
(x)

(x)
_
. (3.5)
The eld equation satised by (x) is obtained from eq. (1.106). This is
the free-eld Dirac equation:
(i
m) = 0 , (3.6)
where the 4 4 Dirac gamma matrices,
, can be expressed in 2 2
block form as follows:
=
_
0

0
_
. (3.7)
Using eqs. (1.67) and (1.68), one immediately obtains the dening
property of the Dirac matrices:
= 2g
. (3.8)
There are many other representations for the Dirac matrices that satisfy
eq. (3.8). The explicit form given in eq. (3.7) is called the chiral
representation, which corresponds to a basis choice in which
5
is diagonal:
5
i
0
3
=
_
0
0

_
. (3.9)
It is convenient to dene chiral projections operators:
P
L

1
2
(1
5
) , P
R

1
2
(1 +
5
) , (3.10)
so that
L
(x) P
L
(x) =
_
(x)
0
_
,
R
(x) P
R
(x) =
_
0

(x)
_
.(3.11)
In addition, we introduce:
1
1
2
i
4
[
] =
_
0
0

_
. (3.12)
1
In most textbooks,
is called
. Here, we use the former symbol so that there

is no confusion with the two-component denition of
given in eq. (1.76).

The duality conditions satised by
and
[eq. (1.78)] imply that:
=
1
2
i
. (3.13)
Given a four-component spinor eld , we introduce four related spinor
elds: the Dirac adjoint eld , the space-reected eld
p
, the time-
reversed eld
t
and the charge conjugate eld
c
, which are respectively
given by
(x)
(x)A =
_
(x) ,

(x)
_
, (3.14)
p
(x) i
0
(x) =
_
i
0
(x)
i
0

(x)
_
, (3.15)
t
(x)
0
B
1
T
(x) =
0
B
1
A
T
(x) =
(x)
(x)
, (3.16)
c
(x) C
T
(x) = CA
T
(x) =
_
(x)

(x)
_
. (3.17)
That is, in the chiral representation, A, B and C are explicitly given by
A =
_
0

0
_
, B =
_
0
0

_
, C =
_
0
0

_
. (3.18)
Note the numerical equalities:
2
A =
0
, B =
1
3
, C = i
0
2
, (3.19)
although these identications do not respect the structure of the undotted
and dotted indices specied in eq. (3.18) In calculations that involve
translations between two-component and four-component notation, the
expressions given in eq. (3.18) should be used. In calculations involving
only four-component notation, there is no harm in using the numerical
values for the matrices noted above.
Although the explicit forms for A, B, and C were given in the chiral
representation, they can be dened independently of the gamma matrix
representation. In general, these matrices must satisfy[1]
A
A
1
=
, (3.20)
B
B
1
=
T
, (3.21)
C
1
C =
T
. (3.22)
2
These identications have been obtained specically in the chiral representation of
the Dirac matrices. As shown in Appendix A, eq. (3.19) also holds in the Dirac
representation but not in the Majorana representation.
We now impose the following additional conditions:
3
= A
1
, (
t
)
t
= , (
c
)
c
= . (3.23)
The rst of these conditions together with eq. (3.14) is equivalent to the
statement that is hermitian. The second condition corresponds to the
result previously noted that when the time-reversal operator is applied
twice to a fermion eld it yields T
2
= 1. Finally, the third condition
corresponds to the statement that the charge conjugation operator applied
twice is equal to the identity operator. Using eqs. (3.20)(3.23) and the
dening property of the gamma matrices [eq. (3.8)], one can show that
(independently of the gamma matrix representation) the matrices A, B
and C must satisfy the following relations:
A
= A, B
T
= B, C
T
= C , (3.24)
CB =
5
, BA
1
= (AB
1
)
, (AC)
1
= (AC)
. (3.25)
We now examine the Dirac bilinear covariants, which are quantities
that are quadratic in the Dirac spinor eld and transform irreducibly as
Lorentz tensors. These are easily constructed from the corresponding
quantities that are quadratic in the two-component fermion elds. To
construct a translation table between the two-component form and the
four-component forms for the bilinear covariants, we rst introduce two
Dirac spinor elds:
1
(x)
_
1
(x)

1
(x)
_
,
2
(x)
_
2
(x)

2
(x)
_
, (3.26)
where spinor indices have been suppressed on the two-component elds
i
(x) and
i
(x). The results are then exhibited in Table 3.1.
It then follows that:
2
=
1
2
+

1

2
(3.27)
2
=
1
2
+

1

2
(3.28)
2
=

2

2
1
(3.29)
2
=
2

2
1
(3.30)
2
= 2(
1
2
+

2
) (3.31)
2
= 2(
1

2
) . (3.32)
3
The dening relations [eqs. (3.20)(3.22)] and the conditions given in eq. (3.23) does
not quite x the values of the matrices A, B and C [see eq. (A.39)]. Nevertheless,
the remaining freedom in dening these matrices has no eect on any of the results
in this section.
Table 3.1. A translation table relating bilinear covariants in two-component
and four-component notation.
1
P
L
2
=
1
2

c
1
P
L
c
2
=
1
1
P
R
2
=

1

2

c
1
P
R
c
2
=
1
c
1
P
L
2
=
1
2

1
P
L
c
2
=
1
1
P
R
c
2
=

2

c
1
P
R
2
=
1

2
P
L
2
=

2

c
1
P
L
c
2
=
1
c
1
P
R
c
2
=
1
2

1
P
R
2
=
1
P
L
2
= 2
1
2

c
1
P
L
c
2
= 2
1
P
R
2
= 2

2

c
1
P
R
c
2
= 2
1
2
Note that the Lorentz vector bilinear covariants are written in terms of
.
Of course, one could also rewrite these expressions by eliminating
in
favor of
by using eq. (1.72). We expect a total of sixteen independent

components among the bilinear covariants corresponding to the sixteen-
dimensional space of 4 4 matrices. From eq. (3.13), we see that
and
5
are not independent. That is, the sixteen-dimensional space
is spanned by I
4
,
5
,
5
and
, which matches the counting

presented at the end of section 1.4.
When
2
=
1
, the bilinear covariants listed in eqs. (3.27)(3.32) are
either hermitian or anti-hermitian. We shall always work in a convention
where A
= A. Then,

O is hermitian if
AOA
1
= O
. (3.33)
Thus, one obtains hermitian bilinear covariants if O = I
4
, i
5
,
5
,
or i
5
.
One can also describe self-conjugate fermions in four-component
fermion notation. These are the Majorana fermion elds, which are
the analogues of the real scalar elds. That is, for a complex scalar
eld, one can reduces the number of degrees of freedom by a factor of
two by imposing the reality condition
= . For a Dirac fermion

, one can likewise reduce the number of degrees of freedom by a
factor of two by imposing the condition
c
= .
4
Thus, we dene a
4
Note that
c
= is a Lorentz covariant condition since it sets equal two quantities
with the same Lorentz transformation properties [see eq. (3.44)]. The same is not
true in general for the condition
= , except in special representations of the

gamma matrices (such as the Majorana representation).
Majorana four-component fermion by imposing the Majorana condition:
M
=
c
M
= C
T
M
. Explicitly, this condition in the chiral representation
implies that
M
takes the following form:
M
(x)
_
(x)

(x)
_
. (3.34)
The results of Table 3.1 and eqs. (3.27)(3.32) also apply to Majorana
fermions. However, due to the Majorana condition, the bilinear covariants
of eqs. (3.27)(3.32) are either hermitian or anti-hermitian even if
1
,=
2
. In general,

M1
O
M2
is hermitian if
ACO
T
(AC)
1
= O
. (3.35)
To derive eq. (3.35), we have made use of eqs. (3.24) and (3.25), the
Majorana condition and the anti-commutativity of the fermion elds.
Thus, one obtains (
M1
O
M2
)
M1
O
M2
if O = I
4
, i
5
, i
5
,
i
or
5
. In addition, we obtain the following relations among the
Majorana bilinear covariants:
M1
M2
=
M2
M1
, (3.36)
M1
M2
=
M2
M1
, (3.37)
M1
M2
=
M2
M1
, (3.38)
M1
M2
=
M2
M1
, (3.39)
M1
M2
=
M2
M1
, (3.40)
M1
M2
=
M2
M1
. (3.41)
We can combine eqs. (3.38) and (3.39) to obtain:
M1
P
L
M2
=
M2
P
R
M1
. (3.42)
In particular, if
M1
=
M2

M
, then eqs. (3.36)(3.41) imply that:
M
=
M
M
=
M
M
= 0 , (3.43)
leaving only six independent components of the non-vanishing bilinear
covariants:
M
M
,
M
M
and
M
M
. This matches the
counting exhibited at the end of section 1.4.
The behavior of the four-component fermion elds under proper
and improper Lorentz transformations is easily obtained from the
corresponding results for the two-component fermion elds. Consider
rst the transformation of the four-component spinor eld under proper
orthochronous Lorentz transformations. The transformations of the two-
component spinors in the (
1
2
, 0) and (0,
1
2
) representations are given by
eqs. (1.31) and (1.32) and eqs. (1.40) and (1.41) respectively. Using the
results of eqs. (1.82)(1.84), it follows that when x

=
(x) = S(
1
x) ,
(x) = (
1
x)S
1
, (3.44)
where
S
_
M
0
0 (M
1
)

_
= exp
_
i
4

_
. (3.45)
The transformation law for is easily derived from that of by making
use of the relation A
1
S
A = S
1
[the latter is a consequence of
eq. (3.20)]. It then follows immediately that (x)(x) is a Lorentz scalar.
Similarly, one can prove that the other bilinear covariants transform like
denite Lorentz tensors. This requires one further result:
S
1
S =
, (3.46)
which is the four-component version of eqs. (1.70) and (1.75). It then
follows that
transforms as a Lorentz four-vector,
transforms
as an antisymmetric second rank tensor, etc.
The behavior of the four-component fermion eld (x) under P, T and
C is likewise easily obtained from the corresponding behavior of the two
component fermion elds (written in the basis of the elds of denite
charge). Using the results of eqs. (1.147) and (1.148) for P, eqs. (1.175)
and (1.176) for T and eqs. (1.186) and (1.187) for C, it follows that:
5
T(x)T
1
=
P
i
0
(x
P
) =
P
p
(x
P
) , (3.47)
T (x)T
1
=
T
(
0
A
1
)
T
B(x
T
) =
T
t
(x
T
) , (3.48)
((x)(
1
=
C
C
T
(x) =
C
CA
T
(x) =
C
c
(x) . (3.49)
For convenience, we also list the corresponding transformation laws for
the Dirac adjoint eld:
T(x)T
1
= i
P
(x
p
)
0
, (3.50)
T (x)T
1
=
T
(x
T
)B
1
(A
0
)
T
, (3.51)
((x)(
1
=
T
(x)C
1
. (3.52)
Thus, e.g., the hermitian bilinear covariant (x)(x) is a scalar quantity
under P, T and C, respectively. These results apply to both Dirac and
5
Once again, we remind the reader that we have deviated from the standard textbook
conventions in eq. (3.47) by separating out the factor of i from the phase P . As a
result, in our notation P = 1 for a Majorana fermion, whereas in the standard
convention P = i after the factor of i is absorbed into the denition of P .
Majorana four-component fermion elds. For the transformation of Dirac
elds, the phases
P
,
T
and
C
are arbitrary. When applied to Majorana
elds, the relation
M
= C
T
M
imposes constraints on these phases. In
particular, it is straightforward to show that the transformation of
c
(x)
under P, T and C is given by eqs. (3.47)(3.52) with the replacement
of the phases
P
,
T
and
C
by their complex conjugates. Hence, for
Majorana elds, which satisfy
c
M
=
M
, it follows that the phases
P
,
T
and
C
are real (and equal to 1).
The transformation law under CPT is particularly simple (after some
algebraic manipulation):
6
(TT (x)((TT )
1
=
T
(
0
A
1
)
T
B(T(x
T
)((T)
1
= i
CPT
(
0
A
1
)
T
B
0
CA
T
(x)
= i
CPT
(ACBA
1
)
T
(x) , (3.53)
where as before
CPT

C
T
. Recalling that CB =
5
and using
eq. (3.20), we end up with:
(TT (x)((TT )
1
= i
CPT
[
5
(x)]
, (3.54)
(TT (x)((TT )
1
= i
CPT
[A
5
(x)]
T
. (3.55)
It is now straightforward to work out the properties of the bilinear
covariants under C, P, T, CP and CPT. For example, consider the CPT-
transformation properties of the hermitian bilinear covariants

O and
M1
O
M2
where is a Dirac eld and
Mi
(i = 1, 2) are Majorana
elds:
(TT

O((TT )
1
=

5
O
5
, (3.56)
(TT

M1
O
M2
((TT )
1
=

M1
5
O
5
M2
, (3.57)
where O satises eq. (3.33) [eq. (3.35)] in the case of the Dirac [Majorana]
bilinear covariants. In deriving these results, we have used the anti-
unitary property of (TT and the anti-commutativity of the fermion
elds.
7
In addition, eq. (3.57) was derived under the assumption that
CPT
is the same for all fermion elds [see eq. (1.200) and the associated
discussion]. Indeed, eqs. (3.56) and (3.57) are consistent with the results
of eqs. (1.201)(1.203).
6
Recall that for any c-number quantity z (in contrast to an operator acting on the
Hilbert space), PzP
1
= z, CzC
1
= z and T zT
1
= z
.
7
Strictly speaking, the bilinear covariants are always assumed to be normal-ordered
expressions. As a result, it is legitimate to treat the fermion elds as anti-
commuting (that is, by ignoring the delta function that appears in the equal-time
anti-commutation relations).
3.2 Lagrangians for free four-component fermions 91
3.2 Lagrangians for free four-component fermions
We briey look at the Lagrangians for a free fermion eld theory in four-
component notation. These can be obtained from the two-component
free-fermion eld Lagrangians given in section 1.5 using the results of
Table 3.1. For example, for a theory of a single two-component fermion,
we employ the Majorana four-component fermion [eq. (3.34)] and obtain
the following Lagrangian:
L =
1
2
i
M
M

1
2
m
M

M
. (3.58)
This result was obtained from eq. (1.99) under the assumption that the
mass parameter m is real.
8
The corresponding eld equations [eqs. (1.100)
and (1.101)] in four-component notation are the free-eld Dirac equation
and its conjugate:
(i
m)
M
= 0 ,
M
(i
+m) = 0 . (3.59)
In a theory of a pair of two-component fermions of equal mass, we
work in the basis of denite charged elds and . Combining these into
a single Dirac eld , the corresponding free-eld Lagrangian [eq. (1.105)]
yields the free-eld Dirac Lagrangian in four-component notation:
L = i
m. (3.60)
The corresponding eld equations are again the free-eld Dirac equation
[eq. (3.6)] and its conjugate. In section 1.5, we learned that a charged
Dirac fermion was equivalent to a pair of mass-degenerate Majorana
fermions. This identication carries over to four-component notation
as follows. Given the four-component Dirac spinor (x) [eq. (3.5)], we
construct a pair of four-component Majorana fermions:
M1
=
1
2
( +
c
) , (3.61)
M2
=
i
2
(
c
) . (3.62)
From the denition of the charge conjugated spinor [eq. (3.17)], it follows
that (i)
c
= i
c
. Thus we see that
c
Mi
=
Mi
(for i = 1, 2)
are self-conjugate four-component spinors.
9
One can rewrite the Dirac
8
In the case of a complex mass parameter m, the term
1
2
mMM in eq. (3.59)
would be replaced by
1
2
(Re m)M M i(Im m)M

5
M
.
9
Moreover, if CC
1
= C
c
, with the phase C chosen real (|C| = 1), then
CM1C
1
= CM1 and CM2C
1
= CM2 [cf. eq. (1.193) for the corresponding
result in two-component notation].
Lagrangian [eq. (3.60)] in terms of the Majorana elds
Mi
:
L =
1
2
i(
M1
M1
+
M2
M2
)
1
2
m(
M1
M1
+
M2
M2
) .
(3.63)
In deriving eq. (3.63), we have used eqs. (3.36) and (3.38) and dropped a
term that is a total divergence (which does not contribute to the action).
The complex four-component Dirac eld, which represents a charged
spin-1/2 particle, possesses eight o-shell degrees of freedom. In
particular, and are initially independent degrees of freedom. After
imposing the eld equations [eq. (3.6)], the four complex components
are no longer independent, resulting in four physical on-shell degrees of
freedom. As noted above, the complex four-component Dirac eld can
also be employed to describe a neutral self-conjugate spin-1/2 Majorana
fermion. In this case, one must impose an additional constraint:
c
(x) =
(x). This Majorana constraint reduces the number of degrees of freedom
of the Dirac fermion by a factor of two, resulting in four o-shell and two
physical on-shell degrees of freedom. Of course, the counting of degrees
of freedom above reproduces the results of the previous analysis based on
two-component fermion elds [see section 1.5].
Models of N free four-component fermions, with non-diagonal mass
matrices are easily constructed. The diagonalization procedure to identify
the mass eigenstates is easily obtained by converting the two-component
fermion mass diagonalization methods of Section 1.6 to four-component
notation. We leave the details as an exercise for the reader.
3.3 Properties of the four-component spinor wave functions
The results of Sections 2.1 and 2.3 can easily be translated into four-
component fermion notation using the results of Section 3.1. For
example, the free-eld Lagrangian for a massive (neutral) Majorana four-
component spinor eld
M
(x) is given by eq. (3.58), and yields the eld
equations given by eq. (3.59). As a result,
M
(x) can be expanded in a
Fourier series:
10
M
(x) =
_
d
3
p
(2)
3/2
(2E
p
)
1/2
_
u( p, )a( p, )e
ip x
+v( p, )a
( p, )e
ip x
_
, (3.64)
10
One can also obtain the results of this section by considering the free-eld Lagrangian
for a massive Dirac four-component spinor eld (x) [see eq. (3.60)]. By virtue of
the eld equations, the Fourier mode expansion of (x) is given by eq. (3.64), with
a
( p, ) replaced by b
( p, ) [c.f. eqs. (2.49) and (2.50)].

3.3 Properties of the four-component spinor wave functions 93
where as in Section 2.1 E
p
([ p[
2
+ m
2
)
1/2
, and the creation and
annihilation operators a
and a satisfy anticommutation relations given

by eq. (2.2). We again emphasize that the spinor wave functions u and v
are commuting quantities.
The results of Sections 2.1 and 2.2 can then be used to relate
the two-component spinor wave functions x
( p, ) and y
( p, ) to the
more traditional four-component spinor wave functions u( p) and v( p).
Specically,
u( p, s) =
_
x
( p, s)
y

( p, s)
_
, u( p, s) = (y
( p, s), x

( p, s)) , (3.65)
v( p, s) =
_
y
( p, s)
x

( p, s)
_
, v( p, s) = (x
( p, s), y

( p, s)) , (3.66)
where v( p, s) = C u( p, s)
T
and C is the charge conjugation matrix [see
eq. (3.18)]. Likewise, u( p, s) = C v( p, s)
T
One can check that u and v
satisfy the Dirac equations
11
(/ p m) u( p, s) = (/ p +m) v( p, s) = 0 , (3.67)
u( p, s) (/ p m) = v( p, s) (/ p +m) = 0 , (3.68)
corresponding to eqs. (2.7)(2.10), and
(
5
s / 1) u( p, s) = (
5
s / 1) v( p, s) = 0 , (3.69)
u( p, s) (
5
s / 1) = v( p, s) (
5
s / 1) = 0 , (3.70)
corresponding to eqs. (2.20)(2.23). For massive fermions, eqs. (2.31)
(2.34) correspond to
u( p, s) u( p, s) =
1
2
(1 +
5
s /) (/ p +m) , (3.71)
v( p, s) v( p, s) =
1
2
(1 +
5
s /) (/ p m) . (3.72)
To apply the above formulas to the massless case, recall that in the m 0
limit, s = 2p/m+O(m/E). Inserting this result in eqs. (3.67) and (3.69),
it follows that the massless helicity spinors are eigenstates of
5
5
u( p, s) = 2u( p, s) , (3.73)
5
v( p, s) = 2v( p, s) . (3.74)
Applying the same limiting procedure to eqs. (3.71) and (3.72) and
using the mass-shell condition (/ p/p = p
2
= m
2
), one obtains the helicity
11
We use the standard Feynman slash notation: /p p
.
projection operators for a massless spin-1/2 particle
u( p, s) u( p, s) =
1
2
(1 + 2
5
) / p , (3.75)
v( p, s) v( p, s) =
1
2
(1 2
5
) / p , (3.76)
which correspond to eqs. (2.39)(2.42). Finally, the spin-sum identities
s
u( p, s) u( p, s) = / p +m, (3.77)
s
v( p, s) v( p, s) = / p m, (3.78)
s
u( p, s)v
T
( p, s) = (/ p +m)C
T
, (3.79)
s
u
T
( p, s) v( p, s) = C
1
(/ p m) , (3.80)
s
v
T
( p, s) u( p, s) = C
1
(/ p +m) , (3.81)
s
v( p, s)u
T
( p, s) = (/ p m)C
T
, (3.82)
correspond to eqs. (2.43)(2.46).
Finally, we note the following useful properties satised by the four-
component spinor wave functions. Starting from eqs. (3.65) and (3.66)
and applying the identities eqs. (B.42)(B.49) for commuting spinor wave
functions, one easily veries that:
v( p, s)v( p
, s
) =
u( p
, s
)u( p, s) ,
v( p, s)u( p
, s
) =
v( p
, s
)u( p, s) ,
u( p, s)v( p
, s
) =
u( p
, s
)v( p, s) , (3.83)
where
= +1 for = 1,
5
,
5
and
= 1 for =
5
.
3.4 Feynman rules for four-component Majorana fermions
Starting with the two-component fermion Feynman rules developed in
Sections 2.42.7, it is strightforward to generate the corresponding set
of rules for four-component fermions. The resulting rules for Dirac
fermions simply reproduce the well-known Feynman rules developed in
most quantum eld theory textbooks. However, one also can develop
Feynman rules for four-component (neutral) Majorana fermions, which
are much less known. It is rather odd that these rules are not more
common in the standard textbooks. After all, there are no signicant
complications that arise when comparing Feynman rules for charged
and neutral scalars. A possible explanation is that Feynman rules for
(charged) Dirac fermions were slightly simplied by making use of the
fermion-number conservation that is connected to the underlying charge
conservation. In this section, we shall see that a minor adjustment of the
Feynamn rules for Dirac fermions will allow us to treat neutral Majorana
fermions on an equal footing.
We begin by considering a set of neutral and charged fermions
interactiong with a neutral scalar or vector boson. Starting from the
two-component fermion Lagrangian given by eq. (2.66), we combine the
i
elds into four-component Majorana elds
Mi
and the oppositely
charged
j
and
j
elds into four-component Dirac elds
j
. Converting
eq. (2.66) to four-component notation yields
12
L
int
=
1
2
(
ij
Mi
P
L
Mj
+
ij
Mi
P
R
Mj
)
(
ji
i
P
L
j
+
ij
i
P
R
j
)
_
(G
)
i
j
Mi
P
L
Mj
+ (G
)
i
j
P
L
j
(G
)
j
i
P
R
,
(3.84)
where is a complex symmetric matrix, is an arbitrary complex matrix
and G
c
hi, G
and G
are hermitian matrices. It is convenient to use

eq. (3.42) to rewrite the term proportional to (G
)
i
j
in eq. (3.84) as
follows
(G
)
i
j
Mi
P
L
Mj
=
1
2
Mi
_
(G
)
i
j
P
L
(G
)
j
i
P
R
Mj
. (3.85)
Using standard four-component methods, the Feynman rules for the
vertices are easily obtained and displayed in Fig. 3.1. Note that the
arrows on the Dirac fermion lines depict the ow of the conserved charge.
A Majorana fermion is self-conjugate, so its arrow simply reects the
structure of the interaction Lagrangian; i.e.,
M
[
M
] is represented
by an arrow pointing out of [into] the vertex. The arrow directions
12
The convention for avor indices [see Section A.2] in which raising all lowered indices
(or vice versa) is equivalent to complex conjugation is less useful in this context, since
a four-component fermion eld is made up of an unbarred two-component fermion
eld with a lowered avor index and a barred two-component fermion eld with a
raised avor index. Henceforth, we shall (arbitrarily) take all avor indices attached
to four-component fermion elds as lowered indices. Nevertheless, we continue to
distinguish ij and
ij

ij
, etc. However, in this case one must expand the rule for
the summation over repeated indices by performing the sums irrespective of whether
an index is in the lowered or raised position.
Mj
Mi
i(
ij
P
L
+
ij
P
R
)
i
i(
ji
P
L
+
ij
P
R
)
A
Mj
Mi
i
[(G
)
i
j
P
L
(G
)
j
i
P
R
]
A
i
i
[(G
)
i
j
P
L
(G
)
j
i
P
R
]
Fig. 3.1. Feynman rules for four-component fermion interactions with neutral
scalar and vector bosons.
determine the placement of the u and v spinors in an invariant amplitude
(as specied at the end of this section).
One can also treat the interaction of fermions with charged bosons.
We may then rewrite the interaction Lagrangian given in eq. (2.67) in
four-component notation:
L
int
=
1
2
_
(
2
)
ij
i
P
L
Mj
+ (
1
)
ij
i
P
R
Mj
_
(G
1
)
i
j
P
L
Mj
(G
2
)
j
i
P
R
Mj
+ h.c. (3.86)
There is an equivalent form of eq. (3.86) where the interaction
Lagrangian is written in terms of charge-conjugated elds. In general,
13
c
i
c
j
=
j
C
T
C
1
i
=
i
, (3.87)
13
In deriving eq. (3.87), we used C
T
= C and noted that the fermion elds
anticommute.
Mj
i
or
Mj
c
i
i(
2ij
P
L
+
1ij
P
R
)
Mj
i
or
Mj
c
i
i(
1ij
P
L
+
2ij
P
R
)
W
Mj
i
i
(G
1i
j
P
L
G
2j
i
P
R
)
W
Mj
c
i
i
(G
1i
j
P
R
G
2j
i
P
L
)
W
Mj
i
i
(G
1j
i
P
L
G
2i
j
P
R
)
W
Mj
c
i
i
(G
1i
j
P
R
G
2j
i
P
L
)
Fig. 3.2. Feynman rules for four-component fermion interactions with charged
scalar and vector bosons. The arrows on the boson and Dirac fermion lines
indicate the direction of charge ow.
where the sign
= +1 for = 1,
5
,
5
and
= 1 for =
5
. Noting that Majorana fermions are self-conjugate, the Feynman
rules for the interactions of neutral and charged fermions with charged
bosons can take two possible forms, as shown in Fig. 3.2. One is free to
choose either a or
c
line to represent a Dirac fermion at any place
in a given Feynman graph. The direction of the arrow on the or
c
line indicates the corresponding direction of charge ow.
14
Moreover, the
structure of the interaction Lagrangian implies that the arrow directions
on fermion lines ow continuously through the diagram. This requirement
then determines the direction of the arrows on Majorana fermion lines.
For a given process, there may be a number of distinct choices for
the arrow directions on the Majorana fermion lines, which may depend
on whether one represents a given Dirac fermion by or
c
. However,
dierent choices do not lead to independent Feynman diagrams.
15
When
computing an invariant amplitude, one rst writes down the relevant
Feynman diagrams with no arrows on any Majorana fermion line. The
number of distinct graphs contributing to the process is then determined.
Finally, one makes some choice for how to distribute the arrows on the
Majorana fermion lines and how to label Dirac fermion lines (either as the
eld or its conjuagate) in a manner consistent with the rules of Figs. 3.1
and 3.2. The end result for the invariant amplitude (apart from an overall
unobservable phase) does not depend on the choices made for the direction
of the fermion arrows.
Using the above procedure, the Feynman rules for the external fermion
wave functions are the same for Dirac and Majorana fermions:
u( p, s): incoming [or
c
] with momentum p parallel to the arrow
direction,
u( p, s): outgoing [or
c
] with momentum p parallel to the arrow
direction,
v( p, s): outgoing [or
c
] with momentum p anti-parallel to the
arrow direction,
v( p, s): incoming [or
c
] with momentum p anti-parallel to the
arrow direction.
The proof that the above rules for external wave functions apply
unambiguously to Majorana fermions is straightforward. Simply insert
14
Since the charge of
c
is opposite to that of , the corresponding arrow direction of
the two lines point in opposite directions.
15
In contrast, the two-component Feynman rules developed in Sections 2.42.7 require
that two vertices diering by the direction of the arrows on the two-component
fermion lines must both be included in the calculation of the matrix element.
the plane wave expansion of the Majorana eld [eq. (3.64)] into eq. (3.84),
and evaluate matrix elements for, e.g., the decay of a scalar or vector
particle into a pair of Majorana fermions.
Finally, we note that either or
c
can be used to represent the
propagation of a (virtual) Dirac fermion. Here, there is no ambiguity
in the propagator Feynman rule, since free Dirac fermion elds satisfy
0[T(
(x)
(y))[0
_
=
0[T(
c
(x)
c
(y))[0
_
. (3.88)
As a result, the Feynman rules for the propagator of a and
c
line are
identical.
The Feynman rule for a four-component Majorana or Dirac fermion
takes the following form:
p
i(/ p +m)
p
2
m
2
+i
Fig. 3.3. The Feynman rule for the virtual propagation of either a Majorana
or Dirac fermion. The four-component spinor labels and are explicitly
indicated.
We reiterate that the directions of the arrows that appear in both
internal and external fermion lines ow continuously through any
Feynman diagram, if the procedures outline above are followed. Then,
one may determine the contribution to an invariant amplitude from a
given Feynman diagram by traversing the diagram in a direction opposite
to the arrow directions. By this procedure, all 4 4 spinor matrix
quantities appear in their natural order, and one may drop the explicit
four-component spinor labels.
3.5 Simple examples of Feynman diagrams revisited
We now reconsider the matrix elements for scalar and vector particle
decays into fermion pairs and 2 2 elastic scattering of a fermion o
a scalar and vector boson, respectively. We shall compute the matrix
elements using the Feynman rules of g. 3.1, and check that the results
agree with the ones obtained by two-component methods of Sections 2.4
2.7.
The matrix element for the decay
M
( p
1
, s
1
)
M
( p
2
, s
2
) is given
by
i/= i u( p
1
, s
1
)(P
L
+
P
R
)v( p
2
, s
2
) . (3.89)
One can easily check that this result matches with eq. (2.72), which was
derived using two-component techniques. Note that if one had chosen
to switch the two nal states (equivalent to switching the directions of
the Majorana fermion arrows), then the resulting matrix element would
simply exhibit an overall sign change [due to the results of eq. (3.83)].
16
Similarly, for
Mi
Mj
(i ,= j) or for the decay into a pair of Dirac
fermions, , one again obtains the invariant matrix element given
in eq. (3.89).
For the decay A

M
( p
1
, s
1
)
M
( p
2
, s
2
), one obtains:
i/= iG
u( p
1
, s
1
)
5
v( p
2
, s
2
)
. (3.90)
One can easily check that this result matches with eq. (2.75). For the
decay into non-identical Majorana fermions, A

Mi
Mj
(i ,= j), we
can use the Feynman rules of Fig. 3.1 to obtain:
i/= i u( p
i
, s
i
)
_
(G
)
i
j
P
L
(G
)
j
i
P
R
v( p
j
, s
j
)
, (3.91)
Again, we note that if one had chosen to switch the two nal states
(equivalent to switching the directions of the Majorana fermion arrows),
then the resulting matrix element would simply exhibit an overall sign
change [due to the results of eq. (3.83)]. Finally, for the decay of the
vector particle into a Dirac fermion-antifermion pair, A
, the
matrix element is given by:
i/= i u( p
1
, s
1
)
(G
P
L
G
P
R
)v( p
2
, s
2
)
, (3.92)
which matches the result of eq. (2.79).
Turning to the elastic scattering of a neutral Majorana fermion and a
neutral scalar, we shall examine two equivalent ways for computing the
amplitude. Following the rules previously stated, there are two possible
choices for the direction of arrows on the Majorana fermion lines. Thus,
one may evaluate either one of the following two diagrams:
p p
plus a second diagram in each case (not shown) where the initial and nal
state scalars are crossed. The contribution of the rst diagram above to
16
The overall sign change is a consequence of the Fermi-Dirac statistics, and
corresponds to changing which order one uses to construct the two particle nal
state.
the matrix element for
M

M
is given by:
i
s m
2
u( p
2
, s
2
)(P
L
+
P
R
)(/ p +m)(P
L
+
P
R
)u( p
1
, s
1
)
=
i
s m
2
u( p
2
, s
2
)
_
[[
2
/p +
_
2
P
L
+ (
)
2
P
R
_
m
u( p
1
, s
1
) , (3.93)
where m is the Majorana fermion mass, s is the centre-of-mass energy
squared. Using eqs. (3.7) and (3.65), one recovers the results of eq. (2.80).
Had we chosen to evaluate the second diagram instead, the resulting
contribution to the invariant amplitude would have been given by:
i
s m
2
v( p
1
, s
1
)
_
[[
2
/p +
_
2
P
L
+ (
)
2
P
R
_
m
v( p
2
, s
2
) . (3.94)
Using eq. (3.83), one can quickly verify that the amplitude computed in
eq. (3.94) is just the negative of eq. (3.93). This is expected, since the
order of spinor wave functions (12) in eq. (3.94) is an odd permuation
(21) of the order of spinor wave functions in eq. (3.93). As in the
two-component Feynman rules, the overall sign of the amplitude is
arbitrary, but the relative signs of either of the diagrams above and the
corresponding crossed diagram is not ambiguous. This relative sign is
given by the relative sign of the permuation of spinor wave functions
appearing in the contributions to the invariant amplitude.
Next, we consider the elastic scattering of a charged fermion and a
neutral scalar. Again, we examine two equivalent ways for computing the
amplitude. Following the rules previously stated, there are two possible
choices for the direction of arrows on the fermion lines, depending on
whether we represent the fermion by or
c
. Thus, we may evaluate
either one of the following two diagrams:
p

p
c
plus a second diagram in each case (not shown) where the initial and nal
state scalars are crossed. Evaluating the rst diagram above, the matrix
element for is given by eq. (3.93), with replaced by . Had we
chosen to evaluate the second diagram instead, the resulting amplitude
would have been given by eq. (3.94), with replaced by . Thus, the
discussion above in the case of neutral fermion scattering processes also
applies to charged fermion scattering processes.
In processes that only involve vertices with two Dirac elds, one can
always choose to avoid employing charge-conjugated Dirac fermion lines.
This may no longer hold true for processes that involve a vertex with one
Dirac and one Majorana fermion. For example, consider the scattering
of a (charged) Dirac fermion and a charged scalar via the exchange of a
neutral Majorana fermion, in which the charge of the outgoing fermion
is opposite to that of the incoming fermion. If one attempts to draw
the relevant Feynman diagram with no charge-conjuagted Dirac fermion
lines, one nds that there is no possible choice of arrow directions for the
Majorana fermion that is consistent with the the vertex rules of Fig. 3.2.
The resolution is simple: one can choose the incoming line to be and
the outgoing line to be
c
or vice versa. Thus, the two possible choices
are given by:
p

c
p
plus a second diagram in each case (not shown) in which the intial and
nal scalars are crossed. The contribution of the rst diagram above to
the matrix element for
c
is given by:
i
s m
2
u( p
2
, s
2
)(
1
P
L
+
2
P
R
)(/ p +m)(
1
P
L
+
2
P
R
)u( p
1
, s
1
)
=
i
s m
2
u( p
2
, s
2
)
_
2
/p +
_
2
1
P
L
+ (
2
)
2
P
R
_
m
u( p
1
, s
1
) , (3.95)
where m is the mass of the s-channel exchanged Majorana fermion. One
can check that this is equivalent to eq. (2.84) obtained via the two-
component methods. Had we evaluated the second diagram above, then
after using eq. (3.83), one nds that the resulting invariant amplitude is
just the negative of eq. (3.95), as expected. As before, the relative sign
between either diagram and its crossed version is not ambiguous.
In the case of elastic scattering of a fermion and a neutral vector boson,
the two contributing diagrams are
p
plus a second diagram (not shown) where the initial and nal state vector
bosons are crossed. Consider rst the scattering of a neutral Majorana
fermion of mass m. Using the Feynman rules of Fig. 3.1 and noting that
G
is real (here, the avor index runs over only one component), we see
that the Feynman rule for the A
M
vertex is given by iG
5
.
Hence, the corresponding matrix element is given by
i/=
iG
2
s m
2
u( p
2
, s
2
)
2
(/ p m)
1
u( p
1
, s
1
) + (crossed), (3.96)
where we have used
5
(/ p + m)
5
=
(/ p m)
. Using eqs. (3.7)

and (3.65), one easily recovers the results of eq. (2.81).
Next, consider the elastic scattering of a Dirac fermion and a neutral
vector boson (Compton scattering). The corresponding matrix element
is given by
i/=
i
s m
2
u( p
2
, s
2
)
2
(G
P
L
G
P
R
)(/ p +m)
1
(G
P
L
G
P
R
)u( p
1
, s
1
) + (crossed)
=
i
s m
2
u( p
2
, s
2
)
2
_
(G
2
P
L
+G
2
P
R
)/ p G

1
u( p
1
, s
1
)
+(crossed) . (3.97)
One can easily check that this result coincides with that of eq. (2.85).
Finally, we examine the elastic scattering of two identical Majorana
fermions via scalar exchange. The three contributing diagrams are:
and the corresponding matrix element is given by
i/=
i
s m
2
[ v
1
(P
L
+
P
R
)u
2
u
3
(P
L
+
P
R
)v
4
]
+ (1)
i
t m
2
[ u
3
(P
L
+
P
R
)u
1
u
4
(P
L
+
P
R
)u
2
]
+
i
u m
2
[ u
4
(P
L
+
P
R
)u
1
u
3
(P
L
+
P
R
)u
2
] , (3.98)
where u
i
u( p
i
, s
i
), v
j
u( p
j
, s
j
) and m
is the exchanged scalar mass.

The relative minus sign of the t-channel graph relative to the other two is
obtained by noting that 3142 [4132] is an odd [even] permutation of 1234.
Using eqs. (3.7) and (3.65), one easily recovers the results of eq. (2.86).
Problems
1. (a) Translate the results of problem 3(a) of Chapter 1 to four-
component notation. Show that the resulting condition for P takes the
form:
S
1
P

S
P
= (
P
)
, (3.99)
where
S
P

_
0 P
P 0
_
. (3.100)
Verify eq. (3.99) directly in the four-component language by noting that
p
(x) = iS
P
(
x), where S
P

0
[see eq. (3.15)].
(b) Translate the results of problem 4(a) of Chapter 1 to four-
component notation. Show that the resulting condition for T takes the
form:
S
1
T

S
T
= (
T
)

, (3.101)
where
S
T

_
T 0
0 T
_
. (3.102)
Verify eq. (3.101) directly in the four-component language by noting that
t
(x) = S
T
(x), where S
T

0
B
1
A
T
[see eq. (3.16)].
2. (a) The Racah time-reversed spinor is dened by
(x
) = [
c
(x)]
t
,
where x
= (
T
)
. Without assuming the chiral representation, show

that [
c
(x)]
t
= S
CT
(x), where S
CT
=
0
5
. Then, verify that:
S
1
CT

S
CT
= (
T
)
. (3.103)
This is the direct analogue of the parity transformation given in eq. (3.99).
(b) What is the two-component version of Racah time reection?
Modify the results of problem 3(b) accordingly.
3. (a) Explain what is wrong with the following computation:
(TT (x)((TT )
1
= (T[
T
(
0
A
1
)
T
BPsi(x
T
)]((T)
1
= ([i
P
0
(
0
A
1
)
T
B(x)](
1
=
C
CA
T
[i
P
0
(
0
A
1
)
T
B(x)]
,
= i
C
T
CA
T
0
(
0
A
1
)
(x) ,
where is a four-component Dirac eld. In particular, by writing out
the various matrices in the chiral representation and keeping the two-
component spinor labels explicit, show that the last line above does not
make any sense. Compare with eq. (3.53) in the text and explain why the
latter provides the correct method of computation.
(b) The eld transforms under charge conjugation according to
eq. (3.49). Compare the following two computations:
(i(
1
= (( i (
1
)(((
1
) = i
C
CA
T
, (3.104)
(i(
1
=
C
CA
T
(i)
= i
C
CA
T
. (3.105)
Which computation is correct? Explain.
4. (a) Work out the transformation laws for
t
(x) and
t
(x) under P, T
and C, respectively.
(b) Work out the transformation laws for (x) and (x) under a CP
transformation.
5. In Section 1.6, a eld theory of n free two-component fermion elds
is constructed with an arbitrary mass matrix M
ij
. Convert the results
of this section to four-component spinor notation. Write down the most
general model of n four-component free fermion elds and discuss the
mass diagonalization procedure.
6. (a) Using the results of problem 4(c), prove the four-component
Bouchiat-Michel formulas:
u( p,
) u( p, ) =
1
2
[
+
5
s /
a
] (/ p +m) , (3.106)
v( p,
) v( p, ) =
1
2
[
+
5
s /
a
] (/ p m) , (3.107)
where the s
a
are dened byeqs. (2.93)(2.93) and the
a
are the usual
Pauli matrices.
(b) Take the m 0 limit in part (a) and derive the relevant Bouchiat-
Michel formulas for a massless helicity four-component spinor. [HINT:
Recall that s
3
= p/m+O(m/E) as m 0.]
References
[1] J.M. Jauch and F. Rohrlich, The Theory of Photons and Electrons, second
expanded edition (Springer-Verlag, New York, 1976), Appendix A2-2;
D. Bailin, in ref. [6], chapter 2.
106
4
Gauge Theories and the Standard Model
In the rst chapter, we focused on quantum eld theories of free
fermions. If we wish to construct renormalizable interacting quantum
eld theories, one must introduce additional elds. The requirement
of renormalizability imposes two constraints. First, the terms of the
interaction Lagrangian must be no higher than dimension-four. Thus,
no (perturbatively) renormalizable interacting theory that consists only
of spin-1/2 elds exists, since the simplest interaction term involving
fermions is a dimension-six four-fermion interaction. Renormalizable
interacting theories consisting of scalars, fermions and spin-one bosons
can be constructed. The vector bosons must either be abelian vector elds
or non-abelian gauge elds. This exhausts all possible renormalizable eld
theories.
The Standard Model is a spontaneously broken non-abelian gauge
theory containing elementary scalars, fermions and spin-one gauge bosons.
Typically, one refers to the spin-0 and spin-1/2 elds (which are either
neutral or charged with respect to the underlying gauge group) as matter
elds, whereas the spin-1 gauge bosons are called gauge elds. In this
chapter we review the ingredients for constructing non-abelian (Yang-
Mills) gauge theories and its breaking via the dynamics of a sector
of self-interacting scalar elds. The Standard Model of fundamental
particles and interactions is then exhibited, and some of its properties
are described.
4.1 Abelian Gauge eld theory
The rst (and simplest) known gauge theory is quantum electrodynamics
(QED). This was a very successful theory that described the interactions
107
108 4 Gauge Theories and the Standard Model
of electrons, positrons and photons. The Lagrangian of QED is given by:
L =
1
4
F
+i
+i
m( + ) , (4.1)
where the electromagnetic eld strength tensor F
is dened in terms of
the gauge eld A
as
F
, (4.2)
and the covariant derivative D
is dened as:
D
(x) (
+ieq
)(x) , (4.3)
where (x) = (x) or (x) with q
= 1 and q
= +1. Note that we

have written eq. (4.1) in terms of the two-component fermion elds and
. The identication of these elds with the electron and positron (with
corresponding electric charges in units of e > 0 are given by q
= 1 and
q
= +1) has been given in Section 2.9.

1
The QED Lagrangian consists of a sum of the kinetic energy term
for the gauge (photon) eld,
1
4
F
and the Dirac Lagrangian with

the ordinary derivative
replaced by a covariant derivative D
. This
Lagrangian is invariant under a local U(1) gauge transformation:
A
(x) A
(x) +
(x) , (4.4)
(x) exp[ie(x)](x) , (4.5)
(x) exp[ie(x)](x) , (4.6)
relecting the fact that the U(1)-charges of and elds are e and +e,
respectively. The Feynman rule for electrons interacting with photons is
obtained by taking G
= eq
for = and . That is, we employ G
=
G
= e in the two-component Feynamn rules displayed in Fig. 2.6.

One can easily check that the corresponding four-component Feynman
rule of Fig. 3.1 yields the well-known rule for the e
+
e
vertex of QED.
One can extend the theory above by including charged scalars among
the possible matter elds. For example, a scalar eld (x) of denite
U(1) charge q
will transform under the local U(1) gauge transformation:

(x) exp[ieq
(x)](x) . (4.7)
A gauge invariant Lagrangian involving the scalar elds can be obtained
from the free-eld scalar Lagrangian of eq. (3.3), by replacing
(x) with
1
It is a simple matter to rewrite eq. (4.1) in terms of the four-component spinor electron
eld. Simply replace (x) with D(x) in the Dirac Lagrangian [eq. (3.60)] with
q = 1 and add the kinetic energy term for the gauge elds.
D
(x) (
+ ieq
)(x). One may also add gauge-invariant Yukawa

interactions of the form
L
Y
= y
ijk
k
+ h.c. , (4.8)
where
i
consist of either neutral or charged scalar elds and
i
consist of
of neutral Majorana () or charged Dirac pairs ( and ) two-component
fermion elds. The (complex) Yukawa couplings y
ijk
vanish unless the
condition q
i
+q
j
+q
k
= 0 is satised, as required by gauge invariance.
4.2 Non-abelian gauge groups and their Lie algebras
Abelian gauge eld theory can be generalized by replacing the abelian
U(1) gauge group of QED with a non-abelian gauge group G. We again
consider possible matter eldsmultiplets of scalar elds
i
(x) and two-
component fermion elds
i
(x) [or equivalently, four component fermions
i
(x)] that are either neutral or charged with respect to G.
The symmetry group G can be expressed in general as a direct product
of simple gauge group factors and some number of U(1) factors. The list
of all possible simple Lie groups are known and consist of SU(n), SO(n),
Sp(n) and ve exceptional groups (G
2
, F
4
, E
6
, E
7
and E
8
). Given some
matter eld (either scalar or fermion), which we generically designate by
i
(x), the gauge transformation under which the Lagrangian is invariant,
is given by:
i
(x) U
i
j
(g)
j
(x) , i, j = 1, 2, . . . , d
R
, (4.9)
where g is an element of G (that is, g is a specic gauge transformation)
and U(g) is a (possibly reducible) unitary representation of G of dimension
d
R
. One is always free to redene the elds via (x) V (x), where
V is any xed unitary matrix (independent of the choice of g). The
gauge transformation law for the redened (x) now has U(g) replaced
by V
1
U(g)V . If U(g) is a reducible representation, then it is possible
to nd a V such that the U(g) for all group lements g assume a
block diagonal form. Otherwise, the representation U(g) is irredicible.
Henceforth, we will assume that matter eld representations are broken
down into their irreducible pieces. Irreducible representations imply that
the corresponding multiplets transform only among themselves, and thus
we can focus on these pieces separately without loss of generality.
The local gauge transformation U(g) is also a function of space-time
position, x
. Explicitly,
U(g(x)) = exp[ig
a
(x)T
a
] , (4.10)
where there is an implicit sum over the repeated index a = 1, 2, . . . , d
G
.
The T
a
are a set of d
G
linearly independent hermitian matrices
2
called
generators of the Lie group, and the corresponding
a
(x) are arbitrary
x-dependent functions. The constant g (which is analogous to e of the
abelian theory) is called the gauge coupling.
In the next section, we will see that the gauge elds transform under
the adjoint representation of the (global) gauge group. If G is semi-
simple (that is, a direct product of simple Lie groups), then the adjoint
representation is irreducible. The explicit matrix elements of the adjoint
representation generators are given by
(T
a
)
bc
= if
abc
a, b, c = 1, 2, . . . , d
G
. (4.11)
Thus, the dimension of the adjoint representation matrices coindices with
the number of generators, d
G
. Consequently, we shall often refer to the
indices a, b, c as adjoint indices.
Lie group theory teaches us that the number of linearly independent
generators, d
G
, depends only on the abstract denition of G (and not on
the choice of representation). Thus, d
G
is also called the dimension of
the Lie group G. Moreover, the commutator of two generators is a linear
combination of generators:
[T
a
, T
b
] = if
abc
T
c
, (4.12)
where the f
abc
are called the structure constants of the Lie group. In
studying the stucture of gauge eld theories, nearly all the information
of interest can be ascertained by focusing on innitesimal gauge
transformations. In this case,
U(g(x)) I
d
R
ig
a
(x)T
a
, (4.13)
and the innitesimal gauge transformation takes the form
i
(x)
i
(x)+
i
(x), where
i
(x) = ig
a
(x)(T
a
)
i
j
j
(x) . (4.14)
From the generators T
a
, one can reconstruct the group elements U(g(x)),
so it is sucient to focus on the innitesimal group transformations. The
group generators T
a
span a real vector space, whose general element is
c
a
T
a
, where the c
a
are real numbers.
3
Given, eq. (4.12), the commutator
2
The condition of linear independence means that c
a
T
a
= 0 (implicit sum over a)
implies that c
a
= 0 for all a.
3
With the T
a
hermitian, we require the c
a
to be real in order that U(g) be unitary.
Then, the T
a
span a real Lie algebra. Mathematicians consider the elements of
the real Lie algebra to be ic
a
T
a
, with anti-hermitian generators iT
a
. Note that for
real Lie algebras, the representation matrices for T
a
(or iT
a
) may be complex or
quaternionic.
of any two vectors (e.g., [c
a
T
a
, d
b
T
b
]) is well dened. Thus, one can
formally dene a vector product of any two elements of the vector space
as the commutator of the two vectors. Then, this vector space is also an
algebra, called a Lie algebra. Henceforth, the Lie algebra corresponding to
the Lie group G will be designated by g. A Lie algebra has one additional
important property:
_
T
a
, [T
b
, T
c
]
+
_
T
b
, [T
c
, T
a
]
+
_
T
c
, [T
a
, T
b
]
= 0 . (4.15)
This is called the Jacobi identity, and it is clearly satised by any three
elements of the Lie algebra.
The choice of basis vectors (or generators) T
a
is arbitrary. Moreover,
the values of the structure constants f
abc
also depend on this choice of
basis. Nevertheless, there is a canonical choice which we now adopt. The
generators are chosen such that:
Tr (T
a
T
b
) = T
R
ab
, (4.16)
where T
R
depends on the irreducible representation of the T
a
. Having
chosen this basis, there is no distinction between upper and lower adjoint
indices.
4
This basis choice is also convenient since in this basis, the
f
abc
are completely antisymmetric under the interchange of a, b and
c.
5
One can show that once T
R
is chosen for any one non-trivial
irreducible representation R, then the value of T
R
for any other irreducible
representation is xed. Corresponding to each simple real (compact)
Lie algebra, one can identify one particular irreducible representation,
called the dening representation (sometimes, but less accurately, called
the fundamental representation); the most useful examples are listed in
Table 4.1. For the dening (or fundamental) representation (which is
indicated by R = F), the conventional value for T
R
is taken to be:
T
F
=
1
2
. (4.17)
As noted above, the basis choice of eq. (4.16) with the normalization
convention given by eq. (4.17) xes the value of T
R
for an arbitrary
irreducible representation R. The quantity T
R
/T
F
is called the index
of the representation R.
For the record, we mention two other properties of Lie algebras that
will be useful in this book. First, a Casimir operator is dened to be
4
More generally, in an arbitrary basis, Tr (T
a
T
b
) = TRg
ab
, where g
ab
is the Cartan-
Killing form (which can be used to raise and lower adjoint indices).
5
By denition, f
abc
is antisymmetric under the interchange of a and b. But the
complete antisymmetry under the interchange of all three indices requires eq. (4.16)
to be satised.
Table 4.1. Simple real compact Lie algebras, g, of dimension d
G
and rank
r
G
. Note that [
1
2
n] indicates the greatest integer less than
1
2
n. The denining
representation refers to arbitrary linear combinations ic
a
T
a
, where the c
a
are
real and the T
a
are the generators of G.
g d
G
r
G
dening represnetation
so(n)
1
2
n(n 1) [
1
2
n] n n real antisymmetric
su(n) n
2
1 n 1 n n traceless complex anti-hermtian
sp(n) n(2n + 1) n n n quaternionic anti-hermitian
an operator that commutes with all the generators T
a
of G. Then, there
exists a quadratic Casimir operator that is dened by:
(T
a
)
i
k
(T
a
)
k
j
= C
R
i
j
. (4.18)
Any operator that commutes with all the generators must be a multiple
of the identity (by Schurs lemma). The coecient of
i
j
depends on the
representation R and is denoted by C
R
. The following theorem is then
easily proven:
T
R
d
G
= C
R
d(R) , (4.19)
where d(R) is the dimension of the representation R. Note that for the
adjoint representation (R = A), d(A) = d
G
, so that C
A
= T
A
. As
an example, for SU(n), Table 4.1 yields d
G
= n
2
1 and d(F) = n.
Using eq. (4.17), one the obtains C
F
= (n
2
1)/(2n). From an explicit
representation of the f
abc
for SU(n), one can also derive C
A
= T
A
= n.
Second, for any semi-simple Lie algebra g (that is, a direct sum of
simple Lie algebras), given an irreducible representation of the hermitian
generators T
a
, one can always nd an equivalent representation V
1
T
a
V
for some unitary matrix V . There exists some choice of V (not necessarily
unique) that maximizes the number of diagonal generators, V
1
T
a
V .
This number, called the rank of g, is independent of the choice of
representation, and is a property of the abstract Lie algebra. The ranks
of the classical Lie algebras are given in Table 4.1.
4.3 Non-abelian gauge eld theory
In order to construct a non-abelian gauge theory, we follow the recipe
introducted in the case of abelian gauge theory. Namely, we introduce
a gauge eld A
and a covariant derivative D
. By replacing
with
D
in the kinetic energy terms of the matter elds and introducing an

appropriate transformation law for A
, the resulting matter kinetic energy

terms are invariant under local gauge transformations.
As an example, consider a scalar eld theory with Lagrangian
L = (
i
)(
i
) V (,
) , (4.20)
where the scalar potential V is invariant under gauge transformations,
i
(x) U
i
j
(g)
j
(x); that is,
V (U, (U)
) = V (,
) . (4.21)
Although L is invariant under global (i.e., x-independent) gauge
transformations, the kinetic energy term is not invariant under local
gauge transformations due to the presence of the derivative. In particular,
under local gauge transformations
(U) = U
+(
U). We
therefore introduce the covariant derivative acting on a matter eld that
transforms according to some representation R of the symmetry group G:
(D
)
i
j
=
I
j
+igA
a
(x)(T
a
)
i
j
a = 1, 2, . . . , d
G
, (4.22)
where the avor indices i, j = 1, 2, . . . , d(R). By making a suitable
transformation law for A
m
u
a
(x), one can arrange D
to transform under
a local gauge transformation as
D
UD
, (4.23)
in which case,
L = (D
i
)
(D
i
) V (,
) (4.24)
is invariant under local gauge transformations.
The transformation law for A
a
(x) is most easily expressed for the

matrix-valued gauge eld
6
A
m
u(x) A
a
(x)T
a
. (4.25)
Under local gauge transformations, the matrix-valued gauge eld trans-
forms as
A
UA
U
1
i
g
U(
U
1
) . (4.26)
Using eq. (4.26), one quickly shows that D
transforms as expected:
D
+igA
)
_
+igUA
U
1
+U(
U
1
)
U
= UD
+
_
U +U(
U
1
)U
= UD
. (4.27)
6
The matrix-valued gauge eld that one employs in the covariant derivative depends
on the representation of the matter elds on which the covariant derivative acts.
In the last step, we noted that
U +U(
U
1
)U =
_
(
U)U
1
+U(
U
1
)
U (4.28)
=
_
(UU
1
)
U = 0 , (4.29)
since UU
1
= I.
It is also useful to exhibit the innitesimal version of the transformation
law, by taking the ininitesimal form for U [eq. (4.13)]. This allows us to
directly write down the transformation law for A
a
(x). Namely, A
a
(x)
A
a
(x) +A
a
(x), where
A
a
= gf
abc
b
A
c
a
D
ab

b
, (4.30)
where
D
ab

ab
+gf
abc
A
c
(4.31)
is the covariant derivative operator applied to elds in the adjoint
representation (i,e,, insert eq. (4.11) for the T
a
in eq. (4.22)). In
particular, under global gauge transformations, the gauge elds A
a
(x)
transform under the adjoint representation of the gauge group.
To complete the construction of the non-abelian gauge eld theory, we
must introduce a gauge invaraint kinetic energy term for the gauge elds.
To motivate the denition of the gauge eld strength tensor, we consider
[D
, D
] acting on ,
[D
, D
] = [
+igA
+igA
]
= ig
_
+ig[A
, A
]
_
. (4.32)
Thus, we dene the matrix-valued gauge eld strength tensor F

F
a
T
a
as follows:
[D
, D
] igF
. (4.33)
Using eq. (4.32) and the commutation relations of the Lie group
generators, it follows that
F
a
A
a
A
a
gf
abc
A
b
A
c
. (4.34)
Under gauge transformations, the transformation law for F
is easily
obtained. Starting with U, D
UD
(the latter is the essential

property of the covariant derivative), and [D
, D
] U[D
, D
], one
easily derives F
UF
= (UF
U
1
)U. That is,
F
UF
U
1
, (4.35)
which is the transformation law for an adjoint eld. Namely, under an
innitesimal gauge transformation,
F
a
(x) = gf
abc
b
FA
c
(x) . (4.36)
Note that inAbelian gauge theory, UF
U
1
= UU
1
F
= F
so that
F
is gauge invariant (i.e., neutral under the gauge group). For non-
Abelian gauge group, F
transforms non-trivially; i.e., it carries non-

trivial gauge charge.
We can now construct a gauge-invariant kinetic energy term for the
gauge elds:
L
gauge
=
1
4T
R
Tr (F
) (4.37)
Using eq. (4.35), it is clear that L
gauge
is gauge invarinat due to the
invariance of the trace under cyclic permutation of the quantities inside
the trace. Using Tr (F
) = F
a
F
b
Tr (T
a
T
b
) == T
R
F
a
F
a
, the
form for L
gauge
does not depend on R and we end up with:
L
gauge
=
1
4
F
a
F
a
=
1
4
(
A
a
A
a
gf
abc
A
b
A
c
)(
A
a
A
a
gf
ade
A
d
A
e
) .
(4.38)
Thus, for non-abelian gauge theory (in contrast to abelian gauge theory),
the gauge kinetic energy term generates three-point and four-point self-
interactions among the gauge elds.
To summarize, if L
matter
is invariant under a group G of global (gauge)
transformations, then
L = L
gauge
+L
matter
(
) (4.39)
is invariant under a group G of local gauge transformations. Above,
L
matter
contains a sum of kinetic energy terms of the various scalar
and fermion matter multiplets, each of which transforms under some
irreducible representation of the gauge group. In these terms, we replace
the ordinary derivative with D
+ igA
a
T
a
and use the matrix
representation T
a
appropriate for each of the matter eld multiplets.
Note that there is no mass term for the gauge eld, since the term:
L
mass
=
1
2
m
2
A
a
A
a
(4.40)
would violate the local gauge invariance. This is a tree-level result; in the
next section we will discusee whether this result persists to all orders in
perturbation theory.
4.4 Feynman rules for Gauge theories
The Feynamn rules for the self-interactions of the gauge elds and for
the interactions of matter are simple to obtain. The triple and quartic
gauge boson self-couplings follow from the the form of the gauge kinetic
energy term [eq. (4.38)]. The interactions of the gauge bosons with
matter are derived from the matter kinetic energy terms. Specically,
after replacing the ordinary derivatives with covariant derivatives, the
gauge eld dependence of D
generates cubic and quartic terms in the

Lagrangian that are linear and quadratic in A
. For example, the

Feynman rules for the interactions of gauge bosons and fermions have
been given in Figs. 2.62.8 [see also Figs. 3.1 and 3.2].
However, an apparent problem is encountered when one tries to obtain
the Feynman rule for the the gauge boson propagator. In general the rule
for the tree-level propagator is obtained by inverting the operator that
appears in the part of the Lagrangian that is quadratic in the elds. In
the case of gauge elds, this is the kinetic energy term, which we can
rewrite as follows
1
4
(
A
a
A
a
)(
A
a
A
a
)
=
1
2
A
a
(g
)A
a
+ total divergence . (4.41)

The total divergence does not contribute to the action. But, note that
(g
= 0, which implies that g
has a zero eigenvalue

and therefore is not an invertible operator.
The solution to this problem is to add the so-called gauge-xing
term and Faddeev-Popov ghost elds (adjoint elds
a
and
a
). The
justication of this procedure can be found in the standard quantum
eld theory textbooks (and is most easily explained using path integral
techniques). Here, we take a more practical view and simply note that
the following Yang-Mills (YM) Lagrangian
L
YM
=
1
4
F
a
F
a
1
2
(
a
)
2
+
a
D
ab
w
b
(4.42)
is invariant under a BRS extended gauge symmetry, whose innitesimal
transformation laws arge given by:
A
a
= D
ab
w
b
(4.43)
w
a
=
1
2
gf
abc
w
b
w
c
(4.44)
w
a
=
1
a
(4.45)
where is an innitesimal anticommuting parameter and w, w
are
independent (unphysical) anticommuting scalar elds. Note that the
4.4 Feynman rules for Gauge theories 117
gauge transformation function
a
(x) [see eq. (4.30)]has been promoted to
a eld, whose transformation law is given above. Eq. (4.42) generates new
interaction terms involving the gauge elds and Faddeev-Popov ghosts.
The Faddeev-Popov ghosts can therefore appear inside loops of Feynman
diagrams. One can show that any scattering amplitudes that involve only
physical particles as external states satisfy unitarity. Hence the theory
based on eq. (4.42) is consistent.
In particular, due to the gauge-xing term (the term that involves
the gauge-xing parameter ), one observes that the past of the
Lagrangian quadratic in the gauge elds has now changed, and the
propagator can now be dened. Converting to momentum space,
the Feynman rule for a non-abelian gauge boson propagator is:
b, a,
q
i
ab
q
2
+i
_
g
+ (1 )
q
q
2
_
The Feynman gauge ( = 1) and the Landau gauge ( = 0) are two
of the more common gauge choices made in practical computations. Of
course, any physical quantity must be independent of . This provides a
good check of Feynman diagram computations of graphs in which internal
gauge bosons propagate. Note that the above considerations also apply to
abelian gauge theories such as QED. In this case, one can can introduce
ghost elds to exhibit the extended BRS symmetry. However, within
the class of gauge xing terms considered here, the ghost elds are non-
interacting (since the photon does not carry any U(1)-charge) and hence
the ghosts can be dropped. The photon propagator still takes the form
above (but with the factor of
ab
removed).
Finally, let us examine the question of the gauge boson mass. We know
that the gauge boson is massless at tree-level. But, can one generate
mass via radiative corrections? One must compute the gauge boson
two-point function (which corresponds to the radiatively-corrected inverse
propagator) and check to see whether the zero at q
2
= 0 (corresponding
to a zero mass gauge boson) is shifted. Summing up the geometric series
yields:
T
(q) = D
(q) +D
(q)i
(q)T
(q) , (4.46)
where D
(q) is the tree-level gauge eld propagator and i
(q) is the
sum over all one-particle irreducible (1PI) diagrams (these are the graphs
cannot be split into two separate graphs by cutting through one internal
line). The Ward identity of the theory (a consequence of gauge invariance)
implies that q
= q
= 0. It follows that one can write:

i
ab
(q) = i(q
2
g
)(q
2
)
ab
. (4.47)
After Multiplying on the left of eq. (4.46) by D
1
and on the right by
T
1
(where, e.g., D
1
= g
), one obtains:
T
1
(q) = D
1
(q) = i
(q) . (4.48)
It is convenient to decompose D
(q) Decompose into transverse and

longitudinal pieces:
D
(q) = D(q
2
)
_
g
q
2
_
+D
()
(q
2
)
q
q
2
. (4.49)
A similar decomposition of the inverse D
1
(q) and T
1
(q) are easily

obtained; for example,
D
1
(q) =
1
D(q
2
)
_
g
q
2
_
+
1
D
()
(q
2
)
q
q
2
. (4.50)
Since
(q) is transverse [eq. (4.47)], one easily concludes from eq. (4.48)
that
T
()
(q
2
) = D
()
(q
2
) =
i
q
2
+i
(4.51)
T
1
(q
2
) = D
1
(q
2
) +iq
2
(q
2
) . (4.52)
Using D
1
(q
2
) = iq
2
, we conclude that
7
T
(q) =
i
q
2
[1 + (q
2
)]
_
g
q
2
_
iq
q
4
. (4.53)
Thus, the pole at q
2
= 0 is not shifted. That is, the gauge boson mass
remains zero to all orders in perturbation theory.
This elegant argument has a loophole. Namely, if (q
2
) develops a pole
at q
2
= 0, then the pole of T
(q) shifts away from zero: (q

2
) m
2
v
/q
2
as q
2
0 implies that T(q
2
) i/(q
2
m
2
v
). This requires some non-
trivial dynamics to generates a massless intermediate state in
(q).
Such a massless state is called the Goldstone boson. The Standard Model
employs the dynamics of elementary scalar elds in order to generate the
Goldstone mode. We thus turn our attention to the vector boson mass
generation mechanism of the Standard Model.
7
We have dropped the explicit +i that is associated with the pole of the propagator.
4.5 Spontaneously broken gauge theories
Start with a non-abelian gauge theory with scalar (and fermion) matter,
given by eq. (4.39). The corresponding scalar potential function must
be gauge invariant [eq. (4.21)]. In order to identify the physical scalar
degrees of freedom of this model, one must minimize the scalar potential
and determine the corresponding values of the scalar elds at the potential
minimum. These are the scalar vacuum expectation values. Expanding
the scalar elds about their vacuum expectation values yields the tree-
level scalar masses and self-couplings. However, in general the scalar
elds are charged under the global symmetry group G, in which case a
non-zero vacuum expectation value would be incompatible with the global
symmetry.
4.5.1 Goldstones theorem
In the absence of the gauge elds, Goldstone proved the following theorem:
Theorem: If the Lagrangian is invariant under a continuous global
symmetry group G (with dimension d
G
), but the vacuum state of the
theory does not respect all G-transformations, then the theory exhibits
spontaneous symmetry breaking. If the vacuum state respects H-
transformations, then we say that the group G breaks to a subgroup H
(with dimension d
H
). The physical spectrum will then contain n massless
scalar excitations (called Goldstone bosons) where n d
G
d
H
.
This theorem can be proved independent of perturbation theory. How-
ever, it can be demonstrated rather easily with a tree-level computation.
Let
i
(x) be a set of N real scalar elds. The Lagrangian
L =
1
2
(
i
)(
i
) V (
i
) (4.54)
is assumed to be invariant under O(N), under which the scalar elds
transform innitesimally as
i
(x) = ig
a
(T
a
)
j
i
i
(x) (4.55)
where the T
a
are imaginary antisymmetric matrices. Since under the
symmetry transformation,
i
O
i
j
j
(where O is orthogonal), the
kinetic term is automatically invariant.
We also assume that V () is invariant, which implies that
V ( +) V () +
V
i
= V () . (4.56)
Step one above is simply a Taylor expansion to rst order in the eld
variation, while step two imposes the invariance assumption. Inserting
the result for
i
(x) from eq. (4.55), it follows that:
V
i
(T
a
)
i
j
j
= 0 . (4.57)
Suppose V () has a minimum at
i
) = v
i
, which is not invariant under
the global symmetry group. That is,
O
i
j
v
j

_
i
j
ig
a
(T
a
)
i
j
_
v
j
,= v
i
. (4.58)
It follows that there exists at least one a such that
(T
a
v)
j
,= 0 (4.59)
and we conclude that the global symmetry is spontaneously broken. Now,
shift the eld by its vacuum expectation value:
i
v
i
+
i
(4.60)
and express the scalar Lagrangian in terms of the
i
L =
1
2
(
i
)(
i
)
1
2
M
2
ij
j
+O(
3
) , (4.61)
where
M
2
ij

_

2
V
j
_
i
=v
i
. (4.62)
The terms cubic and higher in the
i
do not concern us here. Since V
must satisfy eq. (4.57), we can dierentiate the latter with respect to
k
and then set all
i
= v
i
. Since (V/
i
)
i
=v
i
= 0 as a consequence of the
minimum condition, the end result is given by
M
2
ki
(T
a
v)
i
= 0 . (4.63)
Either T
a
v = 0 or (T
a
v)
i
is an eigenvector of M
2
with zero eigenvalue.
That is, G
a
=
i
T
a
ij
v
j
is precisely the linear combination of scalar elds
whose mass is zero. These are the Goldstone bosons, and there are d
G
d
H
independent Goldstone modes, where H is the residual symmetry group
(corresponding to the maximal number of linearly independent elements
of the Lie algebra that annihilate the vacuum).
4.5.2 Massive gauge bosons
If a spontaneously broken global symmetry is made local, then a
remarkable mechanism called the Higgs mechanism takes place. The
Goldstone bosons disappear from the spectrum, and the formerly massless
gauge bosons become massive. Roughly speaking, the Goldstone bosons
become the longitudinal degrees of freedom of the massive gauge bosons.
This is easily demonstrated in a generalization of the tree-level analysis
given above. If the O(N) scalar sector above is coupled to gauge elds,
then one must replace the ordinary derivative with a covariant derivative
in the scalar kinetic energy term:
L
KE
=
1
2
[
i
+igA
a
(T
a
)
i
j
j
][
i
+igA
a
(T
a
)
i
k
k
] . (4.64)
As above, the
i
(x) are real elds at the T
a
are pure imaginary
antisymmetric matrix generators corresponding to the representation of
the scalar multiplet. It is convenient to dene real antisymmetric matrices
L
a
iT
a
. (4.65)
If we expand the
i
around its vacuum expectation value [eq. (4.60)], then
one immediately observes a term quadratic in the gauge elds:
L
mass
=
1
2
M
2
ab
A
a
A
b
, (4.66)
where
M
2
ab
= g
2
(L
a
v, L
b
v) (4.67)
is the gauge boson squared-mass matrix. Here, we have employed a
convenient notation where:
(x, y)
i
x
i
y
i
. (4.68)
If L
a
v ,= 0 for at least one a, then the gauge symmetry is broken and
the gauge bosons acquire mass. The gauge boson squared-mass matrix is
real symmetric, so it can be diagonalized with an orthogonal similarity
transformation:
OM
2
O
T
= diag (0, 0, . . . , 0, m
2
1
, m
2
2
, . . .) . (4.69)
If we dene
L
a
O
ab
L
b
(4.70)
then
(OM
2
O
T
)
ab
= (
L
a
v,
L
b
v) = m
2
a
ab
(4.71)
where m
2
a
= 0,

L
a
v = 0 a = 1, . . . , d
H
correspond to the unbroken
generators and m
2
a
,= 0,

L
a
v ,= 0 a = d
H
+ 1, . . . , d
G
correspond to
the broken generators. That is, there are d
G
d
H
massive gauge bosons
(corresponding to the dimension of the coset space G/H). The gauge
boson mass eigenstates are:
A
a
O
ab
A
b
. (4.72)
Note that
L
a
A
a
= O
ac
O
ab
L
c
A
b
= L
c
A
c
. (4.73)
It is instructive to revisit the scalar Lagrangian:
L =
1
2
(
i
+ (L
a
)
i
j
A
a
j
)(
i
+ (L
b
)
i
k
A
b
k
) V () . (4.74)
Expanding the scalar elds around their vacuum expectation values, and
identifying the massive gauge boson mass eigenstates,
L =
1
2
(
i
)(
i
) +
1
2
m
2
a
A
a
A
a
+
1
2
(
L
a
A
a
v,
) +
1
2
(
L
a
A
a
v) +. . .
=
1
2
(
i
)(
i
) +
1
2
m
2
a
A
a
A
a
+m
a
A
a
G
a
+. . . , (4.75)
where
G
a
=
1
m
a
(
L
a
v, ) . (4.76)
In eq. (4.75), we have not displayed terms cubic or higher in the elds.
Note that the sum over the repeated index a is assumed. We recognize
G
a
as the Goldstone boson which appeared in the spontaneously broken
global theory without gauge bosons.
The coupling m
a
A
a
G
a
provides an explanation for the vector
boson mass generation mechanism. Namely, this coupling yields a new
interaction vertex in the theory. We can then evaluate the contribution
of an intermediate Goldstone line to i
(q):
i
(k) = m
2
a
k
(k
)
i
k
2
+. . .
= im
2
a
k
k
2
+. . . (4.77)
This is the only source for a pole at k
2
= 0 at this order in perturbation
theory. Gauge invariance ensures that
i
(k) = i(k
k
2
g
)(k
2
) (4.78)
so that
(k
2
)
m
2
a
k
2
(4.79)
and
T(k
2
) =
i
k
2
[1 + (k
2
)]
=
i
k
2
m
2
a
(4.80)
We say that the gauge boson eats or absorbs the corresponding
Goldstone boson and thereby acquires mass via the Higgs mechanism.
4.5.3 The unitary and R
gauges
The spontaneously broken non-abelian gauge theory Lagrangian contains
the Goldstone boson eld. However, as we shall now demonstrate, the
Goldstone bosons are gauge artifacts that can be removed by a gauge
transformation. Consider the transformation law for the shifted scalar
eld,
i
(x)
i
(x) v
i
. Using eqs. (4.55) and (4.65) and noting that
v
i
= 0,
i
= ( +v) , (4.81)
where = L
a
a
=

L
a
a
if we dene

a
= O
ab
b
. Then,
G
a
=
1
m
a
(
L
a
v, ) =
1
m
a
(
L
a
v, + v) (4.82)
=
1
m
a
(
L
a
v, )
1
m
a
(
L
a
v,
L
b
v)
b
=
1
m
a
(
L
a
v, ) m
a
a
, (4.83)
after using eq. (4.71) for the gauge boson masses. Note that the last term
above, m
a
a
is an inhomogeneous term independent of . Thus, one can
simply choose

a
such that G
a
= 0. This is called the unitary gauge.
So far, we have not mentioned the gauge xing term and Faddeev-
Popov ghosts. In spontaneously-broken non-abelian gauge theories, the
R
-gauge turns out to be particularly useful.

8
The R
-gauge is dened
8
The R stands for renormalizable, and is the gauge xing parameter. In the R-
gauges, the theory is manifestly renormalizable although unitarity is not manifest
and must be separately proved. The exact opposite is true for the unitary gauge.
by the following gauge-xing term:
L
GF
=
1
2
_
A
a
i
(
L
a
)
i
_
2
=
1
2
(
A
a
)
2
+
1
2
_
L
a
(
A
a
)
_
+
1
2
_
,
L
a
(
A
a
)
_
i
(
L
a
)
i
_
2
=
1
2
(
A
a
)
2
+m
a
G
a
(
A
a
)
m
2
a
2
G
a
G
a
. (4.84)
At this point, we notice that the term m
a
G
a
(
A
a
) of eq. (4.84) combines

with m
a
A
a
G of eq. (4.75) to yield

m
a
[G
a
A
a
+

A
a
G
a
] = m
a
(G
a
A
a
) , (4.85)
which is a total divergence which can be dropped from the Lagrangian.
One must also include Faddeev-Popov ghosts:
L
FP
=
a
D
ab
w
b
w
a
M
2
ab
w
b
g
2
w
a
w
b
(, T
a
T
b
v) , (4.86)
D
ab
is dened in eq. (4.31) and M

2
ab
is the gauge boson squared-mass
matrix.
In the R
gauges, the Feynman rules for the massless and massive

gauge boson propagators take on the following form, respectively:
b, a,
q
i
ab
q
2
+i
_
g
+ (1
0
)
q
q
2
_
b, a,
q
i
ab
q
2
m
2
a
+i
_
g
+ (1 )
q
q
2
m
2
a
_
Above, the massless gauge bosons are indicated by wavy lines, and the
massive gauge bosons are indicated by curly lines. In principle
0
,= is
possible since for

L
a
v = 0 we are free to choose an arbitrary -parameter.
In practice, one usually sets
0
= .
In addition, we note from eq. (4.84) that the Goldstone bosons have
acquired a squared-mass equal to m
2
a
. This is an artifact of the gauge
choice. Nevertheless, for a consistent computation in the R
-gauge, both
the Goldstone bosons and the Faddeev-Popov ghosts must be included
as possible internal lines in Feynman graphs. As noted previously, any
physical quantity must ultimately be independent of .
As in the unbroken non-abelian gauge theory, the two most useful
gauges are = 1 (now called the t Hooft-Feynman gauge) and
= 0 (still called the Landau gauge). The Landau gauge has an
additional extra benet that the Goldstone bosons are massless. Finally,
we note that one can attempt to take the limit of . This
corresponds to the unitary gauge, since the Goldstone boson masses
become innite and thus decouple from all Feynman graphs. Moreover,
one can check that the massive gauge boson propagator reduces to
b, a,
q
i
ab
q
2
m
2
a
+i
_
g
+
q
m
2
a
_
which is the expected form for a massive gauge boson propagator in
a gauge-noninvariant model. The fact that the unitary gauge is a
limiting case of the R
-gauge played an essential role in the proof

that spontaneously-broken non-abelian gauge theories are unitary and
renormalizable.
4.5.4 The physical Higgs bosons
An important check of the formalism is the counting of bosonic degrees of
freedom. Assume that the multiplet of scalar elds transforms according
to some representation R of dimension d(R) under the transformation
group G (which has dimension d
G
). Prior to spontaneous symmetry
breaking, the models contains d(R) scalar degrees of freedom and 2d
G
vector-boson degrees of freedom. The latter consists of d
G
massless gauge
bosons (one for each possible value of the adjoint index a), with each
massless gauge boson contributing two degrees of freedom corresponding
to the two possible transverse helicities. After spontaneous symmetry
breaking of G to a subgroup H (which has dimension d
H
), there are
d
G
d
H
Goldstone bosons which are unphysical (and can be removed
from the spectrum by going to the unitary gauge). This leaves
d(R) d
G
+d
H
scalar degrees of freedom (4.87)
which correspond to the physical Higgs bosons of the theory. We also
found that there are d
H
massless gauge bosons (one for each unbroken
generator) and d
G
d
H
massive gauge bosons (one for each broken
generator). But, for massive gauge bosons which possess a longitudinal
helicity state, we must count three degrees of freedom. Thus, we end up
with
3d
G
d
H
vector boson degrees of freedom (4.88)
Adding the two yields a total of d(R) + 2d
G
bosonic degrees of freedom,
in agreement with our previous counting.
It is instructive to check that the physical Higgs bosons cannot be
removed by a gauge transformation. We divide the scalars into two
classes: (i) the Goldstone bosons G
a
, a = d
H
+ 1, . . . , d
G
[see eq. (4.76)]
and (ii) the scalar states orthogonal to G
a
. These are the Higgs bosons:
H
k
= c
(k)
j

j
, (4.89)
where the c
(k)
j
satisfy:
j
c
(k)
j
(
L
a
v)
j
= 0 . (4.90)
Under a gauge transformation,
(c
(k)
j

j
) = c
(k)
j

j
= c
(k)
j
( + v)
j
= c
(k)
j
()
j
, (4.91)
where we have noted that c
j
(v)
j
= c
j
a
(
L
a
v)
j
= 0 by orthogonality.
Thus the transformation law for

H
k
is homogeneous in the scalar
elds, and thus one cannot remove the Higgs boson eld by a gauge
transformation.
The states

H
k
are in general not mass-eigenstates. Employing the
_
G
a
,

H
k
_
basis for the scalar elds, we note that the scalar boson squared-
mass matrix
/
2
ij
=
_

2
V
j
_
i
=v
(4.92)
satises (M
2
)
ij
(
L
a
v)
j
= 0. Thus, the non-derivative quadratic terms in
the scalar part of the Lagrangian is
L
scalar mass
=
1
2
(/
2
)
k
H
k
. (4.93)
Diagonalizing /
2
yields the Higgs boson eigenstates, H
k
, and their
respective squared-masses.
4.6 Complex representations of scalar elds 127
4.6 Complex representations of scalar elds
We now have nearly all the ingredients necessary to construct the
Standard Model. However, in our treatment of spontaneously broken
non-abelian gauge elds, we took the scalar elds to be real elds that
transformed under a simple orthogonal symmetry group. The Standard
Model employs complex scalars that transform under semi-simple product
of unitary symmetry groups. In this section, we provide the necessary
information which will allow one to treat the complex and semi-simple
cases.
4.7 The Standard Model of particle physics
The Standard Model is a spontaneously broken non-abelian gauge theory
based on the symmetry group SU(3)SU(2)U(1).
4.8 Parameter count of the Standard Model
The Standard Model Lagrangian appears to contain many parameters.
However, not all these parameters can be physical. One is always free to
redene the Standard Model elds in an arbitrary manner. By suitable
redenitions, one can remove some of the apparent parameter freedom and
identify the true physical independent parametric degrees of freedom.
To illustrate the procedure, let us rst make a list of the parameters
of the Standard Model. First, the gauge sector consists of three real
gauge couplings (g
3
, g
2
and g
1
) and the QCD vacuum angle (
QCD
). The
Higgs sector consists of one Higgs squared-mass parameter and one Higgs
self-coupling (m
2
and ). Traditionally, one trades in the latter two real
parameters for the vacuum expectation value (v = 246 GeV) and the
physical Higgs mass. The fermion sector consists of three Higgs-Yukawa
coupling matrices y
u
, y
d
, and y
e
. Initially, y
u
, y
d
, and y
e
are arbitrary
complex 33 matrices, which in total depend on 27 real and 27 imaginary
degrees of freedom.
But, most of these degrees of freedom are unphysical. In particular,
in the limit where y
u
= y
d
= y
e
= 0, the Standard Model possesses a
global U(3)
5
symmetry corresponding to three generations of the ve
SU(3)SU(2)U(1) multiplets: (
m
, e
m
)
L
, (e
c
m
)
L
, (u
m
, d
m
)
L
, (u
c
m
)
L
,
(d
c
m
)
L
, where m is the generation label. Thus, one can make global
U(3)
5
rotations on the fermion elds of the Standard Model to absorb
the unphysical degrees of freedom of y
u
, y
d
, and y
e
. A U(3) matrix can
be parameterized by three real angles and six phases, so that with the
most general U(3)
5
rotation, we can apparently remove 15 real angles
and 30 phases from y
u
, y
d
, and y
e
. However, the U(3)
5
rotations include
four exact U(1) global symmetries of the Standard Model, namely B and
the three separate lepton numbers L
e
, L
and L
. Thus, one can only

remove 26 phases from y
u
, y
d
, and y
e
. This leaves 12 real parameters
(corresponding to six quark masses, three lepton masses,
9
and three
CKM mixing angles) and one imaginary degree of freedom (the phase of
the CKM matrix). Adding up to get the nal result, one nds that the
Standard Model possesses 19 independent parameters (of which 13 are
associated with the avor sector).
4.9 Grand Unication and Running Couplings
The gauge couplings of the Standard Model are functions of the energy
scale.
9
The neutrinos in the Standard Model are automatically massless and are not counted
as independent degrees of freedom in the parameter count.
Part 2
Constructing Supersymmetric Theories
5
Introduction to Supersymmetry
5.1 Motivation: the Hierarchy Problem
Despite the successes of the Standard Model, it seems very likely that new
physics is present at the TeV scale. The mere fact that the ratio M
P
/M
W
is so huge is already a powerful clue to the character of physics beyond
the Standard Model, because of the infamous hierarchy problem. This
is not really a diculty with the Standard Model itself, but rather a
disturbing sensitivity of the Higgs potential to new physics in almost any
imaginable extension of the Standard Model. The electrically neutral part
of the Standard Model Higgs eld is a complex scalar H with a classical
potential given by
V = m
2
H
[H[
2
+[H[
4
. (5.1)
The Standard Model requires a non-vanishing vacuum expectation value
(VEV) for H at the minimum of the potential. This will occur if m
2
H
< 0,
resulting in H) =
_
m
2
H
/2. Since we know experimentally that H) =
174 GeV from measurements of the properties of the weak interactions,
it must be that m
2
H
is very roughly of order (100 GeV)
2
. However, m
2
H
receives enormous quantum corrections from the virtual eects of every
particle that couples, directly or indirectly, to the Higgs eld.
For example, in Fig. 5.1a we have a correction to m
2
H
from a loop
containing a Dirac fermion f with mass m
f
. If the Higgs eld couples to
f with a term in the Lagrangian
f
Hff, then the Feynman diagram in
Fig. 5.1a yields a correction
m
2
H
=
[
f
[
2
16
2
_
2
2
UV
+ 6m
2
f
ln(
UV
/m
f
) +. . .
. (5.2)
Here
UV
is an ultraviolet momentum cuto used to regulate the loop
integral; it should be interpreted as the energy scale at which new
131
132 5 Introduction to Supersymmetry
H
f
(a)
S
H
(b)
Fig. 5.1. One-loop quantum corrections to the Higgs (mass)
2
parameter m
2
H
,
due to (a) a virtual fermion f, and (b) a virtual scalar S.
physics enters to alter the high-energy behavior of the theory. The
ellipses represent terms that depend on the precise manner in which the
momentum cuto is applied, and that approach a constant as
UV
.
Each of the leptons and quarks of the Standard Model can play the role
of f; for quarks, eq. (5.2) should be multiplied by 3 to account for color.
The largest correction comes when f is the top quark with
f
1. The
problem is that if
UV
is of order M
P
, say, then this quantum correction
to m
2
H
is some 30 orders of magnitude larger than the aimed-for value of
m
2
H
(100 GeV)
2
. This is only directly a problem for corrections to
the Higgs scalar boson (mass)
2
, because quantum corrections to fermion
and gauge boson masses do not have the quadratic sensitivity to
UV
found in eq. (5.2). However, the quarks and leptons and the electroweak
gauge bosons Z
0
, W
of the Standard Model all owe their masses to H),

so that the entire mass spectrum of the Standard Model is directly or
indirectly sensitive to the cuto
UV
.
One could imagine that the solution is to simply pick an ultraviolet
cuto
UV
that is not too large. However, one still has to concoct some
new physics at the scale
UV
that not only alters the propagators in the
loop, but actually cuts o the loop integral. This is not easy to do in a
theory whose Lagrangian does not contain more than two derivatives, and
higher-derivative theories generally suer from a failure of either unitarity
or causality [2]. In string theories, this problem is ignored, and loop
integrals are nevertheless cut o at high Euclidean momentum p by factors
e
p
2
/
2
UV
. However, then
UV
is a string scale that is usually
2
thought to
be not very far below M
P
. Furthermore, there is a contribution similar
to eq. (5.2) from the virtual eects of any arbitrarily heavy particles that
might exist.
For example, suppose there exists a heavy complex scalar particle
S with mass m
S
that couples to the Higgs with a Lagrangian term
2
Some recent attacks on the hierarchy problem have assumed that the ultimate cuto
scale is actually much smaller than the apparent Planck scale.
5.1 Motivation: the Hierarchy Problem 133
(b)
H
F
(a)
H
F
Fig. 5.2. Two-loop corrections to the Higgs (mass)
2
due to a heavy fermion.
S
[H[
2
[S[
2
. Then the Feynman diagram in Fig. 5.1b gives a correction
m
2
H
=

S
16
2
_
2
UV
2m
2
S
ln(
UV
/m
S
) +. . .
. (5.3)
If one rejects the possibility of a physical interpretation for
UV
and uses
dimensional regularization on the loop integral instead of a momentum
cuto, then there will be no
2
UV
piece. However, even then the
term proportional to m
2
S
cannot be eliminated without the physically
unjustiable tuning of a counter-term specically for that purpose. So
m
2
H
is sensitive to the masses of the heaviest particles that H couples to;
if m
S
is very large, its eects on the Standard Model do not decouple,
but instead make it very dicult to understand why m
2
H
is so small.
This problem arises even if there is no direct coupling between the
Standard Model Higgs boson and the unknown heavy particles. For
example, suppose there exists a heavy fermion F that, unlike the quarks
and leptons of the Standard Model, has vector-like quantum numbers and
therefore gets a large mass m
F
without coupling to the Higgs eld. [In
other words, an arbitrarily large mass term of the form m
F
FF is not
forbidden by any symmetry, including SU(2)
L
.] In that case, no diagram
like Fig. 5.1a exists for F. Nevertheless there will be a correction to m
2
H
as
long as F shares some gauge interactions with the Standard Model Higgs
eld; these may be the familiar electroweak interactions, or some unknown
gauge forces that are broken at a very high energy scale inaccessible to
experiment. In either case, the two-loop Feynman diagrams in Fig. 5.2
yield a correction
m
2
H
= x
_
g
2
16
2
_
2
_
a
2
UV
+ 48m
2
F
ln(
UV
/m
F
) +. . .
, (5.4)
where g is the gauge coupling in question, and x is a group theory factor of
order 1. (Specically, x is the product of the quadratic Casimir invariant
of H and the Dynkin index of F for the gauge group in question.) The
coecient a depends on the precise method of cutting o the momentum
integrals. It does not arise at all if one uses dimensional regularization, but
the m
2
F
contribution is always present. The numerical factor (g
2
/16
2
)
2
may be quite small (of order 10
5
for electroweak interactions), but the
important point is that these contributions to m
2
H
are sensitive to the
largest masses and/or ultraviolet cuto in the theory, presumably of order
M
P
. The natural (mass)
2
of a fundamental Higgs scalar, including
quantum corrections, therefore seems to be more like some small but
appreciable fraction of M
2
P
rather than the experimentally favored value!
Even very indirect contributions from Feynman diagrams with three or
more loops can give unacceptably large contributions to m
2
H
. The
argument above applies not just for heavy particles, but for arbitrary high-
scale physical phenomena such as condensates or additional compactied
space-time dimensions.
If the Higgs boson is a fundamental particle, and there really is physics
far above the electroweak scale, then we have two remaining options:
either we must make the rather bizarre assumption that there do not exist
any heavy particles that couple (even indirectly or extremely weakly) to
the Higgs scalar eld, or else some rather striking cancellation is needed
between the various contributions to m
2
H
.
The systematic cancellation of the dangerous contributions to m
2
H
can only be brought about by the type of conspiracy that is better known
to physicists as a symmetry. The form of eqs. (5.2) and (5.3) suggests that
the new symmetry ought to relate fermions and bosons, because of the
relative minus sign between fermion loop and boson loop contributions
to m
2
H
. (Note that
S
must be positive if the scalar potential is to be
bounded from below.) If each of the quarks and leptons of the Standard
Model is accompanied by two complex scalars with
S
= [
f
[
2
, then the
2
UV
contributions of Figs. 5.1a and 5.1b will neatly cancel. Clearly, more
restrictions on the theory will be necessary to ensure that this success
persists to higher orders, so that, for example, the contributions in Fig. 5.2
and eq. (5.4) from a very heavy fermion are cancelled by the two-loop
eects of some very heavy bosons. Fortunately, the cancellation of all
such contributions to scalar masses is not only possible, but is actually
unavoidable, once we merely assume that a symmetry relating fermions
and bosons, called a supersymmetry, exists.
5.2 Enter supersymmetry
A supersymmetry transformation turns a bosonic state into a fermionic
state, and vice versa. The operator Q that generates such transformations
must be an anticommuting spinor, with
Q[Boson) = [Fermion); Q[Fermion) = [Boson). (5.5)
Spinors are intrinsically complex objects, so

Q (the hermitian conjugate
of Q) is also a symmetry generator. Because Q and

Q are fermionic
operators, they carry spin angular momentum 1/2, so it is clear that
supersymmetry must be a spacetime symmetry. The possible forms
for such symmetries in an interacting quantum eld theory are highly
restricted by the Haag-Lopuszanski-Sohnius extension of the Coleman-
Mandula theorem. For realistic theories that, like the Standard Model,
have chiral fermions (i.e., fermions whose left- and right-handed pieces
transform dierently under the gauge group) and thus the possibility
of parity-violating interactions, this theorem implies that the generators
Q and

Q must satisfy an algebra of anticommutation and commutation
relations with the schematic form
Q,

Q = P
, (5.6)
Q, Q =
Q,

Q = 0, (5.7)
[P
, Q] = [P
,

Q] = 0, (5.8)
where P
is the momentum generator of spacetime translations. Here

we have ruthlessly suppressed the spinor indices on Q and

Q; we will,
in Chapter 6, derive the precise version of eqs. (5.6)-(5.8) with indices
restored. In the meantime, we simply note that the appearance of P
on
the right-hand side of eq. (5.6) is unsurprising, since it transforms under
Lorentz boosts and rotations as a spin-1 object while Q and

Q on the
left-hand side each transform as spin-1/2 objects.
The single-particle states of a supersymmetric theory fall into irre-
ducible representations of the supersymmetry algebra, called supermul-
tiplets. Each supermultiplet contains both fermion and boson states,
which are commonly known as superpartners of each other. By denition,
if [) and [
) are members of the same supermultiplet, then [
) is
proportional to some combination of Q and

Q operators acting on [),
up to a spacetime translation or rotation. The (mass)
2
operator P
2
commutes with the operators Q,

Q, and with all spacetime rotation and
translation operators, so it follows immediately that particles that inhabit
the same irreducible supermultiplet must have equal eigenvalues of P
2
,
and therefore equal masses.
The supersymmetry generators Q,

Q also commute with the generators
of gauge transformations. Therefore particles in the same supermultiplet
must also be in the same representation of the gauge group, and so must
have the same electric charges, weak isospin, and color degrees of freedom.
Each supermultiplet contains an equal number of fermion and boson
degrees of freedom. To prove this, consider the operator (1)
2s
where s is
the spin angular momentum. By the spin-statistics theorem, this operator
has eigenvalue +1 acting on a bosonic state and eigenvalue 1 acting on
a fermionic state. Any fermionic operator will turn a bosonic state into
a fermionic state and vice versa. Therefore (1)
2s
must anticommute
with every fermionic operator in the theory, and in particular with Q
and

Q. Now consider the subspace of states [i) in a supermultiplet that
have the same eigenvalue p
of the four-momentum operator P
. In view
of eq. (5.8), any combination of Q or

Q acting on [i) will give another
state [i
) that has the same four-momentum eigenvalue. Therefore one

has a completeness relation

i
[i)i[ = 1 within this subspace of states.
Now one can take a trace over all such states of the operator (1)
2s
P
(including each spin helicity state separately):
i
i[(1)
2s
P
[i) =
i
i[(1)
2s
Q

Q[i) +
i
i[(1)
2s

QQ[i)
=
i
i[(1)
2s
Q

Q[i) +
i,j
i[(1)
2s

Q[j)j[Q[i)
=
i
i[(1)
2s
Q

Q[i) +
j
j[Q(1)
2s

Q[j)
=
i
i[(1)
2s
Q

Q[i)
j
j[(1)
2s
Q

Q[j)
= 0. (5.9)
The rst equality follows from the supersymmetry algebra relation
eq. (5.6); the second and third from use of the completeness relation;
and the fourth from the fact that (1)
2s
must anticommute with Q. Now
i
i[(1)
2s
P
[i) = p
Tr[(1)
2s
] is just proportional to the number of
bosonic degrees of freedom n
B
minus the number of fermionic degrees of
freedom n
F
in the trace, so that
n
B
= n
F
(5.10)
must hold for a given p
,= 0 in each supermultiplet.
The simplest possibility for a supermultiplet consistent with eq. (5.10)
has a single Weyl fermion (with two spin helicity states, so n
F
= 2) and
two real scalars (each with n
B
= 1). It is natural to assemble the two real
scalar degrees of freedom into a complex scalar eld; as we will see below
this provides for convenient formulation of the supersymmetry algebra,
Feynman rules, supersymmetry violating eects, etc. This combination
of a two-component Weyl fermion and a complex scalar eld is called a
chiral or matter or scalar supermultiplet.
The next-simplest possibility for a supermultiplet contains a spin-1
vector boson. If the theory is to be renormalizable, this must be a
gauge boson that is massless, at least before the gauge symmetry is
spontaneously broken. A massless spin-1 boson has two helicity states,
so the number of bosonic degrees of freedom is n
B
= 2. Its superpartner
is therefore a massless spin-1/2 Weyl fermion, again with two helicity
states, so n
F
= 2. (If one tried instead to use a massless spin-3/2 fermion,
the theory would not be renormalizable.) Gauge bosons must transform
as the adjoint representation of the gauge group, so their fermionic
partners, called gauginos, must also. Since the adjoint representation
of a gauge group is always its own conjugate, the gaugino fermions must
have the same gauge transformation properties for left-handed and for
right-handed components. Such a combination of spin-1/2 gauginos and
spin-1 gauge bosons is called a gauge or vector supermultiplet.
If we include gravity, then the spin-2 graviton (with 2 helicity states, so
n
B
= 2) has a spin-3/2 superpartner called the gravitino. The gravitino
would be massless if supersymmetry were unbroken, and so it has n
F
= 2
helicity states.
There are other possible combinations of particles with spins that can
satisfy eq. (5.10). However, these are always reducible to combinations of
chiral and gauge supermultiplets if they have renormalizable interactions,
except in certain theories with extended supersymmetry. Theories
with extended supersymmetry have more than one distinct copy of
the supersymmetry generators Q,

Q. Such models are mathematically
amusing, but evidently do not have any phenomenological prospects.
The reason is that extended supersymmetry in four-dimensional eld
theories cannot allow for chiral fermions or parity violation as observed
in the Standard Model. So we will not discuss such possibilities further,
although extended supersymmetry in higher-dimensional eld theories
might describe the real world if the extra dimensions are compactied,
and extended supersymmetry in four dimensions provides interesting toy
models. The ordinary, non-extended, phenomenologically-viable type of
supersymmetric model is sometimes called N = 1 supersymmetry, with
N referring to the number of supersymmetries (the number of distinct
copies of Q,

Q).
In a supersymmetric extension of the Standard Model, each of the
known fundamental particles is therefore in either a chiral or gauge
supermultiplet, and must have a superpartner with spin diering by 1/2
unit. The rst step in understanding the exciting phenomenological
consequences of this prediction is to decide exactly how the known
particles t into supermultiplets, and to give them appropriate names.
A crucial observation here is that only chiral supermultiplets can contain
fermions whose left-handed parts transform dierently under the gauge
group than their right-handed parts. All of the Standard Model fermions
(the known quarks and leptons) have this property, so they must be
members of chiral supermultiplets.
3
The names for the spin-0 partners
of the quarks and leptons are constructed by prepending an s, which
is short for scalar. So, generically they are called squarks and sleptons
(short for scalar quark and scalar lepton). The left-handed and right-
handed pieces of the quarks and leptons are separate two-component Weyl
fermions with dierent gauge transformation properties in the Standard
Model, so each must have its own complex scalar partner. The symbols for
the squarks and sleptons are the same as for the corresponding fermion,
but with a tilde () used to denote the superpartner of a Standard Model
particle. For example, the superpartners of the left-handed and right-
handed parts of the electron Dirac eld are called left- and right-handed
selectrons, and are denoted e
L
and e
R
. It is important to keep in mind
that the handedness here does not refer to the helicity of the selectrons
(they are spin-0 particles) but to that of their superpartners. A similar
nomenclature applies for smuons and staus:
L
,
R
,
L
,
R
. The Standard
Model neutrinos (neglecting their very small masses) are always left-
handed, so the sneutrinos are denoted generically by , with a possible
subscript indicating which lepton avor they carry:
e
,
. Finally, a
complete list of the squarks is q
L
, q
R
with q = u, d, s, c, b, t. The gauge
interactions of each of these squark and slepton elds are the same as for
the corresponding Standard Model fermions; for instance, the left-handed
squarks u
L
and

d
L
couple to the W boson while u
R
and

d
R
do not.
It seems clear that the Higgs scalar boson must reside in a chiral
supermultiplet, since it has spin 0. Actually, it turns out that just one
chiral supermultiplet is not enough. One way to see this is to note that
if there were only one Higgs chiral supermultiplet, the electroweak gauge
symmetry would suer a gauge anomaly, and would be inconsistent as a
quantum theory. This is because the conditions for cancellation of gauge
anomalies include Tr[T
2
3
Y ] = Tr[Y
3
] = 0, where T
3
and Y are the third
component of weak isospin and the weak hypercharge, respectively, in a
normalization where the ordinary electric charge is Q
EM
= T
3
+ Y . The
traces run over all of the left-handed Weyl fermionic degrees of freedom
in the theory. In the Standard Model, these conditions are already
satised, somewhat miraculously, by the known quarks and leptons. Now,
a fermionic partner of a Higgs chiral supermultiplet must be a weak
isodoublet with weak hypercharge Y = 1/2 or Y = 1/2. In either case
alone, such a fermion will make a non-zero contribution to the traces and
spoil the anomaly cancellation. This can be avoided if there are two Higgs
supermultiplets, one with each of Y = 1/2, so that the total contribution
3
In particular, one cannot attempt to make a spin-1/2 neutrino be the superpartner
of the spin-1 photon; the neutrino is in a doublet, and the photon is neutral, under
weak isospin.
to the anomaly traces from the two fermionic members of the Higgs chiral
supermultiplets vanishes by cancellation. As we will see in section 8.1,
both of these are also necessary for another completely dierent reason:
because of the structure of supersymmetric theories, only a Y = +1/2
Higgs chiral supermultiplet can have the Yukawa couplings necessary to
give masses to charge +2/3 up-type quarks (up, charm, top), and only a
Y = 1/2 Higgs can have the Yukawa couplings necessary to give masses
to charge 1/3 down-type quarks (down, strange, bottom) and to the
charged leptons. We will call the SU(2)
L
-doublet complex scalar elds
with Y = 1/2 and Y = 1/2 by the names H
u
and H
d
, respectively.
4
The
weak isospin components of H
u
with T
3
= (+1/2, 1/2) have electric
charges 1, 0 respectively, and are denoted (H
+
u
, H
0
u
). Similarly, the
SU(2)
L
-doublet complex scalar H
d
has T
3
= (+1/2, 1/2) components
(H
0
d
, H
d
). The neutral scalar that corresponds to the physical Standard
Model Higgs boson is in a linear combination of H
0
u
and H
0
d
; we will
discuss this further in section 8.5. The generic nomenclature for a spin-
1/2 superpartner is to append -ino to the name of the Standard Model
particle, so the fermionic partners of the Higgs scalars are called higgsinos.
They are denoted by

H
u
,

H
d
for the SU(2)
L
-doublet left-handed Weyl
spinor elds, with weak isospin components

H
+
u
,

H
0
u
and

H
0
d
,

H
d
.
We have now found all of the chiral supermultiplets of a minimal
phenomenologically viable extension of the Standard Model. They are
summarized in Table 1, classied according to their transformation prop-
erties under the Standard Model gauge group SU(3)
C
SU(2)
L
U(1)
Y
,
which combines u
L
, d
L
and , e
L
degrees of freedom into SU(2)
L
doublets.
Here we follow a standard convention that all chiral supermultiplets are
dened in terms of left-handed Weyl spinors, so that the conjugates of
the right-handed quarks and leptons (and their superpartners) appear
in Table 1. This protocol for dening chiral supermultiplets turns out
to be very useful for constructing supersymmetric Lagrangians, as we
will see in Chapter 6. It is also useful to have a symbol for each of
the chiral supermultiplets as a whole; these are indicated in the second
column of Table 5.1. Thus, for example, Q stands for the SU(2)
L
-doublet
chiral supermultiplet containing u
L
, u
L
(with weak isospin component
T
3
= +1/2), and

d
L
, d
L
(with T
3
= 1/2), while u stands for the
SU(2)
L
-singlet supermultiplet containing u
R
, u
R
. There are three families
for each of the quark and lepton supermultiplets; Table 5.1 lists the
rst-family representatives. Below, a family index i = 1, 2, 3 will be
axed to the chiral supermultiplet names (Q
i
, u
i
, . . .) when needed,
4
Other notations appearing in the literature have H1, H2 or H, H instead of Hu, H
d
.
The notation used here has the virtue of making it easy to remember which Higgs is
responsible for giving masses to which type of quarks.
Table 5.1. Chiral supermultiplets in the Minimal Supersymmetric Standard
Model. The spin-0 elds are complex scalars, and the spin-1/2 elds are left-
handed two-component Weyl fermions.
Names spin 0 spin 1/2 SU(3)
C
, SU(2)
L
,
U(1)
Y
squarks, quarks Q ( u
L

d
L
) (u
L
d
L
) ( 3, 2 ,
1
6
)
(3 families) u u
R
u
R
( 3, 1,
2
3
)
d

d
d
R
( 3, 1,
1
3
)
sleptons, leptons L ( e
L
) ( e
L
) ( 1, 2 ,
1
2
)
(3 families) e e
R
e
R
( 1, 1, 1)
Higgs, higgsinos H
u
(H
+
u
H
0
u
) (
H
+
u

H
0
u
) ( 1, 2 , +
1
2
)
H
d
(H
0
d
H
d
) (
H
0
d
d
) ( 1, 2 ,
1
2
)
e.g. (e
1
, e
2
, e
3
) = (e, , ). The bar on u, d, e elds is part of the name,
and does not denote any kind of conjugation.
It is interesting to note that the Higgs chiral supermultiplet H
d
(containing H
0
d
, H
d
,

H
0
d
,

H
d
) has exactly the same Standard Model
gauge quantum numbers as the left-handed sleptons and leptons L
i
,
e.g. ( , e
L
, , e
L
). Naively, one might therefore suppose that we could
have been more economical in our assignment by taking a neutrino and
a Higgs scalar to be superpartners, instead of putting them in separate
supermultiplets. This would amount to the proposal that the Higgs boson
and a sneutrino should be the same particle. This is a nice try that
played a key role in some of the rst attempts to connect supersymmetry
to phenomenology, but it is now known to not work. Even ignoring
the anomaly cancellation problem mentioned above, many insoluble
phenomenological problems would result, including lepton number non-
conservation and a mass for at least one of the neutrinos in gross violation
of experimental bounds. Therefore, all of the superpartners of Standard
Model particles are really new particles, and cannot be identied with
some other Standard Model state.
The vector bosons of the Standard Model clearly must reside in gauge
supermultiplets. Their fermionic superpartners are generically referred to
as gauginos. The SU(3)
C
color gauge interactions of QCD are mediated
by the gluon, whose spin-1/2 color-octet supersymmetric partner is the
Table 5.2. Gauge supermultiplets in the Minimal Supersymmetric Standard
Model.
Names spin 1/2 spin 1 SU(3)
C
, SU(2)
L
, U(1)
Y
gluino, gluon g g ( 8, 1 , 0)
winos, W bosons

W

W
0
W
W
0
( 1, 3 , 0)
bino, B boson

B
0
B
0
( 1, 1 , 0)
gluino. As usual, a tilde is used to denote the supersymmetric partner of
a Standard Model state, so the symbols for the gluon and gluino are g
and g respectively. The electroweak gauge symmetry SU(2)
L
U(1)
Y
has
associated with it spin-1 gauge bosons W
+
, W
0
, W
and B
0
, with spin-
1/2 superpartners

W
+
,
W
0
,
and

B
0
, called winos and bino. After
electroweak symmetry breaking, the W
0
, B
0
gauge eigenstates mix to
give mass eigenstates Z
0
and . The corresponding gaugino mixtures of
W
0
and

B
0
are called zino (
Z
0
) and photino ( ); if supersymmetry were
unbroken, they would be mass eigenstates with masses m
Z
and 0. Table
2 summarizes the gauge supermultiplets of a minimal supersymmetric
extension of the Standard Model.
The chiral and gauge supermultiplets in Tables 1 and 2 make up
the particle content of the Minimal Supersymmetric Standard Model
(MSSM). The most obvious and interesting feature of this theory is that
none of the superpartners of the Standard Model particles has been
discovered as of this writing. If supersymmetry were unbroken, then
there would have to be selectrons e
L
and e
R
with masses exactly equal
to m
e
= 0.511... MeV. A similar statement applies to each of the other
sleptons and squarks, and there would also have to be a massless gluino
and photino. These particles would have been extraordinarily easy to
detect long ago. Clearly, therefore, supersymmetry is a broken symmetry
in the vacuum state chosen by Nature.
A very important clue as to the nature of supersymmetry breaking can
be obtained by returning to the motivation provided by the hierarchy
problem. Supersymmetry forced us to introduce two complex scalar
elds for each Standard Model Dirac fermion, which is just what is
needed to enable a cancellation of the quadratically divergent (
2
UV
)
pieces of eqs. (5.2) and (5.3). This sort of cancellation also requires
that the associated dimensionless couplings should be related (e.g.
S
=
[
f
[
2
). The necessary relationships between couplings indeed occur in
unbroken supersymmetry, as we will see in Chapter 6. In fact, unbroken
supersymmetry guarantees that the quadratic divergences in scalar
squared masses must vanish to all orders in perturbation theory.
5
Now,
if broken supersymmetry is still to provide a solution to the hierarchy
problem, then the relationships between dimensionless couplings that hold
in an unbroken supersymmetric theory must be maintained. Otherwise,
there would be quadratically divergent radiative corrections to the Higgs
scalar masses of the form
m
2
H
=
1
8
2
(
S
[
f
[
2
)
2
UV
+. . . . (5.11)
We are therefore led to consider soft supersymmetry breaking. This
means that the eective Lagrangian of the MSSM can be written in the
form
L = L
SUSY
+L
soft
, (5.12)
where L
SUSY
contains all of the gauge and Yukawa interactions and
preserves supersymmetry invariance, and L
soft
violates supersymmetry
but contains only mass terms and couplings with positive mass dimension.
Without further justication, soft supersymmetry breaking might seem
like a rather arbitrary requirement. Fortunately, we will see in section
9 that theoretical models for supersymmetry breaking do indeed yield
eective Lagrangians with just such terms for L
soft
. If the largest
mass scale associated with the soft terms is denoted m
soft
, then the
additional non-supersymmetric corrections to the Higgs scalar (mass)
2
must vanish in the m
soft
0 limit, so by dimensional analysis they cannot
be proportional to
2
UV
. More generally, these models maintain the
cancellation of quadratically divergent terms in the radiative corrections of
all scalar masses, to all orders in perturbation theory. The corrections also
cannot go like m
2
H
m
soft
UV
, because in general the loop momentum
integrals always diverge either quadratically or logarithmically, not
linearly, as
UV
. So they must be of the form
m
2
H
= m
2
soft
_

16
2
ln(
UV
/m
soft
) +. . .
_
. (5.13)
Here is schematic for various dimensionless couplings, and the ellipses
stand both for terms that are independent of
UV
and for higher loop
corrections (which depend on
UV
through powers of logarithms).
5
A simple way to understand this is to note that unbroken supersymmetry requires
the degeneracy of scalar and fermion masses. Radiative corrections to fermion masses
are known to diverge at most logarithmically, so the same must be true for scalar
masses in unbroken supersymmetry.
Because the mass splittings between the known Standard Model
particles and their superpartners are just determined by the parameters
m
soft
appearing in L
soft
, eq. (5.13) tells us that the superpartner masses
cannot be too huge. Otherwise, we would lose our successful cure for the
hierarchy problem since the m
2
soft
corrections to the Higgs scalar (mass)
2
would be unnaturally large compared to the square of the electroweak
breaking scale of 174 GeV. The top and bottom squarks and the winos
and bino give especially large contributions to m
2
Hu
and m
2
H
d
, but
the gluino mass and all the other squark and slepton masses also feed
in indirectly, through radiative corrections to the top and bottom squark
masses. Furthermore, in most viable models of supersymmetry breaking
that are not unduly contrived, the superpartner masses do not dier from
each other by more than about an order of magnitude. Using
UV
M
P
and 1 in eq. (5.13), one nds that roughly speaking m
soft
, and
therefore the masses of at least the lightest few superpartners, should
be at the most about 1 TeV or so, in order for the MSSM scalar potential
to provide a Higgs VEV resulting in m
W
, m
Z
= 80.4, 91.2 GeV without
miraculous cancellations. This is the best reason for the optimism among
many theorists that supersymmetry will be discovered at the Fermilab
Tevatron or the CERN Large Hadron Collider, and can be studied at a
future e
+
e
linear collider.
However, it is useful to keep in mind that the hierarchy problem
was not the historical motivation for the development of supersymmetry
in the early 1970s. The supersymmetry algebra and supersymmetric
eld theories were originally concocted independently in various disguises
bearing little resemblance to the MSSM. It is quite impressive that a
theory that was developed for quite dierent reasons, including purely
aesthetic ones, can later be found to provide a solution for the hierarchy
problem.
One might also wonder whether there is any good reason why all of the
superpartners of the Standard Model particles should be heavy enough
to have avoided discovery so far. There is. All of the particles in the
MSSM that have been detected so far have something in common; they
would necessarily be massless in the absence of electroweak symmetry
breaking. In particular, the masses of the W
, Z
0
bosons and all quarks
and leptons are equal to dimensionless coupling constants times the
Higgs VEV 174 GeV, while the photon and gluon are required to
be massless by electromagnetic and QCD gauge invariance. Conversely,
all of the undiscovered particles in the MSSM have exactly the opposite
property; each of them can have a Lagrangian mass term in the absence
of electroweak symmetry breaking. For the squarks, sleptons, and Higgs
scalars this follows from a general property of complex scalar elds that
a mass term m
2
[[
2
is always allowed by all gauge symmetries. For the
higgsinos and gauginos, it follows from the fact that they are fermions in
a real representation of the gauge group. So, from the point of view of
the MSSM, the discovery of the top quark in 1995 marked a quite natural
milestone; the already-discovered particles are precisely those that had to
be light, based on the principle of electroweak gauge symmetry. There is
a single exception: one neutral Higgs scalar boson should be lighter than
about 150 GeV if supersymmetry is correct, for reasons to be discussed
in section 8.5.
A very important feature of the MSSM is that the superpartners listed
in Tables 1 and 2 are not necessarily the mass eigenstates of the theory.
This is because after electroweak symmetry breaking and supersymmetry
breaking eects are included, there can be mixing between the electroweak
gauginos and the higgsinos, and within the various sets of squarks and
sleptons and Higgs scalars that have the same electric charge. The lone
exception is the gluino, which is a color octet fermion and therefore
does not have the appropriate quantum numbers to mix with any other
particle. The masses and mixings of the superpartners are obviously of
paramount importance to experimentalists. It is perhaps slightly less
obvious that these phenomenological issues are all quite directly related
to one central question that is also the focus of much of the theoretical
work in supersymmetry: How is supersymmetry broken? The reason
for this is that most of what we do not already know about the MSSM
has to do with L
soft
. The structure of supersymmetric Lagrangians allows
very little arbitrariness, as we will see in Chapter 6. In fact, all of the
dimensionless couplings and all but one mass term in the supersymmetric
part of the MSSM Lagrangian correspond directly to parameters in the
ordinary Standard Model that have already been measured by experiment.
For example, we will nd out that the supersymmetric coupling of a gluino
to a squark and a quark is determined by the QCD coupling constant
s
.
In contrast, the supersymmetry-breaking part of the Lagrangian contains
many unknown parameters and, apparently, a considerable amount of
arbitrariness. Each of the mass splittings between Standard Model
particles and their superpartners correspond to terms in the MSSM
Lagrangian that are purely supersymmetry-breaking in their origin and
eect. These soft supersymmetry-breaking terms can also introduce a
large number of mixing angles and CP-violating phases not found in the
Standard Model. Fortunately, as we will see in section 8.4, there is already
strong evidence that the supersymmetry-breaking terms in the MSSM are
actually not arbitrary at all. Furthermore, the additional parameters will
be measured and constrained as the superpartners are detected. From a
theoretical perspective, the challenge is to explain all of these parameters
with a predictive model for supersymmetry breaking.
5.3 Historical analogies 145
5.3 Historical analogies
In this section, we recount some entertaining examples of historical
analogies to supersymmetry. Dirac predicted the positron. Gell-Mann
predicted the
. However, nobody predicted the Spanish Inquisition!

6
Supersymmetric Lagrangians
In this chapter we will describe the construction of supersymmetric
Lagrangians. Our aim is to arrive at a sort of recipe that will allow
us to write down the allowed interactions and mass terms of a general
supersymmetric theory, so that later we can apply the results to the
special case of the MSSM. We will not use the supereld language, which
is often more elegant and ecient for those who know it, but which might
seem rather cabalistic to some readers. Our approach is therefore intended
to be rather complementary to the supereld derivations given in Chapter
7. We begin by considering the simplest example of a supersymmetric
theory in four dimensions; the free Wess-Zumino Model.
6.1 A free chiral supermultiplet
The minimum fermion content of any theory in four dimensions consists
of a single left-handed two-component Weyl fermion . Since this is an
intrinsically complex object, it seems sensible to choose as its superpartner
a complex scalar eld . The simplest action we can write down for these
elds just consists of kinetic energy terms for each:
S =
_
d
4
x (L
scalar
+L
fermion
) (6.1)
L
scalar
=
, L
fermion
= i
. (6.2)
This is called the massless, non-interacting Wess-Zumino model, and
it corresponds to a single chiral supermultiplet as discussed in the
Introduction.
A supersymmetry transformation should turn the scalar boson eld
into something involving the fermion eld
. The simplest possibility

146
for the transformation of the scalar eld is
= ;
, (6.3)
where
is an innitesimal, anticommuting, two-component Weyl fermion

object that parameterizes the supersymmetry transformation. Until
section 9.2, we will be discussing global supersymmetry, which means that
is a constant, satisfying
= 0. Since has dimensions of (mass)

3/2
and has dimensions of (mass), it must be that has dimensions of
(mass)
1/2
. Using eq. (6.3), we nd that the scalar part of the Lagrangian
transforms as
L
scalar
=
. (6.4)
We would like for this to be cancelled by L
fermion
, at least up to a total
derivative, so that the action will be invariant under the supersymmetry
transformation. Comparing eq. (6.4) with L
fermion
, we see that for this
to have any chance of happening, should be linear in and in and
contain one spacetime derivative. Up to a multiplicative constant, there
is only one possibility to try:
= i(

= i(
. (6.5)
With this guess, one immediately obtains
L
fermion
=
. (6.6)
This can be put in a slightly more useful form by employing the Pauli
matrix identities
_
= 2
;
_

= 2

(6.7)
and using the fact that partial derivatives commute (
).
Equation (6.6) then becomes
L
fermion
=
_
. (6.8)
The rst two terms here just cancel against L
scalar
, while the remaining
contribution is a total derivative. So we arrive at
S =
_
d
4
x (L
scalar
+L
fermion
) = 0, (6.9)
justifying our guess of the numerical multiplicative factor made in
eq. (6.5).
148 6 Supersymmetric Lagrangians
We are not quite nished in demonstrating that the theory described by
eq. (6.1) is supersymmetric. We must also show that the supersymmetry
algebra closes; in other words, that the commutator of two supersymmetry
transformations parameterized by spinors
1
and
2
is another symmetry
of the theory. Using eq. (6.5) in eq. (6.3), one nds
(
2
)
2
(
1
)
1
(
2
)
= i(
1
1
)
. (6.10)
This is a remarkable result; in words, we have found that the commutator
of two supersymmetry transformations gives us back the derivative of the
original eld. Since
just corresponds to the generator of spacetime

translations P
, eq. (6.10) implies the form of the supersymmetry algebra

that was foreshadowed in eq. (5.6) of the Introduction. (We will make
this statement more explicit before the end of this chapter.)
All of this will be for nothing if we do not nd the same result for the
fermion , however. Using eq. (6.3) in eq. (6.5), we nd
(
2
)
= i(

1
)
+i(

2
)
. (6.11)
We can put this into a more useful form by applying the Fierz identity
() =
()
() (6.12)
with =
1
, =
2
, =
, and again with =

2
, =
1
,
=
, followed in each case by an application of the identity eq. (1.72).

The result is
(
2
)
= i(
1

1
)
+i
1

2
i
2

1
. (6.13)
The last two terms in (6.13) vanish on-shell; that is, if the equation of
motion
= 0 following from the action is enforced. The remaining

piece is exactly the same spacetime translation that we found for the
scalar eld.
The fact that the supersymmetry algebra only closes on-shell (when the
classical equations of motion are satised) might be somewhat worrisome,
since we would like the symmetry to hold even quantum mechanically.
This can be xed by a trick. We invent a new complex scalar eld F,
which does not have a kinetic term. Such elds are called auxiliary, and
they are really just book-keeping devices that allow the symmetry algebra
to close o-shell. The Lagrangian density for F and its complex conjugate
is just
L
auxiliary
= F
F . (6.14)
The dimensions of F are (mass)
2
, unlike an ordinary scalar eld, which
has dimensions of (mass). Equation (6.14) implies the not-very-exciting
equations of motion F = F
= 0. However, we can use the auxiliary elds

to our advantage by including them in the supersymmetry transformation
rules. In view of eq. (6.13), a plausible thing to do is to make F transform
into a multiple of the equation of motion for :
F = i
; F
= i
. (6.15)
Once again we have chosen the overall factor on the right-hand sides by
virtue of foresight. Now the auxiliary part of the Lagrangian density
transforms as
L
auxiliary
= i
+i
F, (6.16)
which vanishes on-shell, but not for arbitrary o-shell eld congurations.
It is easy to see that by adding an extra term to the transformation law
for and

:
= i(
F;

= i(
+

F
(6.17)
one obtains an additional contribution to L
fermion
, which just cancels
with L
auxiliary
up to a total derivative term. So our modied
theory with L = L
scalar
+ L
fermion
+ L
auxiliary
is still invariant under
supersymmetry transformations. Proceeding as before, one now obtains
for each of the elds X = ,
, ,

, F, F
,
(
2
)X = i(
1

1
)
X (6.18)
using eqs. (6.3), (6.15), and (6.17), but without resorting to any of
the equations of motion. So we have succeeded in showing that
supersymmetry is a valid symmetry of the Lagrangian o-shell.
In retrospect, one can see why we needed to introduce the auxiliary
eld F in order to get the supersymmetry algebra to work o-shell. On-
shell, the complex scalar eld has two real propagating degrees of
freedom, which match with the two spin polarization states of . O-
shell, however, the Weyl fermion is a complex two-component object,
so it has four real degrees of freedom. (Going on-shell eliminates half of
the propagating degrees of freedom for , because the Lagrangian is linear
in time derivatives, so that the canonical momenta can be reexpressed in
terms of the conguration variables without time derivatives and are not
independent phase space coordinates.) To make the numbers of bosonic
and fermionic degrees of freedom match o-shell as well as on-shell, we
had to introduce two more real scalar degrees of freedom in the complex
eld F, which are eliminated when one goes on-shell. This counting is
Table 6.1. Counting of real degrees of freedom in the Wess-Zumino model.
F
on-shell (n
B
= n
F
= 2) 2 2 0
o-shell (n
B
= n
F
= 4) 2 4 2
summarized in Table 6.1. The auxiliary eld formulation is especially
useful when discussing spontaneous supersymmetry breaking, as we will
see in section 9.
Invariance of the action under a symmetry transformation always
implies the existence of a conserved current, and supersymmetry is
no exception. The supercurrent J
is an anticommuting four-vector,
which also carries a spinor index, as bets the current associated with
a symmetry with fermionic generators. By the usual Noether procedure,
one nds for the supercurrent (and its hermitian conjugate) in terms of
the variations of the elds X = ,
, ,

, F, F
:
J
+

J
X
X
L
(
X)
K
, (6.19)
where K
is an object whose divergence is the variation of the Lagrangian

density under the supersymmetry transformation, L =
. Note that
K
is not unique; one can always replace K
by K
+ k
, where k
is
any vector satisfying
= 0, for example k
. A
little work reveals that, up to the ambiguity just mentioned,
J
= (
;

J

= (
. (6.20)
The supercurrent and its hermitian conjugate are separately conserved:
= 0;

= 0, (6.21)
as can be veried by use of the equations of motion. From these currents
one constructs the conserved charges
Q
2
_
d
3
x J
0
;

Q

=
2
_
d
3
x

J
0

, (6.22)
which are the generators of supersymmetry transformations. (The factor
of

2 normalization is included to agree with an arbitrary historical
convention.) As quantum mechanical operators, they satisfy
_
Q+

Q, X
= i
2 X (6.23)
for any eld X, up to terms that vanish on-shell. This can be
veried explicitly by using the canonical equal-time commutation and
anticommutation relations
[(x), (y)] = [
(x),
(y)] = i
(3)
(x y); (6.24)

(x),
(y) = (
0
)

(3)
(x y) (6.25)
derived from the free eld theory Lagrangian eq. (6.1). Here =
0
and
=
0
are the momenta conjugate to and
respectively. Now the

content of eq. (6.18) can be expressed in terms of canonical commutators
as
_
2
Q+
2

Q,
_
1
Q+
1

Q, X
1
Q+
1

Q,
_
2
Q+
2

Q, X
_
=
2(
2

2
) i
X (6.26)
up to terms that vanish on-shell. The spacetime momentum operator
P
= (H,

P), where H is the Hamiltonian and

P is the three-momentum
operator, is given in terms of the canonical variables by
H =
+ (
) (
) +i

(6.27)
P =
. (6.28)
It generates spacetime translations on the elds X according to
[P
, X] = i
X. (6.29)
Rearranging the terms in eq. (6.26) using the Jacobi identity, we therefore
have
_
_
2
Q+
2

Q,
1
Q+
1

Q
, X
_
= 2(
2
2
) [P
, X], (6.30)
for any X, so it must be that
_
2
Q+
2

Q,
1
Q+
1

Q
= 2(
2

2
) P
, (6.31)
up to terms that vanish on-shell. Now by expanding out eq. (6.31), one
obtains the non-schematic form of the supersymmetry algebra relations
Q
,

Q

= 2
, (6.32)
Q
, Q
Q

,

Q
= 0 (6.33)
as promised in the Introduction. [The commutator in eq. (6.31) turns into
anticommutators in eqs. (6.32) and (6.33) in the process of extracting
the anticommuting spinors
1
and
2
.] The results [Q
, P
] = 0 and
[

Q

, P
] = 0 follow immediately from eq. (6.29) and the fact that the
supersymmetry transformations are global (independent of position in
spacetime). This demonstration of the supersymmetry algebra in terms
of the canonical generators Q and

Q requires the use of the Hamiltonian
equations of motion, but the symmetry itself is valid o-shell at the level
of the Lagrangian, as we have already shown.
6.2 Interactions of chiral supermultiplets
In a realistic theory like the MSSM, there are many chiral supermultiplets
that have both gauge and non-gauge interactions. In this section, our task
is to construct the most general possible theory of masses and non-gauge
interactions for particles that live in chiral supermultiplets. In the MSSM
these are the quarks, squarks, leptons, sleptons, Higgs scalars and higgsino
fermions. We will nd that the form of the non-gauge couplings, including
mass terms, is highly restricted by the requirement that the action is
invariant under supersymmetry transformations. (Gauge interactions will
be dealt with in the following sections.)
Our starting point is the Lagrangian density for a collection of free
chiral supermultiplets labeled by an index i, which runs over all gauge and
avor degrees of freedom. Since we will want to construct an interacting
theory with supersymmetry closing o-shell, each supermultiplet contains
a complex scalar
i
and a left-handed Weyl fermion
i
as physical degrees
of freedom, plus a complex auxiliary eld F
i
, which does not propagate.
The results of the previous section tell us that the free part of the
Lagrangian is
L
free
=
i
+i
i
+F
i
F
i
(6.34)
where we sum over repeated indices i (not to be confused with the
suppressed spinor indices), with the convention that elds
i
and
i
always carry lowered indices, while their conjugates always carry raised
indices. It is invariant under the supersymmetry transformation
i
=
i
,
i
=
i
, (6.35)
(
i
)
= i(
i
+
F
i
, (6.36)
(

i
)

= i(
i
+

F
i
, (6.37)
F
i
= i
i
, F
i
= i
. (6.38)
We will now nd the most general set of renormalizable interactions for
these elds that is consistent with supersymmetry. To begin, note that
in order to be renormalizable by power counting, each term must have
dynamical eld content with mass dimension 4. So, the only candidates
are:
L
int
=
_
1
2
W
ij
j
+W
i
F
i
+x
ij
F
i
F
j
_
+ c.c. +U, (6.39)
6.2 Interactions of chiral supermultiplets 153
where W
ij
, W
i
, x
ij
, and U are polynomials in the scalar elds
i
,
i
,
with degrees 1, 2, 0, and 4, respectively. [Terms of the form F
i
F
j
can be
absorbed, by a redenition of the auxiliary elds, into the last term in
equation (6.34).]
We must now require that L
int
is invariant under the supersymmetry
transformations, since L
free
was already invariant by itself. This
immediately requires that the candidate term U(
i
,
i
) must vanish.
If there were such a term, then under a supersymmetry transformation
eq. (6.35) it would transform into another function of the scalar elds
only, multiplied by
i
or
i
, and with no spacetime derivatives or F
i
,
F
i
elds. It is easy to see from eqs. (6.35)-(6.40) that nothing of this form
can possibly be cancelled by the supersymmetry transformation of any
other term in the Lagrangian. Similarly, the dimensionless coupling x
ij
must be zero, because its supersymmetry transformation likewise cannot
possibly be cancelled by any other term. So, we are left with
L
int
=
_
1
2
W
ij
j
+W
i
F
i
_
+ c.c. (6.40)
as the only possibilities. At this point, we are not assuming that W
ij
and W
i
are related to each other in any way whatsoever. However, soon
we will nd out that they are related, which is why we have chosen the
same letter for them. Notice that eq. (1.54) tells us that W
ij
is symmetric
under i j.
It is easiest to divide the variation of L
int
into several parts, which must
cancel separately. First, we consider the part that contains four spinors:
L
int
[
4spinor
=
_
1
2
W
ij
k
(
k
)(
i
j
)
1
2
W
ij
k
(
k
)(
i
j
)
_
+ c.c. (6.41)
The term proportional to (
k
)(
i
j
) cannot cancel against any other
term. Fortunately, however, the Fierz identity eq. (6.12) implies
(
i
)(
j
k
) + (
j
)(
k
i
) + (
k
)(
i
j
) = 0, (6.42)
so this contribution to L
int
vanishes identically if and only if W
ij
/
k
is totally symmetric under interchange of i, j, k. There is no such identity
available for the term proportional to (
k
)(
i
j
). Since that term cannot
cancel with any other, requiring it to be absent just tells us that W
ij
cannot contain
k
. In other words, W
ij
is analytic (or holomorphic) in
the complex elds
k
.
Combining what we have learned so far, we can write
W
ij
= M
ij
+y
ijk
k
(6.43)
where M
ij
is a symmetric mass matrix for the fermion elds, and y
ijk
is
a Yukawa coupling of a scalar
k
and two fermions
i
j
that must be
totally symmetric under interchange of i, j, k. It is convenient to write
W
ij
=

2
j
W (6.44)
where we have introduced a very useful object
W =
1
2
M
ij
j
+
1
6
y
ijk
k
, (6.45)
called the superpotential. This is not a scalar potential in the ordinary
sense; in fact, it is not even real. It is instead an analytic function of the
scalar elds
i
treated as complex variables.
Continuing on our vaunted quest, we next consider the parts of L
int
that contain a spacetime derivative:
L
int
[
=
_
iW
ij
j

i
+iW
i
_
+ c.c. (6.46)
Here we have used the identity eq. (1.72) on the second term, which came
from (F
i
)W
i
. Now we can use eq. (6.44) to observe that
W
ij
j
=
_
W
i
_
. (6.47)
Therefore, eq. (6.46) will be a total derivative if and only if
W
i
=
W
i
= M
ij
j
+
1
2
y
ijk
k
, (6.48)
which explains why we chose its name as we did. The remaining terms in
L
int
are all linear in F
i
or F
i
, and it is easy to show that they cancel,
given the results for W
i
and W
ij
that we have already found.
To recap, we have found that the most general non-gauge interactions
for chiral supermultiplets are determined by a single analytic function of
the complex scalar elds, the superpotential W. The auxiliary elds F
i
and F
i
can be eliminated using their classical equations of motion. The
part of L
free
+ L
int
that contains the auxiliary elds is F
i
F
i
+ W
i
F
i
+
W
i
F
i
, leading to the equations of motion
F
i
= W
i
; F
i
= W
i
. (6.49)
Thus the auxiliary elds are expressible algebraically (without any
derivatives) in terms of the scalar elds. After making the replacement
eq. (6.49) in L
free
+L
int
, we obtain the Lagrangian density
L =
i
+i
1
2
_
W
ij
j
+W
ij

j
_
W
i
W
i
. (6.50)
(Since F
i
and F
i
appear only quadratically in the action, the result of
instead doing a functional integral over them at the quantum level has
precisely the same eect.) Now that the non-propagating elds F
i
, F
i
have been eliminated, it is clear from eq. (6.50) that the scalar potential
for the theory is just given in terms of the superpotential by (recall L
contains V ):
V (,
) = W
i
W
i
= F
i
F
i
= M
ik
M
kj
j
+
1
2
M
in
y
jkn
k
+
1
2
M
in
y
jkn
k
+
1
4
y
ijn
y
kln
l
. (6.51)
This scalar potential is automatically bounded from below; in fact, since
it is a sum of squares of absolute values (of the W
i
), it is always
non-negative. If we substitute the general form for the superpotential
eq. (6.45) into eq. (6.50), we obtain for the full Lagrangian density
L =
i
+i
1
2
M
ij
j

1
2
M
ij

j
V (,
1
2
y
ijk
1
2
y
ijk
k
. (6.52)
Now we can compare the masses of the fermions and scalars by looking
at the linearized equations of motion:
i
= M
ik
M
kj
j
+. . . , (6.53)
i
i
= M
ij

j
+. . . , i
i
= M
ij
j
+. . . . (6.54)
One can eliminate in terms of

and vice versa in eq. (6.54), obtaining
[after use of the identity eq. (6.7)]
i
= M
ik
M
kj
j
+. . . ,
j
=

i
M
ik
M
kj
+. . . . (6.55)
Therefore, the fermions and the bosons satisfy the same wave equation
with exactly the same (mass)
2
matrix with real non-negative eigenvalues,
namely (M
2
)
i
j
= M
ik
M
kj
. It follows that diagonalizing this matrix
gives a collection of chiral supermultiplets each of which contains a mass-
degenerate complex scalar and Weyl fermion, in agreement with the
general argument in the Introduction.
6.3 Supersymmetric Gauge Theories
The propagating degrees of freedom in a gauge supermultiplet are a
massless gauge boson eld A
a
and a two-component Weyl fermion gaugino
a
. The index a here runs over the adjoint representation of the gauge
group (a = 1 . . . 8 for SU(3)
C
color gluons and gluinos; a = 1, 2, 3 for
Table 6.2. Counting of real degrees of freedom for each gauge supermultiplet.
A
D
on-shell (n
B
= n
F
= 2) 2 2 0
o-shell (n
B
= n
F
= 4) 3 4 1
SU(2)
L
weak isospin; a = 1 for U(1)
Y
weak hypercharge). The gauge
transformations of the vector supermultiplet elds are then
gauge
A
a
a
+gf
abc
A
b
c
, (6.56)
gauge
a
= gf
abc
c
, (6.57)
where
a
is an innitesimal gauge transformation parameter, g is the
gauge coupling, and f
abc
are the totally antisymmetric structure constants
that dene the gauge group. (The special case of an abelian group like
U(1)
Y
is obtained by just setting f
abc
= 0; the corresponding gaugino is
a gauge singlet in that case.)
The on-shell degrees of freedom for A
a
and
a
amount to two
bosonic and two fermionic helicity states (for each a), as required
by supersymmetry. However, o-shell
a
consists of two complex,

or four real, fermionic degrees of freedom, while A
a
only has three

real bosonic degrees of freedom; one degree of freedom is removed by
the inhomogeneous gauge transformation eq. (6.56). So, we will need
one real bosonic auxiliary eld, traditionally called D
a
, in order for
supersymmetry to be consistent o-shell. This eld also transforms as an
adjoint of the gauge group [i.e., like eq. (6.57) with D] and satises
(D
a
)
= D
a
. Like the chiral auxiliary elds F
i
, the gauge auxiliary eld
D
a
has dimensions of (mass)
2
and thus no kinetic term, so that it can be
eliminated on-shell using its algebraic equation of motion. The counting
of degrees of freedom is summarized in Table 6.2.
Therefore, the Lagrangian density for a gauge supermultiplet ought to
be
L
gauge
=
1
4
F
a
F
a
+i
a
+
1
2
D
a
D
a
, (6.58)
where
F
a
A
a
A
a
+gf
abc
A
b
A
c
(6.59)
is the usual Yang-Mills eld strength, and
D
a
=
a
+gf
abc
A
b
c
(6.60)
is the covariant derivative of the gaugino eld. One can infer the
appropriate form for the supersymmetry transformation of the elds, up
to multiplicative constants, from the requirements that they should be
linear in the innitesimal parameters , with dimensions of (mass)
1/2
,
that A
a
is real, and that D

a
should be real and proportional to the eld
equations for the gaugino, in analogy with the role of the auxiliary eld F
in the chiral supermultiplet case. Thus one can guess, up to multiplicative
factors,
A
a
=
1
2
_

a
+

_
, (6.61)
=
i
2
2
(
F
a
+
1
D
a
, (6.62)
D
a
=
i
2
_
a
D
_
. (6.63)
The factors of

2 are chosen so that the action obtained by integrating
L
gauge
is invariant, and the phase of
a
is chosen for future convenience
in treating the MSSM. It is now a little bit tedious, but straightforward,
to check that eq. (6.18) is modied to
(
2
)X = i(
1

1
)D
X (6.64)
for X equal to any of the gauge-covariant elds F
a
,
a
,

a
, D
a
, as well as
for arbitrary covariant derivatives acting on them. This ensures that the
supersymmetry algebra eqs. (6.32)-(6.33) is realized on gauge-invariant
combinations of elds in gauge supermultiplets, as they were on the chiral
supermultiplets.
2
These calculations require the use of identities
= (
= (
; (6.65)
; (6.66)
= 2

. (6.67)
If we had not included the auxiliary eld D
a
, then the supersymmetry
algebra eq. (6.64) would hold only after using the equations of motion
for
a
and

a
. The auxiliary elds just satisfy the equations of motion
D
a
= 0, but this is no longer true if one couples the gauge supermultiplets
to chiral supermultiplets, as we now do.
2
The supersymmetry transformations eqs. (6.61)-(6.63) are non-linear for non-abelian
gauge symmetries, since there are gauge elds contained in the covariant derivatives
acting on the gaugino elds and in the eld strength F
a
. By adding even more

auxiliary elds besides D
a
, one can make the supersymmetry transformations linear
in the elds. The version given here in which those extra auxiliary elds have been
removed by gauge transformations is called Wess-Zumino gauge.
6.4 Gauge interactions for chiral supermultiplets
Finally we are ready to consider a general Lagrangian density for
a supersymmetric theory with both chiral and gauge supermultiplets.
Suppose that the chiral supermultiplets transform under the gauge group
in a representation with hermitian matrices (T
a
)
i
j
satisfying [T
a
, T
b
] =
if
abc
T
c
. [For example, if the gauge group is SU(2), then f
abc
=
abc
,
and the T
a
are 1/2 times the Pauli matrices for a chiral supermultiplet
transforming in the fundamental representation.] Thus
gauge
X
i
= ig
a
(T
a
X)
i
(6.68)
for X
i
=
i
,
i
, F
i
; since supersymmetry and gauge transformations
commute, the scalar, fermion, and auxiliary elds must be in the same
representation of the gauge group. To have a gauge-invariant Lagrangian,
we need to replace the ordinary derivatives in eq. (6.34) with covariant
derivatives:
i
D
i
=
i
igA
a
(T
a
)
i
(6.69)
i
D
i
=
i
+igA
a
T
a
)
i
(6.70)
i
D
i
=
i
igA
a
(T
a
)
i
. (6.71)
Naively, this simple procedure achieves the goal of coupling the vector
bosons in the gauge supermultiplet to the scalars and fermions in the
chiral supermultiplets. However, we also have to consider whether there
are any other interactions, allowed by gauge invariance and involving
the gaugino and D
a
elds, that might have to be included to make a
supersymmetric Lagrangian. (If A
a
couples to
i
and
i
, then it should
makes sense that its superpartners
a
and D
a
do as well.)
In fact, there are three such possible interaction terms that are
renormalizable (of mass dimension 4), namely
(
T
a
)
a
,

a
(
T
a
), and (
T
a
)D
a
. (6.72)
Now one can add them, with unknown dimensionless coupling coecients,
to the Lagrangians for the chiral and gauge supermultiplets and demand
that the whole mess be real and invariant under supersymmetry
transformations, up to a total derivative. Not surprisingly, this is possible
only if the supersymmetry transformation laws for the matter elds are
modied to include gauge-covariant rather than ordinary derivatives (and
to include one strategically chosen extra term in F
i
):
i
=
i
(6.73)
i
= i(
i
+
F
i
(6.74)
F
i
= i
i
+

2g(T
a
)
i

a
. (6.75)
6.4 Gauge interactions for chiral supermultiplets 159
After some algebra one can now x the coecients for the terms in
eq. (6.72), so that the full Lagrangian density for a renormalizable
supersymmetric theory is
L = L
gauge
+L
chiral
2g
_
(
T
a
)
a
+

a
(
T
a
)
+g(
T
a
)D
a
. (6.76)
Here L
chiral
means the chiral supermultiplet Lagrangian found in section
6.2 [e.g., eq. (6.50) or (6.52)], but with ordinary derivatives replaced
everywhere by gauge-covariant derivatives, and L
gauge
was given in
eq. (6.58). To prove that eq. (6.76) is invariant under the supersymmetry
transformations, one must use the identity
W
i
(T
a
)
j
i
j
= 0. (6.77)
This is precisely the condition that must be satised anyway in order for
the superpotential (and thus L
chiral
) to be gauge invariant, since the left
side is proportional to
gauge
W.
The last two lines in eq. (6.76) are interactions whose strengths are xed
to be gauge couplings by the requirements of supersymmetry, even though
they are not gauge interactions from the point of view of an ordinary eld
theory. The second line is a direct coupling of gauginos to matter elds,
and is the supersymmetrization of the usual gauge boson coupling to
matter elds. The last line combines with the (1/2)D
a
D
a
term in L
gauge
to provide an equation of motion
D
a
= g(
T
a
). (6.78)
Thus, like the auxiliary elds F
i
and F
i
, the D
a
are expressible purely
algebraically in terms of the scalar elds. Replacing the auxiliary elds
in eq. (6.76) using eq. (6.78), one nds that the complete scalar potential
is (recall L contains V ):
V (,
) = F
i
F
i
+
1
2
a
D
a
D
a
= W
i
W
i
+
1
2
a
g
2
a
(
T
a
)
2
. (6.79)
The two types of terms in this expression are called F-term and D-
term contributions, respectively. In the second term in eq. (6.79), we
have now written an explicit sum

a
to cover the case that the gauge
group has several distinct factors with dierent gauge couplings g
a
. [For
instance, in the MSSM the three factors SU(3)
C
, SU(2)
L
and U(1)
Y
have
dierent gauge couplings g
3
, g and g
.] Since V (,
) is a sum of squares,
it is always greater than or equal to zero for every eld conguration. It
is a very interesting and unique feature of supersymmetric theories that
the scalar potential is completely determined by the other interactions in
the theory. The F-terms are xed by Yukawa couplings and fermion mass
terms, and the D-terms are xed by the gauge interactions.
By using Noethers procedure [see eq. (6.19)], one nds the conserved
supercurrent
J
= (
i
)
i
+i(
i
)
1
2
2
(
a
)
F
a
+
i
2
g
T
a
(
a
)
, (6.80)
generalizing the expression given in eq. (6.20) for the Wess-Zumino
model. This expression will be useful when we discuss certain aspects
of spontaneous supersymmetry breaking in section 9.2.
6.5 Summary: How to build a supersymmetric model
In a renormalizable supersymmetric eld theory, the interactions and
masses of all particles are determined just by their gauge transformation
properties and by the superpotential W. By construction, we found that
W had to be an analytic function of the complex scalar elds
i
, which are
always dened to transform under supersymmetry into left-handed Weyl
fermions. We should mention that in an equivalent language, W is said
to be a function of chiral superelds. A supereld is a single object that
contains as components all of the bosonic, fermionic, and auxiliary elds
within the corresponding supermultiplet, e.g.
i
(
i
,
i
, F
i
). (This is
analogous to the way in which one often describes a weak isospin doublet
or color triplet by a multicomponent eld.) The gauge quantum numbers
and the mass dimension of a chiral supereld are the same as that of its
scalar component. In the supereld formulation, one writes instead of
eq. (6.45)
W =
1
2
M
ij
j
+
1
6
y
ijk
k
, (6.81)
which means exactly the same thing. While this entails no dierence in
practical results, the fancier version eq. (6.81) at least serves to remind
us that W determines not only the scalar interactions in the theory, but
the fermion masses and Yukawa couplings as well. The derivation of
all of our preceding results can be obtained somewhat more elegantly
using supereld methods, which have the advantage of making invariance
under supersymmetry transformations manifest. We have avoided this
extra layer of notation on purpose, in favor of the more pedestrian, but
hopefully more familiar, component eld approach. The latter is at least
more appropriate for making contact with phenomenology in a universe
6.5 Summary: How to build a supersymmetric model 161
i
k
j
(a) (b)
j
i
l
k
Fig. 6.1. The dimensionless non-gauge interaction vertices in a supersymmetric
theory: (a) scalar-fermion-fermion Yukawa interaction y
ijk
, (b) quartic scalar
interaction y
ijn
y
kln
.
i
k
j
(a) (b)
i j
(c)
i j
Fig. 6.2. Supersymmetric dimensionful couplings: (a) (scalar)
3
interaction
vertex M
in
y
jkn
, (b) fermion mass term M
ij
, (c) scalar (mass)
2
term M
ik
M
kj
.
with supersymmetry breaking. The only (occasional) use we will make
of supereld notation is the purely cosmetic one of following the common
practice of specifying superpotentials like eq. (6.81) rather than (6.45).
The specication of the superpotential is really a code for the terms that
it implies in the Lagrangian, so the reader may feel free to think of the
superpotential either as a function W(
i
) of the scalar elds
i
or as the
same function W(
i
) of the superelds
i
which contain them.
Given the supermultiplet content of the theory, the form of the
superpotential is restricted by gauge invariance. In any given theory,
only a subset of the couplings M
ij
and y
ijk
will be allowed to be non-
zero. The entries of the mass matrix M
ij
can only be non-zero for i and
j such that the supermultiplets
i
and
j
transform under the gauge
group in representations which are conjugates of each other. (In fact,
in the MSSM there is only one such term, as we will see.) Likewise,
the Yukawa couplings y
ijk
can only be non-zero when
i
,
j
, and
k
transform in representations which can combine to form a singlet.
The interactions implied by the superpotential eq. (6.81) are shown
3
in
Figs. 6.1 and 6.2. Those in Fig. 6.1 are all determined by the dimensionless
parameters y
ijk
. The Yukawa interaction in Fig. 6.1a corresponds to the
next-to-last term in eq. (6.52). For each particular Yukawa coupling of
3
Here, the auxiliary elds have been eliminated using their equations of motion
(integrated out) as in eq. (6.52). One can also give Feynman rules which include the
auxiliary elds, although this tends to be less useful in phenomenological applications.
(a) (b) (c) (d)
(e) (f) (g) (h)
Fig. 6.3. Supersymmetric gauge interaction vertices.
k
with strength y
ijk
, there must be equal couplings of
j
k
and
j
, since y
ijk
is completely symmetric under interchange of any two of
its indices as shown in section 6.2. There is also a dimensionless coupling
for
i
l
, with strength y
ijn
y
kln
as required by supersymmetry [see
the last term in eq. (6.51)]. The arrows on both the fermion and scalar
lines follow the chirality; i.e., one direction for propagation of and
and the other for the propagation of
and

. Thus there is a
vertex corresponding to the one in Fig. 6.1a but with all arrows reversed,
corresponding to the complex conjugate [the last term in eq. (6.52)]. The
relationship between the interactions in Figs. 6.1a and 6.1b is exactly of
the special type needed to cancel the quadratic divergences in quantum
corrections to scalar masses, as discussed in the Introduction (compare
Fig. 5.1).
In Fig. 6.2, we show the only interactions corresponding to renor-
malizable and supersymmetric vertices with dimensions of (mass) and
(mass)
2
. First, there are (scalar)
3
couplings which are entirely determined
by the superpotential mass parameters M
ij
and Yukawa couplings y
ijk
,
as indicated by the second and third terms in eq. (6.51). The propagators
of the fermions and scalars in the theory are constructed in the usual way
using the fermion mass M
ij
and scalar (mass)
2
M
in
M
nj
. Of particular
interest is the fact that the fermion mass term M
ij
leads to a chirality-
changing insertion in the fermion propagator; note the directions of the
arrows in Fig. 6.2b. There is no such arrow-reversal for a scalar propagator
in a theory with exact supersymmetry; as shown in Fig. 6.2c, if one treats
the scalar (mass)
2
term as an insertion in the propagator, the arrow
direction is preserved. Again, for each of Figures 6.2a and 6.2b there
is an interaction with all arrows reversed.
In Fig. 6.3 we show in a similar manner the gauge interactions in a
supersymmetric theory. Figures 6.3a,b,c occur only when the gauge group
is non-abelian (e.g. for SU(3)
C
color and SU(2)
L
weak isospin in the
MSSM). Figures 6.3a and 6.3b are the interactions of gauge bosons which
derive from the rst term in eq. (6.58). In the MSSM these are exactly
the same as the well-known QCD gluon and electroweak gauge boson
vertices of the Standard Model. (We do not show the interactions of ghost
elds, which are necessary only for consistent loop amplitudes.) Figures
6.3c,d,e,f are just the standard interactions between gauge bosons and
fermion and scalar elds which must occur in any gauge theory because of
the form of the covariant derivative; they come from eqs. (6.60) and (6.69)-
(6.71) inserted in the kinetic part of the Lagrangian. Figure 6.3c shows
the coupling of a gaugino to a gauge boson; the gaugino line in a Feynman
diagram is traditionally drawn as a solid fermion line superimposed on a
gauge boson wavy line. In Fig. 6.3g we have the coupling of a gaugino
to a chiral fermion and a complex scalar [the rst term in the second
line in eq. (6.76)]. One can think of this as the supersymmetrization
of Figure 6.3e or 6.3f; any of these three vertices may be obtained from
any other (up to a factor of
2) by replacing two of the particles by their

supersymmetric partners. There is also an interaction like Fig. 6.3g but
with all arrows reversed, corresponding to the complex conjugate term in
the Lagrangian [the second term in the second line in eq. (6.76)]. Finally
in Fig. 6.3h we have a scalar quartic interaction vertex [the last term in
eq. (6.79)] which is also determined by the gauge coupling.
The results of this chapter can be used as a recipe for constructing the
supersymmetric interactions for any renormalizable model. In the case
of the MSSM, we already know the gauge group, particle content and
the gauge transformation properties, so it only remains to decide on the
superpotential. This we will do in section 8.1.
6.6 Soft supersymmetry-breaking interactions
A realistic phenomenological model must contain supersymmetry break-
ing. From a theoretical perspective, we expect that supersymmetry, if it
exists at all, should be an exact symmetry which is spontaneously broken.
In other words, the ultimate model should have a Lagrangian density
which is invariant under supersymmetry, but a vacuum state which is
not. In this way, supersymmetry is hidden at low energies in a manner
exactly analogous to the fate of the electroweak symmetry in the ordinary
Standard Model.
Many models of spontaneous symmetry breaking have indeed been
proposed and we will mention the basic ideas of some of them in section
9. These always involve extending the MSSM to include new particles
and interactions at very high mass scales, and there is no consensus on
exactly how this should be done. However, from a practical point of view,
it is extremely useful to simply parameterize our ignorance of these issues
by just introducing extra terms which break supersymmetry explicitly
in the eective MSSM Lagrangian. As was argued in the Introduction,
the extra supersymmetry-breaking couplings should be soft (of positive
mass dimension) in order to be able to naturally maintain a hierarchy
between the electroweak scale and the Planck (or some other very large)
mass scale. This means in particular that we should not consider any
dimensionless supersymmetry-breaking couplings.
In the context of a general renormalizable theory, the possible soft
supersymmetry-breaking terms in the Lagrangian are
L
soft
=
1
2
(M
a
+ c.c.) (m
2
)
i
j
_
1
2
b
ij
j
+
1
6
a
ijk
k
+ c.c.
_
, (6.82)
L
maybe soft
=
1
2
c
jk
i

i
k
+ c.c. (6.83)
They consist of gaugino masses M
for each gauge group, scalar (mass)

2
terms (m
2
)
j
i
and b
ij
, and (scalar)
3
couplings a
ijk
and c
jk
i
. One might
wonder why we have not included possible soft mass terms for the
chiral supermultiplet fermions. The reason is that including such terms
would be redundant; they can always be absorbed into a redenition
of the superpotential and the terms (m
2
)
j
i
and c
jk
i
. It has been shown
rigorously that a softly-broken supersymmetric theory with L
soft
as given
by eq. (6.82) is indeed free of quadratic divergences in quantum corrections
to scalar masses, to all orders in perturbation theory. The situation
is slightly more subtle if one tries to include the non-analytic (scalar)
3
couplings in L
maybe soft
. If any of the chiral supermultiplets in the theory
are completely uncharged under all gauge symmetries, then non-zero c
jk
i
terms can lead to quadratic divergences, despite the fact that they are
formally soft. Now, this constraint need not apply to the MSSM, which
does not have any gauge-singlet chiral supermultiplets. Nevertheless, the
possibility of c
jk
i
terms is nearly always neglected. The real reason for
this is that it is extremely dicult to construct any model of spontaneous
supersymmetry breaking in which the c
jk
i
are not utterly negligibly small.
Equation (6.82) is therefore usually taken to be the most general soft
supersymmetry-breaking Lagrangian.
It should be clear that L
soft
indeed breaks supersymmetry, since it
involves only scalars and gauginos, and not their respective superpartners.
In fact, the soft terms in L
soft
are capable of giving masses to all of the
scalars and gauginos in a theory, even if the gauge bosons and fermions
(a) (b)
i j
(c)
i j
(d)
i
j
k
Fig. 6.4. Soft supersymmetry-breaking terms: (a) Gaugino mass insertion
M
; (b) non-analytic scalar (mass)

2
(m
2
)
i
j
; (c) analytic scalar (mass)
2
b
ij
; (d)
(scalar)
3
coupling a
ijk
.
in chiral supermultiplets are massless (or relatively light). The gaugino
masses M
are always allowed by gauge symmetry. The (m

2
)
i
j
terms
are allowed for i, j such that
i
,
j
transform in complex conjugate
representations of each other under all gauge symmetries; in particular
this is true of course when i = j, so every scalar is eligible to get a mass
in this way if supersymmetry is broken. The remaining soft terms may
or may not be allowed by the symmetries. In this regard it is useful
to note that the b
ij
and a
ijk
terms have the same form as the M
ij
and y
ijk
terms in the superpotential [compare eq. (6.82) to eq. (6.45)
or eq. (6.81)], so they will be allowed by gauge invariance if and only if
a corresponding superpotential term is allowed. The Feynman diagram
interactions corresponding to the allowed soft terms in eq. (6.82) are
shown in Fig. 6.4. As before, for each of the interactions in Figs. 6.4a,c,d
there is one with all arrows reversed, corresponding to the complex
conjugate term in the Lagrangian. We will apply these general results
to the specic case of the MSSM in the next chapter.
7
Superelds
Nothing here yet.
166
Part 3
Realistic Supersymmetric Models
8
The Minimal Supersymmetric Standard
Model
In chapter 6, we have found a general recipe for constructing Lagrangians
for softly broken supersymmetric theories. We are now ready to apply
these general results to the MSSM. The particle content for the MSSM
was described in chapter 5. In this section we will complete the model by
specifying the superpotential and the soft supersymmetry-breaking terms.
8.1 The superpotential and supersymmetric interactions
The superpotential for the MSSM is given by
W
MSSM
= uy
u
QH
u
dy
d
QH
d
ey
e
LH
d
+H
u
H
d
. (8.1)
The objects H
u
, H
d
, Q, L, u, d, e appearing in eq. (8.1) are chiral
superelds corresponding to the chiral supermultiplets in Table 1.
(Alternatively, they can be just thought of as the corresponding scalar
elds, as was done in section 6, but we prefer not to put the tildes on Q,
L, u, d, e in order to reduce clutter.) The dimensionless Yukawa coupling
parameters y
u
, y
d
, y
e
are 33 matrices in family space. Here we have
suppressed all of the gauge [SU(3)
C
color and SU(2)
L
weak isospin] and
family indices. The term, as it is traditionally called, can be written
out as (H
u
)
(H
d
)
, where
is used to tie together SU(2)

L
weak
isospin indices , = 1, 2 in a gauge-invariant way. Likewise, the term
uy
u
QH
u
can be written out as u
i
a
(y
u
)
i
j
Q
a
j
(H
u
)
, where i = 1, 2, 3
is a family index, and a = 1, 2, 3 is a color index which is raised (lowered)
in the 3 (3) representation of SU(3)
C
.
The term in eq. (8.1) is the supersymmetric version of the Higgs
boson mass in the Standard Model. It is unique, because terms H
u
H
u
or H
d
H
d
are forbidden in the superpotential, which must be analytic
in the chiral superelds (or equivalently in the scalar elds) treated as
169
170 8 The Minimal Supersymmetric Standard Model
complex variables, as shown in section 6.2. We can also see from the
form of eq. (8.1) why both H
u
and H
d
are needed in order to give
Yukawa couplings, and thus masses, to all of the quarks and leptons.
Since the superpotential must be analytic, the uQH
u
Yukawa terms
cannot be replaced by something like uQH
d
. Similarly, the dQH
d
and
eLH
d
terms cannot be replaced by something like dQH
u
and eLH
u
.
The analogous Yukawa couplings would be allowed in a general non-
supersymmetric two Higgs doublet model, but are forbidden by the
structure of supersymmetry. So we need both H
u
and H
d
, even without
invoking the argument based on anomaly cancellation that was mentioned
in section 5.2.
The Yukawa matrices determine the masses and CKM mixing angles
of the ordinary quarks and leptons, after the neutral scalar components
of H
u
and H
d
get VEVs. Since the top quark, bottom quark and tau
lepton are the heaviest fermions in the Standard Model, it is often useful
to make an approximation that only the (3, 3) family components of each
of y
u
, y
d
and y
e
are important:
y
u

0 0 0
0 0 0
0 0 y
t
; y
d

0 0 0
0 0 0
0 0 y
b
; y
e

0 0 0
0 0 0
0 0 y
. (8.2)
In this limit, only the third family and Higgs elds contribute to the
MSSM superpotential. It is instructive to write the superpotential in
terms of the separate SU(2)
L
weak isospin components [Q
3
= (t b); L
3
=
(
); H
u
= (H
+
u
H
0
u
); H
d
= (H
0
d
H
d
); u
3
= t; d
3
= b; e
3
= ], so:
W
MSSM
y
t
(ttH
0
u
tbH
+
u
) y
b
(btH
d
bbH
0
d
) y
d
H
0
d
)
+(H
+
u
H
d
H
0
u
H
0
d
). (8.3)
The minus signs inside the parentheses appear because of the antisymme-
try of the
symbol used to tie up the SU(2)

L
indices. The other minus
signs in eq. (8.1) were chosen for convenience so that the terms y
t
ttH
0
u
,
y
b
bbH
0
d
, and y
H
0
d
, which will become the top, bottom and tau masses
when H
0
u
and H
0
d
get VEVs, have positive signs in eq. (8.3).
Since the Yukawa interactions y
ijk
in a general supersymmetric theory
must be completely symmetric under interchange of i, j, k, we know that
y
u
, y
d
and y
e
imply not only Higgs-quark-quark and Higgs-lepton-lepton
couplings as in the Standard Model, but also squark-Higgsino-quark
and slepton-Higgsino-lepton interactions. To illustrate this, we show
in Figs. 8.1a,b,c some of the interactions which involve the top-quark
Yukawa coupling y
t
. Figure 8.1a is the Standard Model-like coupling of
H
u
0
t
L
t
R
(a)
H
u
0
t
L
t
R
(a)
H
u
0
t
L
t
R
*
(c)
Fig. 8.1. The top-quark Yukawa coupling (a) and its supersymmetrizations
(b),(c), all of strength y
t
.
(a)
t
R
*
t
R
t
L
t
L
*
(b)
H
u
0
H
u
0*
t
L
t
L
*
(c)
H
u
0
H
u
0*
t
R
*
t
R
Fig. 8.2. Some of the (scalar)
4
interactions with strength proportional to y
2
t
.
the top quark to the neutral complex scalar Higgs boson, which follows
from the rst term in eq. (8.3). For variety, here we have used t
L
and
t
R
in place of their synonyms t and t in Fig. 8.1. In Fig. 8.1b, we have
the coupling of the left-handed top squark

t
L
to the neutral higgsino
eld

H
0
u
and right-handed top quark, while in Fig. 8.1c the right-handed
top-squark eld (known either as

t or

t
R
depending on taste) couples
to

H
0
u
and t
L
. For each of the three interactions, there is another with
H
0
u
H
+
u
and t
L
b
L
, with tildes where appropriate, corresponding
to the second part of the rst term in eq. (8.3). All of these interactions
are required by supersymmetry to have the same strength y
t
. This is
also an incontrovertible prediction of softly-broken supersymmetry at
tree-level, since these interactions are dimensionless and can be modied
by the introduction of soft supersymmetry breaking only through nite
(and small) radiative corrections. A useful mnemonic is that each of
Figs. 8.1a,b,c can be obtained from any of the others by changing two of
the particles into their superpartners.
There are also scalar quartic interactions with strength proportional
to y
2
t
, as can be seen e.g. from Fig. 6.1b or the last term in eq. (6.51).
Three of them are shown in Fig. 8.2. The reader is invited to check, using
eq. (6.51) and eq. (8.3), that there are ve more, which can be obtained
by replacing

t
L

b
L
and/or H
0
u
H
+
u
in each vertex. This illustrates
the remarkable economy of supersymmetry; there are many interactions
determined by only a single parameter! In a similar way, the existence
q
g
q
(a)
q
L
, l
L
, H
u
, H
d
W
q
L
, l
L
, H
u
, H
d
(b)
q, l, H
u
, H
d
B
q, l, H
u
, H
d
(c)
Fig. 8.3. Couplings of the gluino, wino, and bino to MSSM (scalar, fermion)
pairs.
of all the other quark and lepton Yukawa couplings in the superpotential
eq. (8.1) leads not only to Higgs-quark-quark and Higgs-lepton-lepton
Lagrangian terms as in the ordinary Standard Model, but also to squark-
higgsino-quark and slepton-higgsino-lepton terms, and scalar quartic
couplings [(squark)
4
, (slepton)
4
, (squark)
2
(slepton)
2
, (squark)
2
(Higgs)
2
,
and (slepton)
2
(Higgs)
2
]. If needed, these can all be obtained in terms of
the Yukawa matrices y
u
, y
d
, and y
e
as outlined above.
However, it is useful to note that the dimensionless interactions
determined by the superpotential are often not the most important
ones of direct interest for phenomenology. This is because the Yukawa
couplings are already known to be very small, except for those of the
third family (top, bottom, tau). Instead, decay and especially production
processes for superpartners in the MSSM are typically dominated by the
supersymmetric interactions of gauge-coupling strength, as we will explore
in more detail below. The couplings of the Standard Model gauge bosons
(photon, W
, Z
0
and gluons) to the MSSM particles are determined
completely by the gauge invariance of the kinetic terms in the Lagrangian.
The gauginos also couple to (squark, quark) and (slepton, lepton) and
(Higgs, higgsino) pairs as illustrated in the general case in Fig. 6.3g
and the second line in eq. (6.76). For instance, each of the squark-
quark-gluino couplings is given by

2g
3
( q T
a
q g + c.c.) where T
a
=
a
/2
(a = 1 . . . 8) are the matrix generators for SU(3)
C
, with
a
the Gell-
Mann matrices. The Feynman diagram for this interaction is shown in
Fig. 8.3a. In Figs. 8.3b,c we show in a similar way the couplings of
(squark, quark), (lepton, slepton) and (Higgs, higgsino) pairs to the winos
and bino, with strengths proportional to the electroweak gauge couplings
g and g
respectively. The winos only couple to the left-handed squarks

and sleptons, and the (lepton, slepton) and (Higgs, higgsino) pairs of
course do not couple to the gluino. The bino couplings for each (scalar,
fermion) pair are also proportional to the weak hypercharges Y as given
in Table 1. The interactions shown in Fig. 8.3 provide for decays q q g
and q

Wq
and q

Bq when the nal states are kinematically allowed
to be on-shell. However, a complication is that the

W and

B states are
not mass eigenstates, because of mixing due to electroweak symmetry
breaking, as we will see in section 8.6.
There are also various scalar quartic interactions in the MSSM
which are uniquely determined by gauge invariance and supersymmetry,
according to the last term in eq. (6.79) illustrated in Fig. 6.3h. Among
them are (Higgs)
4
terms proportional to g
2
and g
2
in the scalar potential.
These are the direct generalization of the last term in the Standard Model
Higgs potential, eq. (5.1), to the case of the MSSM. We will have occasion
to identify them explicitly when we discuss the minimization of the MSSM
Higgs potential in section 8.5.
The dimensionful terms in the supersymmetric part of the MSSM
Lagrangian are all dependent on . Following the general result of
eq. (6.52), we nd that provides for higgsino fermion mass terms
L
higgsino mass
= (
H
+
u

H
d

H
0
u
H
0
d
) + c.c., (8.4)
as well as Higgs squared-mass terms in the scalar potential
L
supersymmetric Higgs mass
= [[
2
_
[H
0
u
[
2
+[H
+
u
[
2
+[H
0
d
[
2
+[H
d
[
2
_
. (8.5)
Since eq. (8.5) is non-negative denite with a minimum at H
0
u
= H
0
d
= 0,
it is clear that we cannot understand electroweak symmetry breaking
without including supersymmetry-breaking (mass)
2
soft terms for the
Higgs scalars, which can be negative. An explicit treatment of the Higgs
scalar potential will therefore have to wait until we have introduced the
soft terms for the MSSM. However, we can already see a puzzle: we
expect that should be roughly of order 10
2
or 10
3
GeV, in order to
allow a Higgs VEV of order 174 GeV without too much miraculous
cancellation between [[
2
and the negative soft (mass)
2
terms that we
have not written down yet. But why should [[
2
be so small compared to,
say, M
2
P
, and in particular why should it be roughly of the same order as
m
2
soft
? The scalar potential of the MSSM seems to depend on two types
of dimensionful parameters which are conceptually quite distinct, namely
the supersymmetry-respecting mass and the supersymmetry-breaking
soft mass terms. Yet the observed value for the electroweak breaking scale
suggests that without miraculous cancellations, both of these apparently
unrelated mass scales should be within an order of magnitude or so
of 100 GeV. This puzzle is called the problem. Several dierent
solutions to the problem have been proposed, involving extensions of
the MSSM of varying intricacy. They all work in roughly the same way;
the parameter is required or assumed to be completely absent at tree-
level, and then arises from the VEV(s) of some new eld(s). The latter
are in turn determined by minimizing a potential which depends on soft
H
d
0*
t
R
*
t
L
(a)
H
u
0*
b
R
*
b
L
(b)
H
u
0*
R
*
L
(c)
Fig. 8.4. Some of the supersymmetric (scalar)
3
couplings proportional to
y
t
,
y
b
, and
. When H
0
u
and H
0
d
get VEVS, these contribute to (a)

t
L
,
t
R
mixing, (b)

b
L
,
b
R
mixing, and (c)
L
,
R
mixing.
supersymmetry-breaking terms. In this way, the value of the eective
parameter is no longer conceptually distinct from the mechanism of
supersymmetry breaking; if we can explain why m
soft
M
P
, we will also
be able to understand why is of the same order. In section 12.1 we will
describe one such mechanism. Some other attractive solutions for the
problem are proposed in Refs.[43, 44, 45] From the point of view of the
MSSM, however, we can just treat as an independent parameter.
The -term and the Yukawa couplings in the superpotential eq. (8.1)
combine to yield (scalar)
3
couplings [see the second and third terms on
the right-hand side of eq. (6.51)] of the form
L
supersymmetric (scalar)
3 =
uy
u
uH
0
d
+
dy
d
dH
0
u
+
ey
e
eH
0
u
+
uy
u
dH
d
+
dy
d
uH
+
u
+
ey
e
H
+
u
_
+ c.c. (8.6)
In Fig. 8.4 we show some of these couplings proportional to
y
t
,
y
b
,
and
respectively. These play an important role in determining the

mixing of top squarks, bottom squarks, and tau sleptons, as we will see
in section 8.8.
8.2 R-parity (also known as matter parity) and its
consequences
The superpotential eq. (8.1) is minimal, in the sense that it is sucient
to produce a phenomenologically viable model. However, there are other
terms that one could write down that are gauge-invariant and analytic
in the chiral superelds, but are not included in the MSSM because they
violate either baryon number (B) or total lepton number (L). The most
general gauge-invariant and renormalizable superpotential would include
s or b
d
u
u u
L
Q

Fig. 8.5. Squarks can mediate disastrously rapid proton decay if R-parity is
violated by both B = 1 and L = 1 interactions.
not only eq. (8.1), but also the terms
W
L=1
=
1
2
ijk
L
i
L
j
e
k
+
ijk
L
i
Q
j
d
k
+
i
L
i
H
u
(8.7)
W
B=1
=
1
2
ijk
u
i
d
j
d
k
(8.8)
where we have restored family indices i = 1, 2, 3. The chiral
supermultiplets carry baryon number assignments B = +1/3 for Q
i
;
B = 1/3 for u
i
, d
i
; and B = 0 for all others. The total lepton number
assignments are L = +1 for L
i
, L = 1 for e
i
, and L = 0 for all others.
Therefore, the terms in eq. (8.7) violate total lepton number by 1 unit (as
well as the individual lepton avors) and those in eq. (8.8) violate baryon
number by 1 unit.
The possible existence of such terms might seem rather disturbing,
since corresponding B- and L-violating processes have not been seen
experimentally. The most obvious experimental constraint comes from
the non-observation of proton decay, which would violate both B and L
by 1 unit. If both
and
couplings were present and unsuppressed,

then the lifetime of the proton would be extremely short. For example,
the Feynman diagram in Fig. 8.5 would lead to p
+
e
+
0
or e
+
K
0
or
0
or
+
K
0
or
+
or K
+
etc. depending on which components of
are largest. (The coupling
must be antisymmetric in its last two avor

indices, since the color indices are contracted antisymmetrically. That is
why the squark in Fig. 8.5 is
s or
b but not
d, for u, d quarks in the initial

state.) As a rough estimate based on dimensional analysis, for example,
pe
+
0 m
5
proton
i=2,3
[
11i
11i
[
2
/m
4
e
d
i
, (8.9)
which would be a fraction of a second if the couplings were of order unity
and the squarks have masses of order 1 TeV. In contrast, the decay time
of the proton is measured to be in excess of 10
33
years. Therefore, at least
one of
11i
or
11i
for i = 2, 3 must be extremely small. Many other
processes also give very strong constraints on the violation of lepton and
baryon numbers; these are reviewed in these are reviewed in Chapter 11.
One could simply try to take B and L conservation as a postulate in the
MSSM. However, this is clearly a step backwards from the situation in
the Standard Model, where the conservation of these quantum numbers
is not assumed, but is rather a pleasantly accidental consequence of
the fact that there are no possible renormalizable Lagrangian terms that
violate B or L. Furthermore, there is a quite general obstacle to treating
B and L as fundamental symmetries of nature, since they are known
to be necessarily violated by non-perturbative electroweak eects (even
though those eects are calculably negligible for experiments at ordinary
energies). Therefore, in the MSSM one adds a new symmetry which has
the eect of eliminating the possibility of B and L violating terms in the
renormalizable superpotential, while allowing the good terms in eq. (8.1).
This new symmetry is called R-parity or equivalently matter parity.
Matter parity is a multiplicatively conserved quantum number dened
as
P
M
= (1)
3(BL)
(8.10)
for each particle in the theory. It is easy to check that the quark and
lepton supermultiplets all have P
M
= 1, while the Higgs supermultiplets
H
u
and H
d
have P
M
= +1. The gauge bosons and gauginos of course
do not carry baryon number or lepton number, so they are assigned
matter parity P
M
= +1. The symmetry principle to be enforced is
that a term in the Lagrangian (or in the superpotential) is allowed only
if the product of P
M
for all of the elds in it is +1. It is easy to
see that each of the terms in eqs. (8.7) and (8.8) is thus forbidden,
while the good and necessary terms in eq. (8.1) are allowed. This
discrete symmetry commutes with supersymmetry, as all members of
a given supermultiplet have the same matter parity. The advantage of
matter parity is that it can in principle be an exact and fundamental
symmetry, which B and L themselves cannot, since they are known to
be violated by non-perturbative electroweak eects. So even with exact
matter parity conservation in the MSSM, one expects that baryon number
and total lepton number violation can occur in tiny amounts, due to
nonrenormalizable terms in the Lagrangian. However, the MSSM does not
have renormalizable interactions that violate B or L, with the standard
assumption of matter parity conservation.
It is sometimes useful to recast matter parity in terms of R-parity,
dened for each particle as
P
R
= (1)
3(BL)+2s
(8.11)
where s is the spin of the particle. Now, matter parity conservation
and R-parity conservation are precisely equivalent, since the product of
(1)
2s
for the particles involved in any interaction vertex in a theory
that conserves angular momentum is equal to +1. However, particles
within the same supermultiplet do not have the same R-parity. In general,
symmetries with the property that particles within the same multiplet
have dierent charges are called R symmetries; they do not commute with
supersymmetry. Continuous U(1) R symmetries are often encountered in
the model-building literature; they should not be confused with R-parity,
which is a discrete Z
2
symmetry. In fact, the matter parity version of R-
parity makes clear that there is really nothing intrinsically R about it;
in other words it secretly does commute with supersymmetry, so its name
is somewhat suboptimal. Nevertheless, the R-parity assignment is very
useful for phenomenology because all of the Standard Model particles and
the Higgs bosons have even R-parity (P
R
= +1), while all of the squarks,
sleptons, gauginos, and higgsinos have odd R-parity (P
R
= 1).
The R-parity odd particles are known as supersymmetric particles
or sparticles for short, and they are distinguished by a tilde (see
Tables 1 and 2). If R-parity is exactly conserved, then there can be no
mixing between the sparticles and the P
R
= +1 particles. Furthermore,
every interaction vertex in the theory contains an even number of P
R
=
1 sparticles. This has three extremely important phenomenological
consequences:
The lightest sparticle with P
R
= 1, called the lightest
supersymmetric particle or LSP, must be absolutely stable. If the
LSP is electrically neutral, it interacts only weakly with ordinary
matter, and so can make an attractive candidate for the non-
baryonic dark matter that seems to be required by cosmology.
Each sparticle other than the LSP must eventually decay into a
state that contains an odd number of LSPs (usually just one).
In collider experiments, sparticles can only be produced in even
numbers (usually two-at-a-time).
We dene the MSSM to conserve R-parity or equivalently matter parity.
While this decision seems to be well-motivated phenomenologically by
proton decay constraints and the hope that the LSP will provide a
good dark matter candidate, it might appear somewhat ad hoc from a
theoretical point of view. After all, the MSSM would not suer any
internal inconsistency if we did not impose matter parity conservation.
Furthermore, it is fair to ask why matter parity should be exactly
conserved, given that the known discrete symmetries in the Standard
Model (ordinary parity P, charge conjugation C, time reversal T, etc.)
are all known to be inexact symmetries. Fortunately, it is sensible to
formulate matter parity as a discrete symmetry that is exactly conserved.
In general, exactly conserved, or gauged discrete symmetries can exist
provided that they satisfy certain anomaly cancellation conditions (much
like continuous gauged symmetries). One particularly attractive way this
could occur is if BL is a continuous U(1) gauge symmetry which is
spontaneously broken at some very high energy scale. From eq. (8.10), we
observe that P
M
is actually a discrete subgroup of the continuous U(1)
BL
group. Therefore, if gauged U(1)
BL
is broken by scalar VEVs (or other
order parameters) that carry only even integer values of 3(BL), then P
M
will automatically survive as an exactly conserved remnant. A variety of
extensions of the MSSM in which exact R-parity arises in just this way
have been proposed. It may also be possible to have gauged discrete
symmetries which do not owe their exact conservation to an underlying
continuous gauged symmetry, but rather to some other structure such as
can occur in string theory. It is also possible that R-parity is broken, or is
replaced by some alternative discrete symmetry. We will briey consider
these as variations on the MSSM in Chapter 11.
8.3 Soft supersymmetry breaking in the MSSM
To complete the description of the MSSM, we need to specify the soft
supersymmetry breaking terms. In section 6.6, we learned how to write
down the most general set of such terms in any supersymmetric theory.
Applying this recipe to the MSSM, we have:
L
MSSM
soft
=
1
2
_
M
3
g g +M
2
W +M
1
B
_
+ c.c.
+
_
ua
u

QH
u
d a
d

QH
d
e a
e
LH
d
_
+ c.c.
+
m
2
Q

Q+

L
m
2
L

L +
um
2
u
d m
2
d
e m
2
e
+m
2
Hu
H
u
H
u
m
2
H
d
H
d
H
d
+ (bH
u
H
d
+ c.c.) . (8.12)
In eq. (8.12), M
3
, M
2
, and M
1
are the gluino, wino, and bino mass terms.
Here, and from now on, we suppress the adjoint representation gauge
indices on the wino and gluino elds, and the gauge indices on all of the
chiral supermultiplet elds. The second line in eq. (8.12) contains the
(scalar)
3
couplings [of the type a
ijk
in eq. (6.82)]. Each of a
u
, a
d
, a
e
is a
complex 3 3 matrix in family space, with dimensions of (mass). They
are in one-to-one correspondence with the Yukawa coupling matrices in
the superpotential. The third line of eq. (8.12) consists of squark and
slepton mass terms of the (m
2
)
j
i
type in eq. (6.82). Each of m
2
Q
, m
2
u
, m
2
d
,
m
2
L
, m
2
e
is a 3 3 matrix in family space that can have complex entries,
but they must be hermitian so that the Lagrangian is real. (To avoid
clutter, we do not put tildes on the Q in m
2
Q
, etc.) Finally, in the last
line of eq. (8.12) we have supersymmetry-breaking contributions to the
Higgs potential; m
2
Hu
and m
2
H
d
are (mass)
2
terms of the (m
2
)
j
i
type, while
b is the only (mass)
2
term of the type b
ij
in eq. (6.82) that can occur in
the MSSM.
2
Schematically, we can write
M
1
, M
2
, M
3
, a
u
, a
d
, a
e
m
soft
; (8.13)
m
2
Q
, m
2
L
, m
2
u
, m
2
d
, m
2
e
, m
2
Hu
, m
2
H
d
, b m
2
soft
, (8.14)
with a characteristic mass scale m
soft
that is not much larger than 10
3
GeV, as argued in the Introduction. The expression eq. (8.12) is the most
general soft supersymmetry-breaking Lagrangian of the form eq. (6.82)
that is compatible with gauge invariance and matter parity conservation.
Unlike the supersymmetry-preserving part of the Lagrangian, L
MSSM
soft
introduces many new parameters that were not present in the ordinary
Standard Model. A careful count reveals that there are 105 masses,
phases and mixing angles in the MSSM Lagrangian that cannot be
rotated away by redening the phases and avor basis for the quark
and lepton supermultiplets, and have no counterpart in the ordinary
Standard Model. Thus, in principle, supersymmetry (or more precisely,
supersymmetry breaking) appears to introduce a tremendous arbitrariness
in the Lagrangian.
8.4 Hints of an Organizing Principle
Fortunately, there is already good experimental evidence that some sort
of powerful organizing principle must govern the soft terms. This is
because most of the new parameters in eq. (8.12) involve avor mixing or
CP violation of the type that is already severely restricted by experiment.
For example, suppose that m
2
e
is not diagonal in the basis ( e
R
,
R
,
R
) of
sleptons whose superpartners are the right-handed pieces of the Standard
Model mass eigenstates e, , . In that case slepton mixing occurs, and
the individual lepton numbers will not be conserved. This is true even for
processes that only involve the sleptons as virtual particles. A particularly
strong limit on this possibility comes from the experimental constraint on
e, which can occur via the one-loop diagram in Fig. 8.6(a) featuring
a virtual bino and slepton. The symbol represents an insertion of
(m
2
e
)
21
e
R

R
in L
MSSM
soft
, and the slepton-bino vertices are determined
by the weak hypercharge gauge coupling [see Fig. 6.3(g) and eq. (6.76)].
2
The parameter we call b is often seen in the literature as m
2
12
or m
2
3
or B.
(a)
e
e
B
(b)
d s
s d
g g
d
s
s
d
Fig. 8.6. Feynman diagrams that contribute to avor violation in models with
arbitrary soft masses: (a) e and (b) K
0
K
0
mixing.
There are similar diagrams if the left-handed slepton mass matrix m
2
L
has arbitrary o-diagonal entries. If m
2
L
or m
2
e
were random, with
all entries of comparable size, then the contributions to BR( e)
would be about much larger than the present experimental upper limit
of 1.2 10
11
, even if the sleptons are as heavy as 1 TeV. Therefore the
form of the slepton mass matrices must be severely constrained to avoid
large e
R
,
R
and e
L
,
L
mixings.
There are also important experimental constraints on the squark
(mass)
2
matrices. The strongest of these come from the neutral kaon
system. The eective hamiltonian for K
0
K
0
mixing gets contributions
from the diagram in Fig. 8.6b, among others, if L
MSSM
soft
contains (mass)
2
terms that mix down squarks and strange squarks. The gluino-squark-
quark vertices in Fig. 8.6b are all xed by supersymmetry to be of
strong interaction strength; there are similar diagrams in which the bino
and winos are exchanged.[56] If the squark and gaugino masses are of
order 1 TeV or less, one nds that limits on the parameters m
K
and
and
/ appearing in the neutral kaon eective hamiltonian severely

restrict the amount of

d
L
, s
L
and

d
R
, s
R
squark mixing, and associated
CP-violating complex phases, that one can tolerate in the soft squared
masses.[57] Weaker, but still interesting, constraints come from the D
0
, D
0
system, which limits the amounts of u
L
, c
L
and u
R
, c
R
mixings from
the soft squark squared mass matrix, and constraints from the B
0
d
, B
0
d
system limit the amount of

d
L
,
b
L
and

d
R
,
b
R
mixing. More constraints
follow from rare meson decays, notably those involving the parton-level
process[58] b s. The possibility of mixings between left-handed
and right-handed squarks and sleptons with the same charges is also
constrained by the same processes. In the MSSM, these mixings can
arise from (scalar)
3
interactions involving the a
u
, a
d
, a
e
matrices, after
the Higgs scalar obtain VEVs. For example, for down-type squarks,
we have:

da
d
QH
d
+ c.c. (a
d
)
12
H
0
d
)
R
s
R
+ c.c. The experimental
constraints on avor-changing neutral currents (FCNCs) therefore imply
that the corresponding o-diagonal elements of the a
u
, a
d
, a
e
matrices
cannot be too large. There are also signicant constraints on CP-violating
phases in the gaugino masses and (scalar)
3
soft couplings following from
limits on the electric dipole moments of the neutron and electron.[59]
Detailed limits can be found in the literature, but the essential lesson from
experiment is that the soft supersymmetry-breaking Lagrangian cannot
be arbitrary or random.
All of these potentially dangerous FCNC and CP-violating eects
in the MSSM can be evaded if one assumes (or can explain!) that
supersymmetry breaking should be suitably universal. In particular,
consider an idealized limit in which the squark and slepton (mass)
2
matrices are avor-blind, each proportional to the 3 3 identity matrix
in family space:
m
2
Q
= m
2
Q
1; m
2
u
= m
2
u
1; m
2
d
= m
2
d
1;
m
2
L
= m
2
L
1; m
2
e
= m
2
e
1. (8.15)
Then all squark and slepton mixing angles are rendered trivial, because
squarks and sleptons with the same electroweak quantum numbers will
be degenerate in mass and can be rotated into each other at will.
Supersymmetric contributions to FCNC processes will therefore be very
small in such an idealized limit, up to mixing induced by a
u
, a
d
, a
e
.
Making the further assumption that the (scalar)
3
couplings are each
proportional to the corresponding Yukawa coupling matrix:
a
u
= A
u0
y
u
; a
d
= A
d0
y
d
; a
e
= A
e0
y
e
(8.16)
will ensure that only the squarks and sleptons of the third family can
have large (scalar)
3
couplings. Finally, one can avoid disastrously large
CP-violating eects with the assumption that the soft parameters do not
introduce new complex phases. This is automatic for m
2
Hu
and m
2
H
d
, and
for m
2
Q
, m
2
u
etc. if eq. (8.15) is assumed; if they were not real numbers, the
Lagrangian would not be real. One can also x in the superpotential
and b in eq. (8.12) to be real, by an appropriate phase rotation of H
u
and
H
d
. If one then assumes that
arg(M
1
), arg(M
2
), arg(M
3
), arg(A
u0
), arg(A
d0
), arg(A
e0
) = 0 or , (8.17)
then the only CP-violating phase in the theory will be the ordinary CKM
phase found in the ordinary Yukawa couplings.
Together, the conditions eqs. (8.15)-(8.17) make up a rather weak
version of what is often called the assumption of soft-breaking universality.
The MSSM with these avor- and CP-preserving relations imposed has far
fewer parameters than the most general case. There are 3 independent real
gaugino masses, only 5 real squark and slepton squared mass parameters,
3 real (scalar)
3
parameters, and 4 Higgs mass parameters (one of which
can be traded for the already-known electroweak breaking scale).
The soft-breaking universality relations eqs. (8.15)-(8.17) (or stronger
versions of them) are presumed to be the result of some specic model for
the origin of supersymmetry breaking, even though there is considerable
disagreement among theorists as to what the specic model should
actually be. In any case, they are indicative of an assumed underlying
simplicity or symmetry of the Lagrangian at some very high energy scale
Q
0
, which we will call the input scale. If we use this Lagrangian to
compute masses and cross-sections and decay rates for experiments at
ordinary energies near the electroweak scale, the results will involve large
logarithms of order ln(Q
0
/m
Z
) coming from loop diagrams. As is usual in
quantum eld theory, the large logarithms can be conveniently resummed
using renormalization group (RG) equations, by treating the couplings
and masses appearing in the Lagrangian as running parameters.
Therefore, eqs. (8.15)-(8.17) should be interpreted as boundary conditions
on the running soft parameters at the RG scale Q
0
which is very far
removed from direct experimental probes. We must then RG-evolve all
of the soft parameters, the superpotential parameters, and the gauge
couplings down to the electroweak scale or comparable scales where
humans perform experiments.
At the electroweak scale, eqs. (8.15) and (8.16) will no longer hold,
even if they were exactly true at the input scale Q
0
. However, key avor-
and CP-conserving properties remain, to a good approximation. This is
because RG corrections due to gauge interactions will respect eqs. (8.15)
and (8.16), while RG corrections due to Yukawa interactions are quite
small except for couplings involving the top, bottom, and tau avors.
Therefore, the (scalar)
3
couplings and scalar squared mass mixings should
be quite negligible for the squarks and sleptons of the rst two families.
Furthermore, RG evolution does not introduce new CP-violating phases.
Therefore, if universality can be arranged to hold at the input scale,
supersymmetric contributions to FCNC and CP-violating observables
can be acceptably small in comparison to present limits (although quite
possibly measurable in future experiments).
One good reason to be optimistic that such a program can succeed is
the celebrated apparent unication of gauge couplings in the MSSM. The
1-loop RG equations for the Standard Model gauge couplings g
1
, g
2
, g
3
are
given by
d
dt
g
a
=
1
16
2
b
a
g
3
a

d
dt
1
a
=
b
a
2
(a = 1, 2, 3) (8.18)
2 4 6 8 10 12 14 16 18
Log
10
(Q/1 GeV)
0
10
20
30
40
50
60
1
1
2
1
3
1
Fig. 8.7. RG evolution of the inverse gauge couplings
1
a
(Q) in the Standard
Model (dashed lines) and the MSSM (solid lines). In the MSSM case,
3
(m
Z
) is
varied between 0.113 and 0.123, and the sparticle mass thresholds between 250
GeV and 1 TeV. Two-loop eects are included.
where t = ln(Q/Q
0
) with Q the RG scale. In the Standard Model, b
SM
a
=
(41/10, 19/6, 7), while in the MSSM one nds instead b
MSSM
a
= (33/5,
1, 3). The latter set of coecients are larger because of the virtual
eects of the extra MSSM particles in loops. The normalization for g
1
here is chosen to agree with the canonical covariant derivative for grand
unication of the gauge group SU(3)
C
SU(2)
L
U(1)
Y
into SU(5) or
SO(10). Thus in terms of the conventional electroweak gauge couplings
g and g
with e = g sin
W
= g
cos
W
, one has g
2
= g and g
1
=
_
5/3g
.
The quantities
a
= g
2
a
/4 have the nice property that their reciprocals
run linearly with RG scale at one-loop order. Figure 8.7 we compare
the RG evolution of the
1
a
, including two-loop eects, in the Standard
Model (dashed lines) and the MSSM (solid lines). Unlike the Standard
Model, the MSSM includes just the right particle content to ensure that
the gauge couplings can unify, at a scale M
U
2 10
16
GeV. While the
apparent unication of gauge couplings at M
U
could be just an accident,
it may also be taken as a strong hint in favor of a grand unied theory
(GUT) or superstring models, both of which indeed predict gauge coupling
unication below M
P
. Furthermore, if we take this hint seriously, then
we can reasonably expect to be able to apply a similar RG analysis to the
other MSSM couplings and soft masses as well.
It must be mentioned that there are two other possible types of
explanations for the suppression of FCNCs in the MSSM, instead of the
universality hypothesis of eqs. (8.15)-(8.17). One might refer to them as
irrelevancy and alignment of the soft masses. The irrelevancy idea
is that the sparticles masses are simply extremely heavy, so that their
contributions to FCNC and CP-violating diagrams like Figs. 8.6a,b are
highly suppressed. In practice, however, the degree of suppression needed
typically requires m
soft
1 TeV for at least some of the scalar masses; this
seems to go directly against the motivation for supersymmetry as a cure
for the hierarchy problem as discussed in the Introduction. Nevertheless,
it is possible to arrange schemes where this can work in a sensible way.
The alignment idea is that the squark (mass)
2
matrices do not have
the avor-blindness indicated in eq. (8.15), but are arranged in avor
space to be aligned with the relevant Yukawa matrices in just such a way
as to avoid large FCNC eects. The alignment models typically require
rather special avor symmetries. In any case, we will not discuss these
possibilities further.
In practice, a given model for the origin of supersymmetry breaking may
make predictions for the MSSM soft terms that are even stronger than
eqs. (8.15)-(8.17). In Chapter 9 we will discuss the ideas that go into
making such predictions and their implications for the MSSM spectrum.
8.5 Electroweak symmetry breaking and the Higgs bosons
In the MSSM, the description of electroweak symmetry breaking is slightly
complicated by the fact that there are two complex Higgs doublets H
u
=
(H
+
u
, H
0
u
) and H
d
= (H
0
d
, H
d
) rather than just one in the ordinary
Standard Model. The classical scalar potential for the Higgs scalar elds
in the MSSM is given by
V = ([[
2
+m
2
Hu
)([H
0
u
[
2
+[H
+
u
[
2
) + ([[
2
+m
2
H
d
)([H
0
d
[
2
+[H
d
[
2
)
+b (H
+
u
H
d
H
0
u
H
0
d
) + c.c.
+
1
8
(g
2
+g
2
)([H
0
u
[
2
+[H
+
u
[
2
[H
0
d
[
2
[H
d
[
2
)
2
+
1
2
g
2
[H
+
u
H
0
d
+H
0
u
H
d
[
2
. (8.19)
The terms proportional to [[
2
come from the [F[
2
terms; see eq. (8.5).
The terms proportional to g
2
and g
2
are the D
2
term contributions, and
may be derived from the general formula eq. (6.79) after some rearranging.
Finally, the terms proportional to m
2
Hu
, m
2
H
d
and b are nothing but a
rewriting of the last three terms of eq. (8.12). The full scalar potential
of the theory also includes many terms involving the squark and slepton
elds that we can ignore here, since they do not get VEVs because they
have large positive (mass)
2
.
We now have to demand that the minimum of this potential should
break electroweak symmetry down to electromagnetism SU(2)
L

U(1)
Y
U(1)
EM
, in accord with experiment. We can use the freedom to
make gauge transformations to simplify this analysis. First, the freedom
to make SU(2)
L
gauge transformations allows us to rotate away a possible
VEV for one of the weak isospin components of one of the scalar elds;
so without loss of generality we can take H
+
u
= 0 at the minimum of
the potential. Then one nds that a minimum of the potential satisfying
V/H
+
u
= 0 must also have H
d
= 0. This is good, because it means that
at the minimum of the potential electromagnetism is necessarily unbroken,
since the charged components of the Higgs scalars cannot get VEVs. After
setting H
+
u
= H
d
= 0, we are left to consider the scalar potential
V = ([[
2
+m
2
Hu
)[H
0
u
[
2
+ ([[
2
+m
2
H
d
)[H
0
d
[
2
(b H
0
u
H
0
d
+ c.c.)
+
1
8
(g
2
+g
2
)([H
0
u
[
2
[H
0
d
[
2
)
2
. (8.20)
The only term in this potential that depends on the phases of the elds
is the b-term. Therefore, a redenition of the phase of H
u
or H
d
can
absorb any phase in b, so we can take b to be real and positive. Then it
is clear that a minimum of the potential V requires that H
0
u
H
0
d
is also
real and positive, so H
0
u
) and H
0
d
) must have opposite phases. We can
therefore use a U(1)
Y
gauge transformation to make them both be real
and positive without loss of generality, since H
u
and H
d
have opposite
weak hypercharges (1/2). It follows that CP cannot be spontaneously
broken by the Higgs scalar potential, since the VEVs and b can be
simultaneously chosen real, as a convention. This means that the Higgs
scalar mass eigenstates can be assigned well-dened eigenvalues of CP,
at least at tree-level. (CP-violating phases in other couplings can induce
loop-suppressed CP violation in the Higgs sector, but do not change the
fact that b, v
u
, and v
d
can always be chosen real and positive.)
Note that the b-term always favors electroweak symmetry breaking.
The combination of the b term and the terms m
2
Hu
and m
2
H
d
can allow
for one linear combination of H
0
u
and H
0
d
to have a negative (mass)
2
near
H
0
u
= H
0
d
= 0. This requires that
b
2
> ([[
2
+m
2
Hu
)([[
2
+m
2
H
d
). (8.21)
If this inequality is not satised, then H
0
u
= H
0
d
= 0 will be a stable
minimum of the potential, and electroweak symmetry breaking will not
occur. A negative value for [[
2
+m
2
Hu
will help eq. (8.21) to be satised,
but it is not necessary. Furthermore, even if m
2
Hu
< 0, there may be no
electroweak symmetry breaking if [[ is too large or if b is too small. Still,
the large negative contributions to m
2
Hu
from the RG equation (10.18)
discussed in the previous section are an important factor in ensuring that
electroweak symmetry breaking can occur in models with simple boundary
conditions for the soft terms.
In order for the MSSM scalar potential to be viable, it is not enough
that the point H
0
u
= H
0
d
= 0 is destabilized by a negative (mass)
2
direction; we must also make sure that the potential is bounded from
below for arbitrarily large values of the scalar elds, so that V will
really have a minimum. (Recall from the discussion in sections 6.2
and 6.4 that scalar potentials in purely supersymmetric theories are
automatically positive and so clearly bounded from below. But, now that
we have introduced supersymmetry breaking, we must be careful.) The
scalar quartic interactions in V will stabilize the potential for almost all
arbitrarily large values of H
0
u
and H
0
d
. However, for the special directions
in eld space [H
0
u
[ = [H
0
d
[, the quartic contributions to V [the second line
in eq. (8.20)] are identically zero. Such directions in eld space are called
D-at directions, because along them the part of the scalar potential
coming from D-terms vanishes. In order for the potential to be bounded
from below, we need the quadratic part of the scalar potential to be
positive along the D-at directions. This requirement amounts to
2b < 2[[
2
+m
2
Hu
+m
2
H
d
. (8.22)
Interestingly, if m
2
Hu
= m
2
H
d
, the constraints eqs. (8.21) and (8.22) cannot
both be satised. In models derived from the minimal supergravity
or gauge-mediated boundary conditions, m
2
Hu
= m
2
H
d
is supposed to
hold at tree level at the input scale, but the X
t
contribution to the
RG equation for m
2
Hu
naturally pushes it to negative or small values
m
2
Hu
< m
2
H
d
at the electroweak scale, as we will see in section 10.1.
Unless this eect is large, the parameter space in which the electroweak
symmetry is broken would be quite small. So in these models electroweak
symmetry breaking is actually driven purely by quantum corrections; this
mechanism is therefore known as radiative electroweak symmetry breaking.
The realization [92, 97] that this works most naturally with a large top-
quark Yukawa coupling provides additional motivation for these models.
Having established the conditions necessary for H
0
u
and H
0
d
to get non-
zero VEVs, we can now require that they are compatible with the observed
phenomenology of electroweak symmetry breaking SU(2)
L
U(1)
Y

U(1)
EM
. Let us write
v
u
= H
0
u
), v
d
= H
0
d
). (8.23)
These VEVs can be connected to the known mass of the Z
0
boson and
the electroweak gauge couplings:
v
2
u
+v
2
d
= v
2
= 2m
2
Z
/(g
2
+g
2
) (174 GeV)
2
. (8.24)
The ratio of the two VEVs is traditionally written as
tan v
u
/v
d
. (8.25)
The value of tan is not xed by present experiments, but it depends
on the Lagrangian parameters of the MSSM in a calculable way. Since
v
u
= v sin and v
d
= v cos were taken to be real and positive, we have
0 < < /2, a requirement that will be sharpened below. Now one
can write down the conditions V/H
0
u
= V/H
0
d
= 0 under which the
potential eq. (8.20) will have a minimum satisfying eqs. (8.24) and (8.25):
[[
2
+m
2
Hu
= b cot + (m
2
Z
/2) cos 2. (8.26)
[[
2
+m
2
H
d
= b tan (m
2
Z
/2) cos 2; (8.27)
It is easy to check that these equations indeed satisfy the necessary
conditions eqs. (8.21) and (8.22). They allow us to eliminate two of the
Lagrangian parameters b and [[ in favor of tan , but do not determine
the phase of .
As an aside, we note that eqs. (8.27) and (8.26) highlight the
problem already mentioned in section 8.1. Suppose we view [[
2
, b, m
2
Hu
and m
2
H
d
as input parameters, and m
2
Z
and tan as output parameters
obtained by solving these two equations. Then, without miraculous
cancellations, we expect that all of the input parameters ought to be
within an order of magnitude or two of m
2
Z
. However, in the MSSM, is
a supersymmetry-respecting parameter appearing in the superpotential,
while b, m
2
Hu
, m
2
H
d
are supersymmetry-breaking parameters. This has
lead to a widespread belief that the MSSM must be extended at very
high energies to include a mechanism that relates the eective value of
to the supersymmetry-breaking mechanism in some way; see section 12.1
and Refs.[43, 44, 45] for examples.
The Higgs scalar elds in the MSSM consist of two complex SU(2)
L
-
doublet, or eight real, scalar degrees of freedom. When the electroweak
symmetry is broken, three of them are the would-be Nambu-Goldstone
bosons G
0
, G
, which become the longitudinal modes of the Z

0
and W
massive vector bosons. The remaining ve Higgs scalar mass eigenstates

consist of two CP-even neutral scalars h
0
and H
0
, one CP-odd neutral
scalar A
0
, and a charge +1 scalar H
+
and its conjugate charge 1 scalar
H
. (Here we dene G
= G
+
and H
= H
+
.) The gauge-eigenstate
elds can be expressed in terms of the mass eigenstate elds as:
_
H
0
u
H
0
d
_
=
_
v
u
v
d
_
+
1
2
R
_
h
0
H
0
_
+
i
2
R
0
_
G
0
A
0
_
(8.28)
_
H
+
u
H
d
_
= R
_
G
+
H
+
_
(8.29)
where the orthogonal rotation matrices
R
=
_
cos sin
sin cos
_
, (8.30)
R
0
=
_
sin
0
cos
0
cos
0
sin
0
_
, R
=
_
sin
cos
cos
sin
_
(8.31)
are chosen so that the quadratic part of the potential has diagonal
squared-masses:
V =
1
2
m
2
h
0
(h
0
)
2
+
1
2
m
2
H
0
(H
0
)
2
+
1
2
m
2
G
0
(G
0
)
2
+
1
2
m
2
A
0
(A
0
)
2
+m
2
G
[G
+
[
2
+m
2
H
[H
+
[
2
+. . . . (8.32)
Then, provided that v
u
, v
d
minimize the tree-level potential,
3
one nds
that
0
=
= , and m
2
G
0
= m
2
G
= 0, and
m
2
A
0
= 2b/ sin 2 (8.33)
m
2
h
0
,H
0
=
1
2
_
m
2
A
0
+m
2
Z

_
(m
2
A
0
+m
2
Z
)
2
4m
2
Z
m
2
A
0
cos
2
2
_
, (8.34)
m
2
H
= m
2
A
0
+m
2
W
. (8.35)
The mixing angle is determined at tree-level by
sin2
sin 2
=
_
m
2
H
0
+m
2
h
0
m
2
H
0
m
2
h
0
_
,
tan 2
tan 2
=
_
m
2
A
0
+m
2
Z
m
2
A
0
m
2
Z
_
, (8.36)
and is traditionally chosen to be negative; it follows that /2 < < 0
(provided m
A
0 > m
Z
). The Feynman rules for couplings of the mass
eigenstate Higgs scalars to the Standard Model quarks and leptons and
the electroweak vector bosons, as well as to the various sparticles, have
been worked out in detail in Ref.[98, 99].
The masses of A
0
, H
0
and H
can in principle be arbitrarily large since

they all grow with b/ sin 2. In contrast, the mass of h
0
is bounded from
above. It is not hard to show from eq. (8.34) that
m
h
0 < [ cos 2[m
Z
(8.37)
3
It is sometimes useful to expand around VEVs vu, v
d
that do not minimize the tree-
level potential, for example to minimize the loop-corrected eective potential instead.
In that case, the three angles , 0, and are all slightly dierent, and m
2
G
0 and
m
2
G
are non-zero and dier from each other by a small amount.

0 50 100 150 200 250 300
H
u
[GeV]
0
20
40
60
H
d

[
G
e
V
]
Fig. 8.8. A contour map of the Higgs potential, for a typical case with tan
cot 10. The symbol + marks the minimum of the potential, and each
curve outward represents an equipotential with V V
0
doubled. Oscillations of
the neutral Higgs elds along the shallow direction with H
0
u
/H
0
d
10 correspond
to the mass eigenstate h
0
, and the orthogonal steeper direction to H
0
.
at tree-level.[100] This corresponds to the existence of a shallow direction
in the scalar potential, along the direction (H
0
u
v
u
, H
0
d
v
d
)
(cos , sin ). A contour map of the potential, for a typical case with
tan cot 10, is shown in Figure 8.8. If the tree-level inequality
(8.37) were robust, the lightest Higgs boson of the MSSM would have been
discovered at LEP2. However, the tree-level mass formulas given above for
the Higgs mass eigenstates are subject to signicant quantum corrections,
which are especially important for h
0
. The largest such contributions
typically come from top and stop loops. In the limit of stop squark masses
m
e
t
= m
e
t
1
m
e
t
2
much greater than the top quark mass m
t
, one nds a
large positive one-loop radiative correction to eq. (8.34):
(m
2
h
0
) =
3
2
2
cos
2
y
2
t
m
2
t
ln(m
e
t
/m
t
). (8.38)
Including this and other corrections [101, 102], one can obtain only a
considerably weaker, but still very interesting, bound
m
h
0
<
130 GeV (8.39)

in the MSSM. This assumes that all of the sparticles that can contribute
to (m
2
h
0
) in loops have masses that do not exceed 1 TeV. By adding
extra supermultiplets to the MSSM, this bound can be made even
weaker. However, assuming that none of the MSSM sparticles have
masses exceeding 1 TeV and that all of the couplings in the theory remain
perturbative up to the unication scale, one still nds [103]
m
h
0
<
150 GeV. (8.40)

This bound is also weakened if, for example, the top squarks are heavier
than 1 TeV, but the upper bound rises only logarithmically with the
soft masses, as can be seen from eq. (8.38). Thus it is a fairly robust
prediction of supersymmetry at the electroweak scale that at least one
of the Higgs scalar bosons must be light. (However, if one is willing to
consider arbitrary extensions of the MSSM involving non-perturbative
physics near the TeV scale, none of these bounds apply.)
An interesting limit occurs when m
A
0 m
Z
. In that case, m
h
0 can
saturate the upper bounds just mentioned with m
h
0 m
Z
[ cos 2[+ loop
corrections. The particles A
0
, H
0
, and H
are much heavier and nearly

degenerate, forming an isospin doublet which decouples from suciently
low-energy experiments. The angle is very nearly /2. In this limit,
h
0
has the same couplings to quarks and leptons and electroweak gauge
bosons as would the physical Higgs boson of the ordinary Standard Model
without supersymmetry. Indeed, model-building experiences have shown
that it is not uncommon for h
0
to behave in a way nearly indistinguishable
from a Standard Model-like Higgs boson, even if m
A
0 is not too huge.
However, it should be kept in mind that the couplings of h
0
might turn
out to deviate in important ways from those of a Standard Model Higgs
boson. For a given set of model parameters, it is always important to
take into account the complete set of one-loop corrections and even the
dominant two-loop eects in order to get reasonably accurate predictions
for the Higgs masses and mixing angles.[101, 102]
In the MSSM, the masses and CKM mixing angles of the quarks and
leptons are determined by the Yukawa couplings of the superpotential and
the parameter tan . This is because the top, charm and up quarks get
masses proportional to v
u
= v sin and the bottom, strange, and down
quarks and the charge leptons get masses proportional to v
d
= v cos .
Therefore one nds at tree-level
m
t
= y
t
v
u
= y
t
v sin , (8.41)
m
b
= y
b
v
d
= y
b
v cos , (8.42)
m
= y
v
d
= y
v cos , (8.43)
with v 175 GeV. These relations hold for the running masses of
t, b, rather than the physical pole masses, which are signicantly larger.
Including those corrections, one can relate the Yukawa couplings to tan
and the known fermion masses and CKM mixing angles. It is now clear
why we have not neglected y
b
and y
, even though m
b
, m
m
t
. To a
rst approximation,
y
b
/y
t
= (m
b
/m
t
) tan (8.44)
y
/y
t
= (m
/m
t
) tan , (8.45)
so that y
b
and y
cannot be neglected if tan is much larger than 1. In

fact, there are good theoretical motivations for considering models with
large tan . For example, models based on the GUT gauge group SO(10)
(or certain of its subgroups) can unify the running top, bottom and tau
Yukawa couplings at the unication scale; this requires tan to be very
roughly of order m
t
/m
b
.
Note that if one tries to make sin too small, then y
t
will become
nonperturbatively large. Requiring that y
t
does not blow up above the
electroweak scale, one nds that tan
>
1.2 or so, depending on the

mass of the top quark, the QCD coupling, and other ne details. In
principle, one can also determine a lower bound on cos by requiring
that y
b
and y
are not nonperturbatively large. This gives a rough upper

bound of tan
<
65. However, this is complicated slightly by the fact

that the bottom quark mass gets signicant one-loop corrections in the
large tan limit [106]. One can obtain a slightly stronger upper bound
on tan in models where m
2
Hu
= m
2
H
d
at the input scale, by requiring
that y
b
does not signicantly exceed y
t
. In the following, we will see that
the parameter tan has an important eect on the masses and mixings
of the MSSM sparticles.
8.6 Neutralinos and charginos
The higgsinos and electroweak gauginos mix with each other because of
the eects of electroweak symmetry breaking. The neutral higgsinos (
H
0
u
and

H
0
d
) and the neutral gauginos (
B,

W
0
) combine to form four neutral
mass eigenstates called neutralinos. The charged higgsinos (
H
+
u
and

H
d
)
and winos (
W
+
and

W
) mix to form two mass eigenstates with charge

1 called charginos. We will denote
4
the neutralino and chargino mass
eigenstates by

N
i
(i = 1, 2, 3, 4) and

C
i
(i = 1, 2). By convention, these
are labelled in ascending order, so that m
e
N
1
< m
e
N
2
< m
e
N
3
< m
e
N
4
and
m
e
C
1
< m
e
C
2
. The lightest neutralino,

N
1
, is usually assumed to be the
LSP, unless there is a lighter gravitino or unless R-parity is not conserved,
because it is the only MSSM particle that can make a good cold dark
matter candidate. In this section, we will describe the mass spectrum
and mixing of the neutralinos and charginos in the MSSM.
In the gauge-eigenstate basis
0
= (
B,
W
0
,

H
0
d
,

H
0
u
), the neutralino
mass terms in the Lagrangian are
L
neutralino mass
=
1
2
(
0
)
T
M
e
N
0
+ c.c. (8.46)
4
Other common notations use e
0
i
or
e
Zi for neutralinos, and e
i
or
f
W
i
for charginos.
where
M
e
N
=
M
1
0 g
v
d
/
2 g
v
u
/
2
0 M
2
gv
d
/
2 gv
u
/
2
g
v
d
/
2 gv
d
/
2 0
g
v
u
/
2 gv
u
/
2 0
. (8.47)
The entries M
1
and M
2
in this matrix come directly from the MSSM soft
Lagrangian [see eq. (8.12)] while the entries are the supersymmetric
higgsino mass terms [see eq. (8.4)]. The terms proportional to g, g
are the
result of Higgs-higgsino-gaugino couplings [see eq. (6.76) and Fig. 6.3g],
with the Higgs scalars getting their VEVs [eqs. (8.24), (8.25)]. One can
also write this as
M
e
N
=
M
1
0 c
s
W
m
Z
s
s
W
m
Z
0 M
2
c
c
W
m
Z
s
c
W
m
Z
c
s
W
m
Z
c
c
W
m
Z
0
s
s
W
m
Z
s
c
W
m
Z
0
. (8.48)
Here we have introduced abbreviations s
= sin , c
= cos , s
W
=
sin
W
, and c
W
= cos
W
. The mass matrix M
e
N
can be diagonalized by
a unitary matrix N with
N
i
= N
ij
0
j
, (8.49)
so that
M
diag
e
N
= N
M
e
N
N
1
(8.50)
has positive real entries m
e
N
1
, m
e
N
2
, m
e
N
3
, m
e
N
4
on the diagonal. These are
the absolute values of the eigenvalues of M
e
N
, or equivalently the square
roots of the eigenvalues of M
e
N
M
e
N
. The indices (i, j) on N
ij
are (mass,
gauge) eigenstate labels. The mass eigenvalues and the mixing matrix
N
ij
can be given in closed form in terms of the parameters M
1
, M
2
,
and tan , but the results are very complicated and not very illuminating.
In general, the parameters M
1
, M
2
, and can have arbitrary complex
phases. In the broad class of minimal supergravity or gauge-mediated
models satisfying the gaugino unication conditions eq. (9.27) or (9.40),
M
2
and M
1
will have the same complex phase, which is preserved by RG
evolution eq. (10.5). In that case, a redenition of the phases of

B and
W allows us to make M
1
and M
2
both real and positive. The phase of
is then really a physical parameter and cannot be rotated away. [We
have already used up the freedom to redene the phases of the Higgs
elds, since we have picked b and H
0
u
) and H
0
d
) to be real and positive,
to guarantee that the o-diagonal entries in eq. (8.48) proportional to
m
Z
are real.] However, if is not real, then there can be potentially
disastrous CP-violating eects in low-energy physics, including electric
dipole moments for both the electron and the neutron. Therefore, it
is usual (although not mandatory because of the possibility of nontrivial
cancellations) to assume that is real in the same set of phase conventions
that make M
1
, M
2
, b, H
0
u
) and H
0
d
) real and positive. The sign of is
still undetermined by this constraint.
In models which satisfy eq. (10.7), one has the nice prediction
M
1

5
3
tan
2
W
M
2
0.5M
2
(8.51)
at the electroweak scale. If so, then the neutralino masses and mixing
angles depend on only three unknown parameters. This assumption
is suciently theoretically compelling that it has been made in almost
all phenomenological studies; nevertheless it should be recognized as an
assumption, to be tested someday by experiment.
Specializing further, there is a not-unlikely limit in which electroweak
symmetry breaking eects can be viewed as a small perturbation on the
neutralino mass matrix. If
m
Z
[ M
1
[, [ M
2
[ (8.52)
then the neutralino mass eigenstates are very nearly a bino-like

N
1

B;
a wino-like

N
2

W
0
; and higgsino-like

N
3
,

N
4
(
H
0
u
H
0
d
)/
2, with
mass eigenvalues:
m
e
N
1
= M
1
m
2
Z
s
2
W
(M
1
+sin 2)
2
M
2
1
+. . . (8.53)
m
e
N
2
= M
2
m
2
W
(M
2
+sin 2)
2
M
2
2
+. . . (8.54)
m
e
N
3
, m
e
N
4
= [[ +
m
2
Z
(1 sin2)([[ +M
1
c
2
W
+M
2
s
2
W
)
2([[ +M
1
)([[ +M
2
)
+. . . , (8.55)
[[ +
m
2
Z
(1 + sin2)([[ M
1
c
2
W
M
2
s
2
W
)
2([[ M
1
)([[ M
2
)
+. . . (8.56)
where we have assumed is real with sign = 1. The labeling of
the mass eigenstates

N
1
and

N
2
assumes M
1
< M
2
< [[; otherwise the
subscripts may need to be rearranged. It turns out that a bino-like
LSP

N
1
can very easily have the right cosmological abundance to make
a good dark matter candidate, so the large [[ limit may be preferred
from that point of view. In addition, this limit tends to emerge from
minimal supergravity boundary conditions on the soft parameters, which
often require [[ to be larger than M
1
and M
2
in order to get correct
electroweak symmetry breaking.
The chargino spectrum can be analyzed in a similar way. In the gauge-
eigenstate basis
= (
W
+
,

H
+
u
,

W
,

H
d
), the chargino mass terms in
the Lagrangian are
L
chargino mass
=
1
2
(
)
T
M
e
C
+ c.c. (8.57)
where, in 2 2 block form,
M
e
C
=
_
0 X
T
X 0
_
(8.58)
with
X =
_
M
2
gv
u
gv
d

_
=
_
M
2
2s
m
W
2c
m
W

_
. (8.59)
The mass eigenstates are related to the gauge eigenstates by two unitary
22 matrices U and V according to
_
C
+
1
C
+
2
_
= V
_
W
+
H
+
u
_
;
_
2
_
= U
_
d
_
. (8.60)
Note that the mixing matrix for the positively charged left-handed
fermions is dierent from that for the negatively charged left-handed
fermions. They are chosen so that
U
XV
1
=
_
m
e
C
1
0
0 m
e
C
2
_
, (8.61)
with positive real entries m
e
C
i
. Because these are only 22 matrices, it is
not hard to solve for the masses explicitly:
m
2
e
C
1
, m
2
e
C
2
=
1
2
_
[M
2
[
2
+[[
2
+ 2m
2
W
_
([M
2
[
2
+[[
2
+ 2m
2
W
)
2
4[M
2
m
2
W
sin 2[
2
_
. (8.62)
These are the (doubly degenerate) eigenvalues of the 4 4 matrix
M
e
C
M
e
C
, or equivalently the eigenvalues of X
X, since
VX
XV
1
= U
XX
U
T
=
m
2
e
C
1
0
0 m
2
e
C
2
. (8.63)
8.7 The gluino 195
(But, they are not the squares of the eigenvalues of X.) In the limit
of eq. (8.52) with real M
2
and , one nds that the charginos mass
eigenstates consist of a wino-like

C
1
and and a higgsino-like

C
2
, with
masses
m
e
C
1
= M
2
m
2
W
(M
2
+sin2)
2
M
2
2
+. . . (8.64)
m
e
C
2
= [[ +
m
2
W
([[ +M
2
sin2)
2
M
2
2
+. . . . (8.65)
Here again the labeling assumes M
2
< [[, and is the sign of .
Amusingly, the lighter chargino

C
1
is nearly degenerate with the second
lightest neutralino

N
2
in this limit, but this is not an exact result. Their
higgsino-like colleagues

N
3
,

N
4
and

C
2
have masses of order [[. The
case of M
1
0.5M
2
[[ is not uncommonly found in viable models
following from the boundary conditions in section 9, and it has been
elevated to the status of a benchmark scenario in many phenomenological
studies. However it cannot be overemphasized that such expectations are
not mandatory.
In practice, the masses and mixing angles for the neutralinos and
charginos are best computed numerically. The corresponding Feynman
rules may be inferred in terms of N, U and V from the MSSM Lagrangian
as discussed above; they are collected in Refs.[21, 98]
8.7 The gluino
The gluino is a color octet fermion, so it cannot mix with any other
particle in the MSSM, even if R-parity is violated. In this regard, it is
unique among all of the MSSM sparticles. In the models following from
minimal supergravity or gauge-mediated boundary conditions, the gluino
mass parameter M
3
is related to the bino and wino mass parameters M
1
and M
2
by eq. (10.7)
M
3
=

s
sin
2
W
M
2
=
3
5
cos
2
W
M
1
(8.66)
at any RG scale, up to small two-loop corrections. If we use values
s
=
0.118, = 1/128, sin
2
W
= 0.23, then one nds the rough prediction
M
3
: M
2
: M
1
7 : 2 : 1 (8.67)
at the electroweak scale. In particular, we suspect that the gluino should
be much heavier than the lighter neutralinos and charginos.
For more precise estimates, one must take into account the fact that the
parameter M
3
is really a running mass which has an implicit dependence
on the RG scale Q. Because the gluino is a strongly interacting particle,
M
3
runs rather quickly with Q [see eq. (10.5)]. A more useful quantity
physically is the RG scale-independent mass m
eg
at which the renormalized
gluino propagator has a pole. Including one-loop corrections to the gluino
propagator due to gluon exchange and quark-squark loops, one nds that
the pole mass is given in terms of the running mass in the DR scheme by
m
eg
= M
3
(Q)
_
1 +

s
4
_
15 + 6 ln(Q/M
3
) +
A
e q
_
(8.68)
where
A
e q
=
_
1
0
dx x ln
_
xm
2
e q
/M
2
3
+ (1 x)m
2
q
/M
2
3
x(1 x)
. (8.69)
The sum in eq. (8.68) is over all 12 squark-quark supermultiplets, and we
have neglected small eects due to squark mixing. It is easy to check that
requiring m
eg
to be independent of Q in eq. (8.68) reproduces the one-loop
RG equation for M
3
(Q) in eq. (10.5). The correction terms proportional
to
s
in eq. (8.68) can be quite signicant, so that m
eg
/M
3
(M
3
) can exceed
unity by 25% or more. The reasons for this are that the gluino is strongly
interacting, with a large group theory factor [the 15 in eq. (8.68)] due to
its color octet nature, and that it couples to all the squark-quark pairs.
Of course, there are similar corrections which relate the running masses
of all the other MSSM particles to their physical masses. These have
been systematically evaluated at one-loop order in Ref.[107] They are
more complicated in form and usually numerically smaller than for the
gluino, but in some cases they could be quite important in future eorts
to connect a given candidate model for the soft terms to experimentally
measured masses and mixing angles of the MSSM particles.
8.8 Squarks and sleptons
In principle, any scalars with the same electric charge, R-parity, and
color quantum numbers can mix with each other. This means that with
completely arbitrary soft terms, the mass eigenstates of the squarks and
sleptons of the MSSM should be obtained by diagonalizing three 6 6
(mass)
2
matrices for up-type squarks ( u
L
, c
L
,

t
L
, u
R
, c
R
,

t
R
), down-type
squarks (
d
L
, s
L
,

b
L
,

d
R
, s
R
,

b
R
), and charged sleptons ( e
L
,
L
,
L
, e
R
,

R
,
R
), and one 33 matrix for sneutrinos (
e
,
). Fortunately, the
general hypothesis of avor-blind soft parameters eqs. (8.15) and (8.16)
predicts that most of these mixing angles are very small. The third-
family squarks and sleptons can have very dierent masses compared to
their rst- and second-family counterparts, because of the eects of large
Yukawa (y
t
, y
b
, y
) and soft (a
t
, a
b
, a
) couplings in the RG equations

(10.20)-(10.24). Furthermore, they can have substantial mixing in pairs
(
t
L
,

t
R
), (
b
L
,

b
R
) and (
L
,
R
). In contrast, the rst- and second-family
squarks and sleptons have negligible Yukawa couplings, so they end up
in 7 very nearly degenerate, unmixed pairs ( e
R
,
R
), (
e
,
), ( e
L
,
L
),
( u
R
, c
R
), (
d
R
, s
R
), ( u
L
, c
L
), (
d
L
, s
L
). As we have already discussed in
section 8.4, this avoids the problem of disastrously large virtual sparticle
contributions to FCNC processes.
Let us rst consider the spectrum of rst- and second-family squarks
and sleptons. In models tting into both of the broad categories of
minimal supergravity [eq. (9.28)] or gauge-mediated [eq. (9.42)] boundary
conditions, their running masses can be conveniently parameterized in the
following way:
m
2
Q
1
= m
2
Q
2
= m
2
0
+K
3
+K
2
+
1
36
K
1
, (8.70)
m
2
u
1
= m
2
u
2
= m
2
0
+K
3
+
4
9
K
1
, (8.71)
m
2
d
1
= m
2
d
2
= m
2
0
+K
3
+
1
9
K
1
, (8.72)
m
2
L
1
= m
2
L
2
= m
2
0
+K
2
+
1
4
K
1
, (8.73)
m
2
e
1
= m
2
e
2
= m
2
0
+ K
1
. (8.74)
In minimal supergravity models, m
2
0
is the common scalar (mass)
2
which
appears in eq. (9.28). It can be 0 in the no-scale limit, but it could
also be the dominant source of the scalar masses. The contributions
K
3
, K
2
and K
1
are due to the RG running proportional to the gaugino
masses; see eq. (10.14). They are strictly positive. A key point is that the
same K
3
, K
2
and K
1
appear everywhere in eqs. (8.70)-(8.74), since all
of the chiral supermultiplets couple to the same gauginos with the same
gauge couplings. The dierent coecients in front of K
1
just correspond
to the various values of weak hypercharge squared for each scalar. The
quantities K
1
, K
2
, K
3
depend on the RG scale Q at which they are
evaluated. Explicitly, they are found by solving eq. (10.14):
K
a
(Q) =
3/5
3/4
4/3
1
2
2
_
lnQ
0
lnQ
dt g
2
a
(t) [M
a
(t)[
2
(a = 1, 2, 3). (8.75)
Here Q
0
is the input RG scale at which the boundary condition eq. (9.28)
is applied, and Q should be taken to be evaluated near the squark and
slepton mass under consideration, presumably less than about 1 TeV or
so. The values of the running parameters g
a
(Q) and M
a
(Q) can be found
using eqs. (8.18) and (10.7). If the input scale is approximated by the
apparent scale of gauge coupling unication Q
0
= M
U
2 10
16
GeV,
one nds that numerically
K
1
0.15m
2
1/2
; K
2
0.5m
2
1/2
; K
3
(4.5 to 6.5)m
2
1/2
. (8.76)
for Q near 1 TeV. Here m
1/2
is the common gaugino mass parameter
at the unication scale. Note that K
3
K
2
K
1
; this is a direct
consequence of the relative sizes of the gauge couplings g
3
, g
2
, and g
1
.
The large uncertainty in K
3
is due in part to the experimental uncertainty
in the QCD coupling constant, and in part to the uncertainty in where
to choose Q, since K
3
runs rather quickly below 1 TeV. If the gauge
couplings and gaugino masses are unied between M
U
and M
P
, as would
occur in a GUT model, then the eect of RG running for M
U
< Q < M
P
can be absorbed into a redenition of m
2
0
. Otherwise, it adds a further
uncertainty which is roughly proportional to ln(M
P
/M
U
), compared to
the larger contributions in eq. (8.75) which go roughly like ln(M
U
/1 TeV).
In gauge-mediated models, the same parameterization eqs. (8.70)-(8.74)
holds, but m
2
0
is always 0. At the input scale Q
0
, each MSSM scalar gets
contributions to its (mass)
2
which depend only on its gauge interactions,
as in eq. (9.42). It is not hard to see that in general these contribute
in exactly the same pattern as K
1
, K
2
, and K
3
in eq. (8.70)-(8.74).
The subsequent evolution of the scalar squared masses down to the
electroweak scale again just yields more contributions to the K
1
, K
2
, and
K
3
parameters. It is somewhat more dicult to give meaningful numerical
estimates for these parameters in gauge-mediated models than in the
minimal supergravity models, because of uncertainties in the messenger
mass scale(s) and in the multiplicities of the messenger elds. However,
in the gauge-mediated case one quite generally expects that the numerical
values of the ratios K
3
/K
2
, K
3
/K
1
and K
2
/K
1
should be even larger than
in eq. (8.76). There are two reasons for this. First, the running squark
squared masses start o larger than slepton squared masses already at
the input scale in gauge-mediated models, rather than having a common
value m
2
0
. Furthermore, in the gauge-mediated case, the input scale
Q
0
is typically much lower than M
P
or M
U
, so that the RG evolution
gives relatively more weight to smaller RG scales where the hierarchies
g
3
> g
2
> g
1
and M
3
> M
2
> M
1
are already in eect.
In general, one therefore expects that the squarks should be consider-
ably heavier than the sleptons, with the eect being more pronounced
in gauge-mediated supersymmetry breaking models than in minimal
supergravity models. For any specic choice of model, this eect can be
easily quantied with an RG analysis. The hierarchy m
squark
> m
slepton
tends to hold even in models which do not really t into any of the
categories outlined in section 9, because the RG contributions to squark
masses from the gluino are always present and usually quite large, since
QCD has a larger gauge coupling than the electroweak interactions.
There is also a hyperne splitting in the squark and slepton mass
spectrum produced by electroweak symmetry breaking. Each squark
and slepton will get a contribution
to its (mass)
2
, coming from
the SU(2)
L
and U(1)
Y
D-term quartic interactions [see the last term in
eq. (6.79)] of the form (squark)
2
(Higgs)
2
and (slepton)
2
(Higgs)
2
, when the
neutral Higgs scalars H
0
u
and H
0
d
get VEVs. They are model-independent
for a given value of tan , and are given by
= (T
3
Q
EM
sin
2
W
) cos 2 m
2
Z
, (8.77)
where T
3
and Q
EM
are the third component of weak isospin and the
electric charge of the chiral supermultiplet to which belongs. [For ex-
ample,
u
= (
1
2

2
3
sin
2
W
) cos 2 m
2
Z
and
u
= (
2
3
sin
2
W
) cos 2 m
2
Z
].
These D-term contributions are typically smaller than the m
2
0
and K
1
,
K
2
, K
3
contributions, but should not be neglected. They split apart the
components of the SU(2)
L
-doublet sleptons and squarks L
1
= (
e
, e
L
),
etc. Including them, the rst-family squark and slepton masses are now
given by:
m
2
e
d
L
= m
2
0
+K
3
+K
2
+
1
36
K
1
+
d
, (8.78)
m
2
e u
L
= m
2
0
+K
3
+K
2
+
1
36
K
1
+
u
, (8.79)
m
2
e u
R
= m
2
0
+K
3
+
4
9
K
1
+
u
, (8.80)
m
2
e
d
R
= m
2
0
+K
3
+
1
9
K
1
+
d
, (8.81)
m
2
e e
L
= m
2
0
+K
2
+
1
4
K
1
+
e
, (8.82)
m
2
e
= m
2
0
+K
2
+
1
4
K
1
+
, (8.83)
m
2
e e
R
= m
2
0
+ K
1
+
e
, (8.84)
with identical formulas for the second-family squarks and sleptons. The
mass splittings for the left-handed squarks and sleptons are governed by
model-independent sum rules
m
2
e e
L
m
2
e e
= m
2
e
d
L
m
2
e u
L
= cos 2 m
2
W
. (8.85)
Since cos 2 < 0 in the allowed range tan > 1, it follows that m
e e
L
> m
e e
and m
e
d
L
> m
e u
L
, with the magnitude of the splittings constrained by
electroweak symmetry breaking.
Let us next consider the masses of the top squarks, for which there
are several non-negligible contributions. First, there are (mass)
2
terms
for

t
t
L
and

t
t
R
which are just equal to m
2
Q
3
+
u
and m
2
u
3
+
u
,
respectively, just as for the rst- and second-family squarks. Second,
there are contributions equal to m
2
t
for each of

t
t
L
and

t
t
R
. These
come from F-terms in the scalar potential of the form y
2
t
H
0
u
H
0
u
t
L
and
y
2
t
H
0
u
H
0
u
t
R
(see Figs. 8.2b and 8.2c), with the Higgs elds replaced
by their VEVs. These contributions are of course present for all of the
squarks and sleptons, but they are much too small to worry about except
in the case of the top squarks. Third, there are contributions to the scalar
potential from F-terms of the form y
t
tH
0
d
+ c.c.; see eqs. (8.6) and
Fig. 8.4a. These become vy
t
cos
t
L
+ c.c. when H
0
d
is replaced by
its VEV. Finally, there are contributions to the scalar potential from the
soft (scalar)
3
couplings a
t
Q
3
H
0
u
+ c.c. [see the rst term of the second
line of eq. (8.12) and eq. (10.8)], which become a
t
v sin
t
L
R
+ c.c. when
H
0
u
is replaced by its VEV. Putting these all together, we have a (mass)
2
matrix for the top squarks, which in the gauge-eigenstate basis (
t
L
,

t
R
)
is given by
L
_
L

t
R
_
m
2
e
t
_
t
L
t
R
_
(8.86)
where
m
2
e
t
=
_
m
2
Q
3
+m
2
t
+
u
v(a
t
sin y
t
cos )
v(a
t
sin y
t
cos ) m
2
u
3
+m
2
t
+
u
_
. (8.87)
This matrix can be diagonalized to give mass eigenstates
_
t
1
t
2
_
=
_
cos
e
t
sin
e
t
sin
e
t
cos
e
t
__
t
L
t
R
_
(8.88)
with m
2
e
t
1
< m
2
e
t
2
being the eigenvalues of eq. (8.87) and 0
e
t
.
Because of the large RG eects proportional to X
t
in eq. (10.20) and
eq. (10.21), at the electroweak scale one nds that m
2
u
3
< m
2
Q
3
, and
both of these quantities are usually signicantly smaller than the squark
squared masses for the rst two families. The diagonal terms m
2
t
in
eq. (8.87) tend to mitigate this eect somewhat, but the o-diagonal
entries will typically induce a signicant mixing which always reduces the
lighter top-squark (mass)
2
eigenvalue. For this reason, it is often found
in models that

t
1
is the lightest squark of all.
A very similar analysis can be performed for the bottom squarks and
charged tau sleptons, which in their respective gauge-eigenstate bases (
b
L
,
b
R
) and (
L
,
R
) have (mass)
2
matrices:
m
2
e
b
=
m
2
Q
3
+
d
v(a
b
cos y
b
sin )
v(a
b
cos y
b
sin ) m
2
d
3
+
d
; (8.89)
m
2
e
=
_
m
2
L
3
+
e
v(a
cos y
sin )
v(a
cos y
sin ) m
2
e
3
+
e
_
. (8.90)
These can be diagonalized to give mass eigenstates

b
1
,
b
2
and
1
,
2
in
exact analogy with eq. (8.88).
The magnitude and importance of mixing in the sbottom and stau
sectors depends on how large tan is. If tan is not too large (in practice,
this usually means less than about 10 or so, depending on the situation
under study), the sbottoms and staus do not get a very large eect from
the mixing terms and the RG eects due to X
b
and X
, because y
b
, y
y
t
from eqs. (8.44,8.45). In that case the mass eigenstates are very nearly the
same as the gauge eigenstates
b
L
,
b
R
,
L
and
R
. The latter three, and
,
will be nearly degenerate with their rst- and second-family counterparts
with the same SU(3)
C
SU(2)
L
U(1)
Y
quantum numbers. However,
even in the case of small tan ,

b
L
will feel the eects of the large top
Yukawa coupling because it is part of the doublet

Q
3
which contains

t
L
.
In particular, from eq. (10.20) we see that X
t
acts to decrease m
2
e
Q
3
as it is
RG-evolved down from the input scale to the electroweak scale. Therefore
the mass of
b
L
can be signicantly less than the masses of

d
L
and s
L
.
For larger values of tan , the mixing in eqs. (8.89) and (8.90) can be
quite signicant, because y
b
, y
and a
b
, a
are non-negligible. Just as in

the case of the top squarks, the lighter sbottom and stau mass eigenstates
(denoted
b
1
and
1
) can be signicantly lighter than their rst- and second-
family counterparts. Furthermore,
can be signicantly lighter than the

nearly degenerate
e
,
.
The requirement that the third-family squarks and sleptons should all
have positive (mass)
2
implies limits on the sizes of a
t
sin y
t
cos ,
a
b
cos y
b
sin, and a
cos y
sin . If they are too large, the

smaller eigenvalue of eq. (8.87), (8.89) or (8.90) will be driven negative,
implying that a squark or charged slepton gets a VEV, breaking SU(3)
C
or electromagnetism. Since this is clearly unacceptable, one can put
bounds on the (scalar)
3
couplings, or equivalently on the parameter A
0
in
minimal supergravity models. Even if all of the (mass)
2
eigenvalues are
Table 8.1. Undiscovered particles in the Minimal Supersymmetric Standard
Model
Names Spin P
R
Mass Eigenstates Gauge Eigenstates
Higgs bosons 0 +1 h
0
H
0
A
0
H
H
0
u
H
0
d
H
+
u
H
d
u
L
u
R

d
L

d
R

squarks 0 1 s
L
s
R
c
L
c
R

t
1

t
2

b
1

b
2

t
L

t
R

b
L

b
R
e
L
e
R

e

sleptons 0 1
L

R

1

2

L

R

neutralinos 1/2 1

N
1

N
2

N
3

N
4

B
0

W
0

H
0
u

H
0
d
charginos 1/2 1

C

H
+
u

H
d
gluino 1/2 1 g
gravitino/
goldstino
3/2 1

G
positive, the presence of large (scalar)
3
couplings can yield global minima
of the scalar potential with non-zero squark and/or charged slepton VEVs
which are disconnected from the vacuum which conserves SU(3)
C
and
electromagnetism. However, it is not always clear whether the non-
existence of such disconnected global minima should really be taken as
a constraint, because the tunneling rate from our good vacuum to the
bad vacua can easily be much longer than the age of the universe.
8.9 Summary: the MSSM sparticle spectrum
In the MSSM there are 32 distinct masses corresponding to undiscovered
particles, not including the gravitino. In this section we have explained
how the masses and mixing angles for these particles can be computed,
given an underlying model for the soft terms at some input scale.
Assuming only that the mixing of rst- and second-family squarks and
sleptons is negligible, the mass eigenstates of the MSSM are listed in
Table 8.1. A complete set of Feynman rules for the interactions of
these particles with each other and with the Standard Model quarks,
leptons, and gauge bosons can be found in Refs.[21, 98] Specic models
for the soft terms typically predict the masses and the mixing angles
angles for the MSSM in terms of far fewer parameters. For example,
in the minimal supergravity models, one has only the parameters m
2
0
,
m
1/2
, A
0
, , and b which are not already measured by experiment. On
the other hand, in gauge-mediated supersymmetry breaking models, the
free parameters include at least the scale , the typical messenger mass
scale M
mess
, the integer number N
5
of copies of the minimal messengers,
the goldstino decay constant F), and the Higgs mass parameters and
b. After RG evolving the soft terms down to the electroweak scale, one
can impose that the scalar potential gives correct electroweak symmetry
breaking. This allows us to trade [[ and b (or B
0
) for one parameter
tan , as in eqs. (8.27)-(8.26). So, to a reasonable approximation, the
entire mass spectrum in minimal supergravity models is determined by
only ve unknown parameters: m
2
0
, m
1/2
, A
0
, tan , and Arg(), while
in the simplest gauge-mediated supersymmetry breaking models one can
pick parameters , M
mess
, N
5
, F), tan , and Arg(). Both frameworks
are highly predictive. Of course, it is easy to imagine that the essential
physics of supersymmetry breaking is not captured by either of these two
scenarios in their minimal forms.
While it would be a mistake to underestimate the uncertainties in the
MSSM mass and mixing spectrum, it is also useful to keep in mind some
general lessons that recur in various dierent scenarios. Indeed, there
has emerged a sort of folklore concerning likely features of the MSSM
spectrum, which is partly based on theoretical bias and partly on the
constraints inherent in any supersymmetric theory. We remark on these
features mainly because they represent the prevailing prejudice among
supersymmetry theorists, which is certainly a useful thing for the reader
to know even if he or she wisely decides to remain skeptical. For example,
it is perhaps not unlikely that:
The LSP is the lightest neutralino

N
1
, unless the gravitino is lighter
or R-parity is not conserved. If > M
1
, M
2
, then

N
1
is likely to
be bino-like, with a mass roughly 0.5 times the masses of

N
2
and
C
1
. In the opposite case < M
1
, M
2
, then

N
1
has a large higgsino
content and

N
2
and

C
1
are not much heavier.
The gluino will be much heavier than the lighter neutralinos and
charginos. This is certainly true in the case of the standard
gaugino mass relation eq. (10.7); more generally, the running gluino
mass parameter grows relatively quickly as it is RG-evolved into the
infrared because the QCD coupling is larger than the electroweak
gauge couplings. So even if there are big corrections to the gaugino
mass boundary conditions eqs. (9.27) or (9.40), the gluino mass
parameter M
3
is likely to come out larger than M
1
and M
2
.
The squarks of the rst and second families are nearly degenerate
and much heavier than the sleptons. This is because each squark
mass gets the same large positive-denite radiative corrections from
loops involving the gluino. The left-handed squarks u
L
,

d
L
, s
L
and
c
L
are likely to be heavier than their right-handed counterparts u
R
,
d
R
, s
R
and c
R
, because of the eect of K
2
in eqs. (8.78)-(8.84).
The squarks of the rst two families cannot be lighter than about
0.8 times the mass of the gluino in minimal supergravity models,
and about 0.6 times the mass of the gluino in the simplest gauge-
mediated models as discussed in section 9.4 if the number of
messenger squark pairs is N
5
4. In the minimal supergravity
case this is because the gluino mass feeds into the squark masses
through RG evolution; in the gauge-mediated case it is because the
gluino and squark masses are tied together by eqs. (9.40) and (9.42)
[multiplied by N
5
, as explained at the end of section 9.4].
The lighter stop

t
1
and the lighter sbottom

b
1
are probably the
lightest squarks. This is because stop and sbottom mixing eects
and the eects of X
t
and X
b
in eqs. (10.20)-(10.22) both tend to
decrease the lighter stop and sbottom masses.
The lightest charged slepton is probably a stau
1
. The mass
dierence m
e e
R
m
e
1
is likely to be signicant if tan is large,
because of the eects of a large tau Yukawa coupling. For smaller
tan ,
1
is predominantly
R
and it is not so much lighter than e
R
,

R
.
The left-handed charged sleptons e
L
and
L
are likely to be heavier
than their right-handed counterparts e
R
and
R
. This is because of
the eect of K
2
in eq. (8.82). (Note also that
e

e
is positive
but very small because of the numerical accident sin
2
W
1/4.)
The lightest neutral Higgs boson h
0
should be lighter than about
150 GeV, and may be much lighter than the other Higgs scalar mass
eigenstates A
0
, H
, H
0
.
In Figure 8.9 we show a qualitative sketch of a sample MSSM mass
spectrum which illustrates these features. Variations in the model
parameters can have important and predictable eects. For example,
taking larger (smaller) m
2
0
in minimal supergravity models will tend to
move the entire spectrum of squarks, sleptons and the Higgs scalars A
0
,
H
, H
0
higher (lower) compared to the neutralinos, charginos and gluino;
taking larger values of tan with other model parameters held xed will
usually tend to lower

b
1
and
1
masses compared to those of the other
N
1
N
2
C
1
N
3
, N
4 C
2
g
e
R
e
, e
L
,
L

2
,
d
R
, u
R
u
L
, d
L
s
R
, c
R
c
L
, s
L
1
t
1
b
1
b
2
, t
2
h
0
A
0
, H
0
, H
+
Mass
Fig. 8.9. A schematic sample spectrum for the undiscovered particles in the
MSSM. This spectrum is presented for entertainment purposes only. No
warranty, expressed or implied, guarantees that this spectrum looks anything
like the real world.
sparticles, etc. The important point is that by measuring the masses
and mixing angles of the MSSM particles we will be able to gain a great
deal of information which can rule out or bolster evidence for competing
proposals for the origin of supersymmetry breaking. Testing the various
possible organizing principles will provide the high-energy physicists of
the next millennium with an exciting challenge.
9
Origins of supersymmetry breaking
9.1 General considerations for supersymmetry breaking
In the MSSM, supersymmetry breaking is simply introduced explicitly.
However, we have seen that the soft parameters cannot be arbitrary. In
order to understand how patterns like eqs. (8.15), (8.16) and (8.17) can
emerge, it is necessary to consider models in which supersymmetry is
spontaneously broken. By denition, this means that the vacuum state
[0) is not invariant under supersymmetry transformations, so Q
[0) ,= 0
and Q

[0) ,= 0. Now, in global supersymmetry, the Hamiltonian operator
H can be related to the supersymmetry generators through the algebra
eq. (6.32):
H = P
0
=
1
4
(Q
1
Q
1
+Q
1
Q
1
+Q
2
Q
2
+Q
2
Q
2
). (9.1)
If supersymmetry is unbroken in the vacuum state, it follows that
H[0) = 0 and the vacuum has zero energy. Conversely, if supersymmetry
is spontaneously broken in the vacuum state, then the vacuum must have
positive energy, since
0[H[0) =
1
4
_
|Q
1
[0)|
2
+|Q
1
[0)|
2
+|Q
2
[0)|
2
+|Q
2
[0)|
2
_
> 0 (9.2)
if the Hilbert space is to have positive norm. If spacetime-dependent
eects and fermion condensates can be neglected, then 0[H[0) = 0[V [0),
where V is the scalar potential in eq. (6.79). Therefore supersymmetry
will be spontaneously broken if F
i
and/or D
a
does not vanish in the
ground state. Note that if any state exists in which all F
i
and D
a
vanish,
then it will have zero energy, implying that supersymmetry cannot be
spontaneously broken in the true ground state. Therefore the way to
achieve spontaneous supersymmetry breaking is to look for models in
206
which the equations F
i
= 0 and D
a
= 0 cannot be simultaneously satised
for any values of the elds.
Supersymmetry breaking with non-zero D-terms can be achieved
through the Fayet-Iliopoulos mechanism. If the gauge symmetry includes
a U(1) factor, then one can introduce a term linear in the corresponding
auxiliary eld of the gauge supermultiplet:
L
FayetIliopoulos
= D (9.3)
where is a constant parameter with dimensions of (mass)
2
. This
term is gauge-invariant and supersymmetric by itself. [Note that the
supersymmetry transformation D in eq. (6.63) is a total derivative for a
U(1) gauge symmetry.] If we include it in the Lagrangian, then D may get
a non-zero VEV, depending on the other interactions of the scalar elds
that are charged under the U(1). To see this, we can write the relevant
part of the scalar potential using eqs. (6.58) and (6.76) as
V =
1
2
D
2
D +gD
i
q
i
i
(9.4)
where the q
i
are the charges of the scalar elds
i
under the U(1) gauge
group in question. The presence of the Fayet-Iliopoulos term modies the
equation of motion eq. (6.78) to
D = g
i
q
i
i
. (9.5)
Now suppose that the scalar elds
i
have other interactions (such as
large superpotential mass terms) which prevent them from getting VEVs.
Then the auxiliary eld D will be forced to get a VEV equal to , and
supersymmetry will be broken. This mechanism cannot work for non-
abelian gauge groups, however, since the analog of eq. (9.3) would not be
gauge-invariant.
In the MSSM, one can imagine that the D term for U(1)
Y
has a Fayet-
Iliopoulos term which is the principal source of supersymmetry breaking.
Unfortunately, this would be an immediate disaster, because at least
some of the squarks and sleptons would just get non-zero VEVs (breaking
color, electromagnetism, and/or lepton number, but not supersymmetry)
in order to satisfy eq. (9.5), because they do not have superpotential
mass terms. This means that a Fayet-Iliopoulos term for U(1)
Y
must be
subdominant compared to other sources of supersymmetry breaking in
the MSSM, if not absent altogether. One could also attempt to trigger
supersymmetry breaking with a Fayet-Iliopoulos term for some other U(1)
gauge symmetry which is as yet unknown because it is spontaneously
208 9 Origins of supersymmetry breaking
broken at a very high mass scale or because it does not couple to the
Standard Model particles. However, if this is the ultimate source for
supersymmetry breaking, it proves dicult to give appropriate masses
to all of the MSSM particles, especially the gauginos. In any case, we
will not discuss D-term breaking as the ultimate origin of supersymmetry
violation any further, although it may not be ruled out.
Models where supersymmetry breaking is due to non-zero F-terms,
called ORaifeartaigh models, may have brighter phenomenological
prospects. The idea is to pick a set of chiral supermultiplets
i

(
i
,
i
, F
i
) and a superpotential W in such a way that the equations
F
i
= W
/
i
= 0 have no simultaneous solution. Then V =

i
[F
i
[
2
will have to be positive at its minimum, ensuring that supersymmetry
is broken. The simplest example which does this has three chiral
supermultiplets with
W = k
1
+m
2
3
+
y
2
2
3
. (9.6)
Note that W contains a linear term, with k having dimensions of (mass)
2
.
This is only possible if
1
is a gauge singlet. In section 6 we cheated and
did not mention such a term, because we knew that the MSSM contains
no such singlet chiral supermultiplet. Nevertheless, it should be clear
from retracing the derivation in section 6.2 that such a term is allowed
if a gauge-singlet chiral supermultiplet is added to the theory. In fact,
a linear term is absolutely necessary to achieve F-term breaking, since
otherwise setting all
i
= 0 will always give a supersymmetric global
minimum with all F
i
= 0. Without loss of generality, we can choose k,
m, and y to be real and positive (by a phase rotation of the elds). The
scalar potential following from eq. (9.6) is
V = [F
1
[
2
+[F
2
[
2
+[F
3
[
2
; (9.7)
F
1
= k
y
2
2
3
; F
2
= m
3
; F
3
= m
2
y
3
. (9.8)
Clearly, F
1
= 0 and F
2
= 0 are not compatible, so supersymmetry must
indeed be broken. If m
2
> yk (which we assume from now on), then it
is easy to show that the absolute minimum of the potential is at
2
=
3
= 0 with
1
undetermined, so F
1
= k and V = k
2
at the minimum
of the potential. The fact that
1
is undetermined is an example of
a at direction in the scalar potential; this is a common feature of
supersymmetric models.
2
2
More generally, at directions are non-compact lines and surfaces in the space of
scalar elds along which the scalar potential vanishes. The classical scalar potential
of the MSSM would have many at directions if supersymmetry were not broken.
If we presciently choose to expand V around
1
= 0, the mass spectrum
of the theory consists of 6 real scalars with tree-level squared masses
0, 0, m
2
, m
2
, m
2
yk, m
2
+yk. (9.9)
Meanwhile, there are 3 Weyl fermions with masses
0, m, m. (9.10)
The non-degeneracy of scalars and fermions is a clear sign that
supersymmetry has been spontaneously broken. The 0 eigenvalues in
eqs. (9.9) and (9.10) correspond to the complex scalar
1
and its fermionic
partner
1
. However,
1
and
1
have dierent reasons for being massless.
The masslessness of
1
corresponds to the existence of the at direction,
since any value of
1
gives the same energy at tree-level. This at
direction is an accidental feature of the classical scalar potential, and
in this case it is removed (lifted) by quantum corrections. This can be
seen by computing the Coleman-Weinberg one-loop eective potential.
After some calculation, one nds the result that the global minimum is
indeed xed at
1
=
2
=
3
= 0, with the complex scalar
1
receiving a
small positive-denite (mass)
2
equal to
m
2
1
=
1
32
2
__
ym
4
k
+y
3
k
_
ln
_
m
2
+yk
m
2
yk
_
+2y
2
m
2
_
ln[1
y
2
k
2
m
4
] 1
__
. (9.11)
[In the limit yk m
2
, this reduces to m
2
1
= y
4
k
2
/(48
2
m
2
).] In
contrast, the Weyl fermion
1
remains exactly massless because of a
general feature of all models with spontaneously broken supersymmetry.
To understand this, recall that the spontaneous breaking of any global
symmetry always gives rise to a massless Nambu-Goldstone mode with the
same quantum numbers as the broken symmetry generator. In the case of
supersymmetry, the broken generator is the fermionic charge Q
, so the
Nambu-Goldstone particle must be a massless neutral Weyl fermion called
the goldstino. In the ORaifeartaigh model example,
1
is the goldstino
because it is the fermionic partner of the auxiliary eld F
1
which got a
VEV. (We will prove these statements in a more general context in section
9.2.)
The ORaifeartaigh superpotential determines the mass scale of
supersymmetry breaking

F
1
in terms of a dimensionful parameter k
which is put in by hand. This is somewhat ad hoc, since

k will have
to be much less than M
P
in order to give the right order of magnitude
for the MSSM soft terms. We would like to have a mechanism which
can instead generate such scales naturally. This can be done in models
of dynamical supersymmetry breaking. In such theories, the small
(compared to M
P
) mass scales associated with supersymmetry breaking
arise by dimensional transmutation. In other words, they generally
feature a new asymptotically-free non-Abelian gauge symmetry with a
gauge coupling g which is perturbative at M
P
and which gets strong in the
infrared at some smaller scale e
8
2
/|b|g
2
0
M
P
, where g
0
is the running
gauge coupling at M
P
with beta function [b[g
3
/16
2
. Just as in QCD, it
is perfectly natural for to be many orders of magnitude below the Planck
scale. Supersymmetry breaking may then be best described in terms of
the eective dynamics of the strongly coupled theory. One possibility is
that the auxiliary F eld for a composite chiral supermultiplet (built out
of the fundamental elds which transform under the new strongly-coupled
gauge group) obtains a VEV. Constructing models which actually break
supersymmetry in an acceptable way is a highly non-trivial business; for
more information we refer the reader to Ref.[67]
The one thing that is now clear about spontaneous supersymmetry
breaking (dynamical or not) is that it requires us to extend the MSSM.
The ultimate supersymmetry-breaking order parameter cannot belong to
any of the supermultiplets of the MSSM; a D-term VEV for U(1)
Y
does
not lead to an acceptable spectrum, and there is no candidate gauge-
singlet whose F-term could develop a VEV. Therefore one must ask
what eects are responsible for spontaneous supersymmetry breaking,
and how supersymmetry breakdown is communicated to the MSSM
particles. It is very dicult to achieve the latter in a phenomenologically
viable way working only with renormalizable interactions at tree-level.
First, it is problematic to give masses to the MSSM gauginos, because
supersymmetry does not allow (scalar)-(gaugino)-(gaugino) couplings
which could turn into gaugino mass terms when the scalar gets a VEV.
Second, at least some of the MSSM squarks and sleptons would have to
be unacceptably light, and should have been discovered already. This can
be understood in a general way from the existence of a sum rule which
governs the tree-level squared masses of scalars and chiral fermions in
theories with spontaneous supersymmetry breaking:
Tr[M
2
real scalars
] = 2Tr[M
2
chiral fermions
]. (9.12)
If supersymmetry were not broken, then eq. (9.12) would follow
immediately from the degeneracy of complex scalars [with two real scalar
components, hence the factor of 2] and their Weyl fermion superpartners.
However, eq. (9.12) still holds at tree-level when supersymmetry is broken
spontaneously by F-terms and D-terms, as one can verify in general
by explicitly computing the (mass)
2
matrices for arbitrary values of the
(Hidden sector)
(Visible sector)
Supersymmetry
breaking origin
MSSM
Flavor-blind
interactions
Fig. 9.1. The presumed schematic structure for supersymmetry breaking.
elds.
3
One can easily see, for example, that with the ORaifeartaigh
spectrum of eqs. (9.9) and (9.10), the sum rule eq. (9.12) is indeed
satised. This sum rule seems to be bad news for a phenomenologically
viable model, because the masses of all of the MSSM chiral fermions are
already known to be small (except for the top quark and the higgsinos).
Even if we could succeed in evading this, there is no reason why the
resulting MSSM soft terms in this type of model should satisfy conditions
like eqs. (8.15) or (8.16).
For these reasons, we expect that the MSSM soft terms arise indirectly
or radiatively, rather than from tree-level renormalizable couplings to
the supersymmetry-breaking order parameters. Supersymmetry breaking
evidently occurs in a hidden sector of particles which have no (or only
very small) direct couplings to the visible sector chiral supermultiplets
of the MSSM. However, the two sectors do share some interactions which
are responsible for mediating supersymmetry breaking from the hidden
sector to the visible sector, where they appear as calculable soft terms.
(See Fig. 9.1.) In this scenario, the tree-level sum rule eq. (9.12) need
not hold for the visible sector elds, so that a phenomenologically viable
superpartner mass spectrum is in principle achievable. As a bonus, if the
mediating interactions are avor-blind, then the soft terms appearing in
the MSSM may automatically obey conditions like eqs. (8.15), (8.16) and
(8.17).
There are two main competing proposals for what the mediating
interactions might be. The rst (and historically the more popular) is
that they are gravitational. More precisely, they are associated with the
new physics, including gravity, which enters at the Planck scale. In this
gravity-mediated supersymmetry breaking scenario, if supersymmetry is
broken in the hidden sector by a VEV F), then the soft terms in the
visible sector should be roughly of order
m
soft

F)
M
P
, (9.13)
3
This assumes only that the trace of the U(1) charges over all chiral supermultiplets
in the theory vanishes (Tr[T
a
] = 0). This holds for U(1)Y in the MSSM and more
generally for any non-anomalous gauge symmetry.
by dimensional analysis. This is because we know that m
soft
must
vanish in the limit F) 0 where supersymmetry is unbroken, and
also in the limit M
P
(corresponding to G
Newton
0) in which
gravity becomes irrelevant. For m
soft
of order a few hundred GeV,
one would therefore expect that the scale associated with the origin of
supersymmetry breaking in the hidden sector should be roughly
_
F)
10
10
or 10
11
GeV. Another possibility is that the supersymmetry breaking
order parameter is a gaugino condensate 0[
a
b
[0) =
ab
3
,= 0. If the
composite eld
a
b
is part of an auxiliary eld F for some (perhaps
composite) chiral supereld, then by dimensional analysis we expect
supersymmetry breaking soft terms of order
m
soft

3
M
2
P
, (9.14)
with, eectively, F)
3
/M
P
. In that case, the scale associated with
dynamical supersymmetry breaking should be more like 10
13
GeV.
The second main possibility is that the avor-blind mediating inter-
actions for supersymmetry breaking are the ordinary electroweak and
QCD gauge interactions. In this gauge-mediated supersymmetry breaking
scenario, the MSSM soft terms arise from loop diagrams involving some
messenger particles. The messengers couple to a supersymmetry-breaking
VEV F), and also have SU(3)
C
SU(2)
L
U(1)
Y
interactions which
provide a link to the MSSM. Then, using dimensional analysis, one
estimates for the MSSM soft terms
m
soft

a
4
F)
M
mess
(9.15)
where the
a
/4 is a loop factor for Feynman diagrams involving gauge
interactions, and M
mess
is a characteristic scale of the masses of the
messenger elds. So if M
mess
and
_
F) are roughly comparable, then
the scale of supersymmetry breaking can be as low as about
_
F) 10
4
or 10
5
GeV (much lower than in the gravity-mediated case!) to give m
soft
of the right order of magnitude.
9.2 The goldstino and the gravitino
As explained in the previous section, the spontaneous breaking of global
supersymmetry implies the existence of a massless Weyl fermion, the
goldstino. In the particular case of the ORaifeartaigh model, the
goldstino was identied to be
1
. More generally, we might expect that
in the case of F-term or D-term breaking, the goldstino is the fermionic
component of the supermultiplet whose auxiliary eld obtains a VEV.
Let us make this more precise by actually proving that the goldstino
exists and, in the process, identifying it. This is actually rather easy.
Consider a general supersymmetric model with both gauge and chiral
supermultiplets as in section 6. The fermionic degrees of freedom consist
of gauginos (
a
) and chiral fermions (
i
). After some of the scalar elds
in the theory obtain VEVs, the fermion mass matrix will have the form:
M
fermion
=
_
0

2g
a
(
)T
a
)
i
2g
a
(
)T
a
)
j
W
ij
)
_
(9.16)
in the (
a
,
i
) basis. [The o-diagonal entries in this matrix come from
the second line in eq. (6.76), and the lower right entry can be seen in
eq. (6.50).] Now we simply note that M
fermion
annihilates the vector
G =
_
D
a
)/
2
F
i
)
_
. (9.17)
The rst row of M
fermion
annihilates

G by virtue of the requirement
eq. (6.77) that the superpotential is gauge invariant, and the second
row annihilates

G because of the condition V/
i
) = 0 which must be
satised at the minimum of the scalar potential. Eq. (9.17) is proportional
to the goldstino wavefunction; it is non-trivial if and only if at least one
of the auxiliary elds has a VEV, breaking supersymmetry. So we have
proven that if global supersymmetry is spontaneously broken, then the
goldstino exists and has zero mass, and that its components among the
various fermions in the theory are just proportional to the corresponding
auxiliary eld VEVs.
We can derive another very important property of the goldstino by
considering the form of the conserved supercurrent eq. (6.80). Suppose
for simplicity
4
that the non-vanishing auxiliary eld VEV is F) and
that its goldstino superpartner is

G. Then the supercurrent conservation
equation tells us that
0 =
= iF)(
+. . . (9.18)
where j
is the part of the supercurrent which involves all of the other

supermultiplets, and the ellipses represent other contributions of the
goldstino supermultiplet to
which we can ignore. [The rst term

in eq. (9.18) comes from the second term in eq. (6.80), using the equation
4
More generally, if supersymmetry is spontaneously broken by VEVs for several
auxiliary elds Fi and D
a
, then one should make the replacement F (
P
i
|Fi|
2
+
1
2
P
a
D
a
2
)
1/2
everywhere in the following.
(a)
G
A
(b)
Fig. 9.2. Goldstino/gravitino interactions with superpartner pairs (, ) and
(
a
, A
a
).
of motion F
i
= W
i
for the goldstinos auxiliary eld.] This equation of
motion for the goldstino eld allows us to write an eective Lagrangian
L
goldstino
= i
G
1
F)
(
+ c.c.) (9.19)
which describes the interactions of the goldstino with all of the other
fermion-boson pairs. In particular, since j
= (
i
)
(1/2
2)
a
F
a
+. . ., there are goldstino-scalar-chiral fermion and

goldstino-gaugino-gauge boson vertices as shown in Fig. 9.2. Since
this derivation depends only on supercurrent conservation, eq. (9.19)
holds independently of the details of how supersymmetry breaking is
communicated from F) to the MSSM sector elds (
i
,
i
) and (
a
, A
a
).
It may appear strange at rst that the interaction terms in eq. (9.19)
get larger as F) goes to zero. However, the interaction term

G
contains two derivatives which turn out to always give a kinematic factor
proportional to the (mass)
2
dierence of the superpartners when they are
on-shell, i.e. m
2
i
m
2
i
and m
2
m
2
A
for Figs. 9.2a and 9.2b respectively.
These can be non-zero only by virtue of supersymmetry breaking, so they
must also vanish as F) 0, and the interaction is well-dened in that
limit. Nevertheless, for xed values of m
2
i
m
2
i
and m
2
m
2
A
, the
interaction term in eq. (9.19) can be phenomenologically important if F)
is not too large.
The above remarks apply to the breaking of global supersymmetry.
However, when one takes into account gravity, supersymmetry must be
a local symmetry. This means that the spinor parameter
which rst
appeared in section 6.1 is no longer a constant, but can vary from point
to point in spacetime. The resulting locally supersymmetric theory is
called supergravity. It necessarily unies the spacetime symmetries of
ordinary general relativity with local supersymmetry transformations. In
supergravity, the spin-2 graviton has a spin-3/2 fermion superpartner
called the gravitino, which we will denote

. The gravitino has odd

R-parity (P
R
= 1), as can be seen from the denition eq. (8.11). It
carries both a vector index () and a spinor index (), and transforms
inhomogeneously under local supersymmetry transformations:
+. . . (9.20)
Thus the gravitino should be thought of as the gauge particle of
local supersymmetry transformations [compare eq. (6.56)]. As long as
supersymmetry is unbroken, the graviton and the gravitino are both
massless, each with two spin helicity states. Once supersymmetry
is spontaneously broken, the gravitino acquires a mass by absorbing
(eating) the goldstino, which becomes its longitudinal (helicity 1/2)
components. This is called the super-Higgs mechanism. It is entirely
analogous to the ordinary Higgs mechanism for gauge theories, by
which the W
and Z
0
gauge bosons in the Standard Model gain
mass by absorbing the Nambu-Goldstone bosons associated with the
spontaneously broken electroweak gauge invariance. The counting works,
because the massive spin-3/2 gravitino now has four helicity states, of
which two were originally assigned to the would-be goldstino. The
gravitino mass is traditionally called m
3/2
, and in the case of F-term
breaking can be estimated as
m
3/2

F)
M
P
, (9.21)
This follows simply from dimensional analysis, since m
3/2
must vanish in
the limits that supersymmetry is restored (F) 0) and that gravity
is turned o (M
P
). Equation (9.21) means that one has very
dierent expectations for the mass of the gravitino in gravity-mediated
and in gauge-mediated models, because they usually make very dierent
predictions for F).
In the gravity-mediated supersymmetry breaking case, the gravitino
mass is comparable to the masses of the MSSM sparticles [compare
eqs. (9.13) and (9.21)]. Therefore m
3/2
is expected to be at least 100 GeV
or so. Its interactions will be of gravitational strength, so the gravitino
will not play any role in collider physics, but it can be a very important
consideration in cosmology. If it is the LSP, then it is stable and its
primordial density could easily exceed the critical density, causing the
universe to become matter-dominated too early. Even if it is not the
LSP, the gravitino can cause problems unless its density is diluted by
ination at late times, or it decays suciently rapidly.
In contrast, gauge-mediated supersymmetry breaking models predict
that the gravitino is much lighter than the MSSM sparticles as long
as M
mess
M
P
. This can be seen by comparing eqs. (9.15) and
(9.21). The gravitino is almost certainly the LSP in this case, and
all of the MSSM sparticles will eventually decay into nal states that
include it. Naively, one might expect that these decays are extremely
slow. However, this is not necessarily true, because the gravitino inherits
the non-gravitational interactions of the goldstino it has absorbed. This
means that the gravitino, or more precisely its longitudinal (goldstino)
components, can play an important role in collider physics experiments.
The mass of the gravitino can generally be ignored for kinematic purposes,
as can its transverse (helicity 3/2) components which really do have
only gravitational interactions. Therefore in collider phenomenology
discussions one may interchangeably use the same symbol

G for the
goldstino and for the gravitino of which it is the longitudinal (helicity
1/2) part. By using the eective Lagrangian eq. (9.19), one can compute
that the decay rate of any sparticle

X into its Standard Model partner X
plus a gravitino/goldstino

G is given by
(
X X
G) =
m
5
e
X
16F)
2
_
1
m
2
X
m
2
e
X
_
4
. (9.22)
This corresponds to either Fig. 9.2a or 9.2b, with (
X, X) = (, ) or
(, A) respectively. One factor (1 m
2
X
/m
2
e
X
)
2
came from the derivatives
in the interaction term in eq. (9.19) evaluated for on-shell nal states, and
another such factor comes from the kinematic phase space integral with
m
3/2
m
e
X
, m
X
.
If the supermultiplet containing the goldstino and F) has canonically-
normalized kinetic terms, and one requires the tree-level vacuum energy
to vanish, then the estimate eq. (9.21) may be sharpened to
m
3/2
=
F)
3M
P
. (9.23)
In that case, one can rewrite eq. (9.22) as
(
X X
G) =
m
5
e
X
48M
2
P
m
2
3/2
_
1
m
2
X
m
2
e
X
_
4
, (9.24)
and this is how the formula is sometimes presented by those who prefer to
take eq. (9.23) seriously. Note that the decay width is larger for smaller
F), or equivalently for smaller m
3/2
, if the other masses are xed. If
X is a mixture of superpartners of dierent Standard Model particles X,

then eq. (9.22) should be multiplied by a suppression factor equal to the
square of the cosine of the appropriate mixing angle. If m
e
X
is of order 100
GeV or more, and
_
F)
<
few10
6
GeV [corresponding to m
3/2
less
than roughly 1 keV according to eq. (9.23)], then the decay

X X
G can
9.3 Gravity-mediated supersymmetry breaking models 217
occur quickly enough to be observed in a modern collider detector. This
gives rise to some very interesting phenomenological signatures, which we
will discuss further in a later Chapter.
We now turn to a slightly more systematic analysis of the way in which
the MSSM soft terms arise, considering in turn the gravity-mediated and
gauge-mediated scenarios.
9.3 Gravity-mediated supersymmetry breaking models
The dening feature of these models is that the hidden sector of the
theory communicates with our MSSM only (or dominantly) through
gravitational-strength interactions. In an eective eld theory format,
this means that the supergravity Lagrangian contains nonrenormalizable
terms which communicate between the two sectors and which are
suppressed by powers of the Planck mass, since the gravitational coupling
is proportional to 1/M
P
. These will include
L
NR
=
1
M
P
F
X
a
1
2
f
a
a
+ c.c.
1
M
2
P
F
X
F
X
k
i
j
1
M
P
F
X
(
1
6
y
ijk
k
+
1
2
ij
j
) + c.c. (9.25)
where F
X
is the auxiliary eld for a chiral supermultiplet X in the
hidden sector, and
i
and
a
are the scalar and gaugino elds in the
MSSM. By themselves, the terms in eq. (9.25) are not supersymmetric,
but it is possible to show that they are part of a nonrenormalizable
supersymmetric Lagrangian (see Appendix) which contains other terms
that we may ignore. Now if one assumes that F
X
) 10
10
or 10
11
GeV,
then L
NR
will give us nothing other than a Lagrangian of the form L
soft
in eq. (6.82), with MSSM soft terms of order a few hundred GeV. [Note
that terms of the form L
maybe soft
in eq. (6.83) do not arise.]
The dimensionless parameters f
a
, k
i
j
, y
ijk
and
ij
in L
NR
are to be
determined by the underlying theory. This is a dicult enterprise in
general, but a dramatic simplication occurs if one assumes a minimal
form for the normalization of kinetic terms and gauge interactions in the
full, nonrenormalizable supergravity Lagrangian (see Appendix). In that
case, one nds that there is a common f
a
= f for the three gauginos; k
i
j
=
k
i
j
is the same for all scalars; and the other couplings are proportional
to the corresponding superpotential parameters, so that y
ijk
= y
ijk
and
ij
=
ij
with universal dimensionless constants and . Then one
nds that the soft terms in L
MSSM
soft
can all be written in terms of just four
parameters:
m
1/2
= f
F
X
)
M
P
; m
2
0
= k
[F
X
)[
2
M
2
P
; A
0
=
F
X
)
M
P
; B
0
=
F
X
)
M
P
. (9.26)
In terms of these, one can write for the parameters appearing in eq. (8.12):
M
3
= M
2
= M
1
= m
1/2
; (9.27)
m
2
Q
= m
2
u
= m
2
d
= m
2
L
= m
2
e
= m
2
0
1; m
2
Hu
= m
2
H
d
= m
2
0
; (9.28)
a
u
= A
0
y
u
; a
d
= A
0
y
d
; a
e
= A
0
y
e
; (9.29)
b = B
0
. (9.30)
It is a matter of some controversy whether the assumptions going into
this parameterization are completely well-motivated on purely theoretical
grounds,
5
but from a phenomenological perspective they are clearly very
nice. This framework successfully evades the most dangerous types
of FCNC and CP-violation as discussed in section 8.4. In particular,
eqs. (9.28) and (9.29) are just stronger versions of eqs. (8.15) and (8.16),
respectively. If m
1/2
, A
0
and B
0
all have the same complex phase, then
eq. (8.17) will also be satised.
Equations (9.27)-(9.30) also have the virtue of being highly predictive.
[Of course, eq. (9.30) is content-free unless one can relate B
0
to the other
parameters in some non-trivial way.] As discussed in section 5.4, they
should be applied as RG boundary conditions at the scale M
P
. The
RG evolution of the soft parameters down to the electroweak scale will
then allow us to predict the entire MSSM spectrum in terms of just ve
parameters m
1/2
, m
2
0
, A
0
, B
0
, and (plus the already-measured gauge
and Yukawa couplings of the MSSM). In practice, the approximation
is usually made of starting this RG running from the unication scale
M
U
2 10
16
GeV instead of M
P
. The reason for this is that the
apparent unication of gauge couplings gives us a strong hint that we
know something about how the RG equations are behaving up to M
U
,
but gives us little guidance about what to expect at scales between M
U
and M
P
. The error made in neglecting these eects is proportional to a
loop suppression factor times ln(M
P
/M
U
) and can be partially absorbed
into a redenition of m
2
0
, m
1/2
, A
0
and B
0
, but in some cases can
lead to important eects. The framework described in the above few
paragraphs has been the subject of the bulk of phenomenological studies
of supersymmetry. It is sometimes referred to as the minimal supergravity
or supergravity-inspired scenario for the soft terms.
5
The familiar avor-blindness of gravitational interactions expressed in Einsteins
equivalence principle does not, by itself, tell us anything about the form of eq. (9.25).
Particular models of gravity-mediated supersymmetry breaking can be
even more predictive, relating some of the parameters m
1/2
, m
2
0
, A
0
and
B
0
to each other and to the mass of the gravitino m
3/2
. For example,
three popular kinds of models for the soft terms are:
Dilaton-dominated: m
2
0
= m
2
3/2
; m
1/2
= A
0
=
3m
3/2
.
Polonyi: m
2
0
= m
2
3/2
; A
0
= (3
3)m
3/2
; m
1/2
= O(m
3/2
).
No-scale: m
1/2
m
0
, A
0
, m
3/2
.
The dilaton-dominated scenario arises in a particular limit of super-
string theory. While it appears to be highly predictive, it can easily
be generalized in other limits. The Polonyi model has the advantage
of being the simplest possible model for supersymmetry breaking in the
hidden sector, but it is rather ad hoc and does not seem to have a special
place in grander schemes like superstrings. The no-scale limit may
arise in a low-energy limit of superstrings in which the gravitino mass
scale is undetermined at tree-level (hence the name). It implies that only
the gaugino masses are appreciable at M
P
. As we will see in section
10.1, RG evolution feeds m
1/2
into the squark, slepton and Higgs (mass)
2
parameters with sucient magnitude to give acceptable phenomenology
at the electroweak scale. More recent versions of the no-scale scenario,
however, also can give signicant A
0
and m
2
0
at M
P
. In many cases B
0
can also be predicted in terms of the other parameters, but this is quite
sensitive to model assumptions. For phenomenological studies, m
1/2
, m
2
0
,
A
0
and B
0
are usually just taken to be convenient independent parameters
of our ignorance of the supersymmetry breaking mechanism.
9.4 Gauge-mediated supersymmetry breaking models
A strong alternative to the scenario described in the previous section
is provided by the gauge-mediated supersymmetry breaking proposal.
The basic idea is to introduce some new chiral supermultiplets, called
messengers, which couple to the ultimate source of supersymmetry
breaking, and which also couple indirectly to the (s)quarks and (s)leptons
and Higgs(inos) of the MSSM through the ordinary SU(3)
C
SU(2)
L
U(1)
Y
gauge boson and gaugino interactions. In this way, the ordinary
gauge interactions, rather than gravity, are responsible for the appearance
of soft terms in the MSSM. There is still gravitational communication
between the MSSM and the source of supersymmetry breaking, of course,
but that eect is now relatively unimportant compared to the gauge
interaction eects.
In the simplest such model, the messenger elds are a set of chiral
supermultiplets q, q, , which transform under SU(3)
C
SU(2)
L
U(1)
Y
as
q (3, 1,
1
3
); q (3, 1,
1
3
); (1, 2,
1
2
); (1, 2,
1
2
).(9.31)
These supermultiplets contain messenger quarks
q
,
q
and scalar quarks
q, q and messenger leptons
and scalar leptons , . All of these

particles must get very large masses so as not to have been discovered
already. They manage to do so by coupling to a gauge-singlet chiral
supermultiplet S through a superpotential:
W
mess
= y
2
S +y
3
Sqq. (9.32)
The scalar component of S and its auxiliary (F-term) component are each
supposed to acquire VEVs, denoted S) and F
S
) respectively. This can
be accomplished either by putting S into an ORaifeartaigh-type model, or
by a dynamical mechanism. Exactly how this happens is a very interesting
and important question. Here, we will simply parameterize our ignorance
of the precise mechanism of supersymmetry breaking by asserting that S
participates in another part of the superpotential, call it W
breaking
, which
provides for supersymmetry breakdown.
Let us now consider the mass spectrum of the messenger fermions and
bosons. The messenger part of the superpotential now eectively becomes
W
mess
= y
2
S) +y
3
S)qq. So, the fermionic messenger elds pair up to
get mass terms:
L = (y
2
S)
+y
3
S)
q
q
+ c.c.) (9.33)
as in eq. (6.52). Meanwhile, their scalar messenger partners , and qq
have a scalar potential given by (neglecting D-term contributions, which
do not aect the following discussion):
V =
W
mess
2
+
W
mess
2
+
W
mess
q
2
+
W
mess
q
2
+
W
mess
S
+
W
breaking
S
2
(9.34)
as in eq. (6.51). Now, using the supposition that
W
breaking
/S) = F
S
) (9.35)
(with W
mess
/S) = 0), and replacing S and F
S
by their VEVs, one nds
quadratic mass terms in the potential for the messenger scalar leptons:
V = [y
2
S)[
2
_
[[
2
+[[
2
_
+[y
3
S)[
2
_
[q[
2
+[q[
2
_
_
y
2
F
S
) +y
3
F
S
)qq + c.c.
_
+ quartic terms. (9.36)
S
F
S

B, W, g
Fig. 9.3. Contributions to the MSSM gaugino masses in gauge-mediated
supersymmetry breaking models arise from one-loop graphs involving virtual
messenger particles.
The rst line in eq. (9.36) represents supersymmetric mass terms that go
along with eq. (9.33), while the second line consists of soft supersymmetry-
breaking masses. The complex scalar messengers , thus obtain a (mass)
2
matrix equal to:
_
[y
2
S)[
2
y
2
F
S
)
y
2
F
S
) [y
2
S)[
2
_
(9.37)
with squared mass eigenvalues [y
2
S)[
2
[y
2
F
S
)[. In just the same way,
the scalars q, q get squared masses [y
3
S)[
2
[y
3
F
S
)[.
So far, we have found that the eect of supersymmetry breaking is to
split each messenger supermultiplet pair apart:
, : m
2
fermions
= [y
2
S)[
2
, m
2
scalars
= [y
2
S)[
2
[y
2
F
S
)[, (9.38)
q, q : m
2
fermions
= [y
3
S)[
2
, m
2
scalars
= [y
3
S)[
2
[y
3
F
S
)[. (9.39)
The supersymmetry violation apparent in this messenger spectrum for
F
S
) ,= 0 is communicated to the MSSM sparticles through radiative
quantum corrections. The MSSM gauginos obtain masses from the 1-
loop graph shown in Fig. 9.3. The scalar and fermion lines in the loop
are messenger elds. Recall that the interaction vertices in Fig. 9.3
are of gauge coupling strength even though they do not involve gauge
bosons; compare Fig. 6.3g. In this way, gauge-mediation provides that
q, q messenger loops give masses to the gluino and the bino, and ,
messenger loops give masses to the wino and bino elds. By computing
the 1-loop diagrams one nds that the resulting MSSM gaugino masses
are given by
M
a
=

a
4
, (a = 1, 2, 3), (9.40)
(in the normalization discussed in section 8.4) where we have introduced
a mass parameter
F
S
)/S) . (9.41)
Fig. 9.4. Contributions to MSSM scalar squared masses in gauge-mediated
supersymmetry breaking models arise in leading order from these two-loop
Feynman graphs.
(Note that if F
S
) were 0, then = 0 and the messenger scalars
would be degenerate with their fermionic superpartners and there would
be no contribution to the MSSM gaugino masses.) In contrast, the
corresponding MSSM gauge bosons cannot get a corresponding mass shift,
since they are protected by gauge invariance. So supersymmetry breaking
has been successfully communicated to the MSSM (visible sector). To
a good approximation, eq. (9.40) holds for the running gaugino masses at
an RG scale Q
0
corresponding to the average characteristic mass of the
heavy messenger particles, roughly of order M
mess
y
i
S). The running
mass parameters can then be RG-evolved down to the electroweak scale
to predict the physical masses to be measured by future experiments.
The scalars of the MSSM do not get any radiative corrections to their
masses at one-loop order. The leading contribution to their masses
comes from the two-loop graphs shown in Fig. 9.4, with the messenger
fermions (heavy solid lines) and messenger scalars (heavy dashed lines)
and ordinary gauge bosons and gauginos running around the loops. By
computing these graphs, one nds that each MSSM scalar gets a (mass)
2
given by:
m
2
= 2
2
_
_
3
4
_
2
C
3
+
_
2
4
_
2
C
2
+
_
1
4
_
2
C
1
_
. (9.42)
Here C
a
are the quadratic Casimir group theory invariants for the scalar
for each gauge group. They are dened by C
j
i
= (T
a
T
a
)
j
i
where the
T
a
are the group generators which act on the scalar . Explicitly, they
are:
C
3
=
4/3 for =

Q
i
,
u
i
,
d
i
;
0 for =

L
i
,
e
i
, H
u
, H
d
(9.43)
C
2
=
3/4 for =

Q
i
,
L
i
, H
u
, H
d
;
0 for =
u
i
,
d
i
,
e
i
(9.44)
C
1
= 3Y
2
/5 for each with weak hypercharge Y
. (9.45)
The squared masses in eq. (9.42) are positive (fortunately!).
The terms a
u
, a
d
, a
e
arise rst at two-loop order, and are suppressed
by an extra factor of
a
/(4) compared to the gaugino masses. So, to a
very good approximation one has, at the messenger scale,
a
u
= a
d
= a
e
= 0, (9.46)
a signicantly stronger condition than eq. (8.16). Again, eqs. (9.42) and
(9.46) should be applied at an RG scale equal to the average mass of
the messenger elds running in the loops. However, after evolving the
RG equations down to the electroweak scale, non-zero a
u
, a
d
and a
e
are
generated proportional to the corresponding Yukawa matrices and the
non-zero gaugino masses, as we will see in section 10.1. These will only
be large for the third family squarks and sleptons, in the approximation of
eq. (8.2). The parameter b may also be taken to vanish near the messenger
scale, but this is quite model-dependent, and in any case b will be non-
zero when it is RG-evolved to the electroweak scale. In practice, b is
determined by the requirement of correct electroweak symmetry breaking,
as discussed below in section 8.5.
Because the gaugino masses arise at one-loop order and the scalar
(mass)
2
contributions appear at two-loop order, both eq. (9.40) and (9.42)
correspond to the estimate eq. (9.15) for m
soft
, with M
mess
y
i
S).
Equations (9.40) and (9.42) hold in the limit of small F
S
)/y
i
S)
2
,
corresponding to mass splittings within each messenger supermultiplet
that are small compared to the overall messenger mass scale. The
subleading corrections in an expansion in F
S
)/y
i
S)
2
turn out to be quite
small unless there are very large hierarchies in the messenger sector.
The model we have described so far is often called the minimal model
of gauge-mediated supersymmetry breaking. Let us now generalize it
to a more complicated messenger sector. Suppose that q, q and , are
replaced by a collection of messengers
i
,
i
with a superpotential
W
mess
=
i
y
i
S
i
i
. (9.47)
The bar means that the chiral superelds
i
transform as the complex
conjugate representations of the
i
chiral superelds. Together they are
said to form a vector-like (real) representation of the Standard Model
gauge group. As before, the fermionic components of each pair
i
and
i
pair up to get squared masses y
i
S) and their scalar partners mix to get
squared masses [y
i
S)[
2
[y
i
F
S
)[. The MSSM gaugino mass parameters
induced are now
M
a
=

a
4
i
n
a
(i) (a = 1, 2, 3) (9.48)
where n
a
(i) is the Dynkin index for each
i
+
i
, in a normalization
where n
3
= 1 for a 3 + 3 of SU(3)
C
and n
2
= 1 for a pair of doublets of
SU(2)
L
. For U(1)
Y
, one has n
1
= 6Y
2
/5 for each messenger pair with
weak hypercharges Y . In computing n
1
one must remember to add up
the contributions for each component of an SU(3)
C
or SU(2)
L
multiplet.
So, for example, (n
1
, n
2
, n
3
) = (2/5, 0, 1) for q + q and (n
1
, n
2
, n
3
) =
(3/5, 1, 0) for + . Thus the total is

i
(n
1
, n
2
, n
3
) = (1, 1, 1) for the
minimal model, so that eq. (9.48) is in agreement with eq. (9.40). On
general group-theoretic grounds, n
2
and n
3
must be integers, and n
1
is
always an integer multiple of 1/5 if fractional electric charges are conned.
The MSSM scalar masses in this generalized gauge-mediation frame-
work are now:
m
2
= 2
2
__
3
4
_
2
C
i
n
3
(i) +
_
2
4
_
2
C
i
n
2
(i)
+
_
1
4
_
2
C
i
n
1
(i)
_
. (9.49)
In writing eqs. (9.48) and (9.49) as simple sums, we have implicitly
assumed that the messengers are all approximately equal in mass, with
M
mess
y
i
S). (9.50)
This is a good approximation if the y
i
are not too dierent from each
other, because the dependence of the MSSM mass spectrum on the
y
i
is only logarithmic (due to RG running) for xed . However, if
large hierarchies in the messenger masses are present, then the additive
contributions to the gaugino and scalar masses from each individual
messenger multiplet i should really instead be incorporated at the mass
scale of that messenger multiplet. Then RG evolution is used to run these
various contributions down to the electroweak or TeV scale; the individual
messenger contributions to scalar and gaugino masses as indicated above
can be thought of as threshold corrections to this RG running.
Messengers with masses far below the GUT scale will aect the running
of gauge couplings and might therefore be expected to ruin the apparent
unication shown in Fig. 8.7. However, if the messengers come in complete
multiplets of the SU(5) global symmetry
6
that contains the Standard
Model gauge group and are not very dierent in mass, then approximate
unication of gauge couplings will still occur when they are extrapolated
up to the same scale M
U
(but with a larger unied value for the gauge
couplings at that scale). For this reason, a popular class of models is
obtained by taking the messengers to consist of N
5
copies of the 5 +5 of
SU(5), resulting in
N
5
=
i
n
1
(i) =
i
n
2
(i) =
i
n
3
(i) . (9.51)
In terms of this integer parameter N
5
, eqs. (9.48) and (9.49) reduce to
M
a
=

a
4
N
5
(9.52)
m
2
= 2
2
N
5
3
a=1
C
a
_
a
4
_
2
, (9.53)
since now there are N
5
copies of the minimal messenger sector particles
running around the loops. For example, the minimal model in eq. (9.31)
corresponds to N
5
= 1. A single copy of 10 + 10 of SU(5) has
Dynkin indices

i
n
a
(i) = 3, and so can be substituted for 3 copies of
5+5. (Other combinations of messenger multiplets can also preserve the
apparent unication of gauge couplings.) Note that the gaugino masses
scale like N
5
, while the scalar masses scale like

N
5
. This means that
sleptons and squarks will tend to be relatively lighter for larger values of
N
5
in non-minimal models. However, if N
5
is too large, then the running
gauge couplings will diverge before they can unify at M
U
. For messenger
masses of order 10
6
GeV or less, for example, one needs N
5
4.
There are many other possible generalizations of the basic gauge-
mediation scenario as described above. An important general expectation
in these models is that the strongly-interacting sparticles (squarks, gluino)
should be heavier than weakly-interacting sparticles (sleptons, bino,
winos, higgsinos) simply because of the hierarchy of gauge couplings
3
>
2
>
1
. The common feature which makes all of these models very
attractive is that the masses of the squarks and sleptons depend only on
their gauge quantum numbers, leading automatically to the degeneracy
6
This SU(5) symmetry may or may not be promoted to a local gauge symmetry at
the GUT scale. For our present purposes, it is used simply as a classication scheme,
since the global SU(5) symmetry is only approximate below the GUT scale at the
messenger mass scale where gauge mediation takes place.
of squark and slepton masses needed for suppression of FCNC eects.
But the most distinctive phenomenological prediction of gauge-mediated
models may be the fact that the gravitino is the LSP. This can have
crucial consequences for both cosmology and collider physics, as we will
discuss further in a later Chapter.
10
From model assumptions to the
low-energy MSSM
10.1 Renormalization Group Equations
In order to translate a set of predictions at the input scale into
physically meaningful quantities which describe physics at the electroweak
scale, it is necessary to evolve the gauge couplings, superpotential
parameters, and soft terms using the RG equations. As a technical
aside, we note that when computing RG eects and other radiative
corrections in supersymmetry, it is important to choose regularization
and renormalization schemes that do not violate supersymmetry. The
most popular regularization method for discussing radiative corrections
within the Standard Model is dimensional regularization (DREG), in
which the number of spacetime dimensions is continued to d = 4
2. Unfortunately, DREG violates supersymmetry explicitly because it
introduces a mismatch between the numbers of gauge boson degrees of
freedom and the gaugino degrees of freedom o-shell. This mismatch
is only 2, but can be multiplied by factors up to 1/
n
in an n-loop
calculation. In DREG, supersymmetric relations between dimensionless
coupling constants (supersymmetric Ward identities) are therefore
disrespected by radiative corrections involving the nite parts of one-
loop graphs and by the divergent parts of two-loop graphs. Instead,
one may use the slightly dierent scheme known as regularization by
dimensional reduction, or DRED, which does respect supersymmetry.[85]
In the DRED method, all momentum integrals are still performed in
d = 42 dimensions, but the vector index on the gauge boson elds A
a
now runs over all 4 dimensions. Running couplings are then renormalized
using DRED with modied minimal subtraction (DR) rather than the
usual DREG with modied minimal subtraction (MS). In particular,
the boundary conditions at the input scale should be applied in the
supersymmetry-preserving DR scheme. (See Ref.[86] for an alternative
227
228 10 From model assumptions to the low-energy MSSM
supersymmetric scheme.) One loop -functions are always the same in
the two schemes, but it is important to realize that the MS scheme does
violate supersymmetry, so that DR is preferred
2
from that point of view.
(It is also possible to work consistently within the MS scheme, as long
as one is careful to correctly translate all DR couplings and masses into
their MS counterparts.[90, 91])
The MSSM RG equations in the DR scheme are given in Refs.[92]
[95];
they are now known for the gauge couplings and superpotential
parameters up to 3-loop order, and for the soft parameters at 2-loop order.
However, for many purposes including pedagogical ones it suces to work
in the 1-loop approximation. Here, we will also use the approximation
that only the third family Yukawa couplings are signicant; see eq. (8.2).
Then the superpotential parameters run with scale according to:
d
dt
y
t
=
y
t
16
2
_
6[y
t
[
2
+[y
b
[
2
16
3
g
2
3
3g
2
2

13
15
g
2
1
_
; (10.1)
d
dt
y
b
=
y
b
16
2
_
6[y
b
[
2
+[y
t
[
2
+[y
[
2
16
3
g
2
3
3g
2
2

7
15
g
2
1
_
; (10.2)
d
dt
y
=
y
16
2
_
4[y
[
2
+ 3[y
b
[
2
3g
2
2

9
5
g
2
1
_
; (10.3)
d
dt
=

16
2
_
3[y
t
[
2
+ 3[y
b
[
2
+[y
[
2
3g
2
2

3
5
g
2
1
_
. (10.4)
The one-loop RG equations for the gauge couplings g
1
, g
2
, g
3
have already
been listed in eq. (8.18). Note that the -functions (the quantities on
the right side of each equation) for each supersymmetric parameter are
proportional to the parameter itself. This is actually a consequence of a
general and powerful result known as the supersymmetric nonrenormaliza-
tion theorem.[96] This theorem implies that the logarithmically divergent
contributions to a given process can always be written in the form of a
wave-function renormalization, without any vertex renormalization.
3
It
is true for any supersymmetric theory, not just the MSSM, and holds
to all orders in perturbation theory. It can be proved most easily using
supereld techniques. In particular, it means that once we have a theory
which can explain why is of order 10
2
or 10
3
GeV at tree-level, we do
not have to worry about being infected (made very large) by radiative
2
Even the DRED scheme may not provide a supersymmetric regulator, because of
ambiguities which appear at ve-loop order at the latest.[87] Fortunately, this does
not seem to cause any practical diculties.[88] See also Ref.[89] for a promising
proposal which avoids doing violence to the number of spacetime dimensions.
3
Actually, there is vertex renormalization in the eld theory in which auxiliary elds
have been integrated out, but the sum of divergent contributions for a given process
always has the form of wave-function renormalization. See Ref.[25] for a discussion
of this point.
corrections involving the masses of some very heavy unknown particles;
all such RG corrections to will be directly proportional to itself.
The one-loop RG equations for the three gaugino mass parameters in
the MSSM are determined by the same quantities b
MSSM
a
which appear in
the gauge coupling RG eqs. (8.18):
d
dt
M
a
=
1
8
2
b
a
g
2
a
M
a
(b
a
= 33/5, 1, 3) (10.5)
for a = 1, 2, 3. It is therefore easy to show that the three ratios M
a
/g
2
a
are
each constant (RG-scale independent) up to small two-loop corrections.
In minimal supergravity models, we can therefore write
M
a
(Q) =
g
2
a
(Q)
g
2
a
(Q
0
)
m
1/2
(a = 1, 2, 3) (10.6)
at any RG scale Q < Q
0
, where Q
0
is the input scale which is presumably
nearly equal to M
P
. Since the gauge couplings are observed to unify at
M
U
0.01M
P
, one expects
4
that g
2
1
(Q
0
) g
2
2
(Q
0
) g
2
3
(Q
0
). Therefore,
one nds that
M
1
g
2
1
=
M
2
g
2
2
=
M
3
g
2
3
(10.7)
at any RG scale, up to small two-loop eects and possibly larger threshold
eects near M
U
and M
P
. The common value in eq. (10.7) is also equal to
m
1/2
/g
2
U
in minimal supergravity models, where g
U
is the unied gauge
coupling at the input scale where m
1/2
is the common gaugino mass.
Interestingly, eq. (10.7) is also the solution to the one-loop RG equations
in the case of the gauge-mediated boundary conditions eq. (9.40) applied
at the messenger mass scale. This is true even though there is no such
thing as a unied gaugino mass m
1/2
in the gauge-mediated case, because
of the fact that the gaugino masses are proportional to the g
2
a
times a
constant. So eq. (10.7) is theoretically well-motivated (but certainly not
inevitable) in both frameworks. The prediction eq. (10.7) is particularly
useful since the gauge couplings g
2
1
, g
2
2
, and g
2
3
are already quite well
known at the electroweak scale from experiment. Therefore they can be
extrapolated up to at least M
U
, assuming that the apparent unication
of gauge couplings is not a fake. The gaugino mass parameters feed into
the RG equations for all of the other soft terms, as we will see.
Next we consider the 1-loop RG equations for the analytic soft
parameters a
u
, a
d
, a
e
. In models obeying eq. (8.16), these matrices start
4
In a GUT model, it is automatic that the gauge couplings and gaugino masses are
unied at all scales Q > MU and in particular at Q MP , because in the unied
theory the gauginos all live in the same representation of the unied gauge group. In
many superstring models, this is also known to be a good approximation.
o proportional to the corresponding Yukawa couplings at the input scale,
and the RG evolution respects this property. With the approximation of
eq. (8.2), one can therefore also write, at any RG scale,
a
u

0 0 0
0 0 0
0 0 a
t
; a
d

0 0 0
0 0 0
0 0 a
b
; a
e

0 0 0
0 0 0
0 0 a
,(10.8)
which denes
5
running parameters a
t
, a
b
, and a
. The RG equations for

these parameters and b are given by
16
2
d
dt
a
t
= a
t
_
18[y
t
[
2
+[y
b
[
2
16
3
g
2
3
3g
2
2

13
15
g
2
1
_
+ 2a
b
y
b
y
t
+y
t
_
32
3
g
2
3
M
3
+ 6g
2
2
M
2
+
26
15
g
2
1
M
1
_
; (10.9)
16
2
d
dt
a
b
= a
b
_
18[y
b
[
2
+[y
t
[
2
+[y
[
2
16
3
g
2
3
3g
2
2
7
15
g
2
1
_
+ 2a
t
y
t
y
b
+2a
y
b
+y
b
_
32
3
g
2
3
M
3
+ 6g
2
2
M
2
+
14
15
g
2
1
M
1
_
; (10.10)
16
2
d
dt
a
= a
_
12[y
[
2
+ 3[y
b
[
2
3g
2
2
9
5
g
2
1
_
+ 6a
b
y
b
y
+y
_
6g
2
2
M
2
+
18
5
g
2
1
M
1
_
; (10.11)
16
2
d
dt
b = b
_
3[y
t
[
2
+ 3[y
b
[
2
+[y
[
2
3g
2
2

3
5
g
2
1
_
+
_
6a
t
y
t
+ 6a
b
y
b
+ 2a
+ 6g
2
2
M
2
+
6
5
g
2
1
M
1
_
(10.12)
in this approximation. The -function for each of these soft parameters
is not proportional to the parameter itself; this makes sense because
couplings which violate supersymmetry are not protected by the
supersymmetric nonrenormalization theorem. In particular, even if A
0
and B
0
appearing in eqs. (9.29) and (9.30) vanish at the input scale, the
RG corrections proportional to gaugino masses appearing in eqs. (10.9)-
(10.12) ensure that a
t
, a
b
, a
and b will still be non-zero at the electroweak

scale.
Next let us consider the RG equations for the scalar masses in the
MSSM. In the approximation of eqs. (8.2) and (10.8), the squarks and
5
We must warn the reader that rescaled soft parameters At = at/yt, A
b
= a
b
/y
b
, and
A = a /y are commonly used in the literature. We do not follow this notation,
because it cannot be generalized beyond the approximation of eqs. (8.2), (10.8)
without introducing horrible complications such as non-polynomial RG equations,
and because at, a
b
and a are the couplings that actually appear in the lagrangian
anyway.
sleptons of the rst two families have only gauge interactions. This means
that if the scalar masses satisfy a boundary condition like eq. (8.15) at
an input RG scale, then when renormalized to any other RG scale, they
will still be almost diagonal, with the approximate form
m
2
Q

m
2
Q
1
0 0
0 m
2
Q
1
0
0 0 m
2
Q
3
; m
2
u

m
2
u
1
0 0
0 m
2
u
1
0
0 0 m
2
u
3
; (10.13)
etc. The rst and second family squarks and sleptons with given gauge
quantum numbers remain very nearly degenerate, but the third family
squarks and sleptons feel the eects of the larger Yukawa couplings and
so get renormalized dierently. The one-loop RG equations for the rst
and second family squark and slepton squared masses can be written as
6
16
2
d
dt
m
2
a=1,2,3
8g
2
a
C
a
[M
a
[
2
(10.14)
for each scalar , where the

a
is over the three gauge groups U(1)
Y
,
SU(2)
L
and SU(3)
C
; M
a
are the corresponding running gaugino mass
parameters which are known from eq. (10.7); and the constants C
a
are
the same quadratic Casimir invariants which appeared in eqs. (9.43)-
(9.45). An important feature of eq. (10.14) is that the right-hand sides
are strictly negative, so that the scalar (mass)
2
parameters grow as they
are RG-evolved from the input scale down to the electroweak scale. Even
if the scalars have zero or very small masses at the input scale, as in
the no-scale boundary condition limit m
2
0
= 0, they will obtain large
positive squared masses at the electroweak scale, thanks to the eects of
the gaugino masses.
The RG equations for the (mass)
2
parameters of the Higgs scalars and
third family squarks and sleptons get the same gauge contributions as
in eq. (10.14), but they also have contributions due to the large Yukawa
(y
t,b,
) and soft (a
t,b,
) couplings. At one-loop order, these only appear in
6
There are also terms in the scalar (mass)
2
RG equations which are proportional to
Tr[Y m
2
] (the sum of the weak hypercharge times the soft (mass)
2
for all scalars
in the theory). However, these contributions vanish in both the cases of minimal
supergravity and gauge-mediated boundary conditions for the soft terms, as one can
see by explicitly calculating Tr[Y m
2
] in each case. If Tr[Y m
2
] is zero at the input
scale, then it will remain zero under RG evolution. Therefore we neglect such terms in
our discussion, although they can have an important eect in more general situations.
three combinations:
X
t
= 2[y
t
[
2
(m
2
Hu
+m
2
Q
3
+m
2
u
3
) + 2[a
t
[
2
, (10.15)
X
b
= 2[y
b
[
2
(m
2
H
d
+m
2
Q
3
+m
2
d
3
) + 2[a
b
[
2
, (10.16)
X
= 2[y
[
2
(m
2
H
d
+m
2
L
3
+m
2
e
3
) + 2[a
[
2
. (10.17)
In terms of these quantities, the RG equations for the soft Higgs (mass)
2
parameters m
2
Hu
and m
2
H
d
are
16
2
d
dt
m
2
Hu
=3X
t
6g
2
2
[M
2
[
2
6
5
g
2
1
[M
1
[
2
, (10.18)
16
2
d
dt
m
2
H
d
=3X
b
+X
6g
2
2
[M
2
[
2
6
5
g
2
1
[M
1
[
2
. (10.19)
Note that X
t
, X
b
, and X
are positive, so their eect is always to decrease

the Higgs masses as one evolves the RG equations downward from the
input scale to the electroweak scale. Since y
t
is the largest of the Yukawa
couplings because of the experimental fact that the top quark is heavy,
X
t
is typically expected to be larger than X
b
and X
. This can cause the

RG-evolved m
2
Hu
to run negative near the electroweak scale, helping to
destabilize the point H
u
= 0 and so provoking a Higgs VEV which is just
what we want.
7
Thus a large top Yukawa coupling favors the breakdown of
the electroweak symmetry breaking because it induces negative radiative
corrections to the Higgs (mass)
2
.
The third family squark and slepton (mass)
2
parameters also get
contributions which depend on X
t
, X
b
and X
. Their RG equations are

given by
16
2
d
dt
m
2
Q
3
= X
t
+X
b
32
3
g
2
3
[M
3
[
2
6g
2
2
[M
2
[
2
2
15
g
2
1
[M
1
[
2
(10.20)
16
2
d
dt
m
2
u
3
= 2X
t
32
3
g
2
3
[M
3
[
2
32
15
g
2
1
[M
1
[
2
(10.21)
16
2
d
dt
m
2
d
3
= 2X
b
32
3
g
2
3
[M
3
[
2
8
15
g
2
1
[M
1
[
2
(10.22)
16
2
d
dt
m
2
L
3
= X
6g
2
2
[M
2
[
2
3
5
g
2
1
[M
1
[
2
(10.23)
16
2
d
dt
m
2
e
3
= 2X

24
5
g
2
1
[M
1
[
2
. (10.24)
In eqs. (10.18)-(10.24), the terms proportional to [M
3
[
2
, [M
2
[
2
and [M
1
[
2
are just the same ones as in eq. (10.14). Note that the terms proportional
7
One should think of m
2
Hu
as a parameter unto itself, and not as the square of some
mythical real number m
Hu
. Thus there is nothing strange about having m
2
Hu
< 0.
However, strictly speaking m
2
Hu
< 0 is neither necessary nor sucient for electroweak
symmetry breaking; see section 8.5.
10.2 The Eective Potential 233
to X
t
appear with smaller numerical coecients in the m
2
Q
3
and m
2
u
3
RG
equations than they did for the Higgs scalars, and they do not appear
at all in the m
2
d
3
, m
2
L
3
and m
2
e
3
RG equations. Furthermore, the third-
family squark (mass)
2
get a large positive contribution proportional to
[M
3
[
2
from the RG evolution, which the Higgs scalars do not get. These
facts make it easy to understand why the Higgs scalars in the MSSM can
get VEVs, but the squarks and sleptons, having large positive (mass)
2
,
do not. An examination of the RG equations (10.9)-(10.12), (10.14), and
(10.18)-(10.24) reveals that if the gaugino mass parameters M
1
, M
2
, and
M
3
are non-zero at the input scale, then all of the other soft terms will
be generated. This is why the no-scale limit with m
1/2
m
0
, A
0
, B
0
can be phenomenologically viable even though the squarks and sleptons
are massless at tree-level. On the other hand, if the gaugino masses
were to vanish at tree-level, then they would not get any contributions to
their masses at one-loop order; in that case M
1
, M
2
, and M
3
would be
extremely small.
Now that we have reviewed the eects of RG evolution from the input
scale down to the electroweak or TeV scale, we are ready to work out the
expected features of the MSSM spectrum in some detail. We will begin
with the Higgs sector in the next section.
10.2 The Eective Potential
10.3 Radiative Corrections to Particle Masses
11
R-parity violation
Here is where we will discuss the phenomenology of R-parity violation,
and discrete symmetries that could replace R parity.
234
Part 4
Advanced Topics
12
Origin of the term
12.1 The next-to-minimal supersymmetric standard model
The simplest possible extension of the particle content of the MSSM
is to add a new gauge-singlet chiral supermultiplet. The resulting
model is often called the next-to-minimal supersymmetric standard model
(NMSSM).[130] The most general possible superpotential for this model
is given by
W
NMSSM
=
1
6
kS
3
+
1
2
S
S
2
+SH
u
H
d
+W
MSSM
, (12.1)
where S stands for both the new chiral supermultiplet and its scalar
component. (There could also be a term linear in S in W
NMSSM
, but this
can always be removed by redening S by a constant shift.)
One of the virtues of the NMSSM is that it can provide a solution to the
problem mentioned in sections 8.1 and 8.5. To understand this, suppose
we set
S
= = 0 so that there are no mass terms or dimensionful
parameters in the superpotential at all. Then an eective -term for
H
u
H
d
will still arise from the third term in eq. (12.1) if S gets a VEV,
with = S). The absence of dimensionful terms in W
NMSSM
can be
enforced by introducing a new symmetry (in various dierent ways). The
soft terms in the lagrangian give a contribution to the scalar potential
which can be written as
V
NMSSM
soft
= (
1
6
a
k
S
3
+a
SH
u
H
d
+ c.c.) +m
2
S
[S[
2
+V
MSSM
soft
, (12.2)
where a
k
and a
have dimensions of mass. One may now set b = 0

in V
MSSM
soft
, because an eective value for b will be generated, equal to
a
S). If the new parameters k, , a

k
and a
are chosen correctly, then

phenomenologically acceptable VEVs will be induced for S, H
0
u
, and H
0
d
.
A correct treatment of this requires the inclusion of one-loop radiative
237
238 12 Origin of the term
corrections. But the important point is that the scale of the VEV S),
and therefore the eective value of , is then determined by the soft terms
of order m
soft
, instead of being a free parameter which is conceptually
independent of supersymmetry breaking.
The NMSSM contains, besides the particles of the MSSM, a real P
R
=
+1 scalar, a real P
R
= +1 pseudoscalar, and a P
R
= 1 Weyl fermion
singlino. These elds have no gauge couplings of their own, so they can
only interact with Standard Model particles by mixing with the neutral
MSSM elds with the same spin and charge. The real scalar mixes with
the MSSM particles h
0
and H
0
, and the pseudo-scalar mixes with A
0
.
One of the eects of replacing the term by the dynamical eld S is to
raise the upper bound on the lightest Higgs mass, for a given set of the
other parameters in the theory. However, the bound in eq. (8.40) is still
respected in the NMSSM (and any other perturbative extension of the
MSSM), provided only that the sparticles that contribute in loops to the
Higgs mass are lighter than 1 TeV or so. The odd R-parity singlino mixes
with the four MSSM neutralinos, so there are really ve neutralinos now.
In many regions of parameter space, mixing eects involving the singlet
elds are small, and they essentially just decouple. In that case, the
phenomenology of the NMSSM is nearly indistinguishable from that of the
MSSM. However, if any of the ve NMSSM neutralinos (and especially the
LSP) has a large mixing between the singlino and the usual gauginos and
higgsinos, then the signatures for sparticles can be altered in important
ways.[131]
12.2 Non-renormalizable operators and the term
Here we put discussion of the Giudice-Masiero mechanism, and models in
which arises from non-renormalizable terms in the superpotential.
Part 5
The Appendices
Appendix A
Notation and Conventions
In this appendix, we collect our conventions for spacetime and spinor
notations.
A.1 Matrix notation and the summation convention
If A is a matrix, then
A
T
is the transpose of A,
A
is the complex conjugate of A,

A
is the hermitian conjugate of A,

I
n
is the n n identity matrix .
The summation convention can be illustrated by the rule for matrix
multiplication: (AB)
ij
= A
ik
B
kj
, where the sum over k is implicit. That
is, the summation convention states that all repeated dummy indices are
summed over. Note that a diagonal matrix can be written as
A
ij
= a
i
ij
= diag(a
1
, a
2
, . . .) , (A.1)
where the Kronecker delta
ij
= 1 if i = j, and otherwise is equal to 0.
Since i and j are held xed and are therefore not dummy indices, no sum
over the repeated index i is implied.
A.2 Complex conjugation and the avor index
In theories with a collection of complex elds,
i
(x), the avor index
i labels the individual eld components. It is convenient to designate
the complex conjugated elds [
i
(x)]
by raising the avor index i and

omitting the complex conjugation symbol (the superscript asterisk):
i
(x) [
i
(x)]
. (A.2)
241
242 Appendix A
When applied to two-component fermion elds, the (
1
2
, 0) fermion elds,
i
, always have lowered avor indices and the (0,
1
2
) fermion elds,
i
,
always have raised indices.
This convention can be generalized to objects that possess more than
one avor index. If M
ij
are the matrix elements of a matrix M, then we
can treat the multiplet
i
as a vector and consider the following quantity
T
M M
ij
j
, (A.3)
where the summation convention has been employed such that two
repeated avor indices are summed over by contracting raised indices
with lowered indices (or vice versa). In this case,
1
[
T
M]
M
ij
j
, (A.4)
where we have used eq. (A.2) and have introduced:
M
ij
(M
ij
)
. (A.5)
By the same convention, given a matrix of the form G
i
j
, its complex
conjugate would be dened by:
G
i
j
(G
i
j
)
. (A.6)
With this notation, an hermitian matrix G
i
j
satises the condition G
i
j
=
G
j
i
.
Quantities with more than two indices are similarly treated. For
example, the Yukawa couplings arise in the interactions of a scalar boson
I
with pairs of two-component fermions
j
k
. The complex conjugate
of the Yukawa couplings y
Ijk
are then given by y
Ijk
(y
Ijk
)
.
A.3 Spacetime notation
Contravariant four-vectors are denoted by: a
(a
0
; a) (a
0
; a
i
).
Here, we use Greek indices such as , , and to be spacetime indices
that run over 0, 1, 2, 3, while Latin indices such as i, j, k, and are space
indices that run over 1, 2, 3.
The spacetime metric is taken to be
g
1 0 0 0
0 1 0 0
0 0 1 0
0 0 0 1
. (A.7)
1
Note that if the elds i(x) commute, then the matrix M in eq. (A.4) must be a
complex symmetric matrix. Indeed, this is the case for anti-commuting fermions, by
virtue of eq. (B.42).
Notation and Conventions 243
The metric can be used to lower indices and convert contravariant four-
vectors into covariant four vectors: a
= (a
0
; a
i
) = (a
0
; a).
Thus, a
0
= a
0
and a
i
= a
i
. To convert covariant indices into
contravariant indices we use the inverse metric g
(g
1
)
which of
course is numerically equal to g
in at spacetime. Equivalently:
g
= g
=
_
1 , = ,
0 , ,= ,
(A.8)
where
is the Kronecker delta-function.

The totally anti-commuting four-dimensional Levi-Cevita tensor
is equal to:
+1 , an even permutation of 0123 ,

1 , an odd permutation of 0123 ,
0 otherwise .
(A.9)
That is
0123
= +1. Lowering the four indices to get
, we see that
0123
= 1 One can also dene the three-dimensional Levi-Cevita tensor
ijk

0ijk
. (A.10)
Thus,
123
= +1. When the corresponding indices are lowered, one nds
123
= 1.
A.4 Two-component spinor notation
As discussed in Chapter 1, two-component spinors come in two varieties:
undotted and dotted. The undotted spinor is denoted by
and the
dotted spinor is denoted by

. We use Greek letters such as , and
to denote undotted spinor indices and ,

and to denote dotted spinor
indices. Both types of indices take on one of two values: 1 or 2. Undotted
[dotted] spinor indices can be raised and lowered with a two dimensional
undotted [dotted] -tensors:
,

=

,

=

. (A.11)
Explicitly, the undotted -tensor is given by
=
_
0 1
1 0
_
,
=
_
0 1
1 0
_
, (A.12)
so that
12
=
21
=
21
=
12
= 1, and
=
_
0 1
1 0
_
,

=
_
0 1
1 0
_
, (A.13)
244 Appendix A
so that
2
=
1
=
1
=
2
= 1. That is,
and

are numerically
equivalent.
The -tensors satisfy:

, (A.14)
from which it follows that:

=

,

=

. (A.15)
A compendium of useful relations and identities in two-component
notation can be found in Appendix B.
We next introduce the sigma-matrices
and

, which are dened

by
0
=
0
=
_
1 0
0 1
_
,
1
=
1
=
_
0 1
1 0
_
,
2
=
2
=
_
0 i
i 0
_
,
3
=
3
=
_
1 0
0 1
_
. (A.16)
The relations between
and
are
,

=
, (A.17)

. (A.18)
From the sigma-matrices, one can construct:
2
=
i
4
(

) , (A.19)
=
i
4
(

) . (A.20)
These matrices satisfy self-duality relations:
=
1
2
i
=
1
2
i
. (A.21)
In addition, eq. (A.14) implies that
=

, (A.22)

, (A.23)
where we have used Tr(
) = Tr(
) = 0.
2
Some authors dene
and
without the factor of i.

A.5 Four-component spinors and the Dirac matrices
We begin by introducing the Dirac gamma matrices which are 4 4
matrices that satisfy:
= 2g
. (A.24)
We also dene the matrices:
5
i
0
3
,
i
2
[
]
i
2
(
) , (A.25)
and note the following relation:
=
1
2
i
. (A.26)
Finally, we introduce the chiral projection operators:
P
L

1
2
(1
5
) , P
R

1
2
(1 +
5
) . (A.27)
Any 4 4 matrix can be expressed as a complex linear combination of
sixteen linearly independent 4 4 matrices. A convenient basis to use
consists of the following sixteen matrices: I
4
,
5
,
5
and
.
Dirac matrices act on a four-component Dirac spinor eld, (x). Given
a four-component spinor (x), we dene the Dirac adjoint eld ,
the space-reected eld
p
, the time-reversed eld
t
and the charge
conjugate eld
c
as follows:
(x)
A, (A.28)
p
(x) i
0
(x) , (A.29)
t
(x)
0
B
1
T
(x) =
0
B
1
A
T
(x) , (A.30)
c
(x) C
T
(x) = CA
T
(x) , (A.31)
where the Dirac conjugation matrix A, the time-reversal matrix B and
the charge conjugation matrix C satisfy [1]:
A
A
1
=
, (A.32)
B
B
1
=
T
, (A.33)
C
1
C =
T
. (A.34)
From these denitions and the dening properties of the gamma matrices
[eq. (A.24)], one can prove that B and C are antisymmetric matrices in all
representations of the gamma matrices. However, eqs. (A.32)(A.34) do
not x the overall scale of the matrices A, B and C. Thus, there is some
246 Appendix A
freedom in the denition of these matrices (independent of the chosen
gamma matrix representation). We shall remove some of this freedom by
imposing the following additional relations:
3
(
c
)
c
= , (
t
)
t
= . (A.35)
After imposing eq. (A.35), one can prove that the following results are
satised in all gamma matrix representations:
BA
1
= (AB
1
)
, (AC)
1
= (AC)
, (A.36)
CB =
5
, A
A
1
= e
i
A
I
4
, (A.37)
where the value of
A
is conventional. The standard choice,
A
= 0,
corresponds to the requirement that (x)(x) is an hermitian quantity
in all gamma matrix representations. In this convention, A
= A.
In Chapter 1, section 3.1, we introduced the chiral (or high-energy)
representation for the gamma matrices, which is dened by:
0
C
=
_
0 I
2
I
2
0
_
,
i
C
=
_
0
i
i
0
_
. (A.38)
This representation follows naturally from the two-component form of
the Dirac Lagrangian. But, given any representation of the gamma
matrices,
[and corresponding four-component Dirac spinor (x)], a

new representation
o
1
also satises eq. (A.24), where o is
any non-singular matrix. The Dirac spinor with respect to the new
representation is given by

(x) = o(x). The matrices A, B and C
are dened according to eqs. (A.32)(A.34) with respect to the gamma
matrices appropriate to the given representation. In addition, we impose
eq. (A.35) and

A
=

A. This is still not quite sucient to x

A,

B and
C uniquely, and we nd
A = [z[(o
1
)
Ao
1
,

B = z(o
1
)
T
Bo
1
,

C = z
1
oCo
T
, (A.39)
where z is an arbitrary complex number.
4
Note that there is only a phase
ambiguity in the denition of

C

A
T
. In particular,
C

A
T
= e
i
oCA
T
o
1
. (A.40)
3
The condition (
p
)
p
= is automatically satised and provides no further
constraint.
4
Most modern textbooks impose one additional constraint by restricting to a class of
-matrices for which
0
=
0
and
i
=
i
. All three representations discussed
in this section satisfy this requirement. Representations in this class are related by
a unitary transformation, e
= S
S
1
, with S
S = I4. One can then show that

it is possible to choose A, B and C to be unitary matrices in all the gamma matrix
representations of this class. Moreover, in this convention, |z| = 1, so that in any
gamma matrix representation of this class one is left with a sign ambiguity in the
denition of A and a phase ambiguity in the denitions of B and C.
Another common representation for the gamma matrices is the Dirac
(or low-energy) representation:
D
= o
C
o
1
, o =
1
2
_
I
2
I
2
I
2
I
2
_
. (A.41)
Note that in this case, o
1
= o
T
= o
. As a result, the representations

of A, B and C in terms of gamma matrices is the same for both the chiral
and the Dirac representations. A convenient choice for A, B and C in
terms of the gamma matrices is given by:
A =
0
, B =
1
3
, C = i
0
2
. (A.42)
Finally, we mention one other representation for the Dirac matrices in
which all four gamma matrices are purely imaginary, i.e.,
.
This is the Majorana representation:
M
= o
D
o
1
, o =
1
2
_
I
2

2
2
I
2
_
. (A.43)
Here, one nds that o = o
1
= o
. A convenient choice for A, B and C

in terms of the gamma matrices is given by:
A =
0
, B =
0
5
, C =
0
. (A.44)
In particular, in this convention CA
T
= I
4
, corresponding to the choice
of e
i
= i in eq. (A.40). That is, in the Majorana representation with
the conventions above,
c
=
. (A.45)
Hence, the four-component self-conjugate Majorana fermion (which
satises
c
= =
) is a real eld in the Majorana representation.

248 Appendix A
Problems
1. (a) Show that if Q is a 4 4 matrix that commutes with all the
( = 0, 1, 2, 3) and R is a 4 4 matrix that anticommutes with all the
, then Q = c
1
I
4
and R = c
2
5
, for some complex numbers c
1
and c
2
.
(b) From the dening relations for the matrices A, B and C
[eqs. (A.32)(A.34)], show that the matrices A
1
A
, B
1
B
T
, C
1
C
T
,
(BA
1
)(BA
1
)
and AC(AC)
all commute with the
and the matrix

CB anticommutes with the
.
(c) Using parts (a) and (b), show that B
T
= B and C
T
= C in all
gamma matrix representations.
(d) By imposing (
c
)
c
= and (
t
)
t
= , derive eq. (A.36). Finally,
impose the condition = A
1
and conclude that A
= A. Verify that
the choice of A, B and C is still not unique, as specied in eq. (A.39).
2. Starting with the chiral representation of the gamma matrices,
transform to the Dirac representation while keeping track of the two-
component spinor indices. Show that in the Dirac representation, the
four-component spinor is given by
=
1
+
0
, (A.46)
and the gamma matrices are given by:
0
=
_
0
0

_
,
i
=
0
i
i
0
,
5
=
0
0
0
0
. (A.47)
Evaluate the matrices A, B and C in the Dirac representation and show
that they are given by:
A =
0
0
0
0
, B =
_
0
0

_
, C =
_
0
0

0
_
. (A.48)
3. Starting with the Dirac representation of the gamma matrices (see
problem 2), transform to the Majorana representation while keeping
track of the two-component spinor indices. Show that in the Majorana
representation, the four component spinor is given by
=
1
+
2
, (A.49)
and the gamma matrices are given by:
0
=
_
0
2
2
0
_
,
1
= 2i
_
12
0
0
12
_
,
2
=
_
0
2
2
0
_
,
3
= 2i
_
23
0
0
23
_
,
5
= 2i
_
02
0
0
02
_
, (A.50)
where the spinor labels in eq. (A.50) have been suppressed.
Evaluate the matrices A, B and C in the Majorana representation and
show that they are given by:
A = 2i
_
0
02
02
0
_
, B =
0
0
,
C = i
_
0
0

0
_
. (A.51)
The factor of i in C is conventional and has been chosen so that the
numerical value of CA
T
is equal to the identity matrix. Verify this last
result (and display the correct spinor label structure).
Appendix B
Compendium of Useful Relations for
Two-Component Notation
B.1 Sigma matrices and associated identities
Various sigma matrices
and
were dened in
eqs. (A.16), (A.19) and (A.20). The following identities involving the
sigma-matrices are useful.
= 2

(B.1)
= 2
(B.2)
= 2
(B.3)
[
= 4
(B.4)
[
= 4

(B.5)
[
= 2g
(B.6)
[
= 2g
(B.7)
[
= 4g
(B.8)
[
= 4g
(B.9)
(
= 2i(
+g
(B.10)
(
= 2i(
+g
(B.11)
(
= 2
(B.12)
(
)

= 2

(B.13)
(
)

= 0 (B.14)
[
= 3
(B.15)
[
= 3

. (B.16)
250
Compendium of Useful Relations for Two-Component Notation 251
Eqs. (B.1), (B.12) and (B.13) are the basis for the Fierz identities, which
are given in detail in Appendix A of ref. [1]. Three other useful identities
of this type are:
=
1
2
_

+i

_
(B.17)
=
1
2
_
_
(B.18)
=
1
2
g

+i

+ 2g

.
(B.19)
There are other identities that are quite useful, which we display here for
completeness. Henceforth, we suppress the spinor indices.
Tr[
] = Tr[
] = 2g
(B.20)
Tr[
] = 2 (g
+g
+i
) (B.21)
Tr[
] = 2 (g
+g
) (B.22)
= 2
(B.23)
= 2
(B.24)
= 2
(B.25)
= 2
(B.26)
= g
+g
(B.27)
= g
+g
+i
(B.28)
=
i
2
(g
+i
) (B.29)
=
i
2
(g
) (B.30)
=
i
2
(g
) (B.31)
=
i
2
(g
+i
) (B.32)
=
1
4
(g
+i
) (B.33)
+
i
2
(g
+g
=
1
4
(g
) (B.34)
+
i
2
(g
+g
) .
Next, we consider two-component spinor identities that arise when
manipulating products of two or four spinors. The heights of indices must
be consistent in the sense that lowered indices must always be contracted
with raised indices. Indices contracted like
and

, (B.35)
252 Appendix B
can be suppressed. In all spinor products given in this paper, contracted
indices are always to have heights that conform to eq. (1.52). For example,

, etc. The behavior of the spinor products

under hermitian conjugation is as follows:
()
(B.36)
(
(B.37)
(
(B.38)
(
(B.39)
(
(B.40)
Note that these relations are applicable both to anti-commuting and to
commuting spinors.
In addition to manipulating expressions containing anti-commuting
fermion elds, we often must deal with products of commuting spinor
wave functions that arise when evaluating the Feynman rules. In the
following expressions we denote the generic spinor by z
i
. In the various
identities listed below, an extra minus sign arises when manipulating a
product of anti-commuting fermion elds. Thus, we employ the notation:
(1)
A
_
+1 , commuting spinors,
1 , anti-commuting spinors.
(B.41)
The following identities hold for the z
i
:
z
1
z
2
= (1)
A
z
2
z
1
(B.42)
z
1
z
2
= (1)
A
z
2
z
1
(B.43)
z
1
z
2
= (1)
A
z
2
z
1
(B.44)
z
1
z
2
= (1)
A
z
2
z
1
(B.45)
z
1
z
2
= (1)
A
z
2
z
1
(B.46)
z
1
z
2
= (1)
A
z
2
z
1
(B.47)
z
1
z
2
= (1)
A
z
2
z
1
(B.48)
z
1
z
2
= (1)
A
z
2
z
1
(B.49)
As previously noted, eqs. (B.1), (B.12) and (B.13) can be used to derive a
series of Fierz identities for two-component spinor products. For example,
1
2
(z
1
z
2
)( z
3
z
4
) = (1)
A
(z
1
z
4
)( z
3
z
2
) (B.50)
1
2
( z
1
z
2
)( z
3
z
4
) = (1)
A
( z
1
z
3
)(z
4
z
2
) (B.51)
1
2
(z
1
z
2
)(z
3
z
4
) = (1)
A
(z
1
z
3
)( z
4
z
2
) (B.52)
Many more Fierz identities can be found in Appendix A of ref. [1].
B.2 Behavior of 2-component fermion bilinears under C, P, T
In Chapter 1, we examined the behavior of two-component fermion elds
under P, C and T. With these results, one can easily compute the behavior
of fermion bilinears under the discrete symmetries. First we examine
fermion bilinears constructed out of two component neutral fermion elds.
We summarize the fermion eld transformation laws:
T
(x)T
1
=
P
i
0
(x
P
) , (B.53)
T
(x)T
1
=
T
(x
T
) , (B.54)
(
(x)(
1
=
C
(x) , (B.55)
where x
P
(t ; x) and x
T
(t ; x).
As shown in chapter 1, the phases
P
,
T
and
C
are restricted to be
real (either 1). From these results, one easily derives:
(T
(x)((T)
1
=
CP
i
0
(x
P
) , (B.56)
(TT
(x)((TT )
1
= i
CPT

(x) , (B.57)
where
CP

C
P
and
CPT

C
T
. To be consistent with the CPT
theorem, we demand that
CPT
is the same for all the fermion species.
Without loss of generality, we may take:
CPT

C
T
= +1 . (B.58)
Using the above results, the behavior of the fermion bilinears under
the discrete symmetries is given in Table B.1. From the results of
Table B.1, one can immediately determine the transformation properties
of the fermion bilinears under CPT. These are given in Table B.2.
Next, we examine the case of a charged fermion eld, which is specied
by a pair of two-component fermion elds and of opposite charge. We
summarize the transformation laws for :
T
(x)T
1
=
P
i
0
(x
P
) , (B.59)
T
(x)T
1
=
T
(x
T
) , (B.60)
(
(x)(
1
=
C
(x) , (B.61)
with no restriction initially on the phases
P
,
T
and
C
. The
corresponding transformation laws for
are obtained from eqs. (B.59)

(B.61) by interchanging and complex conjugating the phases
P
,
T
and
C
. However, to be consistent with the CPT theorem, we must
again demand that
CPT
is independent of the fermion species.
254 Appendix B
T
a
b
l
e
B
.
1
.
T
r
a
n
s
f
o
r
m
a
t
i
o
n
p
r
o
p
e
r
t
i
e
s
o
f
2
-
c
o
m
p
o
n
e
n
t
f
e
r
m
i
o
n
b
i
l
i
n
e
a
r
s
u
n
d
e
r
t
h
e
d
i
s
c
r
e
t
e
s
y
m
m
e
t
r
i
e
s
.
T
h
e
p
h
a
s
e
s
P
,
C
a
n
d
C
a
r
e
e
i
t
h
e
r
+
1
o
r
1
.
T
h
e
f
o
l
l
o
w
i
n
g
n
o
t
a
t
i
o
n
i
s
e
m
p
l
o
y
e
d
:
d
i
a
g
(
1
,
1
,
1
,
1
)
,
d
i
a
g
(
1
,
1
,
1
,
1
)
,
x
=
(
t
;
x
)
,
x
P
=
(
t
;
x
)
a
n
d
x
T
=
(
t
;
x
)
.
P
T
2
(
x
)
P
1
P
2
2
(
x
P
)
T
1
T
2
2
(
x
T
)
2
(
x
)
P
1
P
2
2
(
x
P
)
T
1
T
2
2
(
x
T
)
2
(
x
)
P
1
P
2
(
P
)
2
(
x
P
)
T
1
T
2
(
T
)
2
(
x
T
)
2
(
x
)
P
1
P
2
(
P
)
2
(
x
P
)
T
1
T
2
(
T
)
2
(
x
T
)
2
(
x
)
P
1
P
2
(
P
)
P
)
2
(
x
P
)
T
1
T
2
(
T
)
T
)
2
(
x
T
)
2
(
x
)
P
1
P
2
(
P
)
P
)
2
(
x
P
)
T
1
T
2
(
T
)
T
)
2
(
x
T
)
C
C
P
2
(
x
)
C
1
C
2
2
(
x
)
C
P
1
C
P
2
2
(
x
P
)
2
(
x
)
C
1
C
2
2
(
x
)
C
P
1
C
P
2
2
(
x
P
)
2
(
x
)
C
1
C
2
2
(
x
)
C
P
1
C
P
2
(
P
)
2
(
x
P
)
2
(
x
)
C
1
C
2
2
(
x
)
C
P
1
C
P
2
(
P
)
2
(
x
P
)
2
(
x
)
C
1
C
2
2
(
x
)
C
P
1
C
P
2
(
P
)
P
)
2
(
x
P
)
2
(
x
)
C
1
C
2
2
(
x
)
C
P
1
C
P
2
(
P
)
P
)
2
(
x
P
)
Table B.2. Transformation properties of 2-component fermion bilinears under
CPT. The notation used below is given in the caption to Table B.1. The phase
CPT
is taken to be independent of the particle species as required by the CPT
theorem.
CPT
2
(x)
1

2
(x)

1

2
(x)
1
2
(x)

1
2
(x)
1

2
(x)

2
(x)
1
2
(x)
2
(x)
1
2
(x)

1
2
(x)
1
2
(x)
Without loss of generality, one can impose the result given in eq. (B.58).
Although no further restrictions on the phases is required, it is always
possible to rotate
P
and
C
by an arbitrary phase angle by redening
the parity and charge conjugation operators (by multiplying by an
appropriate gauge transformation). Thus, one is free to establish a
convention in which all phases are real (either 1), as in the case of the
neutral self-conjugate fermion. In this convention, CP = PC, TC = CT
and TP = PT.
Using the above results, the behavior of the charged fermion bilinears
are easily determined. Here, we nd it useful to list the behavior of certain
linear combinations of fermion bilinears, denoted generically by B
12
(x),
under P, C and T transformations. The indices 1 and 2 refer to two
dierent charged fermions. In order to be general, we have not imposed
any reality conditions on the phases
P
,
T
and
C
. The results are
given in Table B.3. From the results of Table B.3, one can immediately
determine the transformation properties of the corresponding fermion
bilinears under CPT. These are given in Table B.4. Note that by using
the translation from two-component to four-component notation given in
eqs. (3.27) and (3.32), the results of Tables B.3 and B.4 reproduce the
well known behavior of the Dirac bilinear covariants under P, T and C
transformations.
1
1
To verify this assertion, recall that T and CPT are anti-unitary operators. Thus,
any factor of i that appears in the bilinear covariant will change sign under these
anti-unitary transformations.
256 Appendix B
T
a
b
l
e
B
.
3
.
T
r
a
n
s
f
o
r
m
a
t
i
o
n
p
r
o
p
e
r
t
i
e
s
o
f
v
a
r
i
o
u
s
2
-
c
o
m
p
o
n
e
n
t
c
h
a
r
g
e
d
f
e
r
m
i
o
n
b
i
l
i
n
e
a
r
s
,
B
1
2
(
x
)
u
n
d
e
r
t
h
e
d
i
s
c
r
e
t
e
s
y
m
m
e
t
r
i
e
s
.
T
h
e
p
h
a
s
e
s
P
,
C
a
n
d
C
a
r
e
a
r
b
i
t
r
a
r
y
.
T
h
e
f
o
l
l
o
w
i
n
g
n
o
t
a
t
i
o
n
i
s
e
m
p
l
o
y
e
d
:
d
i
a
g
(
1
,
1
,
1
,
1
)
,
d
i
a
g
(
1
,
1
,
1
,
1
)
,
x
=
(
t
;
x
)
,
x
P
=
(
t
;
x
)
a
n
d
x
T
=
(
t
;
x
)
.
B
1
2
(
x
)
P
T
(
2
+
2
)
(
x
)
P
1
P
2
B
1
2
(
x
P
)
T
1
T
2
B
1
2
(
x
T
)
(
2
)
(
x
)
P
1
P
2
B
1
2
(
x
P
)
T
1
T
2
B
1
2
(
x
T
)
(
2
+
2
)
(
x
)
P
1
P
2
(
P
)
B
1
2
(
x
P
)
T
1
T
2
(
T
)
B
1
2
(
x
T
)
(
2
)
(
x
)
P
1
P
2
(
P
)
B
1
2
(
x
P
)
T
1
T
2
(
T
)
B
1
2
(
x
T
)
(
2
+
2
)
(
x
)
P
1
P
2
(
P
)
P
)
1
2
(
x
P
)
T
1
T
2
(
T
)
T
)
1
2
(
x
T
)
(
2
)
(
x
)
P
1
P
2
(
P
)
P
)
1
2
(
x
P
)
T
1
T
2
(
T
)
T
)
1
2
(
x
T
)
B
1
2
(
x
)
C
C
P
(
2
(
x
)
+
2
)
(
x
)
C
1
C
2
B
2
1
(
x
)
C
P
1
C
P
2
B
2
1
(
x
P
)
(
2
)
(
x
)
C
1
C
2
B
2
1
(
x
)
C
P
1
C
P
2
B
2
1
(
x
P
)
(
2
+
2
)
(
x
)
C
1
C
2
B
2
1
(
x
)
C
P
1
C
P
2
(
P
)
B
2
1
(
x
P
)
(
2
)
(
x
)
C
1
C
2
B
2
1
(
x
)
C
P
1
C
P
2
(
P
)
B
2
1
(
x
P
)
(
2
+
2
)
(
x
)
C
1
C
2
B
2
1
(
x
)
C
P
1
C
P
2
(
P
)
P
)
2
1
(
x
P
)
(
2
)
(
x
)
C
1
C
2
B
2
1
(
x
)
C
P
1
C
P
2
(
P
)
P
)
2
1
(
x
P
)
Table B.4. Transformation properties of various 2-component charged fermion
bilinears under a CPT transformation. The notation used below is given in the
caption to Table B.3. The phase
CPT
is taken to be independent of the particle
species as required by the CPT theorem.
CPT
(
1
2
(x) +
1

2
)(x) (
2
1
(x) +
2

1
)(x)
(
1

2
2
)(x) (
2

1
1
)(x)
(
2
+
1

2
)(x) (
1
+
2

1
)(x)
(
1
2
)(x) (
2
1
)(x)
(
1
2
+
1

2
)(x) (
2
1
+
2

1
)(x)
(
1
2
)(x) (
2
1
)(x)
References
258
References 259
[1] D. Bailin and A. Love, Supersymmetric Gauge Field Theory and String
Theory (Institute of Physics Publishing, Bristol, UK, 1994).
[2] D.A. Eliezer and R.P. Woodard, Nucl. Phys. B 325, 389 (1989).
[3] S. Weinberg, Phys. Rev. D 13, 974 (1976); Phys. Rev. D 19, 1277 (1979);
L. Susskind, Phys. Rev. D 20, 2619 (1979); G. t Hooft, in Recent
developments in gauge theories, Proceedings of the NATO Advanced
Summer Institute, Cargese 1979, ed. G. t Hooft et al. (Plenum, New York
1980).
[4] E. Witten, Nucl. Phys. B188, 513 (1981); N. Sakai, Z. Phys. C 11, 153
(1981); S. Dimopoulos and H. Georgi, Nucl. Phys. B193, 150 (1981);
R.K. Kaul and P. Majumdar, Nucl. Phys. B199, 36 (1982).
[5] S. Coleman and J. Mandula, Phys. Rev. 159, 1251 (1967); R. Haag,
J. Lopuszanski, and M. Sohnius, Nucl. Phys. B88, 257 (1975).
[6] P. Fayet, Phys. Lett. B 64, 159 (1976).
[7] P. Fayet, Phys. Lett. B 69, 489 (1977); Phys. Lett. B 84, 416 (1979).
[8] G.R. Farrar and P. Fayet, Phys. Lett. B 76, 575 (1978).
[9] P. Ramond, Phys. Rev. D 3, 2415 (1971); A. Neveu and J.H. Schwarz,
Nucl. Phys. B31, 86 (1971); J.L. Gervais and B. Sakita, Nucl. Phys. B34,
632 (1971).
[10] Yu. A. Golfand and E. P. Likhtman, JETP Lett. 13, 323 (1971).
[11] J. Wess and B. Zumino, Nucl. Phys. B70, 39 (1974).
[12] D.V. Volkov and V.P. Akulov, Phys. Lett. B 46, 109 (1973).
[13] J. Wess and J. Bagger, Supersymmetry and Supergravity, (Princeton
University Press, Princeton NJ, 1992).
[14] G.G. Ross, Grand Unied Theories, (Addison-Wesley, Redwood City CA,
1985).
[15] P.P. Srivastava, Supersymmetry and superelds, (Adam-Hilger, Bristol
England, 1986).
[16] P.G.O. Freund, Introduction to Supersymmetry, (Cambridge University
Press, Cambridge England, 1986).
[17] P.C. West, Introduction to Supersymmetry and Supergravity, (World
Scientic, Singapore, 1990).
[18] R.N. Mohapatra, Unication and Supersymmetry: The Frontiers of Quark-
Lepton Physics, Springer-Verlag, New York 1992.
[19] D. Bailin and A. Love, Supersymmetric Gauge Field Theory and String
Theory, (Institute of Physics Publishing, Bristol England, 1994).
[20] P. Ramond, Beyond the Standard Model, Frontiers in Physics series,
Addison-Wesley, to appear.
[21] H.E. Haber and G.L. Kane, Phys. Rep. 117, 75 (1985).
[22] H.P. Nilles, Phys. Rep. 110, 1 (1984).
260 References
[23] Superspace or One Thousand and One Lessons in Supersymmetry, S.J.
Gates, M.T. Grisaru, M. Rocek and W. Siegel, Benjamin/Cummings 1983.
[24] R. Arnowitt, A. Chamseddine and P. Nath, N=1 Supergravity, World
Scientic, Singapore, 1984.
[25] D.R.T. Jones, Supersymmetric gauge theories, in TASI Lectures in
Elementary Particle Physics 1984, ed. D.N. Williams, TASI publications,
Ann Arbor 1984.
[26] H.E. Haber, Introductory low-energy supersymmetry, TASI-92 lectures,
in Recent Directions in Particle Theory, eds. J. Harvey and J. Polchinski,
World Scientic, 1993.
[27] P. Ramond, Introductory lectures on low-energy supersymmetry, TASI-
94 lectures, hep-th/9412234.
[28] J.A. Bagger, Weak-scale supersymmetry: theory and practice, TASI-95
lectures, hep-ph/9604232.
[29] H. Baer et al., Low energy supersymmetry phenomenology, hep-
ph/9503479, in Report of the Working Group on Electroweak Symmetry
Breaking and New Physics of the 1995 study of the future of particle physics
in the USA, to be published by World Scientic.
[30] M. Drees and S.P. Martin, Implications of SUSY model building, as in
Ref.[29]
[31] J.D. Lykken, Introduction to Supersymmetry, TASI-96 lectures, hep-
th/9612114
[32] S. Dawson, SUSY and such, Lectures given at NATO Advanced
Study Institute on Techniques and Concepts of High-energy Physics, hep-
ph/9612229.
[33] M. Dine, Supersymmetry Phenomenology (With a Broad Brush), hep-
ph/9612389.
[34] J.F. Gunion, A simplied summary of supersymmetry, to appear in
Future High Energy Colliders, AIP Press, hep-ph/9704349.
[35] M. Shifman, Non-perturbative dynamics in supersymmetric gauge
theories, hep-ph/9704114.
[36] X. Tata, What is supersymmetry and how do we nd it?, Lectures
presented at the IX Jorge A. Swieca Summer School, Campos do Jord ao,
Brazil, hep-ph/9706307.
[37] S. Ferrara, editor, Supersymmetry, (World Scientic, Singapore, 1987).
[38] A. Salam and J. Strathdee, Nucl. Phys. B76, 477 (1974); S. Ferrara,
J. Wess and B. Zumino, Phys. Lett. B 51, 239 (1974). See Ref.[13] for
a pedagogical introduction to the supereld formalism.
[39] J. Wess and B. Zumino, Phys. Lett. B 49, 52 (1974); J. Iliopoulos and
B. Zumino, Nucl. Phys. B76, 310 (1974).
[40] J. Wess and B. Zumino, Nucl. Phys. B78, 1 (1974).
References 261
[41] L. Girardello and M.T. Grisaru Nucl. Phys. B194, 65 (1982).
[42] See, however, L.J. Hall and L. Randall, Phys. Rev. Lett. 65, 2939 (1990).
[43] J.E. Kim and H. P. Nilles, Phys. Lett. B 138, 150 (1984) J.E. Kim and
H. P. Nilles, Phys. Lett. B 263, 79 (1991); E.J. Chun, J.E. Kim and
H.P. Nilles, Nucl. Phys. B370, 105 (1992).
[44] G.F. Giudice and A. Masiero, Phys. Lett. B 206, 480 (1988); J.A. Casas
and C. Mu noz, Phys. Lett. B 306, 288 (1993).
[45] G. Dvali, G.F. Giudice and A. Pomarol, Nucl. Phys. B478, 31 (1996).
[46] See, for example, F. Zwirner, Phys. Lett. B 132, 103 (1983); R. Barbieri
and A. Masiero, Nucl. Phys. B267, 679 (1986); S. Dimopoulos and L. Hall,
Phys. Lett. B 207, 210 (1987); V. Barger, G. Giudice, and T. Han,
Phys. Rev. D 40, 2987 (1989); R. Godbole, P. Roy and X. Tata, Nucl. Phys.
B401, 67 (1992); G. Bhattacharya and D. Choudhury, Mod. Phys. Lett.
A10, 1699 (1995); G. Bhattacharya, R-parity-violating supersymmetric
Yukawa couplings: a Mini-review, hep-ph/9608415, in Supersymmetry
96, ed. R.N. Mohapatra and A. Rasin; H. Dreiner, An Introduction to
explicit R-parity violation, hep-ph/9707435, baloney.
[47] S. Dimopoulos and H. Georgi, Nucl. Phys. B193, 150 (1981); S. Weinberg,
Phys. Rev. D 26, 2878 (1982); N. Sakai and T. Yanagida, Phys. Rev. D
B197, 533 (1982); S. Dimopoulos, S. Raby and F. Wilczek, Phys. Lett. B
112, 133 (1982).
[48] H. Goldberg, Phys. Rev. Lett. 50, 1419 (1983); J. Ellis, J. Hagelin,
D.V. Nanopoulos, K. Olive, and M. Srednicki, Nucl. Phys. B238, 453
(1984).
[49] L. Krauss and F. Wilczek, Phys. Rev. Lett. 62, 1221 (1989).
[50] L.E. Ib a nez and G. Ross, Phys. Lett. B 260, 291 (1991); T. Banks and
M. Dine, Phys. Rev. D 45, 1424 (1995); L.E. Ib a nez, Nucl. Phys. B398,
301 (1993).
[51] R.N. Mohapatra, Phys. Rev. D 34, 3457 (1986); A. Font, L.E. Ib a nez and
F. Quevedo, Phys. Lett. B 228, 79 (1989); S.P. Martin Phys. Rev. D 46,
2769 (1992); Phys. Rev. D 54, 2340 (1996).
[52] R. Kuchimanchi and R.N. Mohapatra, Phys. Rev. D 48, 4352 (1993);
Phys. Rev. Lett. 75, 3989 (1995); C.S. Aulakh, K. Benakli and G. Sen-
janovic, hep-ph/9703434; Phys. Rev. Lett. 79, 2188 (1997); C.S. Aulakh,
A. Melfo and G. Senjanovic, Phys. Rev. D 57, 4174 (1998).
[53] S. Dimopoulos and D. Sutter, Nucl. Phys. B452, 496 (1995).
[54] For a comprehensive analysis, see F. Gabbiani, E. Gabrielli, A. Masiero
and L. Silvestrini, Nucl. Phys. B477, 321 (1996) and references therein.
[55] L.J. Hall, V.A. Kostalecky and S. Raby, Nucl. Phys. B267, 415 (1986);
F. Gabbiani and A. Masiero, Phys. Lett. B 209, 289 (1988); R. Barbieri
and L.J. Hall, Phys. Lett. B 338, 212 (1994).
262 References
[56] J. Ellis and D.V. Nanopoulos, Phys. Lett. B 110, 44 (1982); R. Barbieri
and R. Gatto, Phys. Lett. B 110, 343 (1982); B.A. Campbell, Phys. Rev. D
28, 209 (1983).
[57] M.J. Duncan, Nucl. Phys. B221, 285 (1983); J.F. Donahue, H.P. Nilles
and D. Wyler, Phys. Lett. B 128, 55 (1983); A. Bouquet, J. Kaplan and
C.A. Savoy, Phys. Lett. B 148, 69 (1984); M. Dugan, B. Grinstein and
L.J. Hall, Nucl. Phys. B255, 413 (1985); F. Gabbiani and A. Masiero,
Nucl. Phys. B322, 235 (1989); J. Hagelin, S. Kelley and T. Tanaka,
Nucl. Phys. B415, 293 (1994).
[58] S. Bertolini, F. Borzumati, A. Masiero and G. Ridol, Nucl. Phys. B353,
591 (1991); R. Barbieri and G.F. Giudice, Phys. Lett. B 309, 86 (1993);
J. Hewett and J.D. Wells, Phys. Rev. D 55, 5549 (1997).
[59] J. Ellis, S. Ferrara and D.V. Nanopoulos, Phys. Lett. B 114, 231 (1982);
W. Buchm uller and D. Wyler, Phys. Lett. B 121, 321 (1983); J. Polchinski
and M.B. Wise, Phys. Lett. B 125, 393 (1983); F. del Aguila, M.B. Gavela,
J.A. Grifols and A. Mendez, Phys. Lett. B 126, 71 (1983); D.V. Nanopoulos
and M. Srednicki, Phys. Lett. B 128, 61 (1983).
[60] P. Langacker, in Proceedings of the PASCOS90 Symposium, Eds. P. Nath
and S. Reucroft, (World Scientic, Singapore 1990) J. Ellis, S. Kelley,
and D. Nanopoulos, Phys. Lett. B 260, 131 (1991); U. Amaldi, W. de
Boer, and H. Furstenau, Phys. Lett. B 260, 447 (1991); P. Langacker and
M. Luo, Phys. Rev. D 44, 817 (1991); C. Giunti, C.W. Kim and U. W. Lee,
Mod. Phys. Lett. A6, 1745 (1991).
[61] A.G. Cohen, D.B. Kaplan and A.E. Nelson, Phys. Lett. B 388, 588 (1996).
[62] Y. Nir and N. Seiberg, Phys. Lett. B 309, 337 (1993).
[63] P. Fayet and J. Iliopoulos, Phys. Lett. B 51, 461 (1974); P. Fayet,
Nucl. Phys. B90, 104 (1975).
[64] However, see for example P. Binetruy and E. Dudas, Phys. Lett. B 389,
503 (1996); G. Dvali and A. Pomarol, Phys. Rev. Lett. 77, 3728 (1996);
R.N. Mohapatra and A. Riotto, Phys. Rev. D 55, 4262 (1997). A non-
zero Fayet-Iliopoulos term for an anomalous U(1) symmetry is commonly
found in superstring models: M. Green and J. Schwarz, Phys. Lett. B 149,
117 (1984); M. Dine, N. Seiberg and E. Witten, Nucl. Phys. B289, 589
(1987); J. Atick, L. Dixon and A. Sen, Nucl. Phys. B292, 109 (1987); This
may even help to explain the observed structure of the Yukawa couplings:
L.E. Ib a nez, Phys. Lett. B 303, 55 (1993); L.E. Ib a nez and G.G. Ross,
Phys. Lett. B 332, 100 (1994); P. Binetruy, S. Lavignac, P. Ramond,
Nucl. Phys. B477, 353 (1996); P. Binetruy, N.Irges, S. Lavignac and
P. Ramond, Phys. Lett. B 403, 38 (1997); N. Irges, S. Lavignac,
P. Ramond, Phys. Rev. D 58, 035003 (1998).
[65] L. ORaifeartaigh, Nucl. Phys. B96, 331 (1975).
[66] S. Coleman and E. Weinberg, Phys. Rev. D 7, 1888 (1973).
References 263
[67] E. Witten, Nucl. Phys. B202, 253 (1982); I. Aeck, M. Dine and
N. Seiberg, Nucl. Phys. B241, 493 (1981); Nucl. Phys. B256, 557
(1986). For reviews, see L. Randall, Models of Dynamical Supersymmetry
Breaking, hep-ph/9706474; A. Nelson, Dynamical Supersymmetry
Breaking, hep-ph/9707442; G.F. Giudice and R. Rattazzi, Theories with
Gauge-Mediated Supersymmetry Breaking, hep-ph/9801271.
[68] P. Fayet, Phys. Lett. B 70, 461 (1977); Phys. Lett. B 86, 272 (1979); and in
Unication of the fundamental particle interactions (Plenum, New York,
1980).
[69] N. Cabibbo, G.R. Farrar and L. Maiani, Phys. Lett. B 105, 155 (1981);
M.K. Gaillard, L. Hall and I. Hinchlie, Phys. Lett. B 116, 279 (1982);
J. Ellis and J.S. Hagelin, Phys. Lett. B 122, 303 (1983), D.A. Dicus,
S. Nandi and J. Woodside, Phys. Lett. B 258, 231 (1991); D.R. Stump,
M. Wiest, and C.P. Yuan, Phys. Rev. D 54, 1936 (1996); S. Ambrosanio,
G. Kribs, and S.P. Martin, Phys. Rev. D 56, 1761 (1997).
[70] S. Dimopoulos, M. Dine, S. Raby and S.Thomas, Phys. Rev. Lett. 76, 3494
(1996); S. Dimopoulos, S. Thomas and J. D. Wells, Phys. Rev. D 54, 3283
(1996).
[71] S. Ambrosanio et al., Phys. Rev. Lett. 76, 3498 (1996); Phys. Rev. D 54,
5395 (1996).
[72] S. Ferrara, D.Z. Freedman and P. van Nieuwenhuizen, Phys. Rev. D 13,
3214 (1976); S. Deser and B. Zumino, Phys. Lett. B 62, 335 (1976);
D.Z. Freedman and P. van Nieuwenhuizen, Phys. Rev. D 14, 912 (1976);
E. Cremmer et al., Nucl. Phys. B147, 105 (1979); J. Bagger, Nucl. Phys.
B211, 302 (1983).
[73] E. Cremmer, S. Ferrara, L. Girardello, and A. van Proeyen, Nucl. Phys.
B212, 413 (1983).
[74] S. Deser and B. Zumino, Phys. Rev. Lett. 38, 1433 (1977); E. Cremmer et
al., Phys. Lett. B 79, 1978 (231).
[75] H. Pagels, J.R. Primack, Phys. Rev. Lett. 48, 1982 (223); T. Moroi,
H. Murayama, M. Yamaguchi, Phys. Lett. B 303, 1993 (289).
[76] P. Moxhay and K. Yamamoto, Nucl. Phys. B256, 130 (1985); K. Grassie,
Phys. Lett. B 159, 32 (1985); B. Gato, Nucl. Phys. B278, 189 (1986);
N. Polonsky and A. Pomarol, Phys. Rev. Lett. 73, 2292 (1994).
[77] G.G. Ross and R.G. Roberts, Nucl. Phys. B377, 571 (1992). H. Arason
et al., Phys. Rev. Lett. 67, 2933 (1991); R. Arnowitt and P. Nath,
Phys. Rev. Lett. 69, 725 (1992) and Phys. Rev. D 46, 3981 (1992);
D.J. Casta no, E.J. Piard, P. Ramond, Phys. Rev. D 49, 4882 (1994);
V. Barger, M. Berger and P. Ohmann, Phys. Rev. D 49, 4908 (1994);
G.L. Kane, C. Kolda, L. Roszkowski, J.D. Wells, Phys. Rev. D 49, 6173
(1994); B. Ananthanarayan, K.S. Babu and Q. Sha, Nucl. Phys. B428,
19 (1994); M. Carena, M. Olechowski, S. Pokorski, C.E.M. Wagner,
Nucl. Phys. B419, 213 (1994); W. de Boer, R. Ehret and D. Kazakov,
264 References
Z. Phys. C 67, 647 (1995); M. Carena, P. Chankowski, M. Olechowski,
S. Pokorski, C.E.M. Wagner, Nucl. Phys. B491, 103 (1997).
[78] V. Kaplunovsky and J. Louis, Phys. Lett. B 306, 269 (1993); R. Barbieri,
J. Louis and M. Moretti, Phys. Lett. B 312, 451 (1993); A. Brignole,
L.E. Ib a nez and C. Mu noz, Nucl. Phys. B422, 125 (1994), erratum
Nucl. Phys. B436, 747 (1995).
[79] J. Polonyi, Hungary Central Research Institute report KFKI-77-93 (1977)
(unpublished). See Ref.[19] for a pedagogical account.
[80] For a review, see A.B. Lahanas and D.V. Nanopoulos, Phys. Rep. 145, 1
(1987).
[81] For a review, see A. Brignole, L.E. Iba nez and C. Mu noz, Soft
supersymmetry-breaking terms from supergravity and supersring models,
hep-ph/9707209, baloney.
[82] M. Dine and W. Fischler, Phys. Lett. B 110, 227 (1982); C.R. Nappi
and B.A. Ovrut, Phys. Lett. B 113, 175 (1982); L. Alvarez-Gaume, M.
Claudson, and M. B. Wise, Nucl. Phys. B207, 96 (1982).
[83] M. Dine, A. E. Nelson, Phys. Rev. D 48, 1277 (1993); M. Dine,
A.E. Nelson, Y. Shirman, Phys. Rev. D 51, 1362 (1995); M. Dine,
A.E. Nelson, Y. Nir, Y. Shirman, Phys. Rev. D 53, 2658 (1996).
[84] S. Dimopoulos, G.F. Giudice and A. Pomarol, Phys. Lett. B 389, 37 (1996);
S.P. Martin Phys. Rev. D 55, 3177 (1997); E. Poppitz and S.P. Trivedi,
Phys. Lett. B 401, 38 (1997).
[85] W. Siegel, Phys. Lett. B 84, 193 (1979); D.M. Capper, D.R.T. Jones and
P. van Nieuwenhuizen, Nucl. Phys. B167, 479 (1980).
[86] V. Novikov, M. Shifman, A. Vainshtein and V. Zakharov, Nucl. Phys.
B229, 381 (1983); Phys. Lett. B 166, 329 (1986); J. Hisano and
M. Shifman, Phys. Rev. D 56, 5475 (1997).
[87] W. Siegel, Phys. Lett. B 94, 37 (1980); L.V. Avdeev, G.A. Chochia
and A.A. Vladimirov, Phys. Lett. B 105, 272 (1981); L.V. Avdeev and
A.A. Vladimirov, Nucl. Phys. B219, 262 (1983).
[88] I. Jack and D.R.T. Jones, Regularisation of supersymmetric theories,
hep-ph/9707278, baloney.
[89] D. Evans, J.W. Moat, G. Kleppe, and R.P. Woodard, Phys. Rev. D 43,
499 (1991); G. Kleppe and R.P. Woodard, Phys. Lett. B 253, 331 (1991);
Nucl. Phys. B388, 81 (1992); G. Kleppe, Phys. Lett. B 256, 431 (1991).
[90] S.P. Martin and M.T. Vaughn, Phys. Lett. B 318, 331 (1993).
[91] I. Jack, D.R.T. Jones and K.L. Roberts, Z. Phys. C 62, 161 (1994) and
Z. Phys. C 63, 151 (1994).
[92] K. Inoue, A. Kakuto, H. Komatsu and H. Takeshita, Prog. Theor. Phys.
68, 927 (1982) and 71, 413 (1984); N. K. Falck Z. Phys. C 30, 247 (1986).
[93] V. Barger, M.S. Berger, and P.Ohmann, Phys. Rev. D 47, 1093 (1993).
References 265
[94] S.P. Martin and M.T. Vaughn, Phys. Rev. D 50, 2282 (1994); Y. Yamada,
Phys. Rev. D 50, 3537 (1994); I. Jack and D.R.T. Jones, Phys. Lett. B
333, 372 (1994); I. Jack, D.R.T. Jones, S.P. Martin, M.T. Vaughn and
Y. Yamada, Phys. Rev. D 50, 5481 (1994).
[95] P.M. Ferreira, I. Jack, D.R.T. Jones, Phys. Lett. B 387, 80 (1996)
[96] A. Salam and J. Strathdee, Phys. Rev. D 11, 1521 (1975); M.T. Grisaru,
W. Siegel and M. Rocek, Nucl. Phys. B159, 429 (1979).
[97] L.E. Ib a nez and G.G. Ross, Phys. Lett. B 110, 215 (1982); L.E. Ib a nez,
Phys. Lett. B 118, 73 (1982); J. Ellis, D.V. Nanopoulos and K. Tamvakis,
Phys. Lett. B 121, 123 (1983); L. Alvarez-Gaume, J. Polchinski, and
M. Wise, Nucl. Phys. B221, 495 (1983).
[98] J.F. Gunion and H.E. Haber, Nucl. Phys. B272, 1 (1986); Nucl. Phys.
B278, 449 (1986); Nucl. Phys. B307, 445 (1988). (E: Nucl. Phys. B402,
567 (1993)).
[99] J.F. Gunion, H.E. Haber, G.L. Kane and S. Dawson, The Higgs Hunters
Guide Addison-Wesley 1991, errata: hep-ph/9302272.
[100] K. Inoue, A. Kakuto, H. Komatsu and S. Takeshita, Prog. Theor. Phys.
67, 1889 (1982); R.A.Flores and M. Sher Ann. Phys. (NY) 148, 95, 1983.
[101] H.E. Haber and R. Hemping, Phys. Rev. Lett. 66, 1815 (1991); Y. Okada,
M. Yamaguchi and T. Yanagida, Prog. Theor. Phys. 85, 1 (1991),
Phys. Lett. B 262, 54 (1991); J. Ellis, G. Ridol and F. Zwirner,
Phys. Lett. B 257, 83 (1991), Phys. Lett. B 262, 477 (1991).
[102] G. Gamberini, G. Ridol and F. Zwirner, Nucl. Phys. B331, 331 (1990);
R. Barbieri, M.Frigeni and F. Caravaglio, Phys. Lett. B 258, 167 (1991);
A. Yamada, Phys. Lett. B 263, 233 (1991) and Z. Phys. C 61, 247
(1994); J.R. Espinosa and M. Quiros, Phys. Lett. B 266, 389 (1991);
A. Brignole, Phys. Lett. B 281, 284 (1992); M. Drees and M.M. Nojiri,
Phys. Rev. D 45, 2482 (1992) and Nucl. Phys. B369, 54 (1992); H.E. Haber
and R. Hemping, Phys. Rev. D 48, 4280 (1993); P.H. Chankowski,
S. Pokorski and J. Rosiek, Phys. Lett. B 274, 191 (1992) and Nucl. Phys.
B423, 437 (1994); R. Hemping and A.H. Hoang, Phys. Lett. B 331, 99
(1994); J. Kodaira, Y. Yasui and K. Sasaki, Phys. Rev. D 50, 7035 (1994);
J.A. Casas, J.R. Espinosa, M. Quiros and A. Riotto, Nucl. Phys. B436,
3 (1995) [E: Nucl. Phys. B439, 466 (1995)]; M. Carena, M. Quir os and
C. Wagner, Nucl. Phys. B461, 407 (1996).
[103] See G.L. Kane, C. Kolda and J.D. Wells, Phys. Rev. Lett. 70, 2686 (1993);
J.R. Espinosa and M. Quir os, Phys. Lett. B 302, 51 (1993), and references
therein.
[104] R. Tarrach, Nucl. Phys. B183, 384 (1980); H. Gray, D.J. Broadhurst,
W. Grafe and K. Schilcher, Z. Phys. C 48, 673 (1990); H. Arason et al.,
Phys. Rev. D 46, 3945 (1992).
[105] L.E. Ib a nez and C. Lopez, Phys. Lett. B 126, 54 (1983); H. Arason et al.,
Phys. Rev. Lett. 67, 2933 (1991); V. Barger, M.S. Berger and P. Ohmann,
266 References
Phys. Rev. D 47, 1093 (1993); P. Langacker and N. Polonsky, Phys. Rev. D
49, 1454 (1994); P. Ramond, R.G. Roberts, G.G. Ross, Nucl. Phys. B406,
19 (1993); M. Carena, S. Pokorski and C. Wagner, Nucl. Phys. B406, 59
(1993); G. Anderson et al, Phys. Rev. D 49, 3660 (1994).
[106] L.J. Hall, R. Rattazzi and U. Sarid, Phys. Rev. D 50, 7048 (1994);
M. Carena, M. Olechowski, S. Pokorski and C.E.M. Wagner, Nucl. Phys.
B426, 269 (1994); R. Hemping, Phys. Rev. D 49, 6168 (1994);
R. Rattazzi and U. Sarid, Phys. Rev. D 53, 1553 (1996).
[107] D. Pierce and A. Papadopoulos, Phys. Rev. D 50, 565 (1994); Nucl. Phys.
B430, 278 (1994); D. Pierce, J.A. Bagger, K. Matchev, and R.-J. Zhang,
Nucl. Phys. B491, 3 (1997).
[108] J. Fr`ere, D.R.T. Jones and S. Raby, Nucl. Phys. B222, 11 (1983);
M. Claudson, L.J. Hall, and I. Hinchlie, Nucl. Phys. B228, 501 (1983);
J.A. Casas, A. Lleyda, C. Mu noz, Nucl. Phys. B471, 3 (1996). T. Falk,
K.A. Olive, L. Roszkowski, and M. Srednicki Phys. Lett. B 367, 183 (1996).
H. Baer, M. Brhlik and D.J. Casta no, Phys. Rev. D 54, 6944 (1996). For
a review, see J.A. Casas, hep-ph/9707475, baloney.
[109] A. Kusenko, P. Langacker and G. Segre, Phys. Rev. D 54, 5824 (1996).
[110] A. Bartl, H. Fraas and W. Majerotto, Z. Phys. C 30, 411 (1986); Z. Phys. C
41, 475 (1988); Nucl. Phys. B278, 1 (1986); A. Bartl, H. Fraas, W.
Majerotto and B. M osslacher, Z. Phys. C 55, 257 (1992). For large
tan results, see H. Baer, C.-h. Chen, M. Drees, F. Paige and X. Tata,
Phys. Rev. Lett. 79, 986 (1997).
[111] H. Baer et al., Intl. J. Mod. Phys. A4, 4111 (1989).
[112] H.E. Haber and D. Wyler, Nucl. Phys. B323, 267 (1989); S. Ambrosanio
and B. Mele, Phys. Rev. D 55, 1399 (1997); S. Ambrosanio et al.,
Phys. Rev. D 55, 1372 (1997) and references therein.
[113] H. Baer et al., Phys. Lett. B 161, 175 (1985); G. Gamberini, Z. Phys. C 30,
605 (1986); H.A. Baer, V. Barger, D. Karatas and X. Tata, Phys. Rev. D
36, 96 (1987); R.M. Barnett, J.F. Gunion amd H.A. Haber, Phys. Rev. D
37, 1892 (1988).
[114] K. Hikasa and M. Kobayashi, Phys. Rev. D 36, 724 (1987).
[115] J. Ellis, J. L. Lopez, D.V. Nanopoulos Phys. Lett. B 394, 354 (1997);
J. L. Lopez, D.V. Nanopoulos Phys. Rev. D 55, 4450 (1997).
[116] M. Boulware and D. Finnell, Phys. Rev. D 44, 2054 (1991); G. Altarelli,
R. Barbieri and F. Caravaglios, Phys. Lett. B 314, 357 (1993); J.D. Wells,
C. Kolda and G.L. Kane Phys. Lett. B 338, 219 (1994); G. Kane, R. Stuart,
and J.D. Wells, Phys. Lett. B 354, 350 (1995); D. Garcia and J. Sola,
Phys. Lett. B 357, 349 (1995); J. Erler and P. Langacker, Phys. Rev. D
52, 441 (1995); X. Wang, J. Lopez and D.V. Nanopoulos, Phys. Rev. D 52,
4116 (1995); P. Chankowski and S. Pokorski, Nucl. Phys. B475, 3 (1996).
[117] J. Bagger, U. Nauenberg, X. Tata, and A. White, Summary of the
Supersymmetry Working Group, 1996 DPF/DPB Summer Study on
References 267
New Directions for High-Energy Physics (Snowmass 96), hep-ph/9612359;
J. Amundson et al., Report of the Snowmass Supersymmetry Theory
Working Group, hep-ph/9609374.
[118] T. Tsukamoto et al., Phys. Rev. D 51, 3153 (1995); H. Baer, R. Munroe
and X. Tata, Phys. Rev. D 54, 6735 (1996).
[119] F. del Aguila and L. Ametller, Phys. Lett. B 261, 326 (1991); H. Baer, C-
H. Chen, F. Paige and X. Tata, Phys. Rev. D 49, 3283 (1994). Phys. Rev. D
52, 2746 (1995).
[120] P. Harrison and C. Llewellyn-Smith, Nucl. Phys. B213, 223 (1983);
G. Kane and J.L. Leveille, Phys. Lett. B 112, 227 (1982); S. Dawson,
E. Eichten and C. Quigg, Phys. Rev. D 31, 1581 (1985); H. Baer and
X. Tata, Phys. Lett. B 160, 159 (1985). Signicant next-to-leading order
corrections have been computed by W. Beenakker, R. Hopker, M. Spira
and P.M. Zerwas, Phys. Rev. Lett. 74, 2905 (1995); Z. Phys. C 69, 163
(1995); Nucl. Phys. B492, 51 (1997).
[121] V. Barger, Y. Keung and R.J.N. Phillips, Phys. Rev. Lett. 55, 166 (1985);
R.M. Barnett, J.F. Gunion, and H.E. Haber, Phys. Lett. B 315, 349 (1993);
H. Baer, X. Tata and J. Woodside, Phys. Rev. D 41, 906 (1990).
[122] R. Arnowitt and P. Nath, Mod. Phys. Lett. A2, 331, (1987); H. Baer and
X. Tata, Phys. Rev. D 47, 2739 (1993); H. Baer, C. Kao and X. Tata
Phys. Rev. D 48, 5175 (1993); T. Kamon, J. Lopez, P. McIntyre and
J.T. White, Phys. Rev. D 50, 5676 (1994); H. Baer, C.-h. Chen, C. Kao and
X. Tata, Phys. Rev. D 52, 1565 (1995); S. Mrenna, G.L. Kane, G.D. Kribs
and J.D. Wells, Phys. Rev. D 53, 1168 (1996).
[123] H. Baer, C-H. Chen, F. Paige and X. Tata, Phys. Rev. D 52, 2746 (1995).
[124] D.O. Caldwell et al., Phys. Rev. Lett. 61, 510 (1988) and Phys. Rev. Lett.
65, 1305 (1990); D. Reuner et al., Phys. Lett. B 255, 143 (1991); M. Mori
et al. (The Kamiokande Collaboration), Phys. Rev. D 48, 5505 (1993).
[125] For reviews, see G. Jungman, M. Kamionkowski and K. Griest, Phys. Rep.
267, 195 (1996); M. Drees, Recent developments in Dark Matter
Physics, hep-ph/9703260; J.D. Wells, Mass density of neutralino dark
matter, hep-ph/9708285, baloney.
[126] L.E. Ib a nez and G. Ross, Nucl. Phys. B368, 3 (1992).
[127] D.J. Casta no and S.P. Martin, Phys. Lett. B 340, 67 (1994).
[128] C. Aulakh and R. Mohapatra, Phys. Lett. B 119, 136 (1983); G.G. Ross
and J.W.F. Valle, Phys. Lett. B 151, 375 (1985); J. Ellis et al.,
Phys. Lett. B 150, 142 (1985); D. Comelli, A. Masiero, M. Pietroni, and
A. Riotto, Phys. Lett. B 324, 397 (1994).
[129] A. Masiero and J.W.F. Valle, Phys. Lett. B 251, 273 (1990); J.C. Romao,
C.A. Santos and J.W.F. Valle, Phys. Lett. B 288, 311 (1992).
[130] H.P. Nilles, M. Srednicki and D. Wyler, Phys. Lett. B 120, 346 (1983);
J. Ellis et al., Phys. Rev. D 39, 844 (1989).
268 References
[131] See, for example, U. Ellwanger, M. Rausch de Traubenberg, and
C.A. Savoy, Phys. Lett. B 315, 331 (1993), Nucl. Phys. B492, 21 (1997);
S.A. Abel, S. Sarkar and I.B. Whittingham, Nucl. Phys. B392, 83 (1993);
F. Franke, H. Fraas and A. Bartl, Phys. Lett. B 336, 415 (1994); S.F. King
and P.L. White, Phys. Rev. D 52, 4183 (1995);
[132] M. Drees, Phys. Lett. B 181, 279 (1986); J.S. Hagelin and S. Kelley,
Nucl. Phys. B342, 85 (1990); A.E. Faraggi, J.S. Hagelin, S. Kelley,
and D.V. Nanopoulos, Phys. Rev. D 45, 3272 (1992); Y. Kawamura
and M. Tanaka, Prog. Theor. Phys. 91, 949 (1994); Y. Kawamura,
H. Murayama and M. Yamaguchi, Phys. Lett. B 324, 52 (1994);
Phys. Rev. D 51, 1337 (1995); H.-C. Cheng and L.J. Hall, Phys. Rev. D
51, 5289 (1995); C. Kolda and S.P. Martin, Phys. Rev. D 53, 3871 (1996);
T. Gherghetta, T. Kaeding and G.L. Kane, hep-ph/9701343.
[133] A. Brignole, L.E. Ib a nez and C. Mu noz, Nucl. Phys. B422, 125 (1994),
[erratum Nucl. Phys. B436, 747 (1995)]; K. Choi, J.E. Kim and H.P. Nilles,
Phys. Rev. Lett. 73, 1758 (1994); K. Choi, J.E. Kim and G.T. Park,
Nucl. Phys. B442, 3 (1995). See also N.C. Tsamis and R.P. Woodard,
Phys. Lett. B 301, 351 (1993); Nucl. Phys. B474, 235 (1996); Annals
Phys. 253, 1 (1996); Annals Phys. 267, 145 (1997), and references therein,
for an exposition of nonperturbative infrared quantum gravitational eects
on the eective cosmological constant. This work implies that it is quite
unlikely that requiring the tree-level vacuum energy to vanish is correct
or meaningful. Moreover, naive supergravity or superstring predictions for
the vacuum energy need not have any relevance to the question of whether
the observed cosmological constant is suciently small.
Index
a (scalar trilinear) terms, 4
A
0
(pseudo-scalar Higgs boson), 4
Aeck-Dine-Seiberg (ADS) superpo-
tential, 4
algebra, supersymmetry, 4
alignment mechanism for avoiding
FCNCs, 4
analyticity, 4
angular momentum operator, 4
anomaly cancellation argument for
two Higgs doublets, 4
anomaly supermultiplet, 4
anomaly-mediated supersymmetry
breaking (AMSB), 4
anti-chiral supereld, 4
auxiliary eld, 4, 66
D term, 4
F term, 4
axino, 4
axion, 4
b (scalar bilinear) term, 4
Baker-Campbell-Haussdorf formula, 4
baryogenesis, 4
baryon triality, 4
angle of Higgs sector, 4
beta functions, 4
bino, 4
Casimir invariant, quadratic, 4
central charges, 4
charge, supersymmetry, 4
charginos, 4
chiral superelds, 4
chiral supermultiplets, 4
component elds, 4
in the MSSM, 4
cold dark matter, 4
Coleman-Mandula theorem, 4
Coleman-Weinberg eective potential,
4
complex conjugation of fermion bilin-
ears, 4
component elds, 4
of chiral superelds, 4
of general superelds, 4
of vector superelds, 4
conventions, 4
cosmological constant, 4
covariant derivative, 4
gauge, 4
spinor, 4
CP violation, 4
CP-even Higgs scalars, 4
CP-odd Higgs scalars, 4
current, supersymmetry, 4
D term, 4
D-term contribution to scalar poten-
tial, 78
D-at directions, 4
dark matter, 4
direct detection, 4
indirect detection, 4
relic density, 4
269
270 Index
decays (sublist of various types), 4
decoupling, 4
dilaton, 4
dilaton dominance, 4
dimensional analysis, 4
dimensional reduction (DRED), 4
dimensional reduction with minimal
subtraction (DR), 4
dimensional regularization (DREG), 4
Dirac matrices, 4
discrete gauge symmetry, 4
discrete symmetry
anomaly cancellation
requirements, 4
baryon triality, 4
R-parity, 4
displaced vertex signals in collider
experiments, 4
dotted indices, 4
dynamical supersymmetry breaking, 4
Dynkin index, 4
e
+
e
colliders, 4
E
6
grand unied group, 4
eective potential, 4
electroweak symmetry breaking, 4
epsilon scalars, 4
extended supersymmetry, 4
F term, 4
F-term contribution to scalar poten-
tial, 78
F-at directions, 4
Fayet-Iliopoulos D term, 4
Fayet-Iliopoulos mechanism, 4
fermion mass matrix, 4
Feynman rules (sublist of various), 4
Fierz identities, 4
xed points, 4
at directions, 4
avor changing neutral currents (FC-
NCs), 4
gamma matrices, 4
gauge coupling unication, 4
gauge kinetic function, 4
gauge supereld, 4
gauge supermultiplet, 4
component elds, 4
of the MSSM, 4
gauge-mediated supersymmetry
breaking (GMSB), 4
gaugino, 4
gaugino condensation, 4
gaugino mass unication, 4
Gell-Mann matrices, 4
global (rigid) supersymmetry, 4
gluino, 4
goldstino, 4
Grand Unied Theories (GUTs), 4
Grassmann numbers, 4
gravitino, 4
gravity-mediated supersymmetry
breaking, 4
H
, 4
H
0
, 4
h
0
, 4
Haag-Lopuszanski-Sohnius theorem, 4
hadron colliders, 4
Hamiltonian, 4
helicity, 4
HERA ep collider, 4
hidden sector, 4
hierarchy problem, 4
Higgs bosons (sublist), 4
higgsino, 4
highly-ionizing tracks, 4
holomorphy, 4
hypercharge, weak, 4
indices, gauge, 4
indices, spinor, 4
indices, vector, 4
infrared-stable xed points, 4
instantons, 4
integration in superspace, 4
isospin, weak, 4
Jacobi identity, 4
jets plus missing energy signals, 4
K
0
K
0
mixing, 4
Kahler function, 4
Index 271
Kahler potential, 4
kinetic supermultiplet, 4
Lagrangians, 4
Large Hadron Collider (LHC), 4
LEP e
+
e
collider, 4
lepton colliders, 4
lepton superelds, 4
lightest supersymmetric particle
(LSP), 4
like-charge dilepton signals at hadron
colliders, 4
linear e
+
e
collider, 4
linear superelds, 4
local supersymmetry, 4
M-theory, 4
macroscopic decay lengths, 4
Majorana fermions, 4
Majoron model, 4
mass sum rules, 4
matter parity, 4
messengers, 4
minimal supergravity, 4
Minimal Supersymmetric Standard
Model (MSSM), 4
missing energy signals at lepton collid-
ers, 4
missing transverse energy signals at
hadron colliders, 4
momentum cuto, 4
momentum operator, 4
MSSM, 4
problem, 4
term, 4
muon decay, 4
naturalness, 4
neutralinos, 4
next-to-lightest supersymmetric parti-
cle (NLSP), 4
next-to-minimal supersymmetric
standard model (NMSSM), 4
no-go theorems, 4
no-scale model, 4
Noether current, 4
Noethers procedure, 4
non-linear realizations, 4
non-renormalization theorem, 4
ORaifeartaigh model, 4
on-shell symmetries, 4
parity, 4
Pauli matrices (see sigma matrices), 4
Peccei-Quinn symmetry, 4
photino, 4
Planck mass, 4
pole mass, 4
Polonyi model, 4
potential, scalar, 4
pp colliders, 4
pp colliders, 4
proton decay, 4
quadratic Casimir invariant, 4
quadratic divergences, 4
quark superelds, 4
quartic scalar couplings, 4
R-current, 4
R-parity, 4
R-parity violation, 4
R-symmetry, 4
Ramond-Neveu-Schwarz model, 4
regularization, 4
renormalization group equations, 4
rigid (global) supersymmetry, 4
running mass, 4
same-charge dilepton signals, 4
sbottoms, 4
scalar couplings, cubic, 4
scalar couplings, quartic, 4
scalar couplings, to fermions, 4
scalar couplings, to gauge bosons, 4
scalar couplings, to gauginos, 4
scalar supereld, 4
scalar supermultiplet, 4
searches for supersymmetry (sublist of
types of searches), 4
selectrons, 4
sfermions, 4
matrices, 4
272 Index
denitions, 4
identities, 4
sleptons, 4
smuons, 4
sneutrinos, 4
SO(10) grand unied group, 4
soft masses, 4
soft supersymmetry breaking, 4
soft terms, 4
sparticle spectrum, 4
sparticles, 4
spin, 4
spinor derivative, 4
spinor, 2-component, 4
spinor, 4-component, 4
spinor, conventions, 4
spinor, Dirac, 4
spinor, Majorana, 4
spinor, Weyl, 4
spontaneous breaking, 4
3-2 model, 4
dynamical models, 4
Fayet-Iliopoulos model, 4
ORaifertaigh model, 4
of electroweak symmetry, 4
spurion method, 4
squark, 4
stau, 4
stop, 4
stress tensor, 4
structure constants, 4
SU(5) grand unied group, 4
sum rules, 4
super-Higgs eect, 4
supercharge, 4
superconformal xed points, 4
superconformal symmetry, 4
supercurrent, 4, 68, 78
coupling to goldstino, 4
supermultiplet, 4
supereld, 4
supergraphs, 4
supergravity, 4
supermultiplet, 4
superpartner, 4
superpotential, 4, 72
superspace, 4
superstrings, 4
supersymmetry algebra, 4
supersymmetry generators, 4
supertrace, 4
Tevatron pp collider, 4
transformation, gauge, 4
transformation, supersymmetry, 4
trilepton signals at hadron colliders, 4
triviality bound, 4
undotted indices, 4
unication of gauge couplings, 4
unitarity gauge, 4
vacuum, 4
vacuum energy density, 4
van der Waerden notation, 4
vector supermultiplet, 4
visible sector, 4
weakly interacting massive particles
(WIMPs), 4
Wess-Zumino gauge, 4
Wess-Zumino model, 4, 64
Wess-Zumino multiplet, 4
Weyl spinors, 4
Wilsonian lagrangian, 4
wino, 4
Witten index, 4
Yang-Mills theory, supersymmetric, 4
Yukawa couplings, 4
bottom, 4
in general superpotential, 4
tau, 4
top, 4
top, importance for electroweak
symmetry breaking, 4
zino, 4

Supersymmetry: Herbi K. Dreiner Howard E. Haber Stephen P. Martin September 22, 2004

Uploaded by

Copyright:

Available Formats

Supersymmetry: Herbi K. Dreiner Howard E. Haber Stephen P. Martin September 22, 2004

Uploaded by

Document Information

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Supersymmetry: Herbi K. Dreiner Howard E. Haber Stephen P. Martin September 22, 2004

Uploaded by

Copyright:

Available Formats

What are two-component and four-component spinors?

What are two-component and four-component spinors?

What are the main components of the Standard Model?

What are the main components of the Standard Model?

Supersymmetry

Herbi K. Dreiner Howard E. Haber Stephen P. Martin

, the spacetime coordinates, x

which governs the transformation of a covariant

. One then easily derives:

. The set of all possible

is an antisymmetric 4 4 matrix, i.e., s

) = (x) , spin 0 , (1.5)

(x) , spin 1 . (1.6)

are (nite-dimensional) irreducible matrix representations

, that satises the commutation relations of the SL(2,C) Lie

is non-negative and either an integer or half-integer.

+ 1). The dimension of the

, where is given by eq. (1.2)

is a constant four-vector. For a eld of spin s, the transformation

a) and redened the dummy variable x

by removing the prime. The Poincare algebra is obtained by considering

are the dierential operators

. Thus, the Poincare algebra consists

which obey the following commutation

include an implicit factor of

, whereas the spin

depends non-trivially on and (except for the case of spin zero,

under Lorentz transformations.

). The rst one is P

= 0. The second Casimir operator is w

= 0. It is simplest to choose a frame in which

P[ ,= 0). Thus, can be used to label states. In

P[) and corresponds to the

(x), and transforms

are also matrix representations of the same dimension. For n > 2,

(x), ( = 1, 2), which

, so the distinction is somewhat pedantic. Nevertheless, it is

is related by a similarity transformation to (M

are numerically equal, but the dotted and undotted

have the same transformation law, as do

. Hence, we may equate them:

= , this convention can

and relabeled the dummy indices.

is a Lorentz transformation. Moreover, one can check

. Likewise, for the same four-

are hermitian matrices, which is expressed by the

transforms as a Lorentz vector.

is a Lorentz scalar. Under a Lorentz transformation,

transform according to eqs. (1.6), (1.35) and (1.41),

is a Lorentz scalar if the following condition

is a Lorentz scalar, we obtain

are easily evaluated:

, for the (0,

( = 0, 1, 2, 3). However, in constructing the bilinear covariants,

= 0 is similar. One consequence of this result is

by a total divergence. Hence, eq. (1.96) is a

(x). This corresponds

. Under this transformation, the kinetic energy term is

M). To compute the values of the (non-negative real)