The use of NMR methods for conformational studies of nucleic acids
Sybren S. Wijmenga*, Bernd N.M. van Buuren
Umea? University, Department of Medical Biochemistry and Biophysics, S 901 87 Umea?, Sweden
Received 10 July 1997
Progress in Nuclear Magnetic Resonance Spectroscopy 32 (1998) 287–387
0022-2860/98/$19.00 q 1998 Published by Elsevier Science B.V. All rights reserved
PII S0079-6565(97)00023-X
* Corresponding author. Tel: +46 9078 6500; fax: +46 9013 6310;
e-mail: sybren@indigo.chem.umu.se
Contents
1. Introduction .................................288
2. RNA and DNA synthesis and purification . . . ......................290
3. Nomenclature .................................291
4. Distances .................................291
4.1. Overview of short distances and their general characteristics . . . . . ...........292
4.2. Overview of structurally important intra-nucleotide distances . . . . . ...........294
4.3. Overview of structurally important sequential and cross-strand distances ...........295
4.4. Derivation of distances from NOESY spectra and structure characterization using distances . . . 295
4.5. Conclusion .................................304
5. J-couplings .................................305
5.1.
1
J
HC
- and
1
J
CC
-couplings . .............................307
5.2. Overview of J-couplings in the bases . . . ......................307
5.3. Ribose sugar .................................307
5.4. Determination of the b torsion angle. . . . ......................310
5.5. Determination of the e torsion angle . . . . ......................312
5.6. Torsion angle g and H59 and H599 stereo specific assignment. . . . . ...........314
5.7. x torison angle and
3
J
HC
sugar to base . . . ......................316
5.8. Measurement of homo- and heteronuclear J-coupling constants . . . . ...........316
5.8.1. Determination of J-couplings from the shape of the signal . . . ...........316
5.8.2. Determination of J-couplings from E.COSY patterns . . . . . ...........318
5.8.2.1. Homonuclear E.COSY . . ......................318
5.8.2.2. Heteronuclear E.COSY . . ......................319
5.8.2.2.1. Determination of J
HP
- and J
CP
-couplings . ...........319
5.8.2.2.2. Determination of J
HC
-couplings . . . . . ...........320
5.8.2.2.3. Determination of J
HH
-couplings via HCC-E.COSY spectra . . . . . 320
5.8.3. Determination of J-couplings from signal intensities . . . . . ...........321
5.8.3.1. Determination of J
HH
-couplings from homonuclear (H,H) TOCSY transfer . . . 321
5.8.3.2. Determination of J-couplings from heteronuclear experiments . .......321
Keywords: NMR; Conformational studies; Nucleic acids; RNA; DNA; Labeling; Assignment; Structure
1. Introduction
Nucleic acid molecules play a central role in cell
biological processes. DNA’s main role is to act as the
carrier of genetic information. Furthermore, DNA is
transcribed into RNA by a carefully regulated process,
and it is duplicated on cell division. RNA’s main role
is to communicate the genetic information for protein
synthesis to the ribosomes. RNA is, however, very
versatile. It can also take on the role of DNA as the
carrier of genetic information, and it can function as
an enzyme. It has even been hypothesized that early in
evolution, life was based entirely on RNA (see, for
example, Ref. [1]). All these different processes
require different structures. The basic structural
elements of RNA and DNA are well established, i.e.
DNA forms a B-helix, while RNA may be either
single-stranded or may form an A-type helix.
However, the alternate RNA and DNA structures,
associated with many of the different processes
mentioned above, are less well known. Only since
the early 1990s have technological advances in
sample preparation, such as isotope labeling and
developments in crystallization, made such structural
data available, and allowed the structural basis of the
biological functions of DNA and RNA to be
addressed.
In the past ten years, we have witnessed an
explosion in the number of crystal and solution struc-
tures of proteins determined by X-ray crystallography
and NMR, respectively. In comparison, the increase in
the number of nucleic acid structures determined by
either X-ray or NMR has been relatively small. This
can be attributed to the difficulties encountered when
trying to crystallize nucleic acids for detailed X-ray
analysis and to the problem of extensive resonance
6. Chemical shifts .................................322
6.1. Chemical shifts; qualitative aspects . . . . ......................324
6.2. Theory .................................324
6.3.
1
H shifts .................................327
6.4. Structurally important
1
H shifts . . . . . . ......................327
6.5.
15
N and
13
C shifts in DNA and RNA . . . ......................329
6.6.
31
P shifts .................................330
7. Assignment methods .................................330
7.1. Assignment without isotope labeling. . . . ......................330
7.2. Assignment with isotope labeling . . . . . ......................335
7.2.1. NOE-based correlation . . . . . . ......................336
7.2.2. Through-bond correlation . . . . . ......................337
7.2.2.1. Coherence transfer functions ......................337
7.2.2.2. Through-bond amino/imino to non-exchangeable proton correlation . . . . . . 337
7.2.2.3. Through-bond H2-H8 correlation . . ..................344
7.2.2.4. Through-bond base 1 sugar correlation . . . . . . ...........347
7.2.2.5. Through-bond sugar correlation. . . ..................355
7.2.2.6. Through-bond sequential backbone assignment . . . ...........357
7.2.3. X-filter techniques . .............................361
8. Relaxation and dynamics . . . .............................363
9. Calculation of structures . . . .............................375
10. Prospects for larger systems . .............................378
11. Conclusions .................................382
Acknowledgments .................................383
References .................................383
288 S.S. Wijmenga, B.N.M. van Buuren/Progress in Nuclear Magnetic Resonance Spectroscopy 32 (1998) 287–387
overlap in NMR spectra of these compounds.
Advances in crystallization techniques have in recent
years resulted in the structure determination of the
RNA hammerhead enzyme [2,3], one of the two fold-
ing domains of the group I intron self-splicing RNA
[4,5], and a few RNA–protein complexes [6,7]. In
addition, the structures of several DNA duplexes, as
well as of a DNA quadruplex, have been determined
by means of X-ray crystallography [8]. However,
despite the two X-ray structures of the hammerhead,
the catalytic mechanism of this ribozyme has not yet
been clarified. Crystal packing forces sometimes
affect RNA or DNA structures. For example, there
is still no crystal structure available of a DNA or
RNA hairpin, since these tend to crystallize in bio-
logically less relevant extended duplex structures
[9,10]. Solution structures, which can be determined
via NMR, are therefore particularly important in DNA
and RNA structural biology as a complement to
crystallography. In addition, nucleic acids often con-
tain regions of higher conformational flexibility.
NMR is particularly suited for identifying such
regions.
In the field of NMR of nucleic acids, advances were
made in the 1980s with the introduction of synthetic
methods for preparing well defined DNA sequences.
This development also made it possible to produce
well defined RNA sequences from DNA templates
by enzymatic synthesis via T7-polymerase. These
developments led to the determination of several solu-
tion DNA and RNA hairpin structures, from which the
main folding principles of hairpin loops could be
determined [11,12]. In addition, these developments
led to the determination of the solution structure of a
DNA quadruplex [13,14] and solution structures of
triple helix molecules [15], as well as to the determi-
nation of a new DNA multi-stranded fold, the C-motif
[16]. Still, the overlap encountered in NMR spectra
limited the size of the molecules that could be studied
and the detail by which the structures could be deter-
mined. In the early 1990s, methods were developed to
produce
13
Cor
15
N enriched RNAs, via enzymatic
synthesis, in quantities large enough for NMR studies.
This possibility enabled more detailed studies of bio-
logically relevant RNA sequences and folds. Initial
NMR studies have been performed and methods
have been developed for assignment of resonances
of
13
C and
15
N labeled RNAs. The direct result of
this is more reliable resonance assignments. In addi-
tion, more extensive constraints lists could be
obtained for subsequent structure determination. In
the past two years a number of RNA structures with
a size up to 30 to 40 nucleotides have been published
[17–30], together with RNA–peptide complexes
[31–36] and an RNA–protein complex of total mol-
ecular weight 22 kDa [37,38]. These studies have also
made it clear that the upper size limit for RNAs which
can be studied by NMR lies around 30 nucleotides
when uniform labeling is employed, a size limit
considerably below that for proteins.
Only quite recently has it become possible to enrich
DNA with
13
C and
15
N isotopes. Zimmer and Crothers
[39] demonstrated that DNA can be enriched via an
enzymatic approach, while even more recently
13
C
and
15
N labeled DNA phosphoramidites have become
available [40], so that
13
C and
15
N enriched DNAs can
now also be obtained via chemical synthesis. It is to be
expected that these possibilities will also have an
effect on NMR structural studies of DNA of larger
size. Larger DNA systems, such as those forming
three- and four-way junctions, have already been
studied [41], but these have not yet produced detailed
solution structures, again due to the extensive signal
overlap (see, for example, Refs. [42–44]). It is note-
worthy that Altona and co-workers used an extremely
interesting approach to achieve the assignments in
their studies of four-way junctions [43,44]. They
used well-determined hairpins as building blocks for
the larger four-way and three-way junctions they
studied. This made it possible to obtain resonance
assignment in very crowded spectra. The future will
reveal whether combining this approach with labeling
will allow an extension to larger systems, both for
RNAs and DNAs.
Naturally, as isotope enriched nucleic acid mol-
ecules are now used in NMR studies, we will pay
particular attention in this review to the related
NMR methods. Various other reviews [36,45–48]
have recently appeared, but they have focused gener-
ally on specific aspects of the NMR of isotope
enriched RNA. We try here to provide a broad over-
view, covering as much as possible of the various
aspects that come into play when performing NMR
structural studies of both DNA and RNA molecules.
Furthermore, the field is developing rapidly and new
aspects have been published since the appearance of
289S.S. Wijmenga, B.N.M. van Buuren/Progress in Nuclear Magnetic Resonance Spectroscopy 32 (1998) 287–387
these reviews. For example, a complete overview of
J-couplings in the nucleic acid bases has been pub-
lished [49] and proton structural chemical shifts have
been calculated and compared with experimental data
[50]. We will incorporate these aspects into this
review, together with a detailed description and criti-
cal evaluation of the present state of the art NMR
methodology for determining the structure of labeled
DNA and RNA molecules. This review is divided into
eleven sections. In Section 2,
13
C and
15
N labeling, as
well as other labeling methods, are described, albeit
briefly, in view of the quite detailed descriptions that
have recently appeared. The IUPAC nomenclature is
introduced in Section 3. In Section 4, we present an
overview of the distances found in DNA and RNA
molecules and discuss their relevance for NMR
structural studies. Section 5 gives an overview of all
homonuclear and heteronuclear J-couplings and
describes their structural dependencies. We also give
an overview of the NMR methods that are or can be
used to determine these J-couplings. In Section 6, we
describe the chemical shifts and discuss their use both
for assignment purposes and as structural parameters.
Section 7 forms the heart of this review, and describes
and discusses in detail the currently available methods
for assignment both in unlabeled and
13
C and
15
N
labeled compounds. Section 8 concentrates on a
description of relaxation. Isotope enrichment has
opened up the way for detailed relaxation studies in
the field of proteins. Such relaxation studies are still
scarce in the field of nucleic acids. We place relaxa-
tion studies on nucleic acids in the context of parallel
studies on proteins, and give an overview of the
theoretical background. In Section 9 we briefly
describe the actual structure determination from
NMR data. In Section 10, we discuss the prospects
for extension of NMR studies to larger systems and
we attempt to draw some conclusions in Section 11.
2. RNA and DNA synthesis and purification
Two strategies are available for preparing large
quantities of DNA and RNA of defined sequence
and high purity for NMR studies: (1) chemical
synthesis by the phosphoramidite method, and (2)
enzymatic synthesis of RNAs via T7-polymerase
and of DNAs via DNA-polymerase. For RNA,
enzymatic synthesis is the usual method of prepara-
tion; although chemical synthesis is possible it is still
prohibitively expensive when large quantities are
required. Chemical synthesis is the usual approach
for the preparation of DNAs of defined sequence.
Zimmer and Crother [39] have shown how large quan-
tities of DNA can be made via enzymatic synthesis,
thus demonstrating the feasibility of
13
C and
15
N
labeling of DNA via this method. However,
13
C and
15
N labeled DNA phosphoramidites have also recently
become commercially available, so that labeled
DNAs can conveniently be prepared via chemical
synthesis [40]. We refer the reader to the original
papers or reviews for the detailed protocols and for
discussions of the relative merits of the various
approaches [36,45,47,48,51–59]. Here we will
concentrate on some general and qualitative aspects.
A certain amount of confusing terminology has
crept into the literature with regard to labeling. We
will use the following terms: uniform labeling, when
every atom of a certain type in the molecule is
enriched; residue-type-specific labeling, if all residues
of a certain type (e.g. all Adenines) in the molecule
are enriched; site-specific labeling, if a particular resi-
due or a number of particular residues are enriched,
e.g. A10; partial labeling, if the labeling of a certain
residue is on, say, C19 only. In order to indicate that
labeling is not 100%, we add the percentage after the
word labeling.
For the enzymatic synthesis of RNA, a DNA tem-
plate is required from which the RNA is transcribed
by T7-polymerase using NTPs as building blocks.
The
13
C and/or
15
N and/or
2
H labeled NTPs are
usually obtained from E. coli cells, which are grown
on either
13
C enriched glucose, and/or
15
N enriched
ammonium chloride. The RNA isolated from the cells
is broken down to
13
C and/or
15
N labeled NMPs,
which are subsequently converted into NTPs. This
method thus allows uniformly labeled RNAs to be
made, or residue-type-specific labeled RNA when
the in vitro transcription occurs on a mixture of
labeled and unlabeled NTPs. The method can in prin-
ciple easily be extended to achieve deuteration or par-
tial labeling. For example, Michnicka et al. [60] have
suggested partial
13
C labeling using acetate as a car-
bon source; most recently Nikonowicz et al. [57] have
demonstrated uniform
2
H/
15
N labeling via the
enzymatic approach. It is more complicated to
290 S.S. Wijmenga, B.N.M. van Buuren/Progress in Nuclear Magnetic Resonance Spectroscopy 32 (1998) 287–387
achieve site-specific labeling via the enzymatic
method (see, for example, Ref. [36]). Site-specific
labeling, on the other hand, can quite easily be
achieved via chemical synthesis. This would be the
method of choice for the preparation of labeled DNA
oligonucleotides.
3. Nomenclature
For atom numbering and torsion angle definitions in
nucleic acids we will follow the IUPAC/IUB guide-
lines [61]. Accordingly, the chemical structure and
atom numbering of the five common bases, the
pyrimidines C, T and U, and the purines G and A,
are given in Fig. 1(A), and of the b-D-(deoxy) riboses
in Fig. 1(B), which also indicates the torsion angles in
the sugar–phosphate backbone (a, b, g, d, ? and z)
and the glycosidic torsion angle x. Their definitions
are: O39–P–O59–C59 (a), P–O59–C59–C49 (b),
O59–C59–C49–C39 (g), C59–C49–C39–O39 (d),
C49–C39–O39–P (?), C39–O39–P–O59 (z), O49–
C19–N1–C2 (x (Py)), and O49–C19–N9–C4
(x (Pu)). Furthermore, it gives a designation of the
chain direction and the unit numbering in a poly-
nucleotide chain. Fig. 1(C) shows the two most
common conformations of the b-D-(deoxy)ribose
sugar ring, the C29-endo (
2
E) and the C39-endo (
3
E)
conformers, also referred to as S-type and N-type
conformers, respectively.
To describe the distances we will use the shorthand
notation introduced by Wijmenga et al. [62]. In this
notation the distance between the protons l and r is
given by:
d
i
(l; r) for intranucleotide distances, e:g: d
i
(8; 29)
d
s
(l; r) for internucleotide distances, e:g: d
s
(19;6)
Here, l corresponds to the proton in the 59-nucleotide
unit and r with the proton in the 39-nucleotide unit. For
methyl protons the l or r is indicated by the letter
M. To indicate that the distance is between H39 in
the 59-nucleotide and H5 or the methyl protons in
the 39-nucleotide we use d
s
(39;5/M). Cross-strand
distances are defined as:
d
ci
(l; r) for distances within a base pair,
e:g: d
ci
(T 1 NH3; A 1 NH
2
6)
d
cs
(l; r)
p
for distances between adjacent base paired
nucleotides, e:g: d
cs
(19;2)
39
The symbols NH and NH
2
represent imino and amino
protons, respectively. The directionality in the
sequential cross-strand distances has to be indicated.
Consider two adjacent base pairs, and define the 59-
and 39-nucleotides. It can be easily seen that d
cs
is
either between two 39-nucleotides or between two
59-nucleotides. This is indicated by the subscript p.
Alternatively, when two protons l and r do not fall
in any of the above categories the distance is indicated
by:
d(l; r) for long 1 range internucleotide distances,
e:g: d(T2 1 NH3; A9 1 NH
2
6)
Here, T2-NH3 indicates the imino proton of Thymine
number 2 and A9-NH
2
6 indicates the amino group of
Adenine number 9.
4. Distances
Proton to proton distances are essential parameters
for the three-dimensional structure determination of
biomolecules by NMR. Since only short distances
( , 5–6 A
?
) can be obtained by NMR, it is difficult to
determine global features, such as bending of the
helix. On the other hand, local features can be deter-
mined quite well and most NMR structural studies
have focused on these aspects. Consequently, it is of
paramount importance to have a good overview of the
short distances in the main structural elements, such
as the sugar ring, the bases, the base pairs, etc. and of
how these distances determine the structural features
of those elements. Another aspect is that several of the
short distances do not depend on conformation nor do
they take on well defined values for the two major
helical conformations, A- and B-helices. For this
reason, it is particularly useful to have at hand an
overview of these distances and their characteristics,
so that one can focus on the relevant data for the more
interesting structural aspects.
In the next sections we therefore discuss the short
distances and how they reflect structural characteris-
tics, by first giving a more general overview and
291S.S. Wijmenga, B.N.M. van Buuren/Progress in Nuclear Magnetic Resonance Spectroscopy 32 (1998) 287–387
subsequently going into more detail. Finally, we
discuss their derivation from NOESY spectra and
their use as constraints in simulated annealing
protocols.
4.1. Overview of short distances and their general
characteristics
In Table 1 we have summarized the short distances
Fig. 1. Structure and atom numbering in nucleic acids, according to the IUPAC/IUB guidelines [61], of the five common bases (pyrimidines C, T
and U; purines G and A) (A), and of the b-D-(deoxy)riboses (B and C). (B) also shows the torsion angles in the sugar–phosphate backbone (a, b,
g, d, ? and z) and the glycosidic torsion angle x (the exact definition is given in the text), a designation of the chain 59 to 39 direction and the unit
numbering in a polynucleotide chain. (C) shows the puckering of the two most common b-D-(deoxy)ribose sugar ring conformations, the
C29-endo (or S-type) and the C39-endo (or N-type) conformations.
292 S.S. Wijmenga, B.N.M. van Buuren/Progress in Nuclear Magnetic Resonance Spectroscopy 32 (1998) 287–387
( , 5–6 A
?
) and categorized them into two main
groups, intra-nucleotide and inter-nucleotide dis-
tances, with further subdivision to reflect more
detailed conformational characteristics. The inter-
nucleotide distances fall into the two broad groups
of sequential and cross-strand distances involving
non-exchanging protons and exchanging protons,
respectively. The sequential distances involving
non-exchanging protons are again subdivided into
sugar-to-sugar distances, base-to-base distances and
sugar-to-base distances. Within each category their
dependence on conformation is indicated (A to D),
with category A referring to conformation indepen-
dent distances, category B to distances that can vary
by less than 6 0.2 A
?
, and category C to ‘structural’
distances, i.e. distances that convey structural infor-
mation since they can vary by more than 6 0.2 A
?
.
The ‘structural’ distances, category C, are subdivided
into two further categories to indicate their usefulness;
category C9 contains those ‘structural’ distances that
are reasonably well accessible by NMR, and category
D refers to NMR accessible ‘structural’ distances that
Table 1
Overview of short distances per residue
Type % M A B C C9 D
Intra-nucleotide
1. constant 5 3 3 0 0 0 0
2. sugar–sugar 16 10 0 8 2 2 1
3. sugar–59/50 13804400
4. base–sugar 12 7 0 1 6 4 4
sum 46 28 3 13 12 6 5
Inter-nucleotide
I. non-exchangeable
1. sequential sugar–base 20 12 0 0 12 12 12
2. sequential base–base 6 4 0 0 4 4 0
3. sequential sugar–sugar 20 12 0 0 12 2 2
4. cross-strand (3%) (2) 0 0 (2) (2) (2)
sum 46 28 0 0 28 18 14
II. exchangeable (imino/amino)
1. within base pair 2 1 0 0 1 1 0
2. sequential 3 2 0 0 2 2 0
3. cross-strand 3 2 0 0 2 2 2
sum 8500552
Total 100 61 3 13 45 29 20
% 10 5 217 4833
M: measurable distances , 5to6A
?
[62]; A: completely conformation independent distances; B: distances that are conformation independent
within approximately 6 0.2 A
?
; C: ‘structural distances’, i.e. conformation dependent distances with variation . 0.2 A
?
(see text); C9: NMR
accessible ‘structural’ distances; D: NMR accessible ‘structural’ distances that are different in A- and B-helices. The intra-nucleotide
distances: 1. The constant distances d
i
(29;20), d
i
(59;50), d
i
(6;5) and d
i
(6;M); 2. the sugar-to-sugar distances, d
i
(19-49;19-49); they all fall into
group B, except for d
i
(20;49) and d
i
(19;49), which fall into groups C and C9, while group D only contains d
i
(20;49); 3. the distances d
i
(29-49;59/
50); group B contains the distances d
i
(29/20;59/50); group C contains d
i
(39/49;59/50); none of them fall into groups C9 or D; 4. sugar-to-base
distances d
i
(6/8;19-50); they are subdivided according to: group B, d
i
(6/8;49), group C, d
i
(6/8;19-39,59/50), group C9, excluded d
i
(6/8;59/50),
group D, d
i
(6/8;19-39). The distances d
i
(5/M;19-50) are not taken into account since they are larger than 5 A
?
[62]. The inter-nucleotide distances
(considered are the distances to and from Cytosine in a GCG trinucleotide sequence (see Fig. 2). I. Non-exchangeable protons: 1. sequential
sugar-to-base distances, d
s
(19-39;8) and d
s
(19-39;6/5); all of them fall into categories C, C9 and D; 2. base-to-base distances, d
s
(6/5;8) and d
s
(8;6/
5); all of them fall into categories C, C9 and D; 3. sequential sugar-to-sugar distances, d
s
(19-20,49;50), d
s
(50;19-20,49), d
s
(29;39), d
s
(39;29),
d
s
(29;20) and d
s
(29;20); all of them are conformation dependent (category C), but only d
s
(29;39) and d
s
(39;29) are easily accessible and differ
between A-type and B-type helices. II. Inter-nucleotide distances involving exchangeable protons: 1. The distances within a base pair are
d
c
(NH2; NH); this distance depends on conformation (category C), is NMR accessible (C9), but does not differ between A-type and B-type
helices; 2. the sequential distances are d
s
(NH2;NH) and d
s
(NH;NH2); they both fall into category C and C9, but not into category D; 2. the cross-
strand distances are d
c
(NH2;NH2)
59
and d
c
(NH2;NH2)
39
; they fall into category C, C9 and D (however, note that the NH2 resonances of G may
be broadened making them inaccessible for NMR).
293S.S. Wijmenga, B.N.M. van Buuren/Progress in Nuclear Magnetic Resonance Spectroscopy 32 (1998) 287–387
also show differences depending on whether they are
present in an A- or B-type helix.
As can be seen from Table 1 approximately 60
distances per residue can in principle be measured.
The number of distances that are constant within
6 0:2A
?
is rather high. They represent about 26%
of the total number of measurable distances. Their
percentage is even higher for the intra-nucleotide dis-
tances, of which they represent about 57% (16 out of
28). The distances that convey relevant structural
information (in helices) and are reasonably well
accessible by NMR represent less then half (48%) of
the total number of distances, while only 20 are dif-
ferent between A- and B-type helices (33%). Note
also the small number of structurally very important
cross-strand and sequential distances involving
exchanging protons which establish base pairing
(8%), and the small number of cross-strand distances
involving non-exchanging protons (3%). On the other
hand, sequential sugar-to-base and sugar-to-sugar dis-
tances, which are so important for establishing base
stacking and defining the phosphate backbone, are
both relatively large in number (20%). The former
are mostly reasonably well accessible by NMR,
whereas the latter are extremely difficult to establish.
Thus, a rather uneven spread in the short distances is
found through the chemical structure. As a conse-
quence, important structural features such as base
pairing often hinge on the presence of a particular
NOE contact reflecting one short distance.
4.2. Overview of structurally important intra-
nucleotide distances
The intra-nucleotide distances in DNA and RNA
can conveniently be subdivided according to the
categories indicated in Table 1, i.e. (1) conformation
independent distances, (2) distances between sugar
protons, (3) distances between H29/20/39/49 and H59/
50, (4) distances between H19 through H59/50 and base
protons.
1. The conformation independent distances are: the
geminal proton distances, d
i
(29;20) and d
i
(59;50),
of 1.8 A
?
, d
i
(5;6) ( ? 2.45 A
?
) in Cytosine and
Uracyl, and d
i
(6,M) in Thymidine.
2. The distances within the sugar ring are all indepen-
dent of its conformation, except for d
i
(20;49) and
d
i
(19;49). Only d
i
(20;49) differs significantly
between S-type and N-type conformers, with
d
i
(20;49) ? 4.2 A
?
for the S-type conformer
(pseudorotation angle P ? 1608) and d
i
(20;49) ?
2.8 A
?
for the N-type conformer (P ? 108).
Although it is in principle possible to determine
the sugar conformation from the d
i
(20;49) distance,
the accuracy of the determination is limited. The
d
i
(20;49) distance is difficult to determine from
NOE intensities because of spin diffusion effects,
due to the close proximity of the H29 and H20
protons. Also note that in RNA the H20 proton is
absent, so that these sugar distances cannot be used
at all to determine the puckering. The distance
d
i
(19;49) is almost identical for N-type and S-
type sugars (3.4 A
?
), but has a lower value for
sugar rings with an intermediate pseudorotation
angle, d
i
(19;49) ? 2.6 A
?
for P ? 908. Here again
spin diffusion can adversely affect the accuracy
distance of the determination.
3. The distances d
i
(39;59/50) depend only weakly on
the sugar ring conformation, but significantly on
the g torsion angle, while the distances d
i
(49;59/
50) only depend on the g torsion angle. The
distances d
i
(39/49;59/50) therefore allow the deter-
mination of the torsion angle [62,63]. This can be
done in conjunction with relevant J-couplings (see
Section 5). Given an uncertainty in these distances
of 6 0.2 A
?
, they do not discriminate well between
the different ranges of the g torsion angle, in par-
ticular when an equilibrium between g
t
and g
t
rotamers exists. The distances d
i
(29/20;59/50)
depend on both the sugar puckering and the torsion
angle g, but their dependence is weak, and they are
of the order of 5 to 6 A
?
[62].
4. The distance between H19 and H8/6, d
i
(19;6/8),
depends only on the glycosidic torsion angle x.It
thus provides a means for determining this torsion
angle. However, the maximum difference in the
values of d
i
(19;6/8) for x in the syn domain (x ?
608) and in the anti domain (x ? 2408) is only
about 1.2 A
?
. Given that in practice the uncertainty
in the distance determination from NOE data is of
the order of 6 0.2 A
?
to 6 0.5 A
?
,itistobe
expected that the use of d
i
(19;6/8) is a rather impre-
cise means to determine the x torsion angle. The
other sugar proton to base proton distances, d
i
(29/
20/39/49;6/8), depend on both the sugar puckering
294 S.S. Wijmenga, B.N.M. van Buuren/Progress in Nuclear Magnetic Resonance Spectroscopy 32 (1998) 287–387
and the x torsion angle. The distance d
i
(49;6/8)
does not convey useful structural information
since its dependence on these parameters is weak
[62]. The distances d
i
(29/20/39;6/8) are on the other
hand quite useful. Each of these distances defines
the x torsion angle quite well, because of their
quite strong dependence on the torsion angle x
[62]. Their dependence on the sugar puckering is
rather weak, in particular for the distances d
i
(29/
20;6/8) [62]. Despite this weak dependence on the
sugar puckering, a concerted use of d
i
(29/20/39;6/8)
makes it possible to determine the percentage N-
type or S-type pucker, but to achieve a reasonable
level of precision requires that the uncertainty in
their values should be less than 6 0.5 A
?
[62]. We
finally note that Lane and co-workers [64] have shown
the improved reliability of sugar pucker determination
using these distances together with J-couplings.
The H59/H50 to base proton distances, d
i
(59/50;6/8),
depend on three torsion angles, g, d and x. Their
dependence on the sugar pucker (d), and on the
glycosidic torsion angle (x) in the usual anti
domain (180–2408) is weak, but they depend
quite strongly on the g torsion angle. In particular,
for g
t
both d
i
(59; 6/8) and d
i
(50;6/8) are long (3.7
to 4.5 A
?
), while for g
t
the distance d
i
(50;6/8)
becomes short (2.5 to 2.9 A
?
). As has been shown,
with uncertainties in the distance estimates in the
order of 6 0.2 A
?
, they determine quite well the
torsion angle g [62]. The distances d
i
(59/50;6/8)
can be quite useful in NOESY spectra of DNA,
since the related NOE cross peaks do not reside
in a crowded spectral region. This does not hold
true for RNA where these cross peaks overlap with
the other H6/8 to H29/39 NOE cross peaks. On the
other hand, the distances d
i
(39/49;59/50) all
relate to cross peaks in crowded spectral
regions for both DNA and RNA and are thus
difficult to establish.
4.3. Overview of structurally important sequential
and cross-strand distances
Helical conformations form an important part of
nucleic acid structures. We therefore present an over-
view of the distances in the two most commonly
found helix types, A-helices and B-helices. Fig. 2,
reproduced from Wijmenga et al. [62], gives the
sequential distances, d
s
(l;r), and cross-strand
distances, d
ci
(l;r) and d
cs
(l;r), found in A-DNA,
B-DNA and RNA helices.
The cross-strand distances, d
ci
(l;r) and d
cs
(l;r),
involving exchangeable protons are indicative of
base pair formation. The sequential distances involv-
ing either exchanging or non-exchanging protons are
indicative of base stacking. However, only a limited
number depend on the type of helix conformation. In
both A- and B-type helices, short base-to-base dis-
tances, d
s
(6/8/5/M;6/8/5/M), are present, depending
on the sequence. Similarly, all distances involving
exchanging protons are very similar in A- and
B-type helices. The differences occur for the cross-
strand and sequential distances involving H2 protons,
d
cs
(2;19/2)39 and d
s
(2;19), the sequential sugar-to-
base distances, d
s
(29/20/39;6/8/5), and for a number
of sequential sugar-to-sugar distances, d
s
(29/20;59/
50), d
s
(29;39), d
s
(20;20) and d
s
(19;50). Short cross-
strand, as well as sequential H2 to H19 distances,
are present in A-type helices, but absent in B-type
helices. Short sequential H29 to H6/8 distances and
long H20 to H6/8 distances are seen in A-helices,
while in B-helices the reverse is found. The sugar-
to-sugar distances show the following pattern: Short
sequential H29/H20 to H59/50 distances in A-helices,
while in B-helices these distances are long; rather
long, but measurable, sequential H29 to H39 distances
in A-helices, which are over 7 A
?
and thus not measur-
able in B-helices; finally, long ( . 7A
?
) sequential H20
to H29 and H19 to H50 distances in A-helices, which
are relatively short in B-helices. While the distances
involving H2 and the H29/20 to base distances are
quite accessible from NMR spectra, the sugar-to-
sugar distances are difficult to determine since the
sugar proton resonances reside in quite crowded
spectral regions.
4.4. Derivation of distances from NOESY spectra and
structure characterization using distances
We will discuss here the three aspects of NMR
accessible distances that are of particular relevance
for structure determination. First, how precisely can
distances be derived from NOE data? Secondly, how
does this precision affect the precision of the deter-
mined structure? Thirdly, how does the spread and
295S.S. Wijmenga, B.N.M. van Buuren/Progress in Nuclear Magnetic Resonance Spectroscopy 32 (1998) 287–387
Fig. 2. Overview of short sequential and inter-strand proton–proton distances for all possible combinations of base stacking in A-DNA (A),
B-DNA (B) and RNA (C). The meaning of the symbols is: 0–2.5 A
?
(thick solid line), 2.5–3.0 A
?
(solid line), 3.0–4.0 A
?
(dashed line), 4.0–5.0 A
?
(dotted line).
296 S.S. Wijmenga, B.N.M. van Buuren/Progress in Nuclear Magnetic Resonance Spectroscopy 32 (1998) 287–387
Fig. 2. (Continued).
297S.S. Wijmenga, B.N.M. van Buuren/Progress in Nuclear Magnetic Resonance Spectroscopy 32 (1998) 287–387
Fig. 2. (Continued).
298 S.S. Wijmenga, B.N.M. van Buuren/Progress in Nuclear Magnetic Resonance Spectroscopy 32 (1998) 287–387
number of distance constraints affect the precision of
the determined structure?
The intensities of the cross peaks in a NOESY
spectrum are related to distances between spins via
the relaxation matrix, R:
NOE ? S exp( 1 Rt
m
) ? S(1 1 Rt
m
t
1
2
(Rt
m
)
2
1 …)
(1)
Here t
m
is the mixing time and S is a scaling factor
taken to be equal to 1. The elements R
ij
of the
relaxation matrix are given by (see also Section 8):
R
ij
?
q
r
6
ij
(6J(2q) 1 J(0)) (2)
where q ? g
4
(h/2p)
2
/4, and r
ij
is the distance between
protons i and j. For a rigid isotropically tumbling
molecule, the spectral density function, J(q), can be
written as:
J(q) ?
2
5
t
c
1 t q
2
t
2
c
(3)
so that in the slow tumbling limit (qt q 1) one
obtains:
R
ij
?1
q
r
6
ij
J(0) ?1
q
r
6
ij
t
c
(4)
This is the equation used in most relaxation matrix
calculations (see below). Internal dynamics or confor-
mational flexibility implies that interconversion
between different conformers takes place, with each
conformer having a different set of distances. The
NOE intensity is then derived from the average
relaxation rate, hR
ij
i, which is either proportional to
h1/r
6
i, when the motion is slower than the overall
tumbling, or to h1/r
3
i
2
, when the averaging is faster
then the overall tumbling time [62]. When the internal
motion is of limited scope (libration motions) the
average distance may be approximated by the middle
distance. As a result, fast internal libration motions on
a ps to ns time-scale can be accounted for by using the
Lipari and Szabo approach [65,66] by introducing a
scaling factor (S
2
) in Eq. (4) (see Section 8):
R
ij
?1
q
r
6
ij
J(0) ?1
q
r
6
ij
S
2
t
c
(5)
The fast rotation of the methyl protons also leads to a
scaling down of the NOE intensity. Corrections for
fast rotation of the methyl groups and fast internal
motion (Eq. (5)) are generally incorporated into
relaxation matrix programs (see below). More
difficult is the situation for interconversion between
conformers, which have distinctly different proton–
proton distances. An example is the rapid inter-
conversion between N-type and S-type sugars
observed in nucleic acids, which leads to very
different intra-nucleotide sugar-to-base distances. In
this case the average distance derived from NOEs
tends to be heavily biased towards the shorter dis-
tances. To account for the latter effects requires
ensemble averaging, a method which has not been
implemented in relaxation matrix programs mainly
because of the major computational effort involved.
Finally, nucleic acids are not spherically but asym-
metrically shaped, leading to anisotropic instead of
isotropic tumbling. Consequently, the spectral density
function has to be replaced by a more complicated
form (see Section 8). Anisotropic tumbling is not con-
sidered in most relaxation matrix programs (see
below).
For short mixing times only the first term in the
expansion given in Eq. (1) is required. The isolated
spin pair approximation, ISPA, then applies, and the
NOE between protons i and j is proportional to their
distance to the inverse sixth power:
NOE
ij
? C
1
r
6
ij
(6)
Plotting the NOE
11/6
versus known distances,
according to
NOE
1 1=6
ij
? C
1 1=6
r
ij
(7)
allows the determination of the constant C. Distances
can then be calculated via
r
ij
? C
1 1=6
NOE
1 1=6
ij
(8)
For longer mixing times spin diffusion becomes more
effective. As a result NOE intensity is lost in cross
peaks involving spins that are close in space, while
NOE intensity may be gained in cross peaks involving
spins that are relatively far apart. The net effect is that
the NOE
11/6
versus distance curve flattens and can be
described by
NOE
1 1=6
ij
? A t B
1 1=6
r
ij
(9)
299S.S. Wijmenga, B.N.M. van Buuren/Progress in Nuclear Magnetic Resonance Spectroscopy 32 (1998) 287–387
The constants A and B can be determined as before
from known distances. This ‘modified’ ISPA
approach allows the estimation of unknown distances
via
r
ij
? (NOE
1 1=6
ij
1 A)B
1=6
(10)
Ultimately, all NOE intensity will be uniformly
spread out through the network of proton spins, and,
consequently, becomes independent of the distances,
and distance estimates can no longer be made. This
method of accounting for spin diffusion is rather
crude. In effect the method more or less assumes
that each spin pair is surrounded by a uniform network
of other spins, which provide the spin diffusion path-
ways. Although this approach does take into account
that short and long distances are differently affected
by spin diffusion, it does not allow for the fact that
different short distances may be differently affected
by spin diffusion. A rigorously correct approach is to
take the full relaxation matrix into account, since ulti-
mately only the full expression correctly gives the
NOE matrix. The relaxation matrix, R, and thereby
the distances, can be calculated from the NOE matrix,
if the complete NOE matrix is known:
R ?
1 ln(NOE)
t
m
(11)
In this way the complete spin network is considered
and spin diffusion can be fully accounted for. One can
now in fact derive from the NOE matrix the confor-
mation of the spin network. In practice, not all NOE
intensities are known. Consequently, one needs to
somehow build up a complete NOE matrix. Several
approaches have been developed for this purpose
[67–74]. These approaches generally proceed as
follows. In the first step a model is chosen, from
which a model NOE matrix is calculated. Next,
those theoretical NOEs for which measured values
are available are replaced by experimental NOEs;
the rest remain at the model values. Thus, a hybrid
model/experimental NOE matrix is constructed which
is complete and from which relaxation rates can be
calculated and distances extracted. In the IRMA [73]
and the MORASS [70] approaches these distances are
immediately used in a further cycle of simulated
annealing refinement to obtain new model distances
and then a new NOE matrix. In the MARDIGRAS
approach [67] the updated set of distances is directly
used to calculate a new model NOE matrix. The whole
procedure of substitution, back-calculation, etc. is
repeated until a good fit between experimental and
calculated NOEs is obtained. In the NO2DI method
[74] ISPA is used to estimate zeroth-order distances.
Subsequently, a relaxation matrix is built of all spins
for which measured NOEs are available. To complete
the relaxation matrix the distances for which no
zeroth-order distances are obtained are set to a large
value. The NOE matrix is then back-calculated and
from the NOE
ij
(calc) a new distance estimate
obtained, r
ij
? r
6
ij
NOE
ij
(calc)=NOE
ij
(exp)
p
. The new
distance is used to calculate the NOE matrix again,
giving a new calculated distance, until NOE
ij
(calc)
and NOE
ij
(exp) are within a certain range (1%).
This process is repeated for each distance, starting
with short distances and working up to the longer
ones, thus obtaining first-order estimates for each
distance. The process can then be repeated until con-
vergence is reached. In the NO2DI method, obviously
the spin network consists of the other spins for which
an experimental NOE was observed. If the number of
experimental NOEs is too sparse the network may
become too sparse, and model distances may be
included. While in the ‘modified’ ISPA approach, uni-
form surrounding is assumed for each spin pair, here
the surrounding of each spin pair does not need to be
uniform and is built up from both estimated and/or
model distances. It turns out that the ultimate results
do not depend strongly on the choice of model [67],
although the best results are obtained with model
distances which are as close as possible to the true
distances [68].
What are the possible sources of error in the ISPA
and relaxation matrix approaches? Most easily
assessed are errors in the distances resulting from
the noise and integration errors in the NOE intensity.
The absolute noise in a spectrum can, with modern
spectrometers, be quite small and varies from 0.01%
to 0.3% depending on the quality of the spectrum
(assuming that the NOE matrix is scaled to 1). In
addition, there is a relative error from peak integration
which amounts to 10–15% of the peak volume [68].
In our studies, peak amplitudes yield good estimates
of the peak volumes, when determined from highly
digitized NOESY spectra. The advantage of using
amplitudes over integration is that errors due to base
line distortion and peak overlap are minimized. We
300 S.S. Wijmenga, B.N.M. van Buuren/Progress in Nuclear Magnetic Resonance Spectroscopy 32 (1998) 287–387
found that the peak amplitudes and volumes from a
3D TOCSY–NOESY spectrum agree within 10%. A
second source of possible error is that in practice,
NOESY spectra are recorded with relatively short
relaxation delays (RD) inbetween FID recordings,
i.e. RD , 5 T
1
. This will affect the signal amplitude
from different spins differently, depending on their T
1
relaxation rates. Such effects are generally not
accounted for in the above described programs. A
third, more important issue, is internal dynamics.
Fast internal libration motions on a ps to ns time-
scale can be accounted for via Lipari and Szabo’s
formalism [65,66] by introducing a scaling factor
(S
2
) (see Section 8). The fast rotation of the methyl
protons also leads to a scaling down of the NOE inten-
sity. These corrections are generally incorporated into
the programs discussed above. To account for inter-
conversion between conformers with distinctly
different proton–proton distances requires ensemble
averaging, a method which has not been implemented
in the programs discussed above, mainly because of
the enormous computational effort involved. The
fourth, and major, source of error in the derivation
of distances from NOEs seems to originate from the
fact that the more spin diffusion contributes to the
intensity of a cross peak, the more difficult it becomes
to retrieve the direct contribution, even when relaxa-
tion matrix approaches are used. This can be under-
stood qualitatively from the decreased dependence on
distance, which is found when spin diffusion plays a
role. This has a profound effect on how errors in NOE
intensities translate into errors in the derived dis-
tances. At short mixing times, when ISPA applies,
errors in the NOE intensity lead to strongly damped
errors in the derived distances, because of the inverse
sixth power relationship. On the other hand, when
spin diffusion becomes highly effective, the distance
dependence of the NOEs becomes less strong, and the
error in the NOE translates into much larger errors in
the derived distances.
How accurately can distances be obtained and how
much better does a relaxation matrix approach per-
form as compared to a simple ISPA approach? The
ultimate errors in the distance estimates are nicely
illustrated by the model calculations performed by
van de Ven et al. [74] and Borgias and James [67].
In the calculation of van de Ven et al., the NOEs of a
B-DNA duplex were calculated assuming a mixing
time of 200 ms and a tumbling time of 2 ns. In
addition, 0.1% random noise was added to the NOE
intensities and 20% of the NOEs were deleted. Figure
7(a) of van de Ven et al. [74] gives the initial ISPA
estimates of the distances, while Fig. 7(b) and (c)
represent distance estimates after five cycles of
NO2DI refinement using the ISPA as starting values
or a wrong model as source for starting values of the
distances, respectively. Van de Ven et al. found that
the initial distance estimates, obtained via the ISPA
approach, have in fact reasonably small error bounds;
the distances are correct within 6 0.3 A
?
at 2.0 A
?
up to
6 0.6 A
?
at 5 A
?
(as judged from Fig. 7(a) in Ref. [74].
As expected the shorter distances are being calculated
too large and the longer distances calculated too short
on average. A ‘modified’ ISPA estimate of the dis-
tances removes this bias and leads to narrower error
bounds of ,60.2 A
?
at 2.0 A
?
to 6 0.6 A
?
at 5 A
?
(again as judged from Fig. 7(a) in Ref. [74]). Most
importantly, we note that all distances are correctly
estimated within these bounds. Van de Ven et al. find
that after five rounds of relaxation matrix refinement
the shorter distances are now correctly estimated
(within 6 0.2 A
?
for distances up to 3.0 A
?
), while
for larger distances the majority of the estimated dis-
tances have errors up to 6 0.7 A
?
at 5 A
?
, which is the
same order of magnitude as the error when using the
ISPA estimates. Similar error bounds for ISPA and
relaxation matrix refined distances have been obtained
by Borgias and James using MARDIGRAS [67] for
NOE data with 0.3% error. Thus, it seems that rather
similar error bounds are obtained for ‘modified’ ISPA
derived and relaxation matrix refined distances, at
least under these conditions. In view of these results
we have investigated the derivation of distances from
NOE data for the case of a 3D TOCSY–NOESY spec-
trum of a 12-mer RNA duplex [75–77]. The NOE
mixing time was 200 ms, so that spin diffusion does
affect the NOEs, albeit to a limited extent. The NOE
intensities derived from the 3D TOCSY–NOESY
spectrum were corrected for T
2
effects and missing
TOCSY peaks. Distances were then estimated using
a ‘modified’ ISPA approach; the NOE
11/6
intensities
for known distances were plotted along the y axis
against the known distances along the x axis; the
known distances covered the complete 1.8 to 5.0 A
?
range. Calibration was done by fitting a straight line
through the data according to Eq. (9). The known
301S.S. Wijmenga, B.N.M. van Buuren/Progress in Nuclear Magnetic Resonance Spectroscopy 32 (1998) 287–387
distances could be determined with an accuracy of
about 6 0.3 A
?
for short distances (1.8–3.0 A
?
), of
6 0.4 A
?
for distances between 3 and 4 A
?
, and
of 6 0:5A
?
for distances between 4.0 and 5.0 A
?
[77].
When considering the relaxation matrix refined
distances obtained by van de Ven et al. [74], it is
disconcerting to find that, in addition to the majority
of correctly determined distances, a number are cal-
culated to be too large, with errors ranging up to 5 A
?
,
and in the extreme case they are said to have
‘exploded’. These ‘exploded’ distances not only
occur for large true distances, but also for true
distances as small as 2.6 A
?
. These wrong distances
probably result from the difficulties in estimating the
spin diffusion contribution to the total NOE intensity.
This may be due to errors in the NOE intensities or
errors in the model distances or a combination of these
effects. To account in the calculations for the first
source of error the following method has been imple-
mented in MARDIGRAS. The calculations are
repeated at least 30 times while randomly varying
NOE intensities with a certain noise level for each
NOE 2D dataset (e.g. 0.002–0.003 for absolute inten-
sity and 5–10% integration error) [68]. The final
distances are taken as the average; for the error one
can conservatively take the maximum and minimum
distance values or, as Schmitz and James suggest [68],
some intermediate range, leading to error ranges of
6 0:25–0.4 A
?
. This procedure does not however
identify the erroneous distances with certainty and
errors resulting from a wrong starting model may
still evolve. Different starting models could be
employed for estimating unknown fixed distances.
Schmitz and James [68] note that a starting model
closer to the true model improves the estimates. The
advantage in this respect of MORASS and IRMA is
that restrained MD structure calculations are done in
each iteration step, thus improving the model
estimates in each iteration step, thus reducing this
possible source of error. On the other hand, we have
found that errors in the distances are very difficult to
detect in the restrained MD calculations (XPLOR,
[78]). The structure is often adjusted to compensate
for the erroneous distance constraint in such a way
that distance violations show up not at the site of the
erroneous constraint but elsewhere in the structure.
Thus, simulated annealing does not provide a certain
means to identify erroneous distance constraints. In
fact, it is our experience that it is of prime importance
for simulated annealing to derive distance constraints
which are assured to be correct. How to identify the
types of errors? One simple approach, which we have
not seen suggested in the literature, however, is to
compare the relaxation matrix estimates with ISPA
or ‘modified’ ISPA derived distances. Since the latter
are assuredly correct, within albeit somewhat larger
error bounds, large deviations directly pinpoint
‘suspect’ distance estimates.
In summary, the relaxation matrix approaches
result for most distances (especially the short
distances , 3.0 A
?
) in better estimates than those
obtained with a ‘modified’ ISPA approach. On the
other hand, the relaxation matrix approaches may
also give distances that are wrong by a large amount.
These erroneous distances are difficult to pinpoint. In
contrast, the simulations by van de Ven et al. [74] or
Borgias and James [67] indicate that the simple ISPA
(or a modified ISPA approach when applied) gives
distance estimates that are assuredly correct, albeit
with slightly larger error bounds. The advantage of
the ISPA method is its simplicity, which allows one
to easily identify those distances that are likely to be
affected most by spin diffusion. In addition, one can
experimentally minimize spin diffusion effects by
choosing relatively short mixing times when record-
ing the NOESY spectra. Furthermore, several factors
that may affect the distance calculations are not well
accounted for in the relaxation matrix equations.
Libration motions may lead to S
2
values varying
from values of 0.8 to 0.6, thereby affecting NOEs
proportionally. When considering that spin diffusion
leads to a lower effective distance dependence of the
NOEs, e.g. from the sixth to the fourth inverse power,
this results in relative errors of 5–8% in the distances.
Nucleic acids do not behave as isotropic tumbling
molecules, but are rather asymmetric. This also
affects the NOE calculations (see Section 8). Thus,
minimal error bounds of at least 10% corresponding
to 6 0.2 A
?
at2A
?
are called for. These issues provide
another incentive for sticking to somewhat conserva-
tive distance constraints, which may as well be
derived from the less precise ‘modified’ ISPA
approach. We suggest therefore the use of conserva-
tive error bounds for the non-exchanging protons of:
6 0.2–0.3 A
?
up to 2.6 A
?
, 6 0.3 A
?
from 2.6 A
?
to 3.3
A
?
, 6 0.4 A
?
from 3.2 A
?
to 4.0 A
?
, 6 0.5 A
?
from 4.0 A
?
302 S.S. Wijmenga, B.N.M. van Buuren/Progress in Nuclear Magnetic Resonance Spectroscopy 32 (1998) 287–387
to 5.0 A
?
, and .60.6 A
?
for distances . 5.0 A
?
. For
exchanging protons, one could use 0 to 5 A
?
, but see
Schmidt and James [68]. Deriving distance constraints
via a ‘modified’ ISPA with conservative error bounds
as described above prevents on the one hand putting
too tight constraints on the long distances and still
providing reasonable estimates for short distances,
while on the other hand it also prevents choosing
unduly loose error bounds.
Are conservative or ‘loose’ distance constraints
detrimental to the structure determination compared
to ‘tight’ distance constraints? The answer seems to be
that they are not. This has recently been elegantly
demonstrated by Allain and Varani [79] using NOE
data from the RNA hammerhead as a model system.
They classified distances loosely into three categories,
i.e. 0–2.9 A
?
(corresponding to d
est
6 0.55 A
?
if one
considers that the shortest distance is 1.8 A
?
any-
how), 2.9–3.5 A
?
(d
est
6 0.3 A
?
), and 3.5–5.0 A
?
(d
est
6
0.75 A
?
). These distance ranges correspond in fact
quite closely to the ones mentioned above. These
ranges are used for all protons except the exchange-
able or strongly overlapping protons, for which the
bounds were set to 0–5.0 A
?
. When tightening the
constraints to ranges of 1.8–2.4 A
?
(d
est
6 0.3 A
?
),
1.8–2.9 A
?
(d
est
6 0.55 A
?
), 2.5–3.5 A
?
(d
est
6 0.5 A
?
),
and 3.5–5.0 A
?
(d
est
6 0.75 A
?
), the precision of the
structures does not significantly improve, i.e. the
rmsd of the final structures improves by 0.2 A
?
at
best (Table 2 of Allain and Varani [79]). Van de
Ven and Hilbers [80] (see also Hilbers et al. [81])
have also investigated how the precision of the dis-
tance data affects the precision of the structures.
Employing all d
s
(6/8/19/29/20/39;6/8/5) distances and
assuming that they are determined with a precision of
6 0.25 A
?
, they find that in a G to G dinucleotide step
the twist is determined with a range of about 428 and
the rise with a range of roughly 1.5 A
?
. For the
unrealistically high precision of 6 0.1 A
?
, they find
similar values, namely, 408 for the twist and 1.2 A
?
for
the rise. The increased tightness of the distance ranges
does not considerably improve the precision of the
helix parameters. Van de Ven and Hilbers [80] and
Wijmenga et al. [62] have investigated how precisely
torsion angles within a nucleotide unit are determined
by distances given a 6 0.2 A
?
uncertainty in the dis-
tances. It is found that a combination of d
i
(6;29/20/39)
distances quite accurately determine the glycosidic
torsion angles and also allows the determination of
the fraction of N- or S-pucker of the sugar pucker,
while the distance d
i
(6;39) by itself does not determine
well the fraction of N- or S-pucker. As pointed out in
Ref. [62] uncertainties .60.5 A
?
make it virtually
impossible to determine the fraction of N- or
S-pucker. The glycosidic torsion angle remains well
determined by the distance d
i
(6/8;29) even with an
uncertainty as large as 6 0.7 A
?
. With an uncertainty
of 6 0.2 A
?
in the distance d
i
(39;59/50) the fraction of
g
t
rotamer is defined, but rather loosely. On the other
hand, a combination of d
i
(6;59/50) is sufficient to
define this fraction rather well [62]. Uncertainties
.60.5 A
?
make it virtually impossible to determine
the fraction of g
t
rotamer (see Fig. 3.1 in Ref. [62]).
The same error bounds for ISPA estimates derived
above start from about 6 0.2 A
?
for distances around
2.0 A
?
and broaden to about 6 0.5 A
?
at 5.0 A
?
; the
error in the ‘loose’ as well as the ‘tight’ constraint
sets used by Allain and Varani also increases from
roughly 6 0.3–0.5 A
?
for short distances between
2.0 and 3.0 A
?
to 6 0.7 A
?
for distances between 3.5
and 5.0 A
?
. These error bounds correspond to uncer-
tainties which fall somewhere in between the 6 0.2 A
?
and 6 0.5 A
?
range. Thus, for both sets of constraints
the error bounds on the intra-residue distances are
sufficiently narrow that the glycosidic torsion angle
and sugar pucker, as well as the g
t
rotamer, can be
determined reasonably well. The conclusion that one
is forced to draw from these investigations is that
rather conservative distance constraints (see above)
can be employed without detrimental effect on the
ultimate precision of the structure.
How does the number of constraints and their
spread through the molecular structure affect the pre-
cision of the derived structure? As can be seen from
Table 1, the spread in space of the NMR accessible
distances in a helix is rather uneven. A large propor-
tion of the total number of short distances are intra-
residue (48%), a considerable percentage of which do
not confer structural information. In addition, the
most easily measured sequential distances, the group
of H8/H6 to sugar proton distances, constitute a large
part of the sequential distances (20%). Unfortunately,
these sequential sugar-to-base distances only define
one side of the base plane. The same applies for the
base-to-base distances (6%). A large number of the
sequential distances involve sequential sugar-to-sugar
303S.S. Wijmenga, B.N.M. van Buuren/Progress in Nuclear Magnetic Resonance Spectroscopy 32 (1998) 287–387
distances (20%). They define the backbone quite well,
but are difficult to measure since the protons involved
reside in very crowded regions of the NMR spectra.
Very few detectable cross-strand distances exist,
except for those involving imino and amino protons
and H2 to H19 protons in an A-helix. Furthermore, in
contrast to a-helices in proteins where one finds NH(i)
to NH(i t 3) distances, in nucleic acids no such long-
range distances are found. This rather uneven spread
through the chemical structure of the NMR detectable
distances, together with the lack of long-range
distances, is expected to detrimentally affect the
precision by which the helical structure can be deter-
mined and thereby the precision of torsion angles and/
or stacking and helical parameters such as twist and
rise. How extensive should the constraint set be in
order to be able to define structural elements
reasonably well?
We find in the case of a 12-mer RNA duplex with a
tandem GA base pair that very high precision is
achieved when about 30 ‘loose’ distance constraints
per residue are employed; the constraint set included,
besides the easily accessible constraints, sequential
sugar-to-sugar distances and torsion angle constraints
for d and g, but not for a, b, ? and z [76,77]. The pair-
wise rmsd of the center part of the duplex consisting
of four base pairs which included the tandem GA base
pairs was found to be 0.6 A
?
. This highly precise struc-
ture allowed the determination of the base pairing of
the tandem GA base pair (no base pair constraints
were applied for the tandem GA part of the structure).
This suggests that the inclusion of the sequential
sugar-to-sugar distances is quite important.
Gorenstein and co-workers [71] also found that
inclusion of sequential sugar-to-sugar distances
gives considerable improvement in the definition of
the derived structure. Van de Ven and Hilbers [80]
(see also Ref. [81]) have investigated how the spread
of the distance constraints through the molecule
affects the precision of the derived structure.
When using distance constraints with a precision
of 6 0.25 A
?
they find that in a G to G dinucleotide
step the twist is determined with a range of 428 and the
rise with a range of roughly 1.5 A
?
(see above). Better
results are obtained when the dinucleotide is C to G;
the range for the twist is then 228 and for the rise
0.8 A
?
. This improved definition results from a better
spatial spread of the protons in the CG step as
compared to that in the GG step. In the CG dinucleo-
tide one finds distances involving H6 and H5 protons,
whereas in the GG dinucleotide only distances involv-
ing H8 protons are found. Similarly, the systematic
study of Allain and Varani [79], using the hammer-
head as a model system, shows that inclusion of loose
constraints (see above) involving exchangeable pro-
tons greatly increases the definition, while adding
sugar-to-sugar constraints and torsion angles gives
further improvement but to a lesser extent. For the
so-called realistic ‘loose’ constraint set they find
approximate ranges of 108 for the twist and 0.4 A
?
for the rise (see above for the distance ranges; this
set contains no sugar-to-sugar constraints but includes
constraints involving exchangeable protons apart
from sugar-to-base and base-to-base constraints).
These ranges are narrower than those found by van
de Ven and Hilbers, which again demonstrates the
advantageous effect of a larger number of constraints
on the precision. Note that van de Ven and Hilbers
considered isolated G to G or C to G dinucleotide
steps, while in the study of Allain and Varani the
bases are part of a helix, which limits the allowed
conformational space. This also demonstrates that
the number of constraints required to obtain high
structural definition is quite context dependent. In
loop regions, and bulge regions where no base pair
constraints are present, either the precision will be
lower with the same number of constraints or a larger
number of constraints is required to achieve the same
high level of precision as in a helix. Finally, we note
that the in-depth studies of James and coworkers
[68,69] and of Luxon and Gorenstein [72] on DNA
duplexes show that using a large number ( . 10/resi-
due) of rather precise constraints (see above), which
are well spread through the molecular structure, leads
to highly defined NMR derived structures. It appears
to be possible to distinguish sequence dependent
structural effects.
4.5. Conclusion
For the precision of NMR derived structures, the
number of (structurally relevant) distance constraints
is more important than the precision of constraints.
The number of structurally relevant constraints
should be around 15 to 30 per residue. They should
as far as possible be uniformly spread through the
304 S.S. Wijmenga, B.N.M. van Buuren/Progress in Nuclear Magnetic Resonance Spectroscopy 32 (1998) 287–387
chemical structure of the molecule, i.e. rather than
including a large number from one category (e.g.
intra-nucleotide), constraints should be derived in
such a way that each category is represented (intra-
nucleotide and its subdivisions or sequential and
cross-strand inter-nucleotide and its subdivisions,
see Table 1). In view of the uncertainties in the dis-
tance determination from NOE data, precise distances
are prone to error and rather conservative distance
ranges should be used for structure refinement.
5. J-couplings
With the development of
13
C and
15
N labeling
techniques for nucleic acids a large number of
new heteronuclear J-couplings have become
accessible. Knowledge of the values of J-coupling
constants is essential for the rational design and
application of resonance assignment techniques
based on through-bond coherence transfer. Further-
more, these couplings provide important additional
parameters for the determination of torsion angles.
In this section we present an overview of these
heteronuclear J-couplings as well as of proton–
proton J-coupling constants, their relation to
torsion angles, and describe NMR techniques for
determining their values. The J-couplings are
discussed according to their torsion angle
dependence, except for the J-couplings in the bases
and the
1
J
CH
-couplings which are discussed
separately.
Table 2
One-bond coupling constants
1
J
HC
(Hz) in the base and sugar moieties of
13
C/
15
N labeled 59-AMP, 59-GMP, 59-UMP and 59-CMP
1
J
CH
59-AMP
a
59-GMP
b
59-UMP
a
59-CMP
b
H8 215.9 216.0
H2 203.2
H6 184.9 184.1
H5 178.7 175.8
H19 166.5 166.1 170.2 169.5
H29
c
150.9 151.4 151.7 151.2
H39
c
152.6 152.0 152.2 151.3
H49 150.9 151.1 150.3 148.9
H59
d
148.3 146.9
e
148.3 147.2
f
H50
d
145.2 146.9
e
144.9 145.0
f
a
average values derived from 1D
1
H and
13
C spectra recorded at 500 and 700 MHz.
b
average values derived from 1D
13
C spectrum recorded at 600 MHz.
c
in cyclic nucleotides, which have a defined sugar conformation, one finds for N- and S-type sugars
1
J
H29C29
values of 158.4 Hz and 149.2 Hz,
respectively, and for
1
J
H39C39
values of 149.6 Hz and 156.8 Hz, respectively. The error in the coupling constant values is given to be about
6 0:3 Hz [49].
d
for 59-AMP, 59-GMP, 59-UMP and 59-CMP the g torsion angle is a mixture of rotamers [49] 2209}; in the cyclic nucleotides, rhpApAi,
rhpGp(dG)i, the g torsion angle is in a defined conformation, gauche t ; the
1
J
H59C59
and
1
J
H50C59
are then 151 Hz and 141 Hz, respectively.
e
near isochronous resonances H59 and H50; value given is from proton-coupled 1D
13
C spectrum.
f
derived from 1D
1
H spectrum at 400 MHz. Table adapted from Ref. [49].
Table 3
One-bond coupling constants
1
J
CC
(Hz) in the base and sugar moieties of
13
C/
15
N labeled 59-AMP, 59-GMP, 59-UMP and 59-CMP
1
J
CC
59-AMP
a
59-GMP
a
59-UMP
a
59-CMP
a
C19-C29 42.2 42.6 43.0 43.4
C29-C39 38.1 37.8 37.8 37.4
C39-C49 38.0 38.3 38.5 38.7
C49-C59 42.3 42.9 42.9 43.0
a
average values derived from 1D
13
C spectra recorded at 500 and 700 MHz (59-AMP and 59-UMP) from 1D
13
C spectrum recorded at
600 MHz (59-GMP and 59-CMP). The error in the coupling constant values is given to be about 6 0.4 Hz [49]. Table adapted from Ref. [49].
305S.S. Wijmenga, B.N.M. van Buuren/Progress in Nuclear Magnetic Resonance Spectroscopy 32 (1998) 287–387
Fig. 3.
306 S.S. Wijmenga, B.N.M. van Buuren/Progress in Nuclear Magnetic Resonance Spectroscopy 32 (1998) 287–387
5.1.
1
J
HC
- and
1
J
CC
-couplings
The
1
J
CH
and
1
J
CC
-couplings are given in Table 2
and Table 3, respectively [49]. To a large extent these
J-couplings are independent of conformation, but
have often indicated a small conformation dependent
variation. For example, the
1
J
C19H19
-couplings have
been found to convey x torsion angle information
[49,82]. The J-couplings,
1
J
H39C39
and
1
J
H29C29
, depend
on the puckering [49]. Finally, the
1
J
H59C59
and
1
J
H50C59
have distinctly different values, namely 151 Hz and
141 Hz, respectively, when the torsion angle g is in
the gauche t domain; these differences are retained
when the rotamer population is not purely gauche t
[49]. Thus, the latter J-couplings can be used for
stereospecific assignment of the H59 and H50 reso-
nances. These conformational dependencies are
extremely useful, especially since the
1
J
HxCx
-
couplings are very easy to measure. They will be
discussed in more detail below in the appropriate
sections.
5.2. Overview of J-couplings in the bases
A complete overview of the homonuclear and
heteronuclear J-couplings, found in the bases A, G,
U and C, involving both newly measured as well as
redetermined J-couplings, has recently been pub-
lished by Ippel et al. [49]. They are summarized
here in Fig. 3 (and Tables 2 and 3). As can be appre-
ciated from Fig. 3 (and also Tables 2 and 3), the
homonuclear and heteronuclear J-couplings in
the bases form a complex network. Consequently,
such a detailed, accurate, and complete set of these
J-couplings such as given here should be of great
value for the rational design and improvement of
through-bond assignment strategies discussed in
Section 7.
5.3. Ribose sugar
Traditionally, the sugar pucker has been
determined mainly from the
3
J
HH
-couplings in the
sugar ring. They can be derived from the generalized
Karplus equation [62,83,84] which is given by:
3
J
HH
? P1 cos
2
f t P2 cos f
t
X
4
i ? 1
(P3 t P4 cos
2
(z
i
f t P5lDx
i
l))Dx
i
t DJ
e12T
Dx
i
? Dx
i, a
t P6
X
3
j ? 1
Dx
ij, b
(13)
Here Dx
i,a
and Dx
ij,b
are the difference in Huggins
electronegativity between hydrogen and the a and b
substituents in the HaS1a(S11b,S12b,S13b)
Fig. 3. From left to right,
1
J-,
2
J- and
3
J-coupling constants (in Hz) in the bases of 59-AMP, 59-GMP, 59-UMP and 59-CMP. This figure is
adapted from a similar one in Ref. [49]. All the given J-coupling values were (re)determined in that study, except those indicated with r1, r2, r3,
r4 and r5, which were taken from Refs. [231,112,259,233,232], respectively. J-couplings for which the assignment was, in Ref. [49], found to be
ambiguous are indicated by the lower case letters a, b, c and d, with: (a) Two equal
n
J
CN
-couplings were found on C6, with a sum value of
14.9 Hz; assigning 7.5 Hz to
1
J
C6N1
leaves ,7.5 Hz for the other J
C6N
-coupling, which can arise either from
3
J
C6N2
or
3
J
C6N3
or
2
J
C6N7
or
3
J
C6N9
.
(b) The C2 resonance exhibits a multiplet pattern consisting of 15 lines (intensity, 1:2:2:2:2:2:3:4:3:2:2:2:2:2:1) in the 1D (
1
H)-decoupled
13
C
spectrum. Since the
3
J
C2C5
has a value of 3.7 Hz, the observed multiplet can be simulated by incorporating four additional J-couplings with
values of approximately 3.9, 7.6, 15.2 and 23 Hz; the 23 Hz coupling can be assigned to
1
J
C2N2
on the basis of a similarly large value for the
corresponding coupling in CMP and AMP; the J-couplings of 7.6 and 15.2 Hz can tentatively be assigned to
1
J
C2N1
and
1
J
C2N3
, respectively, or
the reverse; the smallest of these four J-couplings, i.e. 3.9 Hz, can then be assigned to
3
J
C2N9
, assuming that the four-bond coupling,
4
J
C2N7
,is
undetectably small. (c) Computer simulation of the G C4 resonance multiplet in the 1D (
1
H)-decoupled
13
C spectrum yields two J
CN
-coupling
constants of approximately 20 and 8.5 Hz, respectively. The largest coupling constant can be assigned to
1
J
C4N9
in analogy to that in 59-AMP;
the J-coupling of 8.5 Hz can be unassigned to either one of the couplings
1
J
C4N3
,
2
J
C4N7,
3
J
C4N1
or
3
J
C4N2
. (d) Computer simulation of the H5
resonance multiplet in the fully coupled 1D
1
H spectrum yielded two more not yet assigned values, namely 2.6 and 4.4 Hz, which can be
attributed to
3
J
H5N1
and
3
J
H5N3
, respectively, or the reverse.
Scheme 1.
307S.S. Wijmenga, B.N.M. van Buuren/Progress in Nuclear Magnetic Resonance Spectroscopy 32 (1998) 287–387
S3a(S31bS32bS33b)-C1-C2-S2a(S21bS22bS23b)
S4a(S41bS42bS43b)Hb fragment; Dx
i,a
, Dx
ij,b
? 1.3
(O), 0.4 (C), 0.85 (N) and 10.05 (P) (Scheme 1).
The parameter z
i
is t1or11 depending on the
orientation of the substituent as indicated. The
parameters P1toP6 depend on the number of non-
hydrogen a substituents:
# a substituents P1 P2 P3 P4 P5 P6
2 13.89 10.96 1.02 13.40 14.9 0.24
3 13.22 10.99 0.87 12.46 19.9 0.00
4 13.24 10.91 0.53 12.41 15.5 0.19
overall 13.70 10.73 0.56 12.47 16.9 0.14
From this generalized Karplus equation it follows
that the
3
J
HH
-couplings in the ribose ring are given by:
3
J
HH
? 13:22 cos
2
f 1 0:99 cos f
t
X
4
i ? 1
(0:87 1 2:46 cos
2
(z
i
f t 19:1lDx
i
l))Dx
i
tDJ e14T
Dx
i
? Dx
i, a
(15)
(for J
1929
, J
1920
, J
2939
and J
2039
)
3
J
3949
? 13:24 cos
2
f 1 0:91 cos f
t
X
4
i ? 1
(0:53 1 2:41 cos
2
(z
i
f t 15:5lDx
i
l))Dx
i
t DJ e16T
Dx
i
? Dx
i, a
1 0:19
X
3
j ? 1
Dx
ij, b
(17)
In addition, a correction for the so-called Barfield
transmission effect is required for J
1920
and J
2939
:
DJ
1920
?12:0 cos
2
(P 1 234) 1448 , P , 3248 (18)
DJ
2939
?10:5 cos
2
(P 1 288) 1808 , P , 3608;
08 , P , 188 e19T
The torsion angle f in the above equations can be
related to the pseudorotation angle P and pucker
amplitude J
m
via [62]:
f
1929
? 121:4 t 1:03J
m
cos(P 1 144) (20)
f
1920
? 0:9 t 1:03J
m
cos(P 1 144)
f
2939
? 2:4 t 1:03J
m
cos(P)
f
2039
? 121:9 t 1:03J
m
cos(P)
f
3949
? 124:0 t 1:03J
m
cos(P t 144)
The values of
3
J
1929
to
3
J
3949
are given as a function
of pseudorotation angle P and pucker amplitude J
m
in
Fig. 4(A)–(E). Ribose sugar rings are not rigid but
interconvert rapidly between N- and S-type conforma-
tions. These sugar puckering states are described by
their pseudorotation angles and amplitudes, namely,
P
N
and J
m
N
, for the N-state, and P
S
and J
m
S
, for the
S-state. The relative population can be found via the
fraction S-conformer, pS. The parameters J
m
N
and J
m
S
tend to be fairly constant and range from 32 to
40, while P
N
ranges from 1 108 to 208 and P
S
lies
between 1208 and 1808. The
3
J
HH
-couplings in the
ring are then the weighted average of the
3
J
HH
-cou-
plings in the two conformations:
3
J
av
ab
? (1 1 pS)·
3
J
N
ab
t pS·
3
J
S
ab
(21)
Thus,
3
J
N
ab
and
3
J
S
ab
depend on P and J
m
in their
respective states. Although P
N
, P
S
, J
m
N
and J
m
S
can
be determined in principle from the complete set of
ribose
3
J
HH
-couplings, in practice, these values are
assumed to be known for the least populated state,
and to correspond to the middle values of P and J
m
in the N- or S-puckered state, i.e. P
N
? 10 and J
m
N
?
35, P
S
? 160 and J
m
S
? 35. For the most highly
populated state it is thus possible to derive P and
J
m
. We finally note that a straightforward check
whether the assumption of an equilibrium between
N-type and S-type conformers applies, is to plot
3
J
3949
against
3
J
1929
. These two J-couplings have a
reverse dependence on the fraction of N-type sugar,
which is incompatible with a P value intermediate
between N-type and S-type sugars [62,80]. The most
commonly used program to derive P, J
m
and pS from
3
J
HH
-couplings is PSEUROT, developed by van den
Hoogen et al. [85]. Alternatively, one can add NOE
information to determine the puckering state [62,64].
308 S.S. Wijmenga, B.N.M. van Buuren/Progress in Nuclear Magnetic Resonance Spectroscopy 32 (1998) 287–387
Fig. 4. (A)–(E) Contour lines of
3
J
HH
-coupling constants (Hz) in the sugar ring as a function of P
m
and f
m
. The coupling constants were
calculated with the aid of the EOS–Karplus equation and corrected for the Barfield transmission effect (see text). (A)
3
J
1929
, (B)
3
J
1920
, (C)
3
J
2939
,
(D)
3
J
2039
and (E)
3
J
3949
.
309S.S. Wijmenga, B.N.M. van Buuren/Progress in Nuclear Magnetic Resonance Spectroscopy 32 (1998) 287–387
Conte et al. [64] use a program that incorporates both
3
J
HH
-couplings and NOE information. Finally, as will
be discussed in more detail in Section 5.3, the
3
J
HH
-
couplings can also be derived from TOCSY data. In
the program developed by van Duynhoven et al. [86]
the puckering state, i.e. P, J
m
and pS, are directly
derived from the (H,H) TOCSY cross peak
intensities.
Thanks to the availability of labeled compounds,
3
J
HC
-couplings can be used as additional parameters
to determine the sugar pucker. Ippel et al. [49] have
recently determined parameters for the Karplus rela-
tions between
3
J
H39C19
,
3
J
H19C39
,
3
J
H29C49
and
3
J
H49C29
and the sugar puckering. Since the ribose ring is in
rapid equilibrium between N- and S-conformers,
the
3
J
HC
-couplings are the weighted average of
the
3
J
HC
-couplings in the two conformations:
3
J
ab
? pS·
3
J
S
ab
t (1 1 pS)·
3
J
N
ab
(22)
Here
3
J
S
ab
and
3
J
N
ab
are the
3
J
HC
-coupling values in the
S- and N-puckered state, respectively, i.e.
3
J
S
H39C19
?
6.3 Hz,
3
J
N
H39C19
? 1.4 Hz,
3
J
S
H29C49
? 0.3 Hz,
3
J
N
H29C49
?
4.2 Hz,
3
J
S
H19C39
? 0.3 Hz,
3
J
N
H19C39
? 2.7 Hz,
3
J
S
H49C29
?
1.1 Hz and
3
J
N
H49C29
? 1.0 Hz. Thus, this relation pro-
vides another opportunity to determine the sugar
puckering. Note that
3
J
H39C19
and
3
J
H29C49
differ
strongly between the N- and S-state and can thus be
used quite well to determine pS. The three-bond
couplings,
3
J
H19C39
and
3
J
H49C29
, remain small for
both the S- and N-puckered states and are thus less
useful indicators of the sugar puckering. We finally
note that the
3
J
H39C59
-coupling, in principle, also moni-
tors the puckering, namely via the torsion angle d.
Unfortunately, its value does not differ significantly
between the N- and S-state [49].
A third source of information on the puckering state
of the sugar ring is the
2
J
HC
-couplings. The two-bond
couplings
2
J
H29C19
and
2
J
H39C29
are found to be negative
for S-puckered rings and positive for N-puckered
rings, while the reverse holds for
2
J
H29C39
and
2
J
H39C49
[49]. These two-bond J-couplings are thus good indi-
cators of the sugar ring conformation. However, the
other sugar ring two-bond couplings,
2
J
H19C29
and
3
J
H49C39
, do not change sign and remain negative for
both the N-puckered and the S-puckered state [49].
These signs of the
2
J
HC
-couplings are in complete
accordance with the projection rule proposed by
Bock and Pederson [87]. According to this rule the
sign of the
2
J
HC
-couplings in H–C–
13
C–X1(X2)
systems can simply be predicted from the Newman
projection and the orientation of the electronegative
substituents X1 and X2 relative to the position of the
proton coupled to
13
C; X1 and X2 can either be
oxygen or nitrogen. A trans orientation of this H
relative to X leads to a positive value for
2
J
CH
,
whereas a gauche orientation gives rise to a negative
value for
2
J
CH
. If two X substituents are present on
13
C
the effects are additive. Thus, if both H to X1 and H to
X2 are oriented gauche to each other a negative value
is obtained for
2
J
HC
. On the other hand, a trans orien-
tation of H relative to X1 and a gauche orientation of
H relative to X2 will give rise to compensating effects
and a
2
J
HC
value which is approximately zero. For
example,
2
J
H29C39
is predicted by this rule to be nega-
tive for an N-conformer, since O39 is oriented gauche
with respect to H29, and positive for an S-conformer,
since O39 is oriented trans with respect to H29. This is
exactly what is observed experimentally (see above).
The other
2
J
HC
-couplings in the sugar ring also follow
this projection rule.
Finally, the one-bond couplings
1
J
CH
also convey
sugar pucker information;
1
J
H39C39
has values of
149.6 Hz and 156.8 Hz for N- and S-type sugars,
respectively, while
1
J
H29C29
shows the reverse trend,
namely values of 158.4 Hz and 149.2 Hz for N- and
S-type sugars, respectively. When the sugar ring
occurs as a mixture of N- and S-conformers with inter-
mediate values for pS, intermediate
1
J
H29/39C29/39
values
are also found. We note that the torsion angle
dependencies of the
1
J
HC
-couplings are not very
well understood, in contrast to the case of the
3
J
HC
- and the
2
J
HC
-couplings. Nevertheless these
experimental observations remain useful as indicators
of the sugar pucker.
5.4. Determination of the b torsion angle
The b torsion angle has traditionally been deter-
mined by the
3,2
J
HP
-coupling constants. The Karplus
equation describing the
3
J
H59/50P59
-couplings is
3
J
H59=50P59
? 15:3 cos
2
f 1 6:2 cos f t 1:5 (23)
with f ? b 1 1208 for H59 and f ? b t 1208 for H50.
The most recent parametrization has been used here
[62,88]. The
3
J
H59P59
-and
3
J
H50P59
-couplings describe
the b torsion angle as shown in Fig. 5. As can be
310 S.S. Wijmenga, B.N.M. van Buuren/Progress in Nuclear Magnetic Resonance Spectroscopy 32 (1998) 287–387
seen, these two J-coupling constants define the b
torsion angle quite well for the whole range of b
values.
The
3
J
C49P59
-couplings are described by
3
J
C49P59
? 8:0 cos
2
b 1 3:4 cos b t 0:5 (24)
Here again the latest parametrization, as given by
Mooren et al. [88], is used. Fig. 5 also shows
3
J
C49P59
as a function of b. As can be seen, a fairly narrow
range of b torsion angles can be derived from the
additional knowledge of this J-coupling constant.
Whether the flexibility plays a role or not can be
established by the concerted use of the different
J-coupling constants. If the three J-coupling values do
not indicate one value for the torsion angle, it can be
concluded that angular averaging is taking place. The
angular averaging can be either the result of libration
motions within one rotamer domain or rapid inter-
conversion between different rotamers. It may even
be possible under certain circumstances and making
certain assumptions, for example that only one
rotamer fraction is populated, to establish the range
of allowed torsion angles for that rotamer, i.e. the
width of the libration motion. But usually one tries
to derive from the J-couplings the relative rotamer
populations. An estimation of the fraction of the
most common conformer, b
t
, can be obtained from
the equation
f
t
?
[25:5 1 (J
H59P59
t J
H50P59
)]
20:5
(25)
This equation has been derived under the assumption
that b torsion angle values of the b
t
and b
1
con-
formers are 608 and 3008, respectively. The coupling
values J
H59P59
and J
H50P59
of these rotamers are then
known. The value of J
C49P59
can also be used to
calculate this fraction in a similar way:
f
t
?
J
C49P59
1 1:3
9:8
(26)
Finally, the
4
J
H49P59
-coupling can reach values as large
Fig. 5. The
3
J
H59P59
-,
3
J
H50P59
- and
3
J
C49P59
-coupling constants calculated as a function of the torsion angle b on the basis of their Karplus relations
(see text).
311S.S. Wijmenga, B.N.M. van Buuren/Progress in Nuclear Magnetic Resonance Spectroscopy 32 (1998) 287–387
as ,1.5–2.5 Hz if b falls in the trans region and the
torsion angle g in the gauche t region, i.e. when
the molecular fragment H49–C49–C59–O59–P59 has a
close to planar W-shaped conformation; for other con-
formations a much smaller value of the
4
J
H49P59
-coupling
will be found.
5.5. Determination of the ? torsion angle
The
3
J
H39P39
-coupling describes the ? torsion angle
via the Karplus equation
3
J
H39P39
? 15:3 cos
2
(? t 1208) 1 6:2 cos(? t 1208)
t 1:5 e27T
Here the parametrization by Mooren et al. is used
[88]. The torsion angle ? can for steric reasons only
take values between approximately 1708 and 3008,
depending somewhat on the pucker of the sugar ring
[88,89]. The usual value for ? in A-DNA, B-DNA and
A-RNA is trans, but as has been shown by Schroeder
et al. [90], B-DNA can be either in a BI or a BII
conformation, characterized by ? ranging from 160–
2208 and from 260–3008, respectively. These ranges
fall within the sterically allowed region for ? in DNA
for both C29-endo and C39-endo sugars [88]. For
C29-endo these ranges correspond to energy minima
at ? < 1808 and ? < 2888 [91], while for C39-endo
sugar puckers only a small range of ? values centered
around 1958 is energetically favorable. For RNA,
Mooren et al. [88] have shown that ? has a narrower
range, namely 1708 , < ?, < 2808, for C29-endo
sugar puckers than for the usual C39-endo sugar
puckers, 1858 , < ?, < 2808. Also for RNA a
maximum tends to exist around ? < 2108. With these
data in mind it is interesting to consider the Karplus
curve of the
3
J
H39P39
-coupling versus the torsion angle
? as shown in Fig. 6. As can be seen the two
expected regions, 160–2208 and 260–3008, have
quite similar
3
J
H39P39
-coupling constants. Thus, deter-
mination of the
3
J
H39P39
-coupling constant is of limited
value for determining the torsion angle ?.
Fig. 6. The
3
J
C49P39
-,
3
J
C29P39
- and
3
J
H39P39
-coupling constants calculated as a function of the torsion angle ? on the basis of their Karplus relations
(see text).
312 S.S. Wijmenga, B.N.M. van Buuren/Progress in Nuclear Magnetic Resonance Spectroscopy 32 (1998) 287–387
Fig. 6 also gives
3
J
C49P39
and
3
J
C29P39
as a function of
? as derived from the Karplus equation
3
J
C49=C29P39
? 8:0 cos
2
f 1 3:4 cos f t 0:5 (28)
with f ? ? for C49 and f ? ? 1 1208 for C29. Here
again the latest parametrization as given by Mooren et
al. [88] is used. As can be seen,
3
J
C49P39
and
3
J
C29P39
together define the ? torsion angle quite well for the
trans range as well as for the gauche range. An even
narrower determination can be made when these
J-couplings are used together with the
3
J
H39P39
-
coupling. In addition, there is the
4
J
H29P39
-coupling,
which has a low value when ? is in the trans region,
while its value is < 2.0–3.0 Hz when ? is in the
gauche region. The
4
J
H29P39
-coupling can therefore
further confirm the presence of a gauche rotamer for ?.
As for the torsion angle b, the additional
J-couplings make it also possible to determine the
Fig. 7. The g torsion angle dependence of: (A) The
3
J
H49H59
- and
3
J
H49H50
-coupling constants (calculated according to their Karplus relations (see
text); (B) The
1
H–
1
H distances, d
i
(39;59), d
i
(39;50), d
i
(49;59) and d
i
(49;50).
313S.S. Wijmenga, B.N.M. van Buuren/Progress in Nuclear Magnetic Resonance Spectroscopy 32 (1998) 287–387
rotamer fractions; in the case when only one rotamer
is populated it is, at least in principle, possible to
determine the width of the libration motion.
5.6. Torsion angle g and H59 and H50 stereospecific
assignment
Traditionally, the torsion angle g as well as the H59/
H50 stereospecific assignment have been determined
from a combination of the
3
J
4959/50
-couplings
(Fig. 7(A), [63]), and the distances d
i
(49/39;59/50)
(Fig. 7(B), [62]). With the availability of
13
C labeled
compounds it is now possible to also utilize a number
of heteronuclear
n
J
CH
-couplings for this purpose
[49,92]. Table 4 summarizes all the J-coupling con-
stants and distances involved, and gives their values
for the three main rotamers. As can be seen, a large
number of additional J-couplings has become avail-
able. We will consider below, for each parameter in
turn, its conformational dependence, and then how
they can be used to determine the torsion angle g
and the H59/H50 stereospecific assignment.
The
3
J
4959
-coupling and
3
J
4950
-coupling depend
on the torsion angle g as determined by the general-
ized
3
J
HH
Karplus, given by Eqs. (12) and (13), and as
shown in Fig. 7(A) [63]. In Table 4, these J-couplings
are given for the g
t
, g
t
and g
1
rotamers at the usual
angles of 608, 1808 and 2908, respectively. As can be
read off from this figure and seen in Table 4, when the
torsion angle g is in its usual g
t
domain both the
3
J
4959
-coupling and the
3
J
4950
-coupling are rather
small, about 1.1 and 2.5 Hz, respectively, while for
g
t
the
3
J
4959
-coupling is large (about 10.2 Hz) and
the
3
J
4959
-coupling is small (about 2.7 Hz), while the
reverse holds for the g
1
conformation. Assuming that
the least populated rotamers, gauche(t) and gauche( 1 ),
have middle values for g of 1808 and 2908, it can be
derived using the Karplus equation that the fraction
gauche( t ) can be calculated from [62]
f
g t
?
13:3 1 (
3
J
4959
t
3
J
4950
)
9:7
(29)
The distances d
i
(39;59/50) also depend on the torsion
angle g Fig. 7(B). As can be seen, when the torsion
angle g is in its usual g
t
domain the distance d
i
(39;59)
is large and the distance d
i
(39;59) is small, for g
t
both
the distances are small while for the g
1
conformation
the reverse holds as for g
t
(see Table 4) How pre-
cisely one has to know these distances to determine
the torsion angle g or the fraction g
t
has been dis-
cussed in detail in Section 4 (see also Ref. [62]).
Together with the J-couplings these distances also
provide the H59/H50 stereospecific assignment.
For example, when both the
3
J
4959
-coupling and
3
J
4950
-coupling are small the g torsion angle is
gauche(t), but in this case it is not possible to obtain
the stereospecific assignment from these J-couplings.
The stereospecific assignments of H59 and H50 can
then be obtained from the two distances d
i
(39;59)
and d
i
(39;50), since for g
t
the latter is small while
the former is large (see Table 4).
In addition to the classical parameters discussed
above, a number of heteronuclear J-couplings
have become accessible. Firstly, we consider the
3
J
H59=H50C39
-couplings. Although no well established
Karplus equation exists for the g torsion dependence
of this J-coupling, a qualitative assessment of the tor-
sion angle g is still possible from the available data
(see Ref. [49]). For g
t
, the trans coupling
3
J
H59C39
is
about t5.2 Hz and larger than the gauche coupling,
3
J
H50C39
, which is about t1.2–1.4 Hz. For g
t
, both
Table 4
Overview of J-couplings and distances for the determination of the
torsion angle g and for the stereospecific assignment of the H59 and
H50 resonances
a
g
t
g
t
g
1
3
J
H59C39
t5.2 t1.3 t1.3
3
J
H50C39
t1.3 t1.3 t5.2
2
J
H59C49
14.8 t1.2 14.8
b
2
J
H50C49
t1.0 14.2 14.8
b
2
J
H49C59
t3.5 11.5 11.5
b
1
J
H59C59
151
c
151
c
151
c
1
J
H50C59
141
c
141
c
141
c
3
J
4959
d
t2.5 t2.7 t10.1
3
J
4950
d
t1.1 t10.2 t3.6
d
i
(3959)
d
3.6 2.4 2.7
d
i
(3950)
d
2.5 2.7 3.7
d
i
(4959)
d
2.4 2.4 3.0
d
i
(4950) 2.4 3.0 2.5
a
J-coupling values are given in Hz and distances are in A
?
.
b
The values are estimated on the basis of the known values of
these J-couplings for the g
t
and g
t
rotamers and the Bock
projection rule [86] (see text).
c
There is some uncertainty considering the data in [49] that this
relation holds for other torsion angle values than g
t
.
d
Estimated values at torsion angles g of 608, 3008 and 2908
corresponding to the g
t
, g
t
and g
1
rotamers, respectively.
314 S.S. Wijmenga, B.N.M. van Buuren/Progress in Nuclear Magnetic Resonance Spectroscopy 32 (1998) 287–387
H59 and H50 are gauche with respect to C39, so that
both
3
J
H59C39
and
3
J
H50C39
are expected to be small, i.e.
< 1.2–1.4 Hz (see Table 4, [49]). Secondly, the
2
J
H59C49
and
2
J
H50C49
-couplings also monitor the
torsion angle g and can provide stereospecific assign-
ment of the H59 and H50 protons [49,92]. For g
t
, the
coupling
2
J
H59C49
?14.6 to 15.0 Hz, while
2
J
H50C49
is
small positive, t0.3–1.4 Hz; for g
t
,
2
J
H59C49
and
2
J
H50C49
are t1.2 Hz and 14.2 Hz, respectively (see
Table 4, [49,92]). Finally,
2
J
H49C59
can be estimated at
3.5 Hz and 11.5 Hz for the g
t
and g
t
/g
1
states,
respectively (see Table 4, [49,92]). These variations
in
2
J
HC
are in accordance with the projection rule
([87], see also discussion in Section 5.3). For exam-
ple, for g
t
H59 is oriented gauche with respect to O49
giving rise to a negative value for
2
J
H59C39
, while H50 is
trans with respect to O49 so that
2
J
H50C39
is positive.
The easily measurable
1
J
H59/50C
-couplings have been
discussed in Section 5.1. They seem to depend mainly
on the stereospecificity, with the larger value for
1
J
H59C59
(see Table 4, [49]).
These additional heteronuclear J-couplings provide
an alternative means for the determination of the tor-
sion angle g and the H59/H50 stereospecific assign-
ment. Their usage for determining the torsion angle
g may or may not require stereospecific assignment of
the H59 and H50 resonances. For example, the
2
J
H49C59
-
coupling depends only on the torsion angle g, and
does not involve the H59/H50 protons. It is thus ideally
suited for the estimation of the torsion angle g, since it
does not require a knowledge of the stereospecific
assignment of the H59 and H50 resonances. A positive
value immediately ascertains g
t
(see Table 4). On the
other hand, although
3
J
H59/H50C39
and
2
J
H59/H50C49
also
depend on the torsion angle g, their unambiguous
use requires the stereospecific assignment of the
H59 and H50 resonances; the same applies to
3
J
HH
-
couplings,
3
J
H59H49
and
3
J
H50H49
, discussed previously.
Finally, the
1
J
H59C59
- and
1
J
H50C59
-couplings seem not
to depend on the torsion angle g, but they can provide
stereospecific assignment of the H59 and H50 reso-
nances, since
1
J
H59C59
.
1
J
H50C59
[49]. Another aspect
Fig. 8. The
3
J
H19C6/8
- and
3
J
H19C2/4
-coupling constants calculated as a function of the torsion angle x on the basis of their Karplus relations (see
text).
315S.S. Wijmenga, B.N.M. van Buuren/Progress in Nuclear Magnetic Resonance Spectroscopy 32 (1998) 287–387
is that these extra J-couplings make it possible both to
determine the torsion angle g and obtain the stereo-
specific assignment of the H59 and H50 protons, from
various combinations of these J-couplings. In addi-
tion, NOE or distance data may also be available,
which can be used as an extra source of information.
Table 4 can be consulted to assess which particular
combination of J-coupling and/or NOE data is
required. For example, it can be gleaned from Table
4 that for the most common situation, namely that of
g
t
, various different combinations of J-couplings
suffice to both establish the torsion angle and obtain
stereospecific assignment: (1) (
3
J
4959/50
,
2
J
H59/H50C49
),
(2) (
3
J
4959/50
,
3
J
H59/H50C39
), (3) (
2
J
H49C59
,
2
J
H59/H50C49
),
(4) (
2
J
H49C59
,
3
J
H59/H50C39
), (5) (
2
J
H59/H50C49
,
3
J
H59/H50C39
),
(6) (
2
J
H49C59
,
1
J
H59/H50C59
).
5.7. x torsion angle and
3
J
HC
sugar-to-base
The three-bond couplings
3
J
H19C8/6
and
3
J
H19C4/2
convey information about the glycosidic torsion
angle x. Ippel et al. [49] have derived a new
parametrization for the
3
J
H19C8/6
Karplus equation:
3
J
H19C6=8
? 4:5 cos
2
(x1 608) 1 0:6 cos(x 1 608) t 0:1
(30)
and for the
3
J
H19C4/2
Karplus equation they derive
3
J
H19C2=4
? 4:7 cos
2
(x1 608) t 2:3 cos(x 1 608) t 0:1
(31)
As can be seen from Fig. 8, the combined use of
3
J
H19C8=6
and
3
J
H19C4/2
makes it possible to discriminate
between the anti- and the syn-range for the torsion
angle x.The
3
J
H19C8/6
-coupling has a value of about
4.5 Hz in both the x
anti
and the x
syn
domain. On the
other hand, the
3
J
H19C4/2
-coupling has quite different
values in the x
anti
and the x
syn
domains, 2.0 Hz and
6.0 Hz, respectively. Thus, in a qualitative sense,
when
3
J
H19C4/2
,
3
J
H19C8/6
the glycosidic torsion
angle is in the anti domain, while for
3
J
H19C4=2
.
3
J
H19C8/6
the glycosidic torsion angle is in
the syn domain. We note that Davies and co-workers
[82] also derived a correlation between the glycosidic
torsion angle x and the one-bond coupling
1
J
H19C19
.
5.8. Measurement of homo- and heteronuclear
J-coupling constants
We discuss here the various methods that are avail-
able for measuring J-coupling constants in nucleic
acids. A large number of (novel) approaches exist
for the determination of J-couplings, the majority of
which have been tested and applied to the determina-
tion of J-couplings in proteins (for reviews see, for
example, Refs. [93,94]). The application to nucleic
acids has been more limited. Here we discuss the
established and new approaches as they pertain to
their application in the determination of J-coupling
constants in nucleic acids: (1) Determination of
J-couplings from the shape of the signal; (2) Determi-
nation of J-coupling constants with the E.COSY
principle; (3) Determination of J-coupling constants
from signal intensities.
5.8.1. Determination of J-couplings from the shape of
the signal
These methods apply generally well for small
molecules where the line width is smaller than the
J-coupling. The two main factors that complicate
the determination are (i) the complexity of the
multiplet pattern and (ii) the J-coupling to the line
width ratio, J/LW. The higher the complexity of the
multiplet pattern and the smaller the value of J/LW the
more advanced methods are required.
Direct measurement of the J-couplings from either
1D NMR spectra or from 2D COSY in-phase or
anti-phase multiplets works well only when the
J-couplings are larger than or equal to the line width,
otherwise corrections have to be applied. Kim and
Prestegard [95] have proposed a simple and straight-
forward method for correcting for the line width in a
doublet. Unfortunately, in nucleic acids no such
simple doublets occur, except on the H19 resonance
in RNAs which only couples to H29. This
3
J
H19H29
-
coupling is for RNAs the main
3
J
HH
-coupling used
in practice, to determine the sugar puckering state.
The more complex J-coupling patterns generally
need to be simulated in order to obtain quantitative
values for the J-couplings involved. Programs such as,
for example, Sphinx/Linsha [96], can be used to
extract the
3
J
HH
-couplings in a ribose ring by simula-
tion of the multiplet patterns in a (
1
H,
1
H) DQF–
COSY spectrum. More qualitative or less detailed
316 S.S. Wijmenga, B.N.M. van Buuren/Progress in Nuclear Magnetic Resonance Spectroscopy 32 (1998) 287–387
but still quantitative knowledge of J-couplings can be
derived from complex multiplets under certain condi-
tions without the aid of simulation. For example, cer-
tain rules have been proposed for the extraction of
sums of
3
J
HH
-couplings from the multiplets in
(
1
H,
1
H) DQF–COSY spectra (see e.g. Ref. [62]).
Furthermore, as pointed out by Blommers et al. [97]
the presence of passive
3
J
HP
-couplings in the (
1
H,
1
H)
DQF–COSY multiplets allows the determination of
3
J
HP
from these spectra as well. Recently, Pikkemaat
and Altona [98] published a set of simulated multiplet
patterns found in (
1
H,
31
P) COSY spectra (see also
ECOSY principle, Section 5.8.2). By visually com-
paring such patterns with experimentally observed
patterns, the allowed ranges of the
3
J
HP
and
3
J
HH
-cou-
plings involved can be determined.
All the aforementioned approaches have the danger
that J-couplings are underestimated or overestimated
depending on whether the splitting is measured in an
in-phase or anti-phase doublet, respectively, since the
line width is not known a priori. Macaya et al. [99]
have published a procedure (CHEOPS) where they
refine the simulated COSY multiplet patterns includ-
ing line width and chemical shift position against the
experimental COSY pattern. Similarly, Conte et al.
[64] have published such procedures and extensively
tested their method and described the accuracy of the
J-couplings obtained. They have in addition included
NOE information (Section 4) to improve the accuracy
of the determined sugar puckering state. Leijon et al.
[100] have also devised a method which allows inter-
active adjustment of the J-couplings while the multi-
plet is displayed. It uses SPHINX/Linsha to simulate
two-dimensional multiplet patterns and includes a line
width estimate.
An alternative approach for extracting J-couplings
from a variety of spectra, but which has not been used
extensively in the field of nucleic acids, is the idea of
J-doubling, originally suggested by McIntyre and
Freeman [101], later extended by Titman and Keeler
[102] and then called the FIDs procedure. In the
original approach a FID (free induction decay) is
obtained by IFT from a slice through the multiplet,
after which it is multiplied by sin pJ
test
t. The sub-
sequently fourier transformed FID will, in the case
of a doublet with a splitting of J Hz, show an anti-
phase signal in which the middle signal cancels when
J
test
? J. For more complex multiplets other minima
will occur, so that these J-couplings can also be
extracted. However, the complexity of the multiplet
increases by this procedure, making it less useful for
nucleic acids. Furthermore, it does require the line
width to be small compared to the J-coupling.
An extension of the above approach is the FIDs
procedure by Titman and Keeler [102]. Here two
spectra are acquired. One is the original spectrum
and the other a reference spectrum which lacks the
J-coupling of interest, and which is obtained via
decoupling, for example. The reference spectrum
can then be compared, via the FIDs procedure, i.e.
multiply the time domain signal by cos pJ
test
t, with
the original spectrum and subtracted from it. A mini-
mum is obtained when J ? J
test
. In the section on
ECOSY this method will be discussed as well, since
it allows the determination of J
HP
and J
CP
from hetero-
nuclear E.COSY experiments. As far as we know it
has not been applied to DQF–COSY spectra to obtain
J
HP
-couplings or to selectively decoupled DQF–
COSY to obtain small J
HH
couplings. The FIDs
approach has been applied to the determination of
n
J
HC
couplings in a HMBC spectrum [102]. The multi-
plet contains both J
HH
-couplings and J
HC
-couplings. It
has dispersive character due to the evolution of the
J
HH
-couplings. The reference spectrum containing the
J
HH
-couplings can be obtained from, for example, a
TOCSY experiment, by taking a 1D slice through a
cross peak. In order to reproduce the dispersive
character, the FID is reproduced from the 1D slice
via IFT, and subsequently this FID is time shifted,
which simulates the evolution of the J
HH
-coupling.
After fourier transformation of the time-shifted FID
a 1D reference spectrum with the dispersive character
is obtained. The J
HC
-coupling is obtained via multi-
plication of the FID by cos pJ
HCtest
t and fourier
transformation. The test spectrum is subtracted
from the original and a minimum is obtained when
J
HCtest
? J
HC
.
Recently, Stonehouse and Keeler [103] have pre-
sented a new method akin to the FIDS procedure,
which is more generally applicable. In this approach
no reference spectrum is required. Instead, the multi-
plet is extracted from the spectrum and a time domain
signal is obtained via inverse fourier transformation.
The time domain signal S(t) is multiplied by cos pJ
test
t
or sin pJ
test
t and the resultant time domain signal
S(t,J
test
) is integrated over the region where S(t) , 0.
317S.S. Wijmenga, B.N.M. van Buuren/Progress in Nuclear Magnetic Resonance Spectroscopy 32 (1998) 287–387
The integral is definite positive and has a minimum
for J
test
? J. Stonehouse and Keeler demonstrated the
procedure on a number of different types of spectra,
e.g. HSQC, HMBC, NOESY and COSY spectra, and
investigated the effect of line width on the accuracy.
The modified FIDs procedure works quite well for
relatively simple multiplets, i.e. up to quartets of
both in-phase and anti-phase nature.
Finally, the Inverse Fourier Transformation (IFT)
method of in-phase multiplets can be applied [104].
Extraction of the multiplet and subsequent IFT gives a
FID which can be zero-filled and provides a well
digitized spectrum after FT. Alternatively, a test FID
with a certain T
2
decay rate can be fitted to this FID,
giving both J-coupling and line width. This method is
quite useful for simple doublets because it can be used
for larger molecules as well as smaller molecules.
Furthermore, only one spectrum is required, i.e. a
NOESY spectrum which has to be recorded anyhow.
5.8.2. Determination of J-coupling constants from
E.COSY cross peak patterns
A large number of methods for the determination of
coupling constants employ E.COSY multiplet
patterns [94]. In E.COSY spectra the multiplet
patterns are simplified with respect to those in a
regular DQF–COSY spectrum. The J-couplings in
such spectra can be directly read off from the fre-
quency shift of two in-phase patterns. A large number
of pulse sequences have been published that utilize the
E.COSY principle.
The essence of the method is best illustrated using
an AMX spin system (see Fig. 9). Both A and M are
coupled to spin X, with J-couplings equal to J
AX
and
J
MX
, respectively. We consider a cross peak between
spins A and M created via some transfer method (e.g.
TOCSY, COSY, NOESY) without influencing the
spin state of X Fig. 9(a). The acquired signal will
then contain two components:
S(t)?M exp(iq
A
t
1
)exp(jq
A
t
2
)cos(pJ
AX
t
1
)cos(pJ
MX
t
2
)
t M exp(iq
A
t
1
)exp(jq
A
t
2
)sin(pJ
AX
t
1
)sin(pJ
MX
t
2
)
e32T
The two-dimensional multiplet can thus be con-
sidered to consist of the sum of two spectra. In the
first spectrum the J
AX
-coupling and the J
MX
-couplings
to spin X are passive, giving rise to in-phase splittings
in both dimensions Fig. 9(b). In the second spectrum
the J
AX
-coupling and the J
MX
-coupling to spin X are
active, giving rise to anti-phase splittings in both
dimensions Fig. 9(c). The sum spectrum shows the
well known E.COSY pattern, from which both the
J
AX
-coupling and the J
MX
-coupling can be read off
Fig. 9(d). The direction of the shift depends on the
sign of the J-couplings.
5.8.2.1. Homonuclear E.COSY. In the homonuclear
E.COSY experiment the effect outlined above is not
obtained by the retention of the spin state of the
passive spin but by suitable combination of several
multiple quantum correlations. To derive the
multiplet structure in an E.COSY spectrum one
proceeds however in more or less the same way.
Consider, as before, the cross peak between spins A
and M, which are part of an AMX spin system with all
three spins mutually J-coupled. Again one has to
distinguish between active and passive J-couplings,
with the former giving rise to anti-phase splittings
and the latter to in-phase splittings. The E.COSY or
P.E.COSY spectrum is the sum of two spectra. In the
first spectrum the cross peak multiplet structure is
derived by taking the J
AM
-coupling as the active one
and J
AX
and J
MX
as the passive J-couplings. In the
second spectrum three J-couplings are active.
The net result is, as before, an E.COSY pattern. The
most straightforward approach to achieve
homonuclear E.COSY patterns is to record a
P.E.COSY spectrum [105]. This is in fact a regular
COSY with a small flip angle for the read pulse. From
this spectrum the dispersive diagonal peaks can be
removed by subtracting a 2D spectrum with zero
Fig. 9. Schematic illustration of the E.COSY principle: (a) the three-
spin system, A, M and X; both A and M are J-coupled to X;
magnetization or coherence is assumed to be transferred from A
to M in an (A,M) 2D correlation spectrum, resulting in an (A,M)
cross peak with splittings given by J
AX
and J
MX
, respectively. The
(A,M) cross peak has an E.COSY multiplet pattern as shown in (d),
which is the sum of a multiplet with all in-phase splittings (b), and a
multiplet with all anti-phase spittings (c).
318 S.S. Wijmenga, B.N.M. van Buuren/Progress in Nuclear Magnetic Resonance Spectroscopy 32 (1998) 287–387
flip angle for the read pulse. This spectrum can simply
be obtained arithmetically from a 1D spectrum
by shifting the data points in the time domain to the
left.
P.E.COSY spectra are often recorded for the deter-
mination of J
HH
-couplings in DNA sugars. Indeed,
simplification will be observed for a number of
cross peaks, such as the most easily accessible
H19H29 and H19H20 cross peaks. These cross peaks
will have very distinct patterns depending on whether
the sugar is in an S-puckered or an N-puckered state.
The H39H29,H39H20 and H29H20 cross peaks will
also be simplified. The same applies for the group of
protons involved with H49,H59 and H50. The latter all
strongly overlap, however, which may prevent the
possibility of extracting the J-couplings. For RNA
the H20 is lacking, so that on the H19H29 cross peak
no simplication can be attained. Only the cross peaks
involving the H49,H59 and H50 resonances will be
simplified, but they all strongly overlap.
5.8.2.2. Heteronuclear E.COSY. E.COSY patterns
can be produced in a variety of ways in
heteronuclear spectroscopy. Consider again three
spins A, M and X, and assume that spins A and M
are correlated in a heteronuclear (A,M) correlation
spectrum. Assume further that both A and M are
J-coupled to a spin X whose spin state is not changed
during the experiment. Since the spin state of X
remains unchanged, both in-phase and anti-phase
splittings will show up and an E.COSY pattern will
evolve as shown in Fig. 9. Alternatively, E.COSY
patterns will show up in homonuclear spectra, where
an E.COSY pattern will evolve if both homonuclear
spins A and M are coupled to a heteronuclear spin X.
It is also possible to obtain E.COSY patterns in triple-
resonance spectra, for example, in an HNCA
correlation spectrum [106] of a polypeptide. Here
H
N
coherence is transferred via
15
NtoC
a
, creating a
H
N
C
a
cross peak; H
a
is here the passive J-coupling
partner of H
N
and C
a
. In the t
1
dimension C
a
is split
up by the J
HaCa
-coupling of 150 Hz and in the t
2
dimension the H
N
is split by the J
HNHa
-coupling in
an E.COSY-like fashion. The large J
HaCa
-coupling
makes it possible to determine the small J
HNHa
-
coupling very accurately, i.e. without it being
disturbed by line width.
5.8.2.2.1. Determination of J
HP
- and J
CP
-couplings.
For nucleic acids one could use a NOESY, TOCSY or
COSY experiment to create the (Ha, Hb) cross peak
which shows an E.COSY pattern of J
HaP
-couplings
and J
HbP
-couplings [97]. The major problem is the
extreme overlap in such spectra. This is particularly
true for the (H39,H29) cross peak regions in a NOESY,
COSY or TOCSY spectrum of RNA. For DNA, the
(H39,H29) cross peak lies well off the diagonal in a
less crowded region. In any case, the E.COSY pattern
would here consist of a generally small
4
J
H29P39
-
coupling and a
3
J
H39P39
-coupling which tends to lie
between 4 and 8 Hz. As mentioned before the
3
J
H39P39
-coupling is not very informative with regard
to the ? torsion angle. The b torsion angle is
monitored by
3
J
H59P59
and
3
J
H50P59
, while
4
J
H49P59
has
a significant value ( < 3 Hz) only when the segment
H49–C49–C59–O59–P59 is W-shaped, which occurs
when g is gauche t and b is trans. In the usual trans
conformer for b both
3
J
H59P59
and
3
J
H50P59
are
small, 3–4 Hz, while in both the b
t
and b
1
states
one of them is quite large, up to 20 Hz. Thus,
E.COSY patterns could be quite useful for
determining b. However, the (H59,H50) cross peaks
reside in an extremely crowded region; the same
applies for the (H49,H59/50) cross peaks.
The use of heteronuclei, either by enrichment or via
natural abundance, can resolve a large part of the
overlap. Schmieder et al. [107] have recorded for a
circular DNA a natural abundance (
1
H,
13
C) HSQC
spectrum without
31
P decoupling. In the HSQC
spectrum E.COSY patterns are formed on the
different cross peaks involving the passive
J-couplings with
31
P nucleus, i.e. (J
H29P39
, J
C29P39
),
(J
H39P39
, J
C39P39
), (J
H49P39
, J
C49P39
), (J
H49P59
, J
H49C59
),
(J
H59P59
, J
C59P59
) and (J
H50P59
, J
C59P59
). In addition, the
multiplets contain in-phase splittings in the f
2
dimension due to J
HH
-couplings and in the case of
C49, also in-phase splittings in the f
1
dimension due
to J
CP
-couplings. Analysis of this HSQC spectrum
allowed the determination of a large number of the
J
HP
- and J
CP
-couplings. The sensitivity at natural
abundance is low, putting some limitations on the
approach, for example the J
HH
-couplings could not
be extracted from this spectrum. Van Buuren et al.
[108] have applied high resolution HSQC to a
13
C
labeled small hairpin molecule. Here the same
E.COSY patterns are observed, except for the extra
splitting in the f
1
dimension by the large
1
J
CC
-
319S.S. Wijmenga, B.N.M. van Buuren/Progress in Nuclear Magnetic Resonance Spectroscopy 32 (1998) 287–387
coupling ( < 35 Hz). A much higher sensitivity is
obtained here thanks to the
13
C labeling. With the
aid of simulation, virtually all the above mentioned
J
HP
- and J
CP
-couplings could be extracted from this
spectrum, and virtually all J
HH
-couplings and the J
CC
-
couplings. The cross peaks involving H39 and H29
were rather difficult to interpret. This study showed
that the approach is quite useful but limited to small
systems. As will be discussed later, alternatively
intensity-based methods could be used to obtain
J
HP
- and J
CP
-couplings.
Schwalbe et al. [109] described an approach
which uses both E.COSY patterns as well as the
FIDs procedure (see above) to determine
3
J
HP
- and
3
J
CP
-coupling constants in a
13
C labeled RNA
fragment. They recorded four HSQC spectra, namely,
one with full
31
P decoupling, one without
31
P
decoupling, one with
31
P decoupling in q
1
, and one
with
31
P decoupling in the q
2
dimension [109]. When
no
31
P decoupling is employed the E.COSY patterns
discussed above are obtained. If the line width to J
HP
or J
CP
ratio is such that the J-couplings are not fully
resolved, these J-couplings cannot be directly read off
from the multiplet pattern. One can, as suggested
above, resort to simulation of the multiplet or use
the FIDs procedure to extract the J-coupling of
interest. The latter approach is described by Schwalbe
et al. [109]. In the FIDs procedure a reference spec-
trum is required which does not contain the J-coupling
of interest. This spectrum is for example the
q
1
-decoupled spectrum. By addition of the q
1
-
decoupled spectra shifted by tJ
test
/2 and 1J
test
/2, a
simulated spectrum is obtained. This spectrum is
then subtracted from the
31
P-coupled spectrum.
When J
test
? J
CP
the residual signal after
subtraction is minimal. This procedure provides an
alternative to the use of simulation of cross peak
multiplets.
5.8.2.2.2. Determination of J
HC
-couplings. The q
1
-
filtered (H,H) TOCSY (see e.g. Ref. [49]) can be used
for determining
n
J
HC
at natural abundance as well as
for
13
C labeled compounds. Instead of using a 908 rf
pulse for excitation of the proton magnetization, an
q
1
-half filter is used which creates in-phase
1
H
magnetization of protons bound to a
13
C nucleus and
suppresses magnetization of protons bound to
12
C.
This method has also been used by Ippel et al. [49]
to derive the J
HC
-couplings in small circular
nucleotides and in
13
C labeled NMPs. Hines et al.
[92] have used a 3D HMQC–(H,H)TOCSY,
originally proposed by Wijmenga et al. [110], to
determine J
HC
-couplings in a small
13
C labeled
RNA. The HMQC is used to spread the cross peaks
in the
13
C dimension. In this way, for example, J
H49C59
and J
H59C59
E.COSY patterns evolve, which can easily
be measured thanks to the large J
H59C59
-coupling. The
limitation lies here in the H49 to H59 or H50 TOCSY
transfer, which for g
t
is limited, so that the cross
peak of interest may have very low intensity.
Similarly, in an N-puckered sugar ring the H19 to
H39 transfer is small, so that the E.COSY cross peak
containing J
H19C39
becomes difficult to analyze. The
same applies for
3
J
H39C19
. On the other hand, the
transfer between H29 and H49 is not limited
in an N-puckered sugar ring, so that the
determination of
3
J
H29C49
and
3
J
H49C29
is expected to
work well. This is borne out by the results obtained
by Hines et al. [92].
Interestingly, Glaser et al. [111] have recently sug-
gested a selective TOCSY for the sugar ring. This
method allows selective transfer to occur between,
say, C39 and C19, so that a 2D spectrum with only a
(H19C39) cross peak can be obtained. An E.COSY
pattern will evolve with a large
1
J
H19C19
(150 Hz) and
a small
3
J
C39H19
(2–6 Hz).
5.8.2.2.3. Determination of J
HH
-couplings via
HCCH–E.COSY spectra. The HCCH–E.COSY
experiment has been proposed by Schwalbe et al.
[112] and applied to a
13
C labeled RNA 19-mer to
determine the J
HH
-couplings in the ribose sugar ring.
To generate the E.COSY patterns the 908 rf pulses in
the INEPTs are replaced by 358 rf pulses. The
experiment is essentially employed in 2D format,
but it may also be run as a 3D version to improve
resolution. In the 2D version, for example, C19 is
correlated with H29. The cross peak will then show
an E.COSY type multiplet pattern with J
H19C19
and
J
H19H29
as the passive J-couplings (see Fig. 9). The
large J
H19C19
can effectively be used to resolve the
smaller J
H19H29
-coupling. Zimmer et al. [113]
have applied 2D and 3D versions of this experiment
to a
13
C labeled DNA 10-mer duplex. It turns out that
even with the labeling resonance, overlap prevents the
determination of a full set of J
HH
-coupling constants,
for example only three out of ten J
H39H49
-couplings
could be determined.
320 S.S. Wijmenga, B.N.M. van Buuren/Progress in Nuclear Magnetic Resonance Spectroscopy 32 (1998) 287–387
5.8.3. Determination of J-coupling constants from
signal intensities
5.8.3.1. Determination of J
HH
-couplings from
homonuclear (H,H) TOCSY transfer. This method
has been proposed and used by van Duynhoven et
al. [86] to determine the sugar pucker and the g
torsion angle from the cross peak intensities in
TOCSY spectra. The approach has the advantage
over DQF–COSY or P.E.COSY that, due to the
relay effect in TOCSY, free lying cross peaks which
contain important J-coupling information are more
likely to be available. The method allows the
determination of the sugar pucker with good
accuracy for both DNA [12,86] and RNA [77]. The
method not only works well for 2D TOCSY spectra,
but can also be used for 3D TOCSY–NOESY spectra
[75,114] and 13C-edited TOCSY spectra [260].
5.8.3.2. Determination of J-couplings from
heteronuclear experiments. -coupling values have
traditionally been determined from the splittings in
1D spectra or nD spectra using a variety of methods
as discussed above. Recently, Bax et al. introduced a
number of procedures for the determination of homo-
and heteronuclear J-couplings using quantitative
J-correlation (for a review see Ref. [93]). In
these experiments the J-coupling is derived from the
intensity ratio between two resonances. Special care
needs therefore to be taken that both resonances are
identically affected by relaxation and digitization, so
that the ratio only depends on the size of the
active J-coupling. Bax et al. have demonstrated that
the quantitative J-correlation experiments can provide
accurate J-coupling values.
These methods have not been applied to nucleic
acids, except for the HMBC type experiment
[49,115] to determine
3
J
H19C8/6
and
3
J
H19C4/2
. Other
applications of such methods could be of value for
nucleic acids as well, although it does require
modification of the pulse sequences. For example,
for nucleic acids a particularly appealing experiment
Fig. 10. Pulse sequence of the CT–HSQC experiment, adapted from Vuister and Bax [116], for the measurement of J
CP
-couplings. Narrow and
wide pulses denote 908 and 1808 flip angles, respectively. All proton, carbon and phosphor pulses can be of high power. SL denotes a spin-lock
pulse to suppress the residual HDO resonance. Unless indicated otherwise all pulses are applied along the x axis. Minimum phase cycle resulting
in artifact-free spectra is as follows: f
1
? y, 1 y; f
2
? x; f
3
? 2eyT,2e1yT,2e1xT,2ex); f
4
? 8exT,8e1xT; receiver ? 2ex, 1 xT,4e1x,xT,2(x, 1 x).
Quadrature detection in t
1
is obtained by the States–TPPI technique incrementing f
2
. The delay t should be set to 1/4J
HC
< 1.7 ms; the constant
time 2T should for nucleic acids be adjusted to a multiple of 1/J
CC
for optimal refocusing of the ribose carbon–carbon couplings of 35 Hz. The
13
C (as well as optional
31
P) decoupling can be accomplished with a GARP sequence, relatively low rf field strengths can be used, < 2.5 to
4.7 kHz (and < 1 to 2 kHz for
31
P). The correspondence with the parent experiment of Vuister and Bax [116] is indicated in the sequence, by
CyO in parenthesis on the
31
P line, and by the
15
N line in curly brackets.
321S.S. Wijmenga, B.N.M. van Buuren/Progress in Nuclear Magnetic Resonance Spectroscopy 32 (1998) 287–387
is the CT–HSQC, which can be performed either with
or without
31
P decoupling. The ratio of the cross peaks
in the two experiments is directly proportional to
the J
CP
-coupling. The pulse sequence of this
experiment is shown in Fig. 10. This pulse sequence
has been proposed and used by Vuister and Bax [116]
to determine J
CN
- couplings with good results. In a
modified form this type of experiment has been used
to determine e and b torsion angles in a large 44
nucleotide RNA psudoknot [260].
6. Chemical shifts
Chemical shifts are the most easily measured NMR
parameters and carry important structural informa-
tion. Their values depend on the electron densities
around the nuclei, which is influenced by the
surroundings in variety of ways. Owing to the
uncertainties in their dependence on structural
parameters the use of chemical shifts as a tool to
determine three-dimensional characteristics has been
overtaken by J-couplings and NOEs, for which the
relationship with structural parameters is much better
established. Chemical shifts have however
experienced renewed interest as a structural tool in
the field of protein NMR. The large database of
chemical shifts and independently determined protein
structures now available makes it possible to reliably
interpret chemical shifts in terms of structural
Table 5
Overview of chemical shift ranges (ppm) in DNA and RNA
Name Thymidine Uridine Cytosine Adenine Guanine
H19 5–6 5–6 5–6 5–6 5–6
H29 (DNA) 1.7–2.3 1.7–2.3 2.3–2.9 2.3–2.9
H29 (RNA) 4.4–5.0 4.4–5.0 4.4–5.0 4.4–5.0
H20 2.1–2.7 2.1–2.7 2.1–2.70 2.4–3.1 2.4–3.1
H39 4.4–5.0 4.4–5.0 4.4–5.0 4.4–5.2 4.4–5.2
H49 3.8–4.3 3.8–4.3 3.8–4.3 3.8–4.3 3.8–4.3
H59 3.8–4.3 3.8–4.3 3.8–4.3 3.8–4.3 3.8–4.3
H50 3.8–4.3 3.8–4.3 3.8–4.3 3.8–4.3 3.8–4.3
H6/8 6.9–7.9 6.9–7.9 6.9–7.9 7.7–8.5 7.5–8.3
H5 1.0–1.9 5.0–6.0 5.0–6.0
NH (G,T,U) 13–14 13–14 12–13.6
NH2 6.7–7.0 5–6 5–6
(C,G,A) 8.1–8.8 7–8 8–9
C19 (RNA) 87–94 87–94 87–94 87–94 87–94
C19 (DNA) 83–89 83–89 83–89 83–89 83–89
C29 (DNA) 35–38 35–38 35–38 35–38
C29 (RNA) 70–78 70–78 70–78 70–78
C39 70–78 70–78 70–78 70–78 70–78
C49 82–86 82–86 82–86 82–86 82–86
C59 63–68 63–68 63–68 63–68 63–68
C2 154 154 159 152–156 156
C4 169 169 166–168 149–151 152–154
C5 15–20 102–107 94–99 119–121 117–119
C6 137–142 137–144 136–144 157–158 161
C8 137–142 131–138
N1/N3 (imino) 156 156–162 146–149
N2/6/4 (NH2) 94–98 82–84 72–76
N1(Py)/N9(Pu) 144 142–146 150–156 166–172 166–172
N3 210 220–226 167
N1(Pu) 214–216
N7(Pu) 224–232 228–238
Data are taken from Dieckman et al. [225], Hilbers et al. [76], Ippel et al. [49], Nikonowicz et al. [148], Pardi [48], Van de Ven and Hilbers
[133], van Dongen et al. [144], Varani et al. [36] and Wijmenga et al. [56,62,50]. The ranges are applicable for both DNA and RNA, if not
otherwise indicated.
322 S.S. Wijmenga, B.N.M. van Buuren/Progress in Nuclear Magnetic Resonance Spectroscopy 32 (1998) 287–387
parameters in a number of ways (see, for example,
Refs. [117–119] for recent reviews). Firstly, it has
become possible to assess the reliability of quantita-
tive chemical shift calculations. For
1
H shifts
semi-empirical relationships have been derived
which quite accurately (rmsd ? 0.25 ppm) predict
experimental shifts ([120–123]). From these studies
it appears that the conformation dependent shifts of
both the H
a
and the NH resonances result mainly from
the magnetic anisotropy of the peptide bond. The
conformation dependent shift resulting from ring cur-
rent effect is limited because of the small number of
aromatic residues present in proteins. Improved and
extensive quantum mechanical calculations show for
heteronuclei such as
13
C quite good correlations with
experimental data [124]. In addition, purely empirical
correlations have been derived such as the chemical
shift index [118]. This has led to the introduction of
chemical shifts as constraints in simulated annealing
[125,126].
All nucleic acid residues are aromatic in nature, in
contrast to amino acid residues in proteins. It is
therefore expected that conformation dependent
1
H
chemical shifts originating from the aromatic rings
will play a major role in nucleic acids. The reliability
of
1
H chemical shift calculations in the field of nucleic
acids has only recently been tested [50]. In this study
the experimentally determined and calculated
chemical shifts were compared for about 20 well
determined DNA structures. The correspondence
between calculated and experimental shifts was
shown to be quite good (rmsd ? 0.17 ppm). In view of
these encouraging results we will discuss
1
Hchemical
shift calculations in nucleic acids in some detail.
Fig. 11. Natural abundance (
1
H,
13
C) HMQC of a DNA four-way junction. The different spectral regions for different carbon proton correlations
are indicated in boxed regions.
323S.S. Wijmenga, B.N.M. van Buuren/Progress in Nuclear Magnetic Resonance Spectroscopy 32 (1998) 287–387
6.1. Chemical shifts: qualitative aspects
Chemical shifts have until recently mainly been
used as an assignment tool and their dispersion has
been used to reduce resonance overlap in multi-
dimensional NMR experiments. Table 5 gives an
overview of the approximate ranges for the
1
H,
13
C
and
15
N shifts in nucleic acids. As can be seen, the H29
to H50 resonances in RNA and the H39 to H50
resonances in DNA strongly overlap. This overlap
can be reduced by the use of
13
C correlated experi-
ments. Indeed, as can be seen from Fig. 11, where a
(
1
H,
13
C) HMQC spectrum is shown of a DNA four-
way junction, the H39 to H50 resonances are separated
by the
13
C frequency resonances of the ribose ring into
three groups consisting of the (H39,C39), the
(H49,C49) and the (H59/50,C59) cross peaks,
respectively. For RNA one also finds in this region
of a (
1
H,
13
C) correlation spectrum (HMQC, or
HSQC) three groups of cross peaks, but now the
groups consist of (H39,C49), (H29/H39,C29/C39) and
(H59/50,C59) cross peaks, respectively. Unfortunately,
within each group the resonances still tend to cluster
quite strongly, making assignment difficult. Strong
overlap is also observed for the other groups of
cross peaks, as can be seen from Fig. 11. The best
dispersed are the (H19,C19) cross peaks. Note that
the Cytosine and Thymine (H6,C6), Adenine
(H8,C8) and Guanine (H8,C8) cross peaks cluster in
separate, only slightly overlapping, groups, which
may help in their assignment. As can be appreciated
from Table 5 several of the
13
C resonances that
have no protons attached cluster in even narrower
spectral ranges than those with a proton attached. In
particular the C4, C2 resonances of Cytosine and
Uridine overlap quite strongly. The C5 resonances
in the two purines A and G fall into two groups,
and show within each group a slightly better
dispersion.
The
15
N resonances of the purines and pyrimidines
also group into a number of regions (see Table 5). The
imino
15
N resonances fall into two regions around
150 ppm, with G N1 resonating around 147 ppm and
U or T N3 resonating around 157 ppm. The T/U N1, C
N1 and A/G N9 can be found in spectral regions cen-
tering around 143, 154 and 168 ppm, respectively.
The NH2
15
N resonances for G, A and C can be
found at 73, 83 and 96 ppm, respectively. The N7
and N3 in purines and the N3 of unprotonated Cyto-
sine reside in the spectral region which ranges from
210 to 236 ppm.
The
31
P resonances cluster quite strongly in a
narrow spectral region of about 2 ppm, ranging from
13to11 ppm. As will be discussed later in more
detail, unusual backbone torsion angles may lead to
a shift out of this region.
6.2. Theory
The chemical shift of nuclear spin k in a molecule C
depends on the electron density distribution surround-
ing it. The electron density and the chemical shift
derived from it could be obtained in principle by ab
initio quantum mechanical calculations. However,
quantum mechanical chemical shift calculation of a
complete biomolecule is completely out of reach even
with present day computers, although recent progress
in quantum mechanical theory and computer hard-
ware has made it possible to perform such calculations
for molecular fragments large enough to reflect the
essential features of the local environment (see for
example Refs. [117,127]). For chemical shift calcula-
tions it is thus operationally expedient to divide a
molecule into a number of fragments, i.e. a fragment
A in which the nucleus of interest resides, the
conformation of which is defined with respect to a
reference conformation, and a number, N, of other
fragments B interacting with A. The calculated
chemical shift contributions can then be divided into
two categories, namely a conformation independent
part, d
intrin
, and a conformation dependent part, d
conf
,
respectively. The former represents the chemical shift
of nucleus k in fragment A in its reference state and in
the absence of other fragments. The latter represents
chemical shift changes of nucleus k with respect to the
reference state. These shift changes can result from:
(a) changes in the local electronic environment, d
k,lcA
,
i.e. changes in fragment A with respect to its reference
state due, for example, to changes in torsion angle,
bond length, bond angle, etc. or (b) changes in the
interaction of nucleus k in A with the other molecular
fragments B
j
, d
kj,B
. For
1
H nuclei the conformation
dependent shift of nucleus k in A does not depend
strongly on the torsion angles in A, i.e. the conforma-
tion dependent shifts can conveniently be attributed to
interactions with other fragments. For
1
H nuclei it is
324 S.S. Wijmenga, B.N.M. van Buuren/Progress in Nuclear Magnetic Resonance Spectroscopy 32 (1998) 287–387
therefore expedient to choose as molecular fragments
A, the individual C–H or N–H fragments. As a result,
d
intrin
of the
1
H nuclei in the sugar moiety of nucleic
acids can appropriately be defined as belonging to the
individual C–H fragments. In contrast, for hetero-
nuclei the chemical shift of a particular nucleus k
may be quite strongly affected by torsion angle
changes around this nucleus (see, for example, Ref.
[117]). For instance, for d
intrin
of a
13
C nucleus in a
ribofuranose ring it would be operationally more con-
venient to use as a reference state the S-puckered
conformation of the sugar. The chemical shift, d
k
,of
the resonance of nuclear spin k in molecule C, can
then formally be written as
d
k
? d
k, intrin
t d
k, conf
? d
k, intrin
t d
k, lcA
t
X
N
j ? 1
d
kj, B
(33)
The conformation dependent terms d
kj,B
that result
from interactions with other fragments can each in
turn be divided into a number of distinct
contributions, d
rc
, d
ma
, d
E
and d
CT
, so that
d
k
? d
k, intrin
t d
k, lcA
t
X
N
j ? 1
{d
kj, rc
t d
kj, ma
t d
kj, E
t d
kj, CT
} e34T
Together d
rc
and d
ma
form the chemical shift variation
resulting from magnetic anisotropy effects, with d
rc
the chemical shift from ring current effects produced
by aromatic rings, and d
ma
the chemical shift due to
local magnetic anisotropy effects from, for example,
an asymmetric electron distribution on atom B inter-
acting with nucleus k of A. Analytical expressions
have been derived for the two magnetic anisotropy
terms (to be discussed below). The parameters in the
analytical expressions have derived either from
experimental shift data (e.g. Refs. [120,121]), or
from fitting to quantum mechanically calculated iso-
shielding curves [127]. Note that the quantum
mechanically calculated isoshielding curves reflect
the sum of the two magnetic anisotropy terms, d
rc
and d
ma
, so that the ring current and local magnetic
anisotropy contributions may be difficult to separate
from the isoshielding curves [127]. The analytical
expressions reproduce the quantum mechanically cal-
culated isoshielding curves very accurately, i.e. they
differ on average by not more then 0.07 ppm and at
most 0.2 ppm, this larger difference being found for
the imino protons of Guanine and the H2 protons of
Adenine. The polarization term, d
E
, is the chemical
shift change resulting from polarization by an electric
field of the electron density along the chemical bond
extending from atom l. It can also be described by a
simple analytical expression with the parameters
that have been obtained from comparison with
experimental data (see below). The charge transfer
term, d
CT
, has been introduced by Giessner-Prettre
and Pullman [127] to account for the transfer of
electron density on hydrogen bonding.
For chemical shift calculations the following
further considerations should be taken into account.
As discussed above for shift calculations of
1
H nuclei
the term d
lc,A
can be set to zero, and when non-
exchangeable protons are considered the term d
CT
can also be taken to be zero. The value of the term
d
intrin
is not known a priori, nor is it easily calculated.
However, since d
intrin
is conformation independent, it
is possible to derive its value from a comparison of
d
conf
and d
conf,exp
, the conformation dependent parts of
the calculated and experimental chemical shifts,
respectively. The term d
conf,exp
is given by
d
conf, exp
? d
exp
1d
ref
(35)
Here d
exp
is the experimentally observed shift and d
ref
an experimentally determined reference value, which
in fact is the experimental counterpart of d
intrin
. It thus
represents the chemical shift that would have been
observed in the absence of conformation dependent
shift effects.
Shiftsintroducedasaresultofringcurrentshavebeen
discussed extensively in the literature (for reviews see
Refs. [117,127]). The general form of the semi-
empirical equation that describes these shifts is
d
rc
? iBG(~r) (36)
where i is the ring current intensity factor and B is a
constant which has been adjusted empirically such
that the ring current intensity of benzene is unity. In
this way the ring current intensity factors, i, of the
rings considered are equal to the ring current intensity
of that ring relative to that of benzene. G(~r) is a geo-
metric factor with ~r being the vector connecting the
observed nucleus to the ring that generates the ring
current. The two most popular methods to calculate
325S.S. Wijmenga, B.N.M. van Buuren/Progress in Nuclear Magnetic Resonance Spectroscopy 32 (1998) 287–387
this factor are those developed by Johnson and Bovey
[128] and by Haigh and Mallion [129]. The results
generated by the two approaches are very similar. In
the Johnson–Bovey method adopted by Giessner-
Prettre and Pullman [127], it is assumed that the
ring current shielding arises from two ring current
loops of radius a, situated symmetrically at each
side of and parallel to the plane of the aromatic
molecule. The shielding is calculated at points char-
acterized by the cylindrical coordinates r and z,
expressed in units of the loop radius a. The geometry
factor is then given by
G(r, z) ?
1
a
2
((1 t r)
2
t z
2
1
)
1=2
3 K(k) t
1 1 r
2
1 z
2
1
(1 1 r)
2
t z
2
1
E(k)
t
2
((1 tr)
2
t z
2
t
)
1=2
3 K(k) t
1 1 r
2
1 z
2
1
(1 1 r)
2
t z
2
t
E(k)
e37T
where z
6
? z 6 hzi, with hzi being the distance
between the loop and the plane of the aromatic ring.
K and E are complete elliptic integrals of the first and
second kind. The argument k is given by
k ?
4r
((1 t r)
2
t z
2
)
(38)
Wijmenga et al. [50] used the Johnson–Bovey method
in their calculations and the parameters as derived by
Giessner-Prettre and Pullman [127]. The ring current
loops are then taken to be at heights of 0.5770 A
?
and
0.5660 A
?
above and below the aromatic plane, for
Cytosine/Thymidine and Adenine/Guanine, respec-
tively. The radii of the current loops are 1.3675,
1.3790, 1.3610, 1.1540, 1.3430 and 1.15440 A
?
, for
Cytosine, Thymine, Guanine-6, Guanine-5, Adenine-
6 and Adenine-5, respectively (the numbers 5 and 6
refer to the 5-membered and 6-membered rings of
purines bases). The ring current intensities are
0.2570, 0.1120, 0.3, 0.6530, 0.9 and 0.66, for Cyto-
sine, Thymine, Guanine-6, Guanine-5, Adenine-6 and
Adenine-5, respectively. The empirical parameter B is
adjusted to a value of 2.13 3 10
16
A
?
, so that the ring
current shifts of the protons of benzene equal
11.5 ppm, when hzi ? 0.61 A
?
, and i ? 1.
The chemical shift of a nucleus i of atom A induced
by the local magnetic anisotropy at an atom B is,
expressed in cartesian coordinates, given by [127]:
d
ma
?
1
3r
5
X
ab
(3r
a
r
b
1 r
2
d
ab
)(1:967R
ab
1 5:368Q
ab
)
(39)
Here, r is the distance between atoms A and B, and
a,b ? x, y, z, are the Cartesian coordinates. Further-
more, R
ab
and Q
ab
are the diamagnetic and para-
magnetic parts, respectively, of the ab element of
the magnetic susceptibility tensor of atom B. These
contributions have been calculated and tabulated by
Ribas Prado and Giessner-Prettre [130]. They were
used without modification by Wijmenga et al. [50]
in their calculations.
The effect of an electric field on proton chemical
shifts is usually calculated using the expression
originally derived by Buckingham [131]:
d
E
? AE
k
t BE
2
(40)
Here E
k
is the projection of the electric field
~
E along
the X–H bond at the proton considered. Application
of different computational methods [131] indicated
that A may vary between 2.5 and 3.0 3 10
112
e.s.u.
Wijmenga et al. [50] chose for nucleic acids the same
value (2.9 3 10
112
e.s.u.) as chosen by Giessner-
Prettre and Pullman [127]. The value of B ? 0.74 3
10
118
e.s.u. they took from Marshall and Pople [132].
The electric field at the proton considered, k, can be
derived, using Coulomb’s law, as the vector sum of
the fields emanating from the electric monopoles
(partial charges), q
j
, predicted to be present at the
different atoms j:
~
E ?
X
N
j ? 1
q
j
~r
kj
4p?
0
?r
3
kj
(41)
Here r
kj
is the distance between atoms k and j, ?
0
is the
dielectric constant of a vacuum and ? is the relative
dielectric constant of the medium. For the partial
charges, Wijmenga et al. [50] tried different sets,
but ultimately the partial charge set for the CHARMM
force field and present as a parameter set in XPLOR
was used. Furthermore, in the calculations of the elec-
tric field a distance dependent dielectric constant was
used, ?(r) ? 4r, an approach often used in molecular
mechanics calculations for nucleic acids.
326 S.S. Wijmenga, B.N.M. van Buuren/Progress in Nuclear Magnetic Resonance Spectroscopy 32 (1998) 287–387
6.3.
1
H shifts
The accuracy of proton chemical shift calculations
in nucleic acids has been tested by comparing the
calculated and experimentally observed chemical
shifts [50] for a set of approximately 20 well
determined NMR structures (involving DNA helices,
hairpins, a quadruplex and a circular dumbbell). This
resulted in a database of over 2000 chemical shifts.
The chemical shifts of the non-exchangeable protons
were calculated, as described above, according to Eqs.
(34)–(41), with d
lcA
and d
CT
set to zero, and without
adjustment of the parameters for ring current, local
magnetic anisotropy and electric field derived earlier
by Giessner-Prettre and Pullman [127]. Values for d
ref
were obtained by using it as the only adjustable para-
meter in the comparison of the conformation depen-
dent parts of the calculated and experimental observed
shifts, d
conf
and d
conf,exp
, respectively. As can be seen
from Fig. 12(A) a very good correspondence is found,
the overall rmsd is 0.17 ppm Fig. 12(B). As is evident
from Fig. 12(C), (D) and (E), the two main and about
equally important contributors are the ring current and
local magnetic anisotropy, whereas the effect of the
electric field component is surprisingly small.
6.4. Structurally important
1
H shifts
The comparison of calculated and observed shifts
in Ref. [50] revealed the presence of a number of
characteristic chemical shift effects in various DNA
structures, some of which had been observed pre-
viously [133]. The comparison showed the physical
background of these characteristic shifts. The most
important dependencies of the shifts of the different
non-exchangeable protons in a nucleotide on their
surrounding can be summarized via an estimated
chemical shift, d
est
, which is split for each proton
into a number of terms:
d
est
? d
ref
t d
ib
t d
39b
t d
59b
(42)
Here, d
ref
is the reference shift discussed earlier.
The term d
ib
is the chemical shift effect induced by
the own base when the x-angle is 2408, which is the
value usually found in DNA for this torsion angle.
This value was taken from the x-angle dependence
of the calculated chemical shift in a single nucleotide.
The terms d
39b
and d
59b
represent the chemical shift
induced by the 39- and 59-neighboring base, respec-
tively, in a B-helix environment. Their values were
estimated in Ref. [50] from the difference of d
exp
1 d
ref
1 d
ib
from zero for each type of neighbor. Table 6
gives an overview of these structural shifts; in a
double helical environment, the sugar proton shifts
are to a large extent determined by the own base
and sequential effects are small, except for the H19
protons. In particular the shifts of the H29 and H20
protons are separated into two groups, one for purines
and one for pyrimidines. We note in passing that the
H29 proton shift depends strongly on the glycosidic
torsion angle x.For508 , x , 1308 a strong down-
field shift occurs with a maximum of 1.2 ppm at x ?
1008. Such a downfield shift of the H29 resonance is
very strong evidence for a syn x-angle (seen in the
quadruplex DNA structures for example), since ring
current effects from neighboring bases are generally
upfield and small, except for special cases. For H39 to
H59/H50 the shift also solely depends on the own base,
but the effect of the base is smaller and decreases
going from H39 to H59/50.TheH19 proton shift is
affected both by the own base and the 39-neighboring
base. The own base again separates the shifts of the
H19 protons into a purine and a pyrimidine group.
Each group in turn is separated into two groups
depending on whether the 39-neighboring base is a
purine or pyrimidine residue (see Table 6).
The shift of the base protons can depend in
principle on both the 39- and the 59-neighboring
bases. In accordance with the B-helix geometry the
H5 and the methyl protons are found to experience a
rather strong shift effect from the 59-neighboring base.
The H2 proton can show quite large upfield shifts.
These stem from a variety of sources but mostly
from the 39-neighboring base, which again is in
accordance with the B-helix geometry. The H2 proton
shifts are the worst correlated, for a variety of reasons.
It may result from the less well determined position in
the test structures, but there are also strong indications
that the ring current intensity used may be slightly
overestimated, and finally at large shift values (around
1.5–2.0 ppm) the ring current effect may have to be
corrected for so-called dispersion effects [50]. Finally,
we note that, as expected for a B-helix geometry,
the H8 and H6 protons are affected by both the 39-
and 59-neighboring bases (see Table 6).
The above results are not only useful as structural
327S.S. Wijmenga, B.N.M. van Buuren/Progress in Nuclear Magnetic Resonance Spectroscopy 32 (1998) 287–387
Fig. 12.
328 S.S. Wijmenga, B.N.M. van Buuren/Progress in Nuclear Magnetic Resonance Spectroscopy 32 (1998) 287–387
indicators, but they can also be quite helpful in reso-
nance assignment. The structural shifts described
above, and given in Table 6, make it possible to assign
proton resonances of the sugar protons as well as of
the base protons to the middle one of a triplet of
residues, for example RYR, YYR, etc. where R and
Y stand for purine and pyrimidine, respectively. This
knowledge was sufficient to obtain unambiguous
sequential proton assignments for a hairpin of 16
residues [50].
6.5.
15
N and
13
C shifts in RNA and DNA
The review paper by Giessner-Prettre and Pullman
[127] contains, in addition to proton shifts, a wealth of
information with regard to heteronuclear shifts. At the
time of writing the structural information was rather
limited, which hindered a comparison between
experimental and calculated shifts. Since then a
large body of experimental chemical shifts have
been collected of structurally well characterized
Fig. 12. Correlations between calculated shifts and observed structural shifts for non-exchangeable protons in DNA (Ref. [50]). (A) ‘All
effects’: calculated shifts using the sum of the ring current, magnetic anisotropy and electric field terms; the dashed lines run parallel to the
diagonal at a distance of one standard deviation (0.17 ppm). (B) Distribution of the errors between calculated and observed structural shifts (see
text); the vertical dashed lines indicate the standard deviation. (C) ‘Ring current only’: calculated shift using the ring currents alone. (D)
‘Magnetic anisotropy only’: calculated shifts using magnetic anisotropy alone. (E) ‘Charge only’: calculated shift using electric field term alone
(see text). For the ring current and magnetic anisotropy the parameters given by Ribas Prado and Giessner-Prettre [130] were used without
adjustment; the same applies for the parameters for the electric field term (see text).
Table 6
Structural
1
H chemical shifts (ppm) in DNA
a
d
ref
b
d
ib
d
39b
d
59b
AGCTAGCTAGCTAGCT
H19 5.19 5.25 5.48 5.80 1.25 1.0 0.65 0.45 10.6 10.6 10.3 10.3
H29 2.28 2.28 2.28 2.28 0.3 0.3 10.2 10.2
H20 2.47 2.47 2.47 2.47 0.3 0.2 0.1 0.0
H39 4.72 4.72 4.72 4.72 0.2 0.2 0.0 0.0
H49 4.17 4.17 4.17 4.17 0.1 0.1 0.0 0.0
H59 4.02 4.02 4.02 4.02 0.15 0.15 0.0 0.0
H50 3.98 3.98 3.98 3.98 0.15 0.15 0.0 0.0
H8 8.50 8.11
cccc
H6 7.70 7.64
cccc
H5 6.21 10.8 10.8 10.5 10.5
H5 2.00 10.5 10.5 10.3 10.3
H2
d
8.60 11.5 11.0 10.8 10.8
a
Structural chemical shift effects in DNA [50]. The chemical shift can be estimated using these structural chemical shift effects from, d
est
?
d
ref
t d
ib
t d
39b
t d
59b
, where d
ref
is the reference shift, i.e. the chemical shift in the absence of ring current, magnetic anisotropy and electric
field effect, d
ib
is the shift from the own base, it is calculated assuming that the glycosidic torsion angle is 2408 and the sugar is S-puckered, d
39b
is the experimentally observed shift resulting from the presence of a 39-neighboring base in DNA B-type helices, and d
59b
is the experimentally
observed shift resulting from the presence of a 59-neighboring base in DNA B-type helices. The uncertainty, i.e. expected standard deviation, in
the estimation of d
est
is about 6 0.2 ppm giving the overall correspondence between calculated and experimentally observed shifts.
b
The chemical shift of the H2 protons is mainly affected by the 39-neighboring base, although some effect of the 59-neighboring base can also
be observed.
c
The chemical shifts of the base protons H6 and H8 are influenced by both the 59- and 39-neighboring bases. To estimate their effect the sum of
d
39b
and d
59b
needs to be taken. For the different base sequence combinations one finds: d
exp
of Py-N-Py equals 8.3 ppm (N ? A), 7.9 ppm (N ?
G), 7.55 ppm (N ? C) and 7.44 ppm (N ? T), so that d
39b
t d
59b
equals 10.15 to 0.2 ppm for N ? A, G, C and T; d
exp
of Pu-N-Pu equals
8.05 ppm (N ? A), 7.65 ppm (N ? G), 7.25 ppm (N ? C) and 7.15 ppm (N ? T), so that d
39b
t d
59b
equals 10.45 to 0.5 ppm for N ? A, G, C and
T; d
exp
of Pu-N-Py equals 8.15 ppm (N ? A), 7.75 ppm (N ? G), 7.35 ppm (N ? C) and 7.25 ppm (N ? T), so that d
39b
t d
59b
equals 10.35 to
0.4 ppm for N ? A, G, C and T; d
exp
of Pu-N-Py equals 8.15 ppm (N ? A), 7.8 ppm (N ? G), 7.42 ppm (N ? C) and 7.3 ppm (N ? T), so that
d
39b
t d
59b
equals 10.30 to 0.35 ppm for N ? A, G, C and T.
329S.S. Wijmenga, B.N.M. van Buuren/Progress in Nuclear Magnetic Resonance Spectroscopy 32 (1998) 287–387
molecules, in particular of proteins. For proteins it has
now been found that
13
C shifts can be calculated with
good accuracy, as evidenced by a comparison with the
experimental chemical shifts of known structures
[117]. These studies show that the
13
C shifts are
mainly affected by the local geometry and the electric
field effect which can be quite long range in nature.
These results also put the older chemical shift calcu-
lation data for heteronuclei in nucleic acids as
described by Giessner-Prettre and Pullman into a
new perspective. They may be expected to be more
reliable than originally thought, but various aspects
still have to be tested experimentally; for example,
the dependence of the shifts of the carbon resonances
in the ribose ring on sugar pucker; the dependence of
the shifts of carbon resonances of the base moiety on
electric field, hydrogen bonding and changes in the
glycosidic torsion angle x. Ghose et al. [134] have
calculated the C8 chemical shift as a function of the
glycosidic torsion angle x and found very significant
variations (up to 5 ppm). Apart from these more subtle
effects of local geometry both the
13
C and
15
N base
resonances can shift up to 70 ppm on protonation, for
example, the Adenine N1 and C2 resonances [48].
6.6.
31
P shifts
Gorenstein and coworkers (see, for example, Ref.
[90]) have extensively studied the relationship
between the
31
P chemical shift and backbone torsion
angles ? and z. They observed that for DNA a correla-
tion exists between the ? and z torsion angles. Regular
B-helix DNA can exist in a BI conformation defined
by ? and z in the gauche range and the trans range,
respectively, or in a minor BII conformation defined
by ? and z in the gauche range and the trans range,
respectively. Experimentally, the
31
P shift shows a
linear correlation with changes in ?. Quantum
mechanical chemical shift calculations were per-
formed to demonstrate that the change in
31
P shift is
linearly related to changes in the torsion angle z. From
these data it was inferred that a change in ? affects a
corresponding change in z. Giessner-Prettre and Pull-
man [127] have critically discussed these
31
P changes.
Alternative calculations have demonstrated that
31
P
shifts may just as easily result from bond angle
changes or be due to counterions bounds close to
the phosphate group. Thus, although
31
P shifts are
likely to correspond to torsion angle changes they
may also be the result of other effects. In fact, in hair-
pin loop structures changes in
31
P shift are observed,
but not all
31
P changes can be attributed to changes in
?, z and a (see, for example, Ref. [11]). At present
therefore the consensus is to take the conservative
approach. The torsion angles ?, z and a are assumed
to be in the regular helix ranges, trans, gauche 1 and
gauche 1 , respectively, only when
31
P is not shifted
(see, for example, Refs. [36,135]).
7. Assignment methods
Assignment of resonances is the first essential step
in an NMR study of biomolecules aimed at determi-
ning their three-dimensional structure. The recent
development of
13
C and
15
N labeling has made it
possible to devise heteronuclear through-bond assign-
ment methods for nucleic acids that are so successful
in the field of NMR of proteins. As a result the pos-
sibilities for structural studies by NMR have been
considerably enhanced. We will discuss in detail the
novel through-bond assignment methods for labeled
nucleic acids. In addition, we describe NOE-based
assignment approaches that employ
13
C and
15
N
edited NOESY spectra. For these heteronuclear
methods we pay particular attention to the practical
aspects of setting up the experiments, i.e. optimal
delay settings, etc. We will, however, first briefly
outline the approaches that are used in the case of
unlabeled compounds.
7.1. Assignment without isotopic labeling
For unlabeled compounds the
1
H sequential
resonance assignment is naturally based on a
combination of through-bond
1
Hto
1
H and NOE
connectivities augmented with
1
Hto
31
P connectiv-
ities. Although heteronuclear NMR experiments,
involving
13
Cor
15
N at natural abundance, can also
be performed, they are of limited value because of
their inherent lower sensitivity (see below). Since
the assignment strategy for unlabeled nucleic acids
has been described in detail on various occasions
(see, for example, Wijmenga et al. [62]), we will
here only briefly outline the main steps and concen-
trate on a discussion of the main uncertainties
associated with such an NOE-based assignment.
330 S.S. Wijmenga, B.N.M. van Buuren/Progress in Nuclear Magnetic Resonance Spectroscopy 32 (1998) 287–387
The assignment for unlabeled nucleic acids consists
of essentially four steps, as sketched in Fig. 13(A):
1. Assignment of imino and amino resonances via
sequential NOE imino to imino and amino to
imino contacts to establish the base pairing, i.e.
the secondary structure (see Fig. 2 for an overview
of the relevant short distances).
2. Partial assignment of non-exchangeable base and
sugar proton resonances via NOE imino (and
amino) to H2, H6, H8, H5(M) and H19 contacts
(see Fig. 2 for an overview of the short distances).
In A.T and A.U base pairs the NOE cross peak
between imino (T,U) is very intense, because of
the short intra-base pair distance, d(AH2;T/
UNH3) (see Fig. 2), so that A H2 resonances can
easily be identified from the imino resonances. In
A-type helices the H2 protons have
short cross-strand distances to H19 protons, as
shown in Fig. 2, which allows an easy means of
identification of H19 resonances (but see also, for
example, Ref. [136]). Unfortunately, in B-type
helices such short distances do not exist, so that
this route does not really provide a means for
further assignment of non-exchangeable proton
resonances. Generally, NOE cross peaks are
observed between imino resonances to H6, H5
(M) resonances in pyrimidines, which result from
magnetization transfer via spin diffusion through
the amino resonances.
3. The sugar proton spin systems (H19 to H39) and the
Cytosine/Thymine/Uridine proton spin systems
(H6 to H5/M) are identified in (
1
H,
1
H) COSY or
(
1
H,
1
H) TOCSY spectra (via through-bond
coherence transfer). In TOCSY spectra of DNA,
the cross peaks are between Cytosine H6 and H5
resonances, a direct and easy way to identify these
residues; the cross peaks between H6 and methyl
resonances provide the identification of Thymine
residues. In TOCSY spectra of RNA, the H6 to H5
cross peaks of Cytosine and Uracyl reside in the
same spectral region, so that one cannot
distinguish between these types of bases. These
TOCSY cross peaks do however still allow them
to be distinguished from the Guanine and Adenine
bases. Depending on the sugar puckering one may
be able to identify the sugar spin system more or
less completely from (
1
H,
1
H) TOCSY spectra. In
DNA the TOCSY transfer route from H19 up to
H39 is always open independent of the sugar puck-
ering, because of the presence of the H20 proton
(see Section 5). For DNA the sugar ring is usually
in the S-type conformation, which is characterized
by a small
3
J
3949
-coupling, so that TOCSY transfer
can only occur from H19 up to H39. To be able to
identify the other sugar proton resonances the
sugar ring should be N-puckered or consist of a
mixture of N- and S-type puckers, so that transfer
up to H49 can be achieved. Transfer up to H59 and
H50 may even be achieved when in addition the g
torsion angle is not gauche t . In RNA the sugar
pucker is generally N-type, leading to small
3
J
1929
-
couplings, so that TOCSY transfer cannot be
achieved from H19 into the system, rendering the
identification of H29 protons more difficult. On the
other hand, for S-type sugar conformations or
when the sugar puckering state is a mixture of
N- and S-type puckers, transfer from H19 up to H49
can be achieved. This difference in transfer
between N-type and S-type sugars allows an easy
and direct means of identifying the sugar pucker-
ing in RNA. The identification of these spin
systems aids in the subsequent sequential assign-
ment of the non-exchangeable base and sugar
proton resonances via sequential NOE contacts,
i.e. the set of contacts involving H6/H8 and H19
resonances and/or the set involving the H6/8 and
H29/20 resonances.
4. Assigment of
31
P and H39,H49 and H59/50 reso-
nances via (
1
H,
31
P) HETCOR experiments.
Depending on the results obtained in step 3 these
proton resonance assignments constitute either an
extension or confirmation. The
31
PtoH39 and H59/
H50 correlations in HETCOR experiments can in
favorable cases, i.e. no overlap and large enough
J
HP
-couplings, provide sequential contacts in a
through-bond fashion [137,138].
There are two main problems associated with
sequential assignment of proton resonances in
unlabeled nucleic acids. The first is resonance over-
lap. The second is related to the fact that the assign-
ment is based mainly on NOE contacts, which are
inherently ambiguous with regard to, for example,
intra- and internucleotide contacts. In addition,
NOE-based sequential assignment requires assump-
tions with regard to the conformation of structural
331S.S. Wijmenga, B.N.M. van Buuren/Progress in Nuclear Magnetic Resonance Spectroscopy 32 (1998) 287–387
elements of the sequence studied. In contrast, sequen-
tial assignment based on through-bond connectivities
does not suffer from these two drawbacks. It would
seem to be of value, therefore, to critically discuss for
the main assignment steps when and where these
problems show up.
The first problem, resonance overlap, is mostly
encountered in the sequential assignment of the H6/
H8 and sugar resonances (step 3). Particularly strong
overlap is seen for the sugar ring proton resonances
H39 to H50 in DNA and H29 to H50 in RNA. A number
of ways exist to resolve (at least partly) this overlap in
unlabeled compounds. The first possibility is to use
multi-dimensional homonuclear NMR spectroscopy,
such as 3D TOCSY–NOESY, and/or 3D NOESY–
NOESY. A single NOE cross peak is in a 3D
TOCSY–NOESY spectrum spread out in a third
dimension into a number of cross peaks. This set of
cross peaks, corresponding to the H19 to H59/50
protons, results from TOCSY transfer through the
sugar spin system, and serves as a fingerprint of the
ribose system [62,75,77,114]. From such a spectrum
the virtually complete assignment has been made in a
24-residue RNA duplex containing a G.A tandem
base pair [75], and a large number of NOE con-
straints were derived, leading to a high resolution
structure [76,77]. Another example is represented by
a 31-residue intramolecular DNA triple helix, for
which the proton resonance assignment could be
obtained using, for example, 3D TOCSY–NOESY
spectroscopy [12,139]. Gorenstein et al. have used
3D NOESY–TOCSY and 3D NOESY–NOESY spec-
troscopy to derive high resolution structures of DNA
duplex molecules (see, e.g. Ref. [72]), and perform
structural studies of DNA three-way junctions (see,
e.g. Ref. [140]). An alternative approach to resolving
assignment problems associated with resonance
overlap is to build into the sequence structural ele-
ments of known fold and for which the resonances
positions are known, a method which has been
employed by Altona et al. with good success in their
studies of DNA three- and four-way junctions (see,
e.g. Refs. [42–44,141,142]).
The inherent ambiguity of NOE-based assignment
shows up in each of the steps. It is the least prominent
in the first step of the assignment process (Fig. 13(A)),
for a number of reasons. Firstly, only one imino reso-
nance is observed per base pair. In addition, the imino
resonances reside in a separate spectral region with
relatively little overlap. Furthermore, their spectral
positions within this region are well known in regular
Watson–Crick base pairs and for other non-standard
base pairs (Table 5). Consequently, the presence of an
imino proton resonance in a particular chemical shift
region is good evidence for the formation of a base
pair. Sequential assignment in these structural ele-
ments can be achieved with a high level of certainty,
since in canonical A- and B-helices the sequential
contacts are quite well established. Uncertainties can
Fig. 13. Flowcharts for resonance assignment in unlabeled and
labeled nucleic acids. (A) NOE-based assignment in unlabeled
compounds; (B) NOE-based assignment in labeled compounds;
(C) assignment via through-bond coherence transfer in labeled
compounds.
332 S.S. Wijmenga, B.N.M. van Buuren/Progress in Nuclear Magnetic Resonance Spectroscopy 32 (1998) 287–387
arise for non-canonical structures such as tetramers,
triple helices, and three- or four-way junctions
[11,16,143]. In these structural elements unusual
base pairing may be found. The imino proton
resonances may then show up at unusual resonance
positions and/or can be broadened. In addition, the
sequential NOE contacts can be quite different from
those found in a canonical helical environment.
In the second assignment step, NOE connectivities
from imino and amino protons to non-exchangeable
protons are used. Here again no uncertainty exists
with regard to assignment if the conformation is a
regular A- or B-type helix, but it is a different matter
in non-canonical environments. In the latter case
uncertainties arise because assumptions must be
made with regard to the conformation and the correct-
ness of the assignment must be based on the internal
consistency of NOE contacts and on a physically
reasonable model. We consider as an example the
H2 protons. In an A.U/T Watson–Crick the most
intense and thus the most reliable is an intra base
pair NOE contact from U/T NH3 to A H2. When
non-standard G.A base pairs may be present, a
number of possible configurations may be assumed
for them [144]. In the G.A anti–anti base pair [144],
the same type of intra base pair NOE contact is
present as in a Watson–Crick A.T/U base pair,
namely from G NH1 to A H2. On the other hand, in
Fig. 13. (Continued).
333S.S. Wijmenga, B.N.M. van Buuren/Progress in Nuclear Magnetic Resonance Spectroscopy 32 (1998) 287–387
a sheared G.A base pair this contact is absent [144]. In
short, for both situations, i.e. for standard Watson–
Crick base pairs and for G.A base pairs, the H2 proton
is an important stepping stone on the route from imino
proton resonance assignment to the assignment of the
non-exchangeable proton resonances. Unfortunately,
as discussed above, the H2 proton often shows a rather
complex and/or limited set of NOE contacts to other
non-exchangeable protons (see Section 4 and Ref.
[136]). It is therefore of limited use for further sequen-
tial NOE-based assignment of non-exchangeable pro-
tons. The Adenine H8 protons have, on the other hand,
an extensive set of intra- and inter-residue (sequential)
NOE contacts. Through-bond H2 to H8 correlations
can fortunately be obtained not only for
13
C labeled,
but also for unlabeled compounds, as was recently
shown in Ref. [144], providing a reliable link between
the H2 and other non-exchangeable protons.
In the third step, sequential and intra-residue NOE
contacts between H6/8 and H19/H29/H29 are used.
Similar arguments to those above apply here with
regard to uncertainties in the assignment. NOE
contacts are well established and reliable in a regular
A- or B-type helix environment, but not for other
conformations; the correctness of the ultimate assign-
ment must be based on internal consistency of the
NOE contacts and a physically reasonable model. For-
tunately, the NOE network for the H8/6/5/M and
sugar ring protons is quite dense and contains a large
number of NOE contacts related to conformation
Fig. 13. (Continued).
334 S.S. Wijmenga, B.N.M. van Buuren/Progress in Nuclear Magnetic Resonance Spectroscopy 32 (1998) 287–387
independent distances (see Section 4), providing a
large number of internal checks on the internal con-
sistency of the assignment. Furthermore, as discussed
above, through-bond proton–proton correlations can
be obtained in the pyrimidine bases (H6 to H5 or M),
so that they can be identified and distinguished from
purine bases, via COSY or TOCSY experiments. The
H6 proton resonances of Cytosine have a characteris-
tic doublet structure, whereas H8 resonances have a
singlet structure, so that they can often be
distinguished in NOESY spectra. Through-bond
proton–proton correlations in the sugar ring may
serve as a fingerprint for the ribose spin system, a
feature used to great advantage in 3D TOCSY–
NOESY spectra. Finally, chemical shifts of the base
and ribose sugar protons can help in the assignment,
as they show distinct patterns depending on the type
of sequence (see Section 6). Nevertheless, the only
way to ascertain base to sugar ring contacts without
any ambiguity is via through-bond connectivities.
These are relatively easy to generate in
13
C/
15
N
labeled compounds. They could in principle also be
derived in unlabeled compounds, but their derivation
suffers from the low sensitivity of (
1
H,
13
C/
15
N) HET-
COR experiments at natural abundance.
Through-bond correlations are used in the fourth
assignment step. Thus, these assignments do not suffer
from the uncertainties associated with NOEs. Here, it is
the extreme overlap that often prevents full assignment.
When using an assignment approach based on
NOEs, proof for the correctness of the assignment
has to come from the internal consistency of the
NOE contacts and from agreement with a physically
reasonable model. One might at first glance expect
that in the subsequent structure calculations further
confirmation of the assignment could easily be
obtained. This may not always be the case, however,
since simulated annealing protocols can be quite for-
giving with regard to a few wrong NOEs. This is
important, since structural features derived by means
of NMR studies often depend on a few NOE contacts.
Various examples existing in the literature show that a
few wrong assignments have led to wrong structural
features [16,145–147]. This is a very serious problem.
It is therefore of the utmost importance that correct
assignments are obtained, and that the ultimately
derived set of structures is checked, beyond the
usual means of precision (rmsd’s) and the number of
violations, to see whether the structures are physically
reasonable. It also means that the arguments for the
key resonance assignments should be presented.
As follows from the discussion given above, reso-
nance assignment in unlabeled compounds can be
quite difficult, due to the extensive overlap of the
proton resonances of the sugar ring and the fact that
NOEs are inherently ambiguous with regard to intra-
and internucleotide contacts. Isotope labeling opens
up the way for a suite of through-bond assignment
strategies offering a variety of options for establishing
unambiguous intra- and internucleotide contacts.
Nevertheless, the NOE-based assignment strategy is
extremely powerful and quite a number of detailed
structures have been derived with this method (see,
for example, Refs. [12,68,69,72,76,77]).
7.2. Assignment with isotope labeling
The recent possibility of
13
C and
15
N labeling has
signaled the introduction of alternative and improved
assignment schemes, which aim to resolve the two
main problems associated with resonance assignment
of unlabeled nucleic acids, namely, the resonance
overlap and the ambiguity associated with NOE-
based assignments. In the first scheme shown in
Fig. 13(A), which is a slightly adapted version of
the one proposed by Nikonowicz and Pardi [148],
the assignment process is improved mainly via
reduction of resonance overlap, i.e. 2D NOESY
experiments are replaced by
15
N- or
13
C-edited
NOESY experiments (compare steps I, II and III in
Fig. 13(A) and (B), respectively). In addition, in step
III the through-bond proton–proton correlations are
replaced or augmented by through-bond carbon–
carbon correlations (compare step III in Fig. 13(A)
and (B), respectively). In Fig. 13(C) an assignment
scheme is shown, in which as many as possible of
the NOE-based correlations are replaced by through-
bond correlations. In the following we will first
discuss the elements of the assignment scheme
shown in Fig. 13(B), and subsequently, the elements
of the assignment scheme shown in Fig. 13(C).
Several aspects of the through-bond assignment
methods have recently been reviewed [36,46–48].
We will try to give a complete overview on the one
hand and on the other concentrate on matters of sen-
sitivity and implementation. The latter can now be
335S.S. Wijmenga, B.N.M. van Buuren/Progress in Nuclear Magnetic Resonance Spectroscopy 32 (1998) 287–387
done, because only recently a complete overview of
all the heteronuclear J-couplings in
13
C and
15
N
labeled nucleic acids has appeared [49]. In the light
of these results we will discuss the proposed through-
bond experiments and describe their transfer functions
in order to assess their relative sensitivities.
7.2.1. NOE-based correlation
Traditionally, sequential assignment in unlabeled
nucleic acids has been based on sequential and
intra-residue NOE connectivities, augmented with
intra-sugar ring through-bond
1
Hto
1
H connectivities
and
1
Hto
31
P connectivities (see Fig. 13(A)).
Nikonowicz and Pardi [148] proposed a complete
strategy for sequential resonance assignment in
13
C=
15
N labeled RNAs based mainly on NOE con-
nectivities. The main steps in this scheme are illu-
strated in Fig. 13(B); this scheme shows some
changes compared to the original scheme proposed
by Pardi et al., for example the order of the assign-
ment steps III and I/II is reversed and step IV is
missing. We have adapted the original scheme in
this way to emphasize the similarities with the assign-
ment in unlabeled compounds, and to indicate the
usual flow in the assignment process, which goes
from imino/amino assignment to assignment of non-
exchangeable protons. In the original, the assignment
starts (step III in Fig. 13(B)) with identifying all
protons and carbons belonging to the same sugar
ring through application of a set of 2D and 3D hetero-
nuclear HCCH–COSY NMR experiments [148,149].
Note that the sugar ring systems are identified by 2D
and 3D HCCH experiments instead of (H,H) COSY or
(H,H) TOCSY experiments as used in the scheme for
unlabeled nucleic acids. This has the advantage that
resonance overlap is reduced and, very importantly,
that for the through-bond coherence transfer the large
conformation independent (42 Hz) J
CC
-couplings are
used, instead of the smaller (2–8 Hz) conformation
dependent J
HH
-couplings. Next, the individual sugar
rings are connected to their corresponding bases
through intra-residue proton–proton NOE contacts
observed in a 3D (
1
H,
13
C,
1
H) NOESY–HMQC (see
Fig. 14(A)). Sequential NOE connectivities observed
in this experiment are used to assign each residue in
the nucleotide sequence. The imino/amino protons are
assigned by 2D (
1
H,
15
N) HMQC and 3D (
1
H,
15
N,
1
H)
NOESY–HMQC experiments in H
2
O (step I in
Fig. 14. The NOESY–HMQC sequences and HMQC sequence. The
thin and thick filled bars indicate 908 and 1808 pulses; the filled grey
boxes indicate decoupling. (A) The NOESY–HMQC sequence for
use in D
2
O; (B) the NOESY–HMQC sequence for use in H
2
O, the
water suppression is achieved via (1, 1 1,echo) incorporated into the
HMQC sequence [148]; (C) HMQC sequence for use in H
2
O, with a
(1, 1 1,echo) for water suppression [148]. In the (1, 1 1,echo)
sequence, the carrier should be at the water resonance position,
u
H2O
; d should be set to (4(u
H2O
1 u
obs
))
11
, where u
obs
is the fre-
quency corresponding to the part of the spectrum that is to be
observed, e.g. for imino resonance observation set d in the range
50–70 ms.The delay D is for creating an anti-phase signal, and
should be approximately 1/(2J
HX
); t
m
is the NOESY mixing time.
The minimal phase cycling is in (A): f
0
? (x, 1 x) t (TPPI or
States–TPPI); f
1
?1x; f
2
? x; f
3
? x; f
4
? (x,x, 1 x, 1 x) t
(TPPI or States–TPPI); f
5
? x; receiver ? x, 1 x, 1 x,x; in (B): f
0
? 8(x, 1 x) t (States–TPPI); f
1
?1x t 458; f
2
? 16x; f
3
?
16e1xT; f
4
? 2(2x,2y,2e1xT,2e1yTT; f
5
? 2(2( 1 x),2( 1 y),
2x;2yT; f
6
? 4(x,x, 1 x, 1 x) t (TPPI or States–TPPI); f
7
? x;
receiver ? 4(x, 1 x, 1 x,x); in (C):
1
H: f
1
? 8x; f
2
? 8( 1 x); f
3
?
(2x,2y,2e1xT,2( 1 y)); f
4
? (2( 1 x),2( 1 y),2x,2y);
15
N: f
4
?
4ex, 1 xTt(TPPI or States–TPPI); f
5
? x; receiver ? 4(x, 1 x).
The phase of f
1
in (B) is set to x t 458 to distribute the effect of
radiation damping by the water signal (see Section 7.2.3).
336 S.S. Wijmenga, B.N.M. van Buuren/Progress in Nuclear Magnetic Resonance Spectroscopy 32 (1998) 287–387
Fig. 13(B); the pulse sequences of the two experi-
ments are given in Fig. 14(B) and (C), respectively).
These experiments also provide connectivities to non-
exchangeable protons (step II in Fig. 13(B)).
7.2.2. Through-bond correlation
For a description of the through-bond correlation
experiments we follow the assignment scheme as
sketched in Fig. 13(C). In order to be able to indicate
the efficiency of the various experiments we will first
discuss the coherence transfer functions used.
7.2.2.1. Coherence transfer functions. The efficiency
of a 2D or 3D NMR experiment can be calculated via
coherence transfer functions. Assume an nD
experiment in which m transfer steps occur of
duration t
m
and interspaced with p evolution
periods. We will denote the coherence transfer
efficiency for each step by T
m
. T
m
is the product of
the transfer efficiency per se, F
m
, and the effect of
relaxation of the relevant coherences, T
2m
. T
m
is
then given by T
m
? F
m
exp( 1 t
m
=T
2m
). The term F
m
can be calculated exactly. For pulse sequences using
INEPT type coherence transfers it is simply a product
of sin and cos terms; for TOCSY transfers analytical
expressions can only be derived for simple two spin
systems, whilst for larger spin systems TOCSY
transfers have to be calculated numerically. The
overall efficiency of the nD NMR experiment, T
nD
,
is the product of all T
m
values, T
nD
? T
1
·
…
·T
m
.
However, in each transfer step, losses, T
mrf
, due to
imperfect settings of, for example, 1808 or 908
pulses and rf inhomogeneity may be incurred. We
collect them together in one term, T
rf
. With this
term included the overall efficiency of an nDNMR
experiment becomes T
nD
? T
rf
·T
1
·
…
·T
m
. The T
rf
term
is difficult to assess exactly, although it can be
reasonably estimated from the properties of a probe.
T
rf
will become smaller as the number of pulses
applied in an nD NMR experiment increases. For
modern probes the rf homogeneity is so good, i.e.
the signal ratio is around 0.8, when comparing a
7208 to a 908 pulse, that the term T
rf
is expected to
lie around 0.8 for most multi-pulse NMR experiments.
With these considerations in mind the efficiency
calculations excluding the effect of the T
rf
term
should still give a reasonable estimate of the actual
efficiencies of an nD NMR experiment.
7.2.2.2. Through-bond amino/imino to non-
exchangeable proton correlation. Until now three
sets of pulse sequences have been proposed for the
correlation of exchangeable and non-exchangeable
protons via through-bond coherence transfer (step II
in Fig. 13(C)). The first set is by Simorre et al.
[150,151], a second set by Sklenar et al. [152], and
a third by Fiala et al. [153]. The proposed sequences
have large overall similarities, but differ in the
parameter settings used.
The first sequence of Simorre et al. [151]
(Fig. 15(A)) seeks to correlate guanosine imino pro-
tons and H8 protons in
13
C/
15
N labeled RNAs via an
HNC–TOCSY–CH experiment. The experiment
starts with imino proton coherence which is frequency
labeled during t
1
and then transferred to
15
N via a
refocused INEPT sequence. The in-phase N1
coherence is subsequently transferred to C6 and C2
via a (C,N) DIPSI cross-polarization sequence of
44 ms duration at a radio frequency field strength (rf
field strength) of 1.9 kHz and with the carriers of
15
N
and
13
C set at 146 and 161 ppm, respectively. The
rather long mixing time is required because of the
relatively small J-couplings from N1 to C6 and
from N1 to C2 ( < 7 Hz, see Fig. 3). Subsequently,
a 37.8 ms long FLOPSY8 (C,C) TOCSY sequence is
employed, with an rf field strength of 5 kHz and the
carrier at 145 ppm, to transfer the in-phase C6 or C2
coherence to C8. Since the C5 and C6 carbons reside
at approximately 120 and 160 ppm, respectively, a
carrier position at 145 ppm corresponds for C5 and
C6 to offsets of 3750 and 2250 kHz. Considering
these settings and the J-couplings in the guanosine
base (see Fig. 3), possible coherence transfer routes
are from C6 directly to C8 or from C6 to C4 to C8, or
from C6 via C5 to C4 and then to C8. In view of the
small J
C2C5
-coupling a transfer route from C2 to C5
and then either via C4 or C6 to C8 is unlikely. The in-
phase C8 coherence is in the last stage transferred to
H8 via a reverse refocused INEPT sequence.
The second two sequences proposed by Simorre et
al. [150], Fig. 15(B) and (C)/(D), are designed to
correlate U NH and C NH
2
with H6 in Uridine and
Cytosine, respectively. In the Uridine experiment NH
proton magnetization is transferred via a refocused
INEPT to in-phase N3 coherence. Next, this
coherence is transferred to C4 via a heteronuclear
(N,C) cross-polarization. This is done in such a way
337S.S. Wijmenga, B.N.M. van Buuren/Progress in Nuclear Magnetic Resonance Spectroscopy 32 (1998) 287–387
Accepted
Fig. 15.
338 S.S. Wijmenga, B.N.M. van Buuren/Progress in Nuclear Magnetic Resonance Spectroscopy 32 (1998) 287–387
Fig. 15. (Continued).
339S.S. Wijmenga, B.N.M. van Buuren/Progress in Nuclear Magnetic Resonance Spectroscopy 32 (1998) 287–387
Fig. 15. Pulse schemes for through-bond correlations of amino/imino and non-exchangeable protons. (A) The 2D H(NC)–TOCSY–(C)H
experiment, used to connect imino to H8 protons in guanosines [151]. The indirect detection period t
1
(see below) was partially concatenated
with the first
1
H–
15
N INEPT using the time-shared evolution procedure, which allows a reduction of 5 ms for the longest t
1
value [234]. For
the first t
1
point, ?
1
? ?
2
? 0.5*(dw/2 1 (4p)PW90(
1
H) 1 PW180(
15
N)) and ?3 ? t1. For subsequent t
1
points, ?
1
was incremented by
dw=2 1 t
1
=en t 1T and ?
2
was decremented by t
1
/(n t 1), where dw is the dwell time and n is the total number of complex points in t
1
. The t
1
and t
2
delays were set to 2.5 and 1.25 ms respectively. The magnetization was transferred from
15
Nto
13
C by cross polarization using a 44.9 ms
DIPSI-3 sequence [235–237] at an rf field of 1.9 kHz. A 37.8 ms FLOPSY-8 sequence [238] at an rf strength of 5 kHz was used for the
13
C
homonuclear TOCSY period [239]. Unless otherwise noted, all pulses have phase x. The phase cycle was f
1
? 2(y t 458),2( 1 y t 458);
f
2
? y, 1 y; f
3
? 4eyT, 4e1yT; f
4
? 8exT, 8e1xT and receiver ? x, 2e1xT,x, 1 x,2(x), 1 x. Instead of phase cycling f2, the two 908
15
N pulses
before the
15
N–
13
C cross polarization can be eliminated and the phase of the
15
N DIPSI-3 can be phase cycled (x, 1 x) using the same receiver
phase cycling as before [240]. Eliminating these two 908
15
N pulses should lead to a slight improvement in sensitivity for probes with low
15
N
B1 homogeneity. The
15
N frequency was set to 146 ppm during the
1
H–
15
N and
15
N–
13
C transfers and it was shifted to 195 ppm at point a to
decouple N7 and N9 during the acquisition period. The
13
C frequency was positioned at 151 ppm for the
15
N–
13
C hetero-TOCSY period and
was shifted to 145 ppm at point a for the
13
C-TOCSY period and to 142 ppm at point b. To avoid a complete inversion of the water signal by the
second
1
H908 pulse, the phase of f
1
was shifted by 458 and the
1
H carrier was set to the water frequency. Radiation damping returns the water
magnetization to the t z axis during the
15
N–
13
C cross polarization and
13
C-TOCSY periods. At point b, water flip-back is achieved with a
2.9 ms selective E-BURP pulse [241,242]. During the last INEPT periods, a WATERGATE sequence is applied to suppress the residual water
signal using two 1.55 ms soft square pulses [243]. The phases of the selective E-BURP and soft square pulses were adjusted with a small angle
phase shifter for optimal solvent suppression. During the detection period,
13
C and
15
N GARP1 decoupling was used at rf fields of 1.6 and
1.14 kHz, respectively. All gradients were applied along the z axis, with g
1
? 12 G/cm, g
2
? 24 G/cm and g
3
? 32 G/cm. The gradient times for
g
1
, g
2
and g
3
were 300, 300 and 450 ms, respectively. Each gradient was followed by a recovery time of 200 ms. (B) The uridine optimized 2D
H(NCCC)H experiment employed a refocused INEPT sequence for the
1
H–
15
N transfer [150]. The
1
H carrier is placed at the center of the imino
proton region (13.2 ppm for uridine) at the beginning of the experiment and shifted to the water frequency at point a. A 90 ms DIPSI-3
13
C–
15
N
hetero-TOCSY mixing period was used at an rf field strength of 1.9 kHz, using
13
C and
15
N carrier frequencies of 169 and 160 ppm,
respectively. The
13
C carrier was shifted to 139 ppm at point a. Water magnetization returns to the t z axis by radiation damping during
the
13
C–
15
N hetero-TOCSY period. After this
13
C–
15
N hetero-TOCSY period, the magnetization is transferred from C4 to H6 using a string of
concatenated
13
C–
13
C and
13
C–
1
H INEPT type transfers. The second half of the
13
C–
1
H reverse INEPT transfer has been adapted to include a
WATERGATE solvent peak suppression [243]. In this scheme the proton 908 pulse in the center of the reverse INEPT period is preceded by a
selective water flip-back 908 E-BURP pulse (2.8 ms) at point b [241,244]. In the final refocused INEPT interval, the proton 1808 pulse is flanked
by two square 1.67 ms 908 pulses of phase 1 x, and the carbon 1808 pulse is placed at the interval t
1
? 1/(4*
1
J
CH
) following the proton 908
pulse. The phases of the selective square 908 pulses and the 908 E-BURP pulse are further optimized for maximum solvent peak suppression,
using a small angle phase shifter. The transfer delays were t ? 3.6 ms, t1 ? 2.5 ms and t2 ? 1.25 ms. G
z
denotes the z axis pulsed-field
gradients with the following values: g
1
? 300 ms at 24 G/cm and g
2
? 450 ms at 32 G/cm. A 400 ms recovery time was added after each gradient.
Quadrature detection in the proton t
1
period was obtained with the TPPI–States method [245]. The 16-step phase cycle is f
1
? x, 1 x;
f
2
? y, y, 1 y, 1 y; f
3
? 4exT, 4e1xT, f
4
? 8exT, 8e1xT and receiver ? 2ex, 1 x, 1 x, xT,2(1 x,x,x, 1 x). (C) The cytidine optimized 2D
H(NCCC)H [150], where the
13
C–
15
N refocused INEPT period in the pulse sequence given in (B) is replaced with a
1
H–
15
N hetero-TOCSY
period. A 7.5 ms half DIPSI-3 (R, 1 R supercycle) sequence [237] was used for the
1
H–
15
N hetero-TOCSY transfer at an rf field of 1.9 kHz with
the
15
N carrier set to 96.7 ppm and the
1
H carrier before point a set to 7.6 ppm. Other changes compared to the pulse sequence given in (B) were
that the
13
C–
15
N hetero-TOCSY period was set to 45 ms (
1
J
NC
< 20 Hz for cytidine instead of < 10 Hz for uridine) and an eight-step phase
340 S.S. Wijmenga, B.N.M. van Buuren/Progress in Nuclear Magnetic Resonance Spectroscopy 32 (1998) 287–387
that transfer to C2 is minimized, i.e. an rf field
strength of 1.9 kHz is chosen with the
13
C carrier at
169 ppm, approximately, the C4 resonance position
which is about 19 ppm offset from the C2 position
at 150 ppm; the
15
N carrier is set to 160 ppm. The
mixing period is 90 ms. Next, the C4 magnetization
is transferred by two concatenated refocused INEPT
sequences to C6. Here the choice was refocused
INEPT instead of TOCSY because of the large differ-
ence in chemical shift between C5 and C4, C6. Finally,
the C6 coherence is transferred to in-phase H coher-
ence via a reverse INEPT sequence. To correlate the
NH
2
protons with H6 in Cytosine Simorre et al. [150]
propose to use cross-polarization for the transfer from
NH
2
protons to NH
2
nitrogen as well as for the sub-
sequent transfer to C4 (Fig. 15(C) and (D)).
The pulse sequence by Fiala et al. [153] (Fig. 15(E))
aims to correlate exchangeable and non-exchangeable
protons in purines. The approach is conceptually
similar to that described above, except that the
transfer direction is from H8 to the imino proton
instead of the reverse and that an INEPT sequence
instead of a cross-polarization sequence is used in
all hetero C ! N nuclear transfer steps. For the
(C,C) TOCSY from C8 to C6, a mixing time of
55 ms and an rf field strength of 3 kHz were chosen,
and the
13
C carrier was placed at 150 ppm. The selec-
tive 1808 carbon pulse for the INEPT transfer from C6
to N1 was a 1.5 ms REBURP, positioned at 160 ppm
to cover the region from 150 to 170 ppm. As a
result the J
C6C5
-coupling does not influence the C6
to N1 transfer (see below). The
15
N carrier was
positioned at 120 ppm inbetween the N1 and NH2
resonances.
cycle was used with: f
1
? x, 1 x; f
2
? y, y, 1 y, 1 y; f
3
? 4exT, 4e1xT; and receiver ? x, 1 x, 1 x,x. (D) The cytidine optimized pseudo 3D
H(NCC)CH experiment [150], where both the amino proton resonances and the C6 resonances are frequency labeled during the t
1
evolution
periods. The parameters for the cytidine optimized pseudo 3DH(NCC)CH were the same as those for the cytidine optimized 2D H(NCCC)H,
except that the
13
C carrier was positioned at 129 ppm at point a and two 1.03 ms frequency-shifted selective IBURP 1808
13
C pulses [241,246]
were applied at 104 ppm. The first selective pulse is used to decouple C5 from C6 during the time-shared
13
C evolution period and the second
pulse at point b compensates for off-resonance effects caused by the first selective 1808 pulse [242]. The C6 evolution period t
1
is time-shared
with both the 2*t period that refocuses the C5–C6 anti-phase component and the 2*t
2
period that creates the C6–H6 anti-phase component
[234]. The time periods ?
1
, ?
3
and ?
4
were set to 0, 0 and t, respectively, for the initial t
1
interval. For evolution of the C6 chemical shift during
the t
1
period, the increments of these intervals are d?
1
? dw=2, d?
3
?edw=2 1 t=en t 1TT and d?
4
?1t/(n t 1), where dw is the dwell time and n
is the total number of points in the t
1
dimension. The
1
H 1808 pulse at point a is used to invert the water magnetization and the five gradient
pulses subsequently eliminate radiation damping and keep the water magnetization aligned along the 1 z axis until the next
1
H 1808 pulse flips
the water signal back to the t z axis. (E) The 3D (
1
H,
13
C,
15
N) triple resonance pulse sequence HC(CN)H TOCSY used to establish guanine
imino-H8,C8 and adenine amino-H8,H2,C8,C2 correlations in
13
C,
15
N uniformly labeled RNA oligonucleotides [153]. The last six proton
pulses function as a selective 1808 pulse in WATERGATE; their lengths are in the ratio 3:9:19:19:9:3 and were applied with a power of 8.9 kHz.
The DIPSI-3 [237] spin lock was applied at a power of 3.0 kHz. GARP [247] decoupling was applied at a power of 2.2 kHz for
13
C and 930 Hz
for
15
N. All
13
C pulses were applied at 150 ppm except for the selective refocusing C6 pulse, represented by the thick rounded bar, for which a
1.5 ms REBURP [241] was frequency shifted by phase modulation [248] to cover the region between 150 and 170 ppm. The
1
H frequency was
set to 10.5 ppm and the
15
N frequency to 120 ppm. Rectangular gradient pulses of 1.0 ms duration and a strength of 8 G/cm were used. Unless
indicated otherwise, 908 pulses were applied with phase x and 1808 pulses were applied with phase y. The phase cycle was f
1
? x t States–
TPPI; f
2
? 4eyT, 4e1yT; f
3
? x, 1 x t States–TPPI; f
4
? 8exT, 8e1xT; f
5
? x, x, 1 x, 1 x; receiver:x, 1 x, 1 x, x, 2e1x, x, x, 1 x),x, 1 x, 1
x, x. Delays were a ?t
1
t t
1
=2, b ? t
1
=2, c ?t
1
t t
2
=2, d ? t
2
=2; t
1
was set to 1.1 ms, t
2
to 6.5 ms, t
3
to 1.4 ms and t
4
to 2.7 ms. The value of
150 ms for q was found by optimization for the best water suppression. (F) 2D HCCNH TOCSY. (G) 3D HCCNH TOCSY experiments [152].
The INEPT evolution delays are set to match 1/4J, both for H ! C(J < 18–220 Hz) and N ! H(J < 90 Hz) transfers (d ? 1.25 ms and D ?
2.5 ms). In the 2D experiment, the proton carrier is centered at the water frequency. In the 3D experiment, the proton carrier is placed at the
center of an aromatic region at < 7.6 ppm for the t
1
evolution period and switched to the water position for the detection during t
3
in order to
optimize the resolution and the size of the data matrix. DIPSI-3 mixing [237] along the y axis is used for the C-C and C-N TOCSY with an rf
field of 2.9 kHz. The
13
C and
15
N carriers are set to 150 and 153 ppm respectively. Water flip-back WATERGATE is applied during the
15
N–
1
H
INEPT step, using typically a 4 ms EBURP2 908 pulse and 1.3 ms selective 908 pulses flanking the non-selective 1808 pulse. One scrambling
gradient is applied during the
15
N–
1
H INEPT, when the magnetization of interest is converted into HzNz order. Two identical gradients are used
in the WATERGATE sequence. In the 3D experiment the bipolar gradient echo [249] with an amplitude of 0.5 G/cm is used to suppress
radiation damping during t
2
. The phase cycle is: f incremented in order to obtain States–TPPI [245] quadrature detection in t
1
; z ? y, 1 y; w ?
y, y, 1 y, 1 y; q ? x, x, x, x, 1 x, 1 x, 1 x, 1 x; receiver ?t, 1 , 1 , t . The phase of the 908
1
H mixing pulse in the
1
H–
13
C INEPT and the
receiver phase are inverted after four scans. In addition, all phases of the WATERGATE and of the receiver are changed by 1808 after eight
scans for the 16-step cycle. The phase w is incremented according to the States–TPPI protocol to obtain quadrature detection in t
2
of the 3D
experiment, asynchronous GARP decoupling [247] is used to suppress the heteronuclear spin–spin interactions during the acquisition.
341S.S. Wijmenga, B.N.M. van Buuren/Progress in Nuclear Magnetic Resonance Spectroscopy 32 (1998) 287–387
The third set of pulse sequences that has been
published to correlate exchangeable and non-
exchangeable protons is by Sklenar et al. [152]. The
2D and 3D versions of these sequences are shown in
Fig. 15(F) and (G), respectively. They are very similar
to those proposed by Simorre et al. [150,151]. The
main difference is that the transfer here is from H6/
H8 to the imino protons instead of the reverse. In
addition, the authors use the same pulse sequence
and settings for both uridine and guanosines. For
both the C–C transfer and the C–N transfer the rf
field strength used is 2.9 kHz, and the carriers are
set at 150 ppm for
13
C and at 153 ppm for
15
N. The
(C,C) TOCSY mixing period is taken to be 19 ms and
the (C,N) TOCSY mixing period 58 ms. The authors
suggest that with the carrier at 150 ppm the low rf
field will eliminate C5 from the J
CC
-coupling network
both in G and in U. They state that in G the main route
would be C8–C4–C6. However, as follows from the
J-coupling data in Fig. 3, a route directly from C8 to
C6 is also possible. In U the elimination of C5 from the
network renders the (C,C) TOCSY here ineffective
and the coherence remains on C6. The long 70 ms
mixing period in the hetero TOCSY, transfers the
C6 coherence in U to N1H coherence via the route
C6–N1–C2–N3.
To estimate the relative sensitivities of the three
groups of experiments we will compare the different
sequences in terms of their transfer efficiencies, and
discuss in detail the G N1H to H8 transfer. The overall
efficiency of the three groups of experiments can, for
the G N1H(imino) to H8 transfer, be expressed as
Trt ? Tr
INEPT
(H8 ! C8)Tr
CCTOCSY
(C8 ! C6)Tr
cpCN
3 (C6 ! N1)Tr
INEPT
(N1 ! N1H) e43T
(Simorre et al., [150,151])
Trt ? Tr
INEPT
(H8 ! C8)Tr
CCTOCSY
(C8 ! C6)Tr
INEPT
3 (C6 ! N1)Tr
INEPT
(N1 ! N1H) e44T
(Fiala et al., [153])
Trt ? Tr
INEPT
(H8 ! C8)Tr
CCTOCSY
(C8 ! C6)Tr
cpCN
3 (C6 ! N1)Tr
INEPT
(N1 ! N1H) e45T
(Sklenar et al., [152])
Here, Tr
INEPT
(H8 ! C8) and Tr
INEPT
(N1 ! N1H)
represent the efficiency of the transfer of in-phase
proton to in-phase X-nucleus coherence. For the
calculations we assume that Tr
INEPT
(H8 ! C8) and
Tr
INEPT
(N1 ! NH) are 100% efficient, which seems
reasonable in view of the large J-couplings involved
(200 Hz for J
H8C8
and 90 Hz for J
HN
). The term
Tr
CCTOCSY
(C8 ! C6) represents the efficiency of the
(C,C) TOCSY transfer, Tr
cpCN
(C6 ! N1) stands for
the efficiency of the cross-polarization between C6
and N1, and finally, Tr
INEPT
(C6 ! N1) represents
the efficiency of transfer of in-phase C6 coherence
to in-phase N1 coherence via INEPT. Comparison
of the transfer functions shows that all sequences
have the (C,C) TOCSY transfer step, the H8 to C8
INEPT transfer step, and the N1 to N1H INEPT trans-
fer step in common. The differences are found in the
C6 to N1 transfer step. For the sequences proposed by
Simorre et al. and Sklenar et al. the transfer functions
are identical, both using (C,N) cross-polarization for
this transfer step, while Fiala et al. employ INEPT for
this transfer step. We will now consider these steps in
more detail in view of the more detailed knowledge of
the J-couplings in the bases.
Fig. 16. Transfer of coherence during (C,C) TOCSY in guanosine
via numerical simulation using MLEV-17 for three different rf field
strengths, demonstrating the influence of chemical shift offsets on
the efficiency of the coherence transfer. The spin system consists of
C6, C8 and C4, all coupled with 9 Hz J-couplings; the chemical
shift offsets are 1200 Hz for C6, 200 Hz for C4 and 1 1200 Hz for
C8. At mixing time zero all coherence is assumed to be present on
C6. (a) The coherence on C6 as a function of the TOCSY mixing
time (rf field 10 kHz). (b) The coherence transferred from C6 to C8
and to C4 as a function of the TOCSY mixing time (rf field 10 kHz).
(c) As in (b), except rf field 5 kHz. (d) As in (b), except rf field
3 kHz.
342 S.S. Wijmenga, B.N.M. van Buuren/Progress in Nuclear Magnetic Resonance Spectroscopy 32 (1998) 287–387
Fig. 16 shows the efficiency of C6 ! C8 coherence
via (C,C) TOCSY in Guanosine bases. As can be
seen from Fig. 3, in principle all carbons in the
base are connected via J
CC
-couplings. However, the
J-coupling to C2 is small. Furthermore, the C5 spin
resonates at approximately 100 ppm, so that it is
excluded from the network (but see discussion in
Section 7.2.2.3), when a sufficiently weak rf field is
used. The J
CC
-coupling network consists then of the
C6, C8 and C4 spins, connected with J-couplings of
approximately 9 Hz. The (C,C) TOCSY transfer
through the C6, C8, C4 spin network was simulated
numerically assuming an MLEV-17 pulse sequence
[77,86], for three different rf field strengths (10, 5
and 3 kHz); the offsets of C6, C8 and C4 were taken
to be 1200, 11200 and 200 Hz, respectively (esti-
mated from their usual chemical shift values as
given in Table 5, and assuming a carrier position of
150 ppm). The C6 to C8 transfer efficiency reaches an
optimum of 0.42 at 40 ms (10 kHz rf field), 0.47 at
60 ms (5 kHz), and 0.35 at 90 ms (3 kHz). These
values are obtained when T
2
relaxation is assumed
to be absent. In Table 7 the optimum values for
(C,C) TOCSY transfer are given when T
2
relaxation
is taken into account; one finds for T
2
relaxation times
of 3 s and 0.03 s, optimum transfer efficiencies of 0.42
at 40 ms, 0.11 at 25 ms, respectively, for a 10 kHz rf
field. The simulations also demonstrate that the main
transfer route is directly from C6 to C8, and not only
via the route C8–C4–C6 as suggested by Sklenar et al.
[152]. We note that the offset of the C5 spin, which
has been excluded from our calculations, corresponds
to 14500 Hz. Obviously, at rf field strengths of
10 kHz and 5 kHz, offset effects arising from C5 are
expected to affect the C6 to C8 transfer. On the other
hand, for an rf field strength of 3 kHz one expects that
the C5 spin can be disregarded. Sklenar et al. and
Fiala et al. employed a DIPSI pulse sequence of
3 kHz, while Simorre et al. employed a FLOPSY
sequence of 5 kHz. Thus, particularly in the latter
case, considerable offset effects are expected
involving the C5 spin. To estimate the effect of the
presence of C5 in the spin network in an exact manner
would require an extensive set of numerical simula-
tions, which is beyond the scope of this review. Such
an analysis would be of interest, however, since it
might very well lead to improved sequences and/or
parameter settings for the (C,C) TOCSY transfer.
Qualitatively, it is to be expected that fully excluding
the C5 spin from the network should improve the
Table 7
Transfer efficiencies in HNCCH and HCCNH-type experiments
Efficiency
a
(T
2
3s) t (ms)(T
2
3 s) Efficiency
f
(T
2
30 ms) t
f
(ms)(T
2
30 ms)
Tr
INEPT
(C6 ! N1) 0.20 56 0.034 32
Tr
cpCN
(C6 ! N1)
b1
0.20 56 0.034 32
Tr
cpCN
(C6 ! N1)
b2
0.14 50 0.034 32
Tr
CCTOCSY
(C8 ! C6)
c
0.46 55 0.12 30
Tr
CCTOCSY
(C8 ! C6)
d
0.46 40 0.17 25
Tr
CCTOCSY
(C8 ! C6)
e
0.15 19 0.08 19*
Trt (Fiala et al.) 0.092 90 0.004 62
Trt (Simorre et al.) 0.092 90 0.006 57
Trt (Sklenar et al.) 0.021 69 0.003 51
a
Transfer efficiencies calculated for the time t
m
b1
The optimum transfer with the settings Simorre et al. (experimental mixing time 44.3 ms) (see text).
b2
Optimum transfer Sklenar et al. (experimental mixing time 58 ms) (see text).
c
The transfer was simulated using an MLEV-17 sequence of 5 kHz with the carrier at 150 ppm (see text) (optimum of 0.48 at 60 ms); the
efficiency is given at 55 ms which corresponds to the mixing time used by Fiala et al.
d
The numbers correspond to the efficiency calculated for MLEV-17 sequence using an rf field of 10 kHz with the carrier at 150 ppm (see text)
(optimum of 0.46 at 40 ms); the efficiency is given at a mixing time of 40 ms, which roughly corresponds to the mixing time used by Simorre et
al. (37.8 ms) (see also text).
e
Transfer found for the shorter mixing time of 19 ms used by Sklenar et al.; the rf field is assumed to be 5 kHz (optimum of 0.48 at 60 ms) (see
text).
f
Efficiency at the optimum value for the mixing time, which is the value given in the next column, except for those indicated with a *.
343S.S. Wijmenga, B.N.M. van Buuren/Progress in Nuclear Magnetic Resonance Spectroscopy 32 (1998) 287–387
transfer efficiency. At first sight one would expect that
this could be achieved by just lowering the rf field
strength, but as can be seen from Fig. 16, at 3 kHz
the transfer becomes negatively affected by the off-
sets of C6 and C8. Thus, an improved sequence
should consist of a shaped (C,C) TOCSY
sequence, which only covers the region of C6,
C4, C8 and C2.
The INEPT transfer from C6 to N1 (see the
sequence proposed by Fiala et al. [153]) is given by
T
C6x, C6yN1z
? sin(pJ
N1C6
2t
2
)cos(pJ
C6C8
2t
2
)
cos(pJ
C6C5
2t
2
)cos(pJ
C6C4
2t
2
) e46T
T
N1zC6x, N1y
? sin(pJ
N1C6
2t
2
)cos(pJ
N1C2
2t
2
)
Tr
INEPT
? T
C6x, C6yN1z
·T
N1zC6x, N1y
Fiala et al. employ a selective
13
C pulse, which
covers the region of 150 to 170 ppm, i.e. the spectral
region where C6, C2 and C4 resonate. Consequently,
the terms involving J
C6C5
, cos(pJ
C6C5
2t
2
), and J
C6C8
,
cos(pJ
C6C8
2t
2
), equal 1. The transfer efficiencies,
Tr
INEPT,
in G are shown in Fig. 17, assuming T
2
values
of 3 s and 30 ms, respectively. An optimum of about
20% is reached at t
2
? 14 ms with a selective pulse
on
13
C and T
2
? 3 s; this number drops to 3.4% at
8 ms when T
2
? 30 ms (see also Table 7); without the
selective
13
C pulse, i.e. with J
C6C8
active, these are
14% at t
2
? 12.5 ms, and 3.4% at 8 ms, respectively
(see also Table 7). We note that leaving the passive
J
C6C5
-coupling present would not have a significantly
negative effect on the efficiency.
In the pulse sequences of Simorre et al. and Sklenar
et al., a DIPSI cross-polarization was used to achieve
the C ! N coherence transfer. Simorre et al. use a
weak field of 1.9 kHz and place the
13
C and
15
N
carriers exactly on C6 (161 ppm) and on N1H
(146 ppm), respectively. They use a mixing time of
44 ms. With these settings the C8 spin may be
assumed to be absent from the spin system effective
during the C to N transfer, i.e. it consists of C6, C2,
N1 and C4. The efficiency may then be estimated to
be similar to that of the INEPT transfer with a selec-
tive
13
C 1808 positioned at C6 as used by Fiala et al.,
i.e. optimum transfer of 20% at 56 ms (T
2
? 3 s).
Sklenar et al. used a field of 2.9 kHz, with the carrier
at 150 and 153 ppm for
13
C and
15
N, respectively; the
mixing time for C ! N transfer was optimized for
guanosines to 58 ms. With these settings C8 may
not be excluded from the spin network, and the
cross-polarization transfer then corresponds to an
INEPT transfer where C8 is included in the spin
network, i.e. optimum transfer of 14% at 50 ms (T
2
? 3 s). Since the transfer based on cross-polarization
is analogous to INEPT, we used the values dervied
from INEPT in Table 7.
The transfer efficiencies are summarized in Table 7.
As can be seen, no essential differences exist for the
three sequences when T
2
is 3 s, although the
sequences of Simorre et al. and Fiala et al. are slightly
more efficient, because the C8 spin is excluded from
the network on C to N transfer. For larger systems
(T
2
? 30 ms) the efficiency drops dramatically.
Interestingly, when T
2
is 30 ms, the sequence of
Simorre et al. outperforms the other sequences. This
is a consequence of the faster carbon to carbon trans-
fer, which in turn results from the stronger rf field
strength used by Simorre et al.
7.2.2.3. Through-bond H2–H8 correlation. To obtain
through-bond H2–H8 correlations in
13
C/
15
N labeled
RNAs (Fig. 13(C), step III), two pulse sequences have
been published; one by Marino et al. [154] (Fig. 18(A)),
and one by Legault et al. [155] (Fig. 18(B)). The two
pulse sequences are quite similar and are closely
related to the HCCH–TOCSY experiment originally
Fig. 17. Efficiency of N1x ! C6x coherence transfer via INEPT in
HCCNH experiment (see text); (a) T
2
? 3s,J
C6C8
inactive; (b) T
2
?
30 ms, J
C6C8
inactive; (c) T
2
? 3s,J
C6C8
active; (d) T
2
? 30 ms,
J
C6C8
active. J
CC
-couplings are as given in Fig. 3.
344 S.S. Wijmenga, B.N.M. van Buuren/Progress in Nuclear Magnetic Resonance Spectroscopy 32 (1998) 287–387
proposed for proteins [156–158]. In the version of
Legault et al., improved signal-to-noise was
obtained by replacing the final INEPT step by a
sensitivity-enhanced version. Nowadays, versions of
the 2D HCCH–TOCSY and 3D HCCH–TOCSY exist
which have further improved sensitivity; as compared
to the standard sequence these sequences exhibit an
improvement in sensitivity by a factor of 1.4 for the
2D versions and close to a factor of 2 for the 3D
version (see also discussion in Section 7.2.2.5, and
Refs. [158–166]. Thus, it would be advisable, for
optimal sensitivity, to use these enhanced sequences
instead of the ones illustrated in Fig. 18.
In both the published pulse sequences (Fig. 18) H2
Fig. 18. Pulse schemes for HCCH–TOCSY experiments to determine adenine H2 to H8 correlations in isotopically labeled RNAs. The narrow
black rectangles indicate 908 pulses, the wider black rectangles 1808 pulses, the grey filled wide rectangles indicate 1808 composite pulses (see
below), and the narrow unfilled rectangle indicates a trim pulse. All pulses are applied along the x axis, unless indicated otherwise. (A) The
2D HCCH–TOCSY [157,158] experiment [154]. The 16-step phase cycle is as follows: f
1
? y, 1 y; f
2
? 4exT,4eyT,4( 1 x),4( 1 y); f
3
? 8(x),
8( 1 x), f
4
? 2exT,2e1xT; f
5
? 2exT,2eyT,2e1xT,2e1yT; f
6
? 4exT,4e1xT; Acq. ? 2ex, 1 x, 1 x, xT,2e1x, x, x, 1 xT. The 1808 f
3
and 1808 f
4
are of the composite type (90x180y90x) and a 1.7 ms spin lock trim pulse is applied with phase f
6
. The
13
C carrier is positioned in the center of
the aromatic region (150 ppm), and a 100 ms spin lock period using DIPSI-2 with an rf field of 3.5 kHz (covering the range of C2, C4, C6, C8) is
applied along the y axis (C5, which is 30 ppm upfield of the carrier, is not excited). The t, d
1
and d
2
delays were set to 1/4J
CH
, 1.25 ms where J
CH
,200 Hz for aromatic resonances. Quadrature detection is obtained with the TPPI–States method [245] using w
1
? 16(x),16(y). The receiver
reference phase and w
1
are incremented by 1808 for each t
1
increment. GARP decoupling [247] of
13
C and
15
N was applied during the
acquisition, as well as during t
1
for
15
N. (B) The 3D (
1
H,
13
C,
1
H) HCCH–TOCSY experiment [155]. The phase cycle is: f
1
? x, 1 x; f
2
?
2exT, 2e1xT; f
3
? 4eyT, 4e1yT; f
4
? 8exT, 8e1xT; f
5
? 16(x),16( 1 x); and f
r
? f
1
t f
2
t f
3
t f
4
1 p/2. The composite 1808 pulses are of
the type e90x240y90xT. Complex data were collected in t
1
by States–TPPI with FIDs for f
q1
? (x,y) [245] being stored separately and in t
2
with
FIDs for f
q2
? (x, 1 x) [250] being stored separately. The 3D hypercomplex data set was processed to achieve a quadrature detected, phase
sensitive display along the
13
C t
2
dimension with sensitivity enhancement [250]. A FLOPSY-8
13
C–
13
C spin lock [238] was applied for 65.4 ms
(t
m
) using a 2.53 kHz rf field strength. For optimum efficiency in
13
C isotropic mixing, the
13
C carrier was placed midway between the adenine
C2 and C8 resonances. All gradients were square shaped and applied along the z axis: G1 ? 8.0 G/cm for 0.5 ms, G2 ? 6.5 G/cm for 1.0 ms and
G3 ? 4.7 G/cm for 0.7 ms. G1, G2 and G3 were followed by 1.5, 2.0 and 0.7 ms recovery times, respectively. Other relevant acquisition
parameters are: SW
q1
(
1
H) ? 1 kHz, t
1
max ? 27.00 ms, SW
q2
(
13
C) ? 4.2 kHz, t
2
max ? 7.38 ms, SW
q3
(
1
H) ? 4.45 kHz, t
3
? 57.53 ms, d
ch
?
2.0 ms, t
ch
? 2.4 ms, gB
2
(
13
C decouple) ? 1.79 kHz, and gB
3
(
15
N decouple) ? 1.23 kHz with GARP-1 [247].
345S.S. Wijmenga, B.N.M. van Buuren/Progress in Nuclear Magnetic Resonance Spectroscopy 32 (1998) 287–387
coherence is transferred to in-phase C2 coherence
via a refocused INEPT sequence. Subsequently,
the in-phase C2 coherence is transferred further to
in-phase C8 coherence, via the carbon–carbon
J-coupling network, using a (C,C) DIPSI-2 or
FLOPSY-8 sequence. In the last transfer step the
in-phase C8 coherence is transferred into in-phase H8
coherence via a refocused INEPT sequence (Marino
et al., Fig. 18(A)) or via a sensitivity-enhanced
refocused INEPT sequence (Legault et al.,
Fig. 18(B)). The main differences between the two
H2–H8 correlation experiments lie in the type of iso-
tropic mixing sequence used, the settings of the rf field
strength, carrier position, and the length of the
TOCSY mixing time. The question is then, for
which settings is optimal C ! C transfer achieved?
A similar problem is encountered in the HNCCH and
HCCNH experiments, where the coherence transfer is
through the complex carbon network of the guanosine
base (see Section 7). The carbon spins in the Adeno-
sine base also form quite a complex network, but with
somewhat different characteristics compared to the
guanosine base (see Fig. 3). As can be seen from
Fig. 3, C6, C4 and C8 have J-couplings of approxi-
mately 10 Hz; C5 has in turn a large J-coupling to C6
and C4 (about 80 Hz), while C2 has a J-coupling of
12 Hz to C5. C2 may be further J-coupled to C6 or C4,
but these J-couplings must be smaller than 3 Hz [49].
Furthermore, C6, C4, C8 and C2 reside in a relatively
narrow spectral region (162–135 ppm), while C5
resides around 120 ppm (Table 6). The problem is
thus to obtain optimal transfer through this system
of carbon spins from C2 to C8. Marino et al. position
the
13
C carrier at the center of the aromatic region at
150 ppm, apply a DIPSI-2 mixing sequence of 100 ms
duration with an rf field strength of 3.5 kHz; Pardi et
al. use a FLOPSY-8 sequence of 64.5 ms duration and
rf field strength of 2.53 kHz with the carrier midway
between C2 and C8 (at 145 ppm). With these settings
the C2, C4, C6 and C8 are covered by both the
3.5 kHz and 2.53 kHz rf fields. The C5 spins, at
120 ppm, have offsets of 4500 Hz and 3750 Hz
in the settings of Marino et al. and Legault et al.,
respectively. These offsets are larger, but not very
much larger, than the rf field strengths. Consequently,
the C5 spin cannot be neglected, leading to a
coherence transfer, which is governed by a complex
interplay of offset effects with field strength and
type of mixing sequence. Numerical simulations
are required to analyze and determine the ultimate out-
come of the transfer. Such an in-depth analysis of the
coherence transfer would be of interest, since it might
very well turn out that the settings used are not optimal.
Fig. 19. Sections of the HMBC spectrum of a 1.5 mM sample of
59d(CGGCCG-GAIAGAGA-CGGCCG)-39 in 99.98% D
2
O, pH
6.8, at 258C [144]. The spectrum was acquired in about 80 h on a
Varian UNITY plus spectrometer operating at 750 MHz, although it
should be noted that the obtained signal-to-noise ratio allows the
duration of the experiment to be reduced by a factor of two or three.
The proton carrier was placed at the position of the water resonance
and the spectral width in the F2 dimension was set to 7500 Hz. For
the indirect dimension the spectral width was set to 1163 Hz,
centered at 139 ppm. States–TPPI was used for phase-sensitive
detection of the indirectly observed frequency [245]. The spectrum
was recorded with 2048 increments in t
2
and 1024 increments in t
1
.
Suppression of the HDO signal was achieved by presaturation dur-
ing the relaxation delay. During detection,
13
C GARP decoupling
was executed. The upper panel shows the C5–H8 correlations for
adenine and guanine residues. In the middle panel the C4–H8 and
C4–H2 cross peaks, used to correlate the adenine H8 and H2 reso-
nances, are given. The lower panel shows the C6–H2 correlations
for adenine and inosine residues.
346 S.S. Wijmenga, B.N.M. van Buuren/Progress in Nuclear Magnetic Resonance Spectroscopy 32 (1998) 287–387
An alternative means of obtaining H2 to H8
correlations is to use an HMBC experiment to obtain
long-range H to C correlations. As can be gleaned
from Fig. 3, H2 has sizable
3
J
HC
-couplings to C4
and C6, and H8 to C5 and C4, so that in an HMBC,
H2 to C4, C5 and C6 cross peaks can be observed,
together with H8 to C5 and C4 cross peaks. The com-
mon J-coupling partner, C4, provides a means of
obtaining through-bond H2 to H8 correlations. As
far as we are aware, this type of experiment has not
been used in labeled RNA, but has been used in DNA
(unlabeled) to obtain H2 to H8 correlations [144].
Fig. 19 shows part of the HMBC spectrum of a
DNA hairpin, which contains a number of G.A base
pairs, demonstrating the utility of the experiment for
obtaining H2 to H8 correlations. The experiment has
the added advantage that it also provides other long-
range correlations. For example, one finds for the
Guanosine bases H8 to C5 and C4 correlations. As
can be seen in Fig. 19, the G C5 resonances have a
spectral position, which differs from that of A G5, and
the H8 to C5 cross peaks can be used to distinguish
Guanosine and Adenosine bases.
7.2.2.4. Through-bond base–sugar correlation. The
NMR experiments that have been proposed for
correlating base and sugar protons (step III,
Fig. 13(C)) are usually given a name that indicates
the magnetization transfer route; for example, HCN
indicates a triple resonance experiment, where H, C
and N are correlated. With these naming conventions,
the NMR experiments proposed for through-bond
base–sugar correlation can be put under the headings
of HCN experiments [167,168], HCNCH experiments
[169,170], HbNb(Hb)CbHb experiments where b
stands for base [168], and HCNH experiments [171].
Fig. 20(A), (B) and (C) show the pulse sequences
proposed by Sklenar et al. [167,169], Fig. 20(D) the
sequence proposed by Tate et al. [171], and Fig. 20(E)
the sequence proposed by Farmer et al. [168,170]. In
the HCN experiment either H19 is correlated with C19
and N9/N1 or H6/H8 is correlated with C8/C6 and N9/
N1 [167,168]. In the HCNCH type of experiment
the HCN sequence is extended into a relay
experiment. The original proton magnetization,
say H19, is transferred to N9/N1 as in the HCN part,
and then relayed from N9/N1 via C6/C8 to H6/H8
[168–170]. Farmer et al. [168] also proposed for
purines an experiment, HbNb(Hb)CbHb, where the
H8 magnetization is transferred directly to N9 and
back to H8, utilizing the J
H8N9
-coupling. In order to
turn it into a 3D experiment the magnetization is
transferred to C8 and back before acquisition. Tate
et al. [171] proposed for purine residues an
experiment which also utilizes the direct H8 to N9
J-coupling. In their proposed HCNH experiment the
H19 magnetization is transferred via C19 to N9 as in
the first part of the HCN or HCNCH experiments, and
then directly via the J
H8N9
-coupling to H8. Sklenar et
al. [172] have proposed that the correlation of sugar
and base protons can also be obtained via
15
N HSQC
spectrum.
It is interesting to compare the various approaches
and consider their sensitivities in view of the detailed
knowledge that now exists of the complex network of
J-couplings in the bases (see Section 5). To gain a
correct insight into the relative performance of the
pulse sequences the magnetization transfer functions
are required. The transfer functions governing the
HCNCH pulse sequence are given by
Pyrimidine:
Tr
H19C19
? sin(pJ
H19C19
2t
1
) (47)
Tr
C19N9
? sin(pJ
C19N9
2t
2
)cos(pJ
C19C29
2t
2
)
sin(pJ
C19H19
2t
1
)
Tr
N9C6
? sin(pJ
C19N9
2t
3
)sin(pJ
N9C6
2t
3
)cos(pJ
N9C2
2t
3
)
Tr
C6H6
? sin(pJ
C19N9
2t
4
)sin(pJ
C6H6
2t
5
)cos(pJ
C6C5
2t
4
)
cos(pJ
C6C2
2t
4
)
Tr
H6C6
? sin(pJ
H6C6
2t
5
)
Purine:
Tr
H19C19
? sin(pJ
H19C19
2t
1
) (48)
Tr
C19N9
? sin(pJ
C19N9
2t
2
)cos(pJ
C19C29
2t
2
)
sin(pJ
C19H19
2t
1
)
Tr
N9C8
? sin(pJ
C19N9
2t
3
)sin(pJ
N9C8
2t
3
)cos(pJ
N9C4
2t
3
)
cos(pJ
N9C5
2t
3
)
347S.S. Wijmenga, B.N.M. van Buuren/Progress in Nuclear Magnetic Resonance Spectroscopy 32 (1998) 287–387
Fig. 20.
348 S.S. Wijmenga, B.N.M. van Buuren/Progress in Nuclear Magnetic Resonance Spectroscopy 32 (1998) 287–387
Fig. 20. Pulse schemes for experiments that correlate the base to the sugar. (A) Non-selective version of the HCN experiment [167] to detect
H19–C19–N9/N1 connectivities; (B) semi-selective version of the HCN experiment [167] to detect the H8–C8–N9 and H6–C6–N1 con-
nectivities; (C) 2D triple resonance HCNCH experiment to determine through-bond H19–H6/H8 connectivities via the route H19–C19–N9/N1–
C6/C8–H6/H8 [169]; (D) 2D triple resonance HCNH experiment to determine through-bond H19 to H8 connectivities via the route H19–C19–
N9–H8 [171]; and (E) 3D triple resonance HCNCH experiment proposed by Farmer et al. [170]. Narrow black rectangles indicate 908 pulses,
wide black rectangles 1808 pulses, grey filled rectangles composite 1808 pulses of the type (90
x
240
y
90
x
), black filled rounded rectangles semi-
selective 1808 pulses (B, C and E), and unfilled rounded rectangles pulsed field gradients along the z axis. All pulses are applied along the x axis
unless otherwise indicated. For (A), d ? 1/4JH19C19 ? 1.6 ms, d
2
? 42 ms, D ? 21 ms. The
13
C and
15
N carriers and sweep widths are set to
cover the range of C19 (88–96 ppm) and N9/N1 (142–176 ppm) chemical shifts. For (B), d,1/4JC8H8,1/4JC6H6,1.25 ms, d
2
? 38–40 ms,
D ? 19–20 ms, 180 sel1 ? 4 ms, REBURP refocusing pulse [241] centered at 140.5 ppm, 180 sel2 ? 2 ms, IBURP2 inversion pulse [241]
centered at 160 ppm. For both experiments, phase cycling is: f
2
? y, 1 y; f
3
? 4exT, 4e1xT; f
4
? 8eyT, 8e1yT; f
5
? x,x, 1 x, 1 x; f
6
?
32exT, 32e1xT; f
7
? 16e1yT, 16eyT; receiver a, 1 a, 1 a, a where a ??x, 1 x, 1 x, x, 2e1x, x, x, 1 x),x, 1 x, 1 x,x]. In addition, f
3
and f
5
phases are cycled to obtain the States–TPPI t
1
and t
2
quadrature detection [245]. The 1 ms gradient pulses (800 ms gradient pulse shaped to a 1%
349S.S. Wijmenga, B.N.M. van Buuren/Progress in Nuclear Magnetic Resonance Spectroscopy 32 (1998) 287–387
Tr
C8H8
? sin(pJ
N9C8
2t
4
)cos(pJ
N7C8
2t
4
)cos(pJ
C8C6
2t
4
)
cos(pJ
C8C4
2t
4
)sin(pJ
C8H8
2t
5
)
Tr
H8C8
? sin(pJ
H6C6
2t
5
)
The transfer functions of the HCNH experiment for
Purines, proposed by Tate et al., are given by
Tr
H19C19
? sin(pJ
H19C19
2t
1
) (49)
Tr
C19N9
? sin(pJ
C19N9
2t
2
)cos(pJ
C19C29
2t
2
)
3 sin(pJ
C19H19
2t
1
)
Tr
N9H8
? sin(pJ
C19N9
2t
2
)sin(pJ
N9H8
2t
3
)
3 cos(pJ
N9C4
2t
2
)cos(pJ
N9C8
2t
2
)
Tr
N9H8
? sin(pJ
N9H8
2t
4
)cos(pJ
N7H8
2t
4
)
The transfer functions of the experiment proposed
by Farmer et al. [168], the HbNb(Hb)CbHb
experiment for purine, where Hb is H8, Nb is N9,
and Cb is C8, are given by
Tr
N9H8
vv
? (sin(pJ
H9H8
2t
1
)cos(pJ
N7H8
2t
1
))
2
(50)
Tr
H8C8
vv
? (sin(pJ
H8C8
2t
2
))
2
Here, vv denotes that the transfer efficiency given is
for the out- and back route.
truncated sine envelope and 200 ms for the magnetic field recovery) are used with amplitudes 1 6/3/3/ 1 6/3/3/12/ 1 3/ 1 3 G/cm. The
asynchronous GARP decoupling [247] is used to suppress the heteronuclear spin–spin interactions during t
3
acquisition. For (C), the delay
intervals are set to d ? 1.6 ms, j ? 1.25 ms, D ? 19.6 ms, h ? 18 ms, t ? 19 ms. The 1808 semi-selective pulses (indicated with black round
bars) as used at 500 MHz are (1)
13
C 4 ms REBURP refocusing pulse [241] covering the range of C19 (88–96 ppm), (2)
13
C 4 ms IBURP2
inversion pulse [241] with additional cosine modulation [251,252] for simultaneous inversion of C19 (88–96 ppm) and C6/C8 (136–
144.5 ppm), (3)
13
C 4 ms REBURP refocusing pulse [241] covering the range of C6 and C8 (136–144.5 ppm), and (4)
15
N 2 ms IBURP2
pulse [241] centered at 160 ppm covering the range of N1 and N9 (142–176 ppm). Phase cycling: f
1
? x; f
2
? y, 1 y; f
3
? x,x, 1 x, 1 x; f
4
?
16exT, 16e1xT; f
5
? 4exT, 4e1xT; f
6
? 8exT, 8e1xT; f
7
? 16exT, 16e1xT; receiver ? a, 1 a, 1 a, a, with a ? x, 1 x, 1 x,x. In addition, f
1
is
phase cycled to obtain States–TPPI [245] t
1
quadrature detection. The 1 ms gradient pulses (800 ms gradient pulse shaped to a 1% truncated sine
envelope and 200 ms for the magnetic field recovery) are used with amplitudes 1 6/3/3/ 1 6/3/3/12/ 1 3/ 1 3 G/cm. The asynchronous GARP
decoupling [247] is used to suppress the heteronuclear spin–spin interactions during t
2
acquisition. For (D), the delay intervals are set to t
1
?
1.3 ms, t
2
? 2.6 ms, t
3
? 12.8 ms, t
4
? 16.2 ms, t
5
? 16.0 ms and t
6
? 8.0 ms. All carbon pulses are generated using a simple synthesizer
without frequency switching. The coherence time delays, t
5
and t
6
, were experimentally optimized to compensate for the sensitivity loss by the
relaxation effect. A delay time as short as 16 ms was found to be a good value, although it was substantially shorter than the theoretical optimal
t
5
of 42 ms. The frequency offset for carbon pulses is at the center of the C19 carbons, 84 ppm. The rf field strength for all carbon pulses is
19.2 kHz. For off-resonance selective decoupling a G3–MLEV16 expansion is used [253], where each G3 inversion pulse [254] is phase
modulated to shift its inversion center to t 7.8 kHz [248,255] which is around the center of the base carbon, 146 ppm, excluding the C5 carbon.
This G3–MLEV16 selective decoupling is achieved at a field of 2.9 kHz, and under these conditions the selective decoupling has a bandwidth of
6 1.5 kHz and has little perturbation on the deoxyribose ring carbons. The
1
H pulses are at a field strength of 29.5 kHz, with an offset on the
water resonance. For
1
H decoupling WALTZ16 [256] is used with a 2.7 kHz field strength. All nitrogen pulses are applied at a field strength of
6.6 kHz, with an offset at the midpoint between the N9 and N7 nitrogen resonances, at 180 ppm. During acquisition, GARP [247] is applied
from the nitrogen channel at a field strength of 0.86 kHz and, simultaneously, the same
13
C decoupling as described above is applied from the
carbon channel. The durations and strengths of the gradients are G1 ? (1.0 ms, 8.0 G/cm), G2 ? (4.0 ms, 28.2 G/cm), G3 ? (3.0 ms, 18.3 G/cm)
and G4 ? G5 ? (1.0 ms, 5.0 G/cm). A delay of at least 150 ms is inserted between the gradient pulse and the subsequent application of an rf
pulse to avoid the eddy current effects. All gradients are applied along the z axis and are rectangular. The phase cycle is f
1
? x; f
2
? y, 1 y;
f
3
? x; f
4
? 2exT, 2eyT, 2e1xT, 2e1yT; f
5
? 8exT, 8e1xT; f
6
? x; receiver ? x, 2e1xT, 2exT, 2e1xT, x, 1 x, 2(x),2( 1 x),2(x), 1 x. The quad-
rature detection in t
1
is accomplished by States–TPPI [245] of f
1
. For (E), the phase cycle is f
1
? x, 1 x; f
2
? 2(x),2( 1 x); f
3
? 8(x),8( 1 x);
f
4
? 4exT,4e1xT); f
5
? 16exT,16( 1 x); and f
r
? f
1
t f
2
t f
3
t f
4
t f
5
. Complex data were collected in t
1
by States–TPPI [245] with FIDs
for t
f1
? x,y stored separately. Phase modulated data were collected in t
2
with FIDs for t
f2
? x, 1 x corresponding to N- and P-type signals,
respectively [250], being stored separately. The 3D hypercomplex data set was processed as described by Palmer et al. [250] to achieve
quadrature detected phase-sensitive display along the C19 dimension with sensitivity enhancement. The
13
C carrier was placed at the center of
the C19 resonances, except during the time between points a and b when the
13
C carrier was placed at the C6/C8 resonances. All pulses on lines
labeled with a particular spin group, i.e.
13
C29 or
15
N3 or
13
C6/8, were Gaussian shaped (64 steps, 5j cutoff) and are selective for that spin
group. The subscript FS appended to certain pulse phases denotes a frequency shifted pulse achieved by phase modulation [246]. Typical
settings at 600 MHz are t
1
? 1:48 ms, t
2
? 8:52 ms, t
3
? 20:0 ms, t
4
? 16:40 ms, t
5
? t
1
=4, t
7
? 2:40 ms, t
6
? t
4
1 t
5
1 t
7
, t
8
?t
3
1et
2
=2T,
t
9
?t
2
1et
2
=2T, and t
10
?t
1
tet
2
=2T; t
90
(
1
H) ? 7.5 ms, t
90
(
13
C) ? 13.4 ms, t
180
(
13
C29) ? 896 ms, t
90
(
13
C6) ? t
180
(
13
C6) ? 704 ms, gB
2
? (C19
decouple) ? 1.29 kHz with WALTZ16 [256], t
90
(
15
N) ? 37 ms, sw(H6) ? 650.0 Hz, t
1
max(H6) ? 35.58 ms, sw(C19) ? 600.0 Hz, t
2
max ?
25.00 ms, sw(
1
H) ? 3 kHz, t
3
max(H19) ? 120.0 ms.
350 S.S. Wijmenga, B.N.M. van Buuren/Progress in Nuclear Magnetic Resonance Spectroscopy 32 (1998) 287–387
In Fig. 21(A) and (B), the HCNCH transfer
functions are shown for purines and pyrimidines,
respectively, using realistic J-coupling constants.
The presence of the additional passive J-couplings
in non-selective transfers adversely affects the trans-
fer efficiency. Farmer et al. [168] and Sklenar et al.
[167] have therefore introduced selective refocusing
in the various transfer steps. Considering in somewhat
more detail the HCNCH pulse sequence by Sklenar et
al. [167], which is designed to perform well for both
purine and pyrimidine residues, we find selective
refocusing of the C1’s in the C19–N1/N9 step,
which removes the passive coupling with C29.As
can be seen in Fig. 21(A) and (B) the selective
decoupling of C29 is not essential for sensitivity. In
the subsequent step, C19zN9/N1x ! N9/N1y ! N9/
N1xC6/C8z, an IBURP is employed, which
simultaneously inverts C19 and C6/C8, so that C2
Fig. 21. Transfer efficiency in sugar-to-base correlation experiments. The coherence transfers are given as a function of the transfer time t (ms)
(see text). The transfer steps considered are indicated by their name, e.g. T
C8xN9z,C8xH8z
. Transfer curves are shown either in the presence or in the
absence of additional passive J-couplings; the latter are indicated by the letter ‘d’, which is placed close to the corresponding curve. The T
2
relaxation is assumed to be 3 s, unless otherwise indicated. (A) Transfer curves for the different transfer steps in the HCNCH experiment in
purines. (B) Transfer curves for the different transfer steps in the HCNCH experiment in pyrimidines. For uridine and cytosine the ‘decoupled’
transfer curves are identical. (C) Transfer curves of the final transfer steps (see text) in the HCNCH and the HCNH experiments in purines,
indicated as Tr
C19zN9x;N9xC8z
, Tr
C8xN9z;C8xH8z
and Tr
C19zN9x;N9xH8z
, Tr
N9zH8x,H8y
, respectively. For the transfers in the HCNCH experiment only the
‘decoupled’ versions are shown. For the corresponding transfers in the HCNH experiment both the ‘coupled’ and ‘decoupled’ versions are
shown. (D) As in (C), except a T
2
relaxation time of 30 ms is assumed (see text).
351S.S. Wijmenga, B.N.M. van Buuren/Progress in Nuclear Magnetic Resonance Spectroscopy 32 (1998) 287–387
resonating at 176 ppm becomes decoupled from N1,
C4 resonating at 151 ppm and C5 resonating at
120 ppm become decoupled from N9. The selective
decoupling is essential for this transfer step since a
considerable increase in the transfer efficiency is
achieved (Fig. 21(A) and (B)). In purine C4 resonates
quite close to C8 and C6, so that the selective IBURP
on C8/C6 may also touch the C4 spins, and conse-
quently, the J
N9C4
-coupling may remain slightly
effective. Further note that in U the J
N1C2
-coupling
is 19.2 Hz and in C 12 Hz. This transfer step will
have a much lower efficiency in C than in U, when
C2 is not decoupled from N1. In the third step, N9/
N1zC6/C8x ! C8/C6y, a simultaneous REBURP and
IBURP pulse is given on C6/C8 and N9/N1, respec-
tively, which removes the coupling of C8 to C4 and
C6 as well as to N7, and in pyrimidines the couplings
of C6 to C5 and C2. As Fig. 21(A) and (B) show, the
selective decoupling thus employed significantly
increases the transfer magnitude in this transfer step.
It is important to note that the pulse sequence pro-
posed by Sklenar et al. [167] functions for pyrimidines
as well as for purines. The overall efficiency of the
HCNCH will be quite high, since the efficiency is close
to 1.0 for each transfer step, at least when T
2
is large
(the optimal transfer times lie for all transfer steps
around 40 ms (see Fig. 21(A) and (B)).
In this first step of the HCNCH experiment no
adverse effect is felt from the presence of the passive
J
C19C29
-coupling. Sklenar et al. [167] therefore pro-
posed using two HCN 3D experiments to achieve
the H19 to H6/H8 correlation, i.e. an HCN (sugar),
which starts from H19 and transfers via C19 to N9/
N1 and back, and an HCN (base), which starts from
the base proton, H6/8, and transfers via C8/C6 to N1/
N9 and back. In this way the H19–C19–N9/N1 corre-
lation can be non-selective and is efficient for both
purine and pyrimidines. The HCN (base) experiment,
where H8–C8–N9 or H6–C6–N1 are correlated,
employs selective pulses as in the HCNCH experi-
ment. The main advantage of such an approach is
that each of these experiments is more sensitive than
the relayed experiment.
It is also interesting to compare the efficiency of the
HCNCH experiment with the HCNH experiment pro-
posed by Tate et al. [171] and the HbNb(Hb)CbHb
experiment proposed by Farmer et al. [168]. In all of
these experiments the J
H8N9
-coupling is used to
transfer from N9 to H8. Since the HCN step in the
HCNH is the same as in the HCNCH experiment, we
compare in Fig. 21(C) only the efficiencies in the
final transfer steps. As can be seen, the transfer
C19zN9x ! N9y ! N9xC8z, indicated as
Tr
C19zN9x;N9xC8z
, and the transfer C19zN9x ! N9y !
N9xH8z, indicated as Tr
C19zN9x,N9xH8z
, have comparable
efficiencies, with the latter being slightly less
effective. In the HCNCH experiment the refocusing
step C8xN9z ! C8y ! C8xH8z, indicated as
Tr
C8xN9z;C8xH8z
, also has an optimal efficiency close
to 1, while in the HCNH experiment the correspond-
ing refocusing step, N9zH8x ! H8y, indicated as
Tr
N9zH8x,H8y
, has a much lower efficiency due to the
remaining passive J
H8N7
-coupling. If a selective
IBURP on N9 had been used as was done by Farmer
et al. [168] in the HbN9(Hb)CbHb experiment, the
efficiency of this step would again be close to 1 (see
Fig. 21(C)). Thus, the efficiency of the HCNH is not
expected to be better than the efficiency of HCNCH.
The HbNb(Hb)CbHb will have a better efficiency
since the HCN step and the relay step are absent.
To further establish how well the transfer function
calculations reproduce the experimental observations,
we compare the optimal delay settings as suggested by
the authors of the various pulse sequences with the
optimal settings as deduced from the transfer func-
tions. For this the effect of T
2
relaxation needs to be
considered. In Fig. 21(D), we therefore show the same
transfer functions as in Fig. 21(A)–(C), but now
calculated assuming a T
2
of 30 ms. For the C19N9
refocusing step Sklenar et al. and Farmer et al.
[167,168] found 40 ms to be the optimal transfer
time. This value lies inbetween the optimal values,
40 ms and 26 ms, calculated for T
2
? 3 s and 30 ms,
respectively. Tate et al. [171] did not apply C29-
decoupling during this transfer step in the HCNH
experiment, and neither did Sklenar et al. [167] in
the HCN (sugar) experiment. The former used
25 ms for the transfer time, which corresponds to
the first optimum in the transfer function (Fig. 21(A)),
while Sklenar et al. [167] used a value of 42 ms,
which corresponds to the second maximum. Appar-
ently, the DNA studied by Tate et al. [171] has shorter
T
2
values than the RNA studied by Sklenar et al. and
Farmer et al. [167,168] in their experiments. In the
relay step, C19zN9x ! N9y ! C8zN9x, Sklenar et al.
and Farmer et al. [167,168] use values for the transfer
352 S.S. Wijmenga, B.N.M. van Buuren/Progress in Nuclear Magnetic Resonance Spectroscopy 32 (1998) 287–387
time of 36 ms and 30–40 ms, respectively. As can be
seen from Fig. 21(A)–(D), these values again lie
inbetween the optimal values for the calculated trans-
fers when T
2
is 3 s and 30 ms, respectively. Tate et al.
[171] used a transfer time of 32 ms for the transfer
C19zN9x ! N9y ! H8zN9x, which is very close to
the optimal value for this transfer when T
2
is 30 ms
Fig. 21(D). For the refocusing of H8zN9x, Tate et al.
[171] used a transfer time of 16 ms. This is again close
to the optimal value for the calculated transfer when
T
2
is 30 ms (Fig. 21(D)). Thus, the calculated transfer
curves explain nicely the experimentally obtained
results. In fact, they could provide an alternative way
to estimate the T
2
governing the systems under study.
Since the calculated transfer functions explain well
the observed experimental results, it is worth con-
sidering the relative merits of each approach in
terms of sensitivity. For this we compare the overall
efficiencies, which are calculated as the product of the
transfer efficiencies of the individual transfer steps
and compiled in Table 8. As can be seen, the overall
transfer efficiency drops dramatically from close to 1,
when T
2
? 3 s, to much smaller values, ranging from
0.006 to 0.2, when T
2
is 30 ms; for high values of T
2
most of the experiments have quite similar efficien-
cies, but for T
2
? 30 ms considerable differences
appear. The HCN experiments, as well as the
HbNb(Hb)CbHb experiment, are then significantly
more efficient than the relay experiments, although
the HCNCH experiment remains efficient enough to
be performed. The advantage of the HCN and
HCNCH experiments is that they have similar
efficiencies for both purine and pyrimidine residues.
The (Hb,Nb) HSQC suggested by Sklenar et al.
[172] has an efficiency similar to that of the
HbNb(Hb)CbHb experiment (when T
2
? 3 s, see
Table 8). For T
2
values of 30 ms the efficiency
remains reasonable for purines, but becomes quite
small for pyrimidines, which is due to the small size
of the J
H6N1
-couplings (see Table 8).
We finally consider the possibility of using the
J
H19C6/8
-coupling and the J
H19C2/4
-coupling for obtain-
ing base–sugar correlation via long-range HSQC or
HMQC experiments. The J
H19C6/8
- and J
H19C2/4
-
couplings depend on the x torsion angle and have
values of 4–5 Hz and 2 Hz, respectively, when the x
torsion angle is in the usual anti range [49]. The
HSQC or HMQC experiments could be expanded
into a 3D experiment, by adding HMQC type correla-
tion to C19, using the large
1
J
H19C19
-coupling, in much
the same way as has been done for the
HbNb(Hb)CbHb experiment. In combination with a
(H8/6,C8/6) HSQC, a through-bond correlation can
then be obtained between H19 and H6/8 resonances.
The experiment has a theoretical efficiency which is
of the order of the efficiency for an H19 to C6/8 cor-
relation in an HCNCH experiment when T
2
is 30 ms
(Table 8).
In summary, from the viewpoint of sensitivity, and
taking the values at T
2
? 30 ms as a yardstick, the two
3D HCN experiments perform best, while the (HbNb)
HSQC and the related HbNb(Hb)CbHb are also quite
sensitive, followed by the HCNCH and the (H19C6/8)
HSQC. In practice, losses due to rf inhomogeneity
Table 8
Overall transfer efficiency
#
in base-to-sugar through-bond correla-
tion
Efficiency
(%; T
2
3s)
Efficiency
(%; T
2
30 ms)
HCNCH
a
82 2.5
HCNH
b
(Tate et al.) 20 1.2
HCNH
c
(Tate et al. optimal) 73 (10) 1.7 (0.2)
HbNb(Hb)CbHb
d
90 (10) 6.3 (0.6)
HCN (sugar)
e
98 10
HCN (base)
e
96 10
(HbNb) HSQC
f
98 (98) 6.3 (0.6)
(H19C8/6) HSQC
g
98 2.3
(H19C4/2) HSQC
h
98 0.6
#
The overall transfer efficiencies are obtained as the product of the
efficiencies of the individual steps. In parentheses are the values for
pyrimidines when different from purine values. Optimal delay set-
tings for (a) HCNCH, T
2
? 3s:t
1
? 1.5 ms, t
2
? 42 ms, t
3
?
40 ms, t
4
? 42 ms, t
5
? 1.2 ms; T
2
? 30 ms: t
1
? 1.5 ms, t
2
?
25 ms, t
3
? 31 ms, t
4
? 26 ms, t
5
? 1.2 ms; (b) HCNH, T
2
? 3s:
t
1
? 1.5 ms, t
2
? 25 ms, t
3
? 50 ms, t
4
? 25 ms, t
5
? 1.2 ms; T
2
?
30 ms: t
1
? 1.5 ms, t
2
? 25 ms, t
3
? 32 ms, t
4
? 16 ms, t
5
?
1.2 ms; (c) HCNH, T
2
? 3s:t
1
? 1.5 ms, t
2
? 48 ms, t
3
? 50 ms
(50 ms), t
4
? 50 ms (50 ms); T
2
? 30 ms: t
1
? 1.5 ms, t
2
? 25 ms,
t
3
? 32 ms (32 ms), t
4
? 26 ms (26 ms); (d) HbCb(Hb)CbHb, T
2
?
3s: t
1
? 50 ms (50 ms), t
2
? 1.2 ms; T
2
? 30 ms: t
1
? 26 ms
(26 ms), t
2
? 1.2 ms; (e) the overall efficiencies of HCN (sugar) and
HCN (base) are the square of T(H19C19)*T(C19N9) and
T(H8C8)*T(N9C8), respectively, with the delays as in HCNCH;
(f) (HbNb) HSQC T
2
? 3s:t
1
? 50 ms (110 ms); T
2
? 30 ms: t
1
? 26 ms (20–30 ms); (g) (H19C8/6) HSQC T
2
? 3s:t
1
? 55 ms; T
2
? 30 ms: t
1
? 26 ms; (h) (H19C4/2) HSQC T
2
? 3s:t
1
? 120 ms;
T
2
? 30 ms: t
1
? 26 ms.
353S.S. Wijmenga, B.N.M. van Buuren/Progress in Nuclear Magnetic Resonance Spectroscopy 32 (1998) 287–387
have to be taken into account. Limiting the number of
pulses will improve sensitivity, although with
improved probes, this aspect has become of lesser
importance. Table 8 should therefore give a fair
representation of the practical performance of the
different base–sugar through-bond correlation
experiments. Finally, we should consider the aspect
of resonance overlap. The best dispersed resonances
are those of H19,C19, H6, H8, C6 and C8; the
resonances of N9 and N1 form two well separated
groups, but within each group overlap is quite
strong. Ideally, one would therefore like to obtain
triple correlations involving the best dispersed
resonances, e.g. H19–C8–C19. Note that the best
Fig. 22. Pulse sequences of the CP CCH–TOCSY (A) [76], constant time CP CCH–TOCSY (B) [76] and the 3D CP HCCH–TOCSY (C) [163]
experiments to determine carbon to carbon connectivities in the ribose sugar ring. Narrow black rectangles indicate 908 pulses, wide black
rectangles 1808 pulses, vertically striped rectangles trim pulses bracketing the DIPSI sequences, and unfilled rounded rectangles pulsed field
gradients along the z axis. All pulses are applied along the x axis unless otherwise indicated. (A, B) Phase cycling is as follows: f
1
? x, 1 x ( t
TPPI(t
1
)); f
2
? x, 1 x. The applied rf fields in the DIPSI correspond typically to a 908 pulse of 28 ms; the cross-polarization period is normally
set to about 6.2 ms ( < 1/J
CH
); the homonuclear (C–C) mixing time can be set to approximately 18 ms in order to obtain sufficient transfer
through the J-coupled
13
C sugar ring spin system; the constant time period T is typically set to approximately 12 ms ( < 1/2J
CC
). (C) All B
0
gradient fields are sine-bell shaped. Phase cycle: f
0
? y; f
1
? x; f
2
? x, 1 x; f
3
? y; f
4
? y; f
5
(receiver) ? x, 1 x. Absorption mode spectra in
the indirect dimensions are obtained by separate measurements of the N- and P-type coherences, which is achieved by simultaneously inverting
the sign of the first gradient (G1) and of phase f
1
for the t
1
dimension and by simultaneously inverting the sign of the last gradient (G4) and of
phase f
4
for the t
2
dimension. The values to be used for delay lengths, gradient amplitudes, gradient durations, and field strengths are discussed
in detail in Wijmenga et al. [163]. In all cases (A,B,C) the heteronuclear and homonuclear cross-polarizations are carried out using synchronous
DIPSI3 sequences. Carbon decoupling during acquisition can be achieved via GARP sequence as indicated.
354 S.S. Wijmenga, B.N.M. van Buuren/Progress in Nuclear Magnetic Resonance Spectroscopy 32 (1998) 287–387
dispersion is of course obtained in the triple resonance
experiments.
7.2.2.5. Through-bond sugar correlation. For
correlation through the ribose sugar ring (step III,
Fig. 13(C)), HCCH experiments can be used. HCCH
experiments have been developed in the field of
protein NMR and come in a variety of forms. In all
experiments the transfer from
1
Hto
13
C is achieved
either via a refocused INEPT [157,158] or via a
Fig. 23. (C,C) TOCSY transfer efficiencies in ribose sugar. (A) Transfer from C19 into the ring spin system; (B) transfer from C29 into the ring
spin system; (C) transfer from C39 into the ring spin system. The transfer from C49 is identical to that of C29, and the transfer from C59 is
identical to that of C19. The transfer has been numerically simulated using the J
CC
-coupling constants as given in Section 5, and offsets as given
in Table 6; rf field assumed to be 20 kHz, so that offset effects are minimized.
355S.S. Wijmenga, B.N.M. van Buuren/Progress in Nuclear Magnetic Resonance Spectroscopy 32 (1998) 287–387
cross-polarization sequence [156,164,165]. Since it
has been demonstrated by Zuiderweg and coworkers
[156,164,165] that cross-polarization is more
effective then refocused INEPT, the HCCH
experiments using cross-polarization are expected to
have the higher sensitivity. Traditionally, phase
cycling has been used to suppress artifacts or
unwanted coherences, but nowadays, gradients are
used to select the coherence pathways of interest
and suppress artifacts. Kay et al. [159] were the first
to show how gradient coherence selection can be
combined with enhanced sensitivity, i.e. per transfer
step the enhancement can be as much as a factor
of 2
1/2
. Recently, this enhancement scheme has
been incorporated into HCCH sequences, which
employ refocused INEPT sequences to affect
13
C
to
1
H and
1
Hto
13
C transfers [160–163].
Furthermore, doubly-enhanced 3D HCCH
experiments have been published that give close to a
factor of 2 higher sensitivity than the conventional 3D
HCCH sequence using either refocused INEPT
[161,162] or cross-polarization [163]. An important
feature of these gradient-selective 3D HCCH
experiments is their impressive water signal
suppression. Finally, we note that the
13
C evolution
is often incorporated into a constant time period
[166,173,174] in order to remove the splittings on the
cross peaks stemming from the rather large J
CC
-
couplings in the ribose ring, which are all
approximately 40 Hz (see Section 5). Fig. 22(A)–(C)
Fig. 24. Constant time CCH–TOCSY spectrum, showing the ribose sugar region (C19 to C59). The spectrum is of a
13
C labeled RNA hairpin
with sequence 59pppGGGC-CAAA-GCCU. The assignments are indicated in the H19 region. The spectrum was recorded with a constant time
period of approximately 12 ms (1/2J
CC
); the (C,C) TOCSY (DIPSI-2) mixing time was 18 ms; the cross-polarization was 6.2 ms; the rf field
9 kHz.
356 S.S. Wijmenga, B.N.M. van Buuren/Progress in Nuclear Magnetic Resonance Spectroscopy 32 (1998) 287–387
show the pulse sequences for the 2D cpCCH and 3D
cpHCCH experiments either with or without constant
time
13
C evolution.
In order to be able to select the optimal mixing
times for
13
Cto
13
C transfers, we show in
Fig. 23(A)–(C) the (C,C) TOCSY transfer functions
in the ribose sugar ring. As can be seen, in order to
optimally transfer through the sugar ring C49 to C19,a
(C,C) TOCSY mixing time of approximately 18 ms
should be used, while maximal C19 to C59 transfer is
achieved at 25 ms. Single step transfer, for example
from C19 to C29, can be obtained when the mixing
time is taken to be less than 6 to 7 ms. Thus, the
HCCH experiment is ideally suited to establish
which sugar ring carbons and/or protons belong to
the same ribose. Because the C19,C29/C39,C49 and
C59 reside in clearly separate regions they can easily
be distinguished, as shown in Fig. 24. To differentiate
between C29 and C39 one can either use the HCCH
experiment with short mixing times or instead a (C,C)
COSY, which is the approach taken by Nikonowicz
and Pardi [148].
7.2.2.6. Through-bond sequential backbone
assignment. In the protein NMR field, the sequential
resonance assignment along the backbone is now done
via through-bond experiments. For labeled RNA and
DNA a similar approach is possible (step III,
Fig. 13(C)). A number of pulse sequences have been
proposed that use various possible transfer routes to
achieve sequential backbone assignment via through-
bond coherence transfer. As shown in Fig. 25, a
number of routes is available for this purpose. The
first experiment, originally proposed by Pardi et al.
[137] before the advent of
13
C labeling, utilizes
J
H39iP3i
and J
H59/50it1Pi
. With the advent of
13
C
labeling it became possible to achieve backbone
assignment by through-bond coherence transfer via
the generally rather large J
C49iPi
and J
C49it1Pi
in a 3D
HCP experiment [56,76,175,176]. Heus et al. [175]
proposed an assignment scheme that uses J
C49P
to
establish the sequential connectivity, and J
C59P
to
establish the 59–39 directionality. A CCH–TOCSY
experiment is used to establish the C59i to C49i
identity [175]. The HCP pulse sequence is shown in
Fig. 26(A). In this pulse sequence one starts with
generating H49 coherence, which is transferred to
C49, and subsequently is transferred to P, and back.
Alternatively, as shown in the PCH pulse sequence in
Fig. 26(B), one can start on P and transfer the P
coherence via C49 to H49. The sensitivity of these
experiments is quite similar. Finally, the
experiments can be converted into versions with
gradients for coherence selection and with
sensitivity enhancement. Since the H49 resonances
as well as the P resonances tend to overlap quite
strongly, it is better to transfer the magnetization to
more dispersed resonances such as the H19
resonances. It is possible for this purpose to convert
the PCH sequence into a PCCH–TOCSY sequence, in
which the initially generated
31
P coherence is
Fig. 25. Transfer routes for sequentially correlating residues via
through-bond coherence transfer. (A) Transfer routes in HCP/
PCH experiments and PCCH–TOCSY experiments; (B) transfer
routes in HPHCH experiments.
357S.S. Wijmenga, B.N.M. van Buuren/Progress in Nuclear Magnetic Resonance Spectroscopy 32 (1998) 287–387
Fig. 26. Pulse sequence of the (A) HCP [76,175] and (B) PCH [76] experiments. Narrow black rectangles indicate 908 pulses, and wide black
rectangles 1808 pulses. All pulses are applied along the x axis unless otherwise indicated. (A) Phase cycling is as follows: f
1
? x, 1 x;
f
2
? y, 1 y; f
3
? x( t TPPI(t
1
)); f
4
? 4(x),4(y); f
5
? x, 1 x; f
6
? x, x, 1 x, 1 x ( t TPPI(t
2
)); f
7
(receiver) ? x, x, 1 x, 1 x, 1 x, 1 x, x, x.
Typical settings are t ? t9 ? 1.5 ms ( < 1/4J
CH
), t
1
max/2 ? T ? 12.5 ms, and T 1 d ? 5 ms (see text). (B) Phase cycling is as follows:
f
1
? xetTPPIet
1
TT; f
2
? y, 1 y; f
3
? y, y, 1 y, 1 yetTPPIet
2
TT; f
4
(receiver) ? x, 1 x, 1 s, x. Typical settings are t ? t9 ? 1.5 ms
e < 1=4J
CH
T, t
1
max/2 ? T ? 12.5 ms, T 1 d ? 5 ms, t
2
max/2 ? T9 ? 12.5 ms, T9 1 d9 ? 5 ms (see text), and D ? 25 ms.
Fig. 27. Pulse sequence of the PCCH–TOCSY experiment [177]. Narrow black rectangles indicate 908 pulses, wide black rectangles 1808
pulses, vertically striped rectangle trim pulse before the DIPSI sequences, and the unfilled rounded rectangle sine-bell shaped pulsed field
gradient along the z axis. All pulses are applied along the x axis unless otherwise indicated. Phase cycling is as follows: f
1
? x,x, 1 x, 1 x (with
TPPI [257]); f
2
? y; f
3
? y, 1 y; f
4
(receiver) ? x, 1 x, 1 x,x. Details concerning settings are given in the legend of Fig. 28.
358 S.S. Wijmenga, B.N.M. van Buuren/Progress in Nuclear Magnetic Resonance Spectroscopy 32 (1998) 287–387
transferred to C49, and then via the route C39 ! C29
! C19 relayed to the less overlapping protons,
the H1’s [177]. The pulse sequence for the
PCCH–TOCSY is shown in Fig. 27, and the
experimental 2D spectrum in Fig. 28. As can be
seen, a complete sequential walk can be made. A
similar experiment has been proposed by Marino et
al. [154].
A number of aspects may adversely influence the
backbone assignment via these approaches. Firstly,
the size of the J
C49P
-couplings may decrease from 10
to 11 Hz, the value usually found in a regular helix, to
smaller values. Secondly, the phosphorus resonances
as well as the proton (H49) and carbon (C49) reso-
nances of the ribose tend to overlap quite strongly.
Varani et al. [178] have therefore proposed perform-
ing additional HCP triple resonance experiments,
namely, a HPHCH experiment (Fig. 29). Here,
coherence is transferred from H39 and H59/50 to P
for frequency labeling, then transferred back to proton
coherence, after which it is transferred to C39 and/or
C59 for carbon frequency labeling and finally back to
proton coherence. This approach is very similar to that
employed in the HbNb(Hb)CbHb experiment of
Farmer et al. [168]. The advantage of using such an
additional experiment is that certain cross peaks that
Fig. 28. H19 region of the 600 MHz 2D P(CC)H–TOCSY spectrum of a
13
C labeled RNA hairpin with a CAAA tetranucleotide loop, shown
schematically in the inset, in D
2
Oat308C. The arrows indicate the 59 ! 39 direction of the sequential walk along the sugar–phosphate
backbone. The labels at the cross peaks indicate the H19 assignment, while the
31
P assignment is given along the left side of the spectrum at the
vertical position (
31
P) of the cross peaks. The chemical shifts are referenced relative to TSP for the
1
H and
31
P dimensions; for
31
P the 0 ppm
chemical shift value is obtained by multiplying the
1
H TSP frequency by 0.40480793, which corresponds to calibration relative to inorganic
phosphate. The
13
C labeled RNA CAAA hairpin was prepared as described by Wijmenga et al. [56]. The P(CC)H–TOCSY experiment was
performed on a Bruker AMX(2)600 spectrometer, equipped with a broadband
13
C/
1
H probe. The spectrum was recorded in approximately 12 h
with the acquisition settings: 512 scans for each FID of 1024 points (t
2
), 64 t
1
values. The delay D was set to 25 ms; t, t9 ? 12.5 ms; 1 ms trim
pulse; DIPSI3 mixing time for isotropic mixing equal to 13.0 ms and DIPSI3 mixing time for cross-polarization set to 6.5 ms, both with an rf
field strength 8333.3 Hz, 1.0 s relaxation delay, 1 ms gradient pulse (800 ms gradient pulse shaped to a 1% truncated sine envelope and 200 ms
for the magnetic field recovery) of strength 12 G/cm, low power GARP [247] decoupling (rf strength 625 Hz) of
13
C, spectral width 486 Hz, and
5000 Hz for
31
P and
1
H respectively; carrier positions at 1 2.09, 70 and 4.62 ppm for
31
P,
13
C and
1
H receptively. Typical processing
parameters were zero-filling twice in t
1
and once in t
2
, and 0.4p shifted squared sine window multiplication in t
1
and t
2
. The final data matrix
consisted of 128*1024 data points. Reproduced with permission from Ref. [177].
359S.S. Wijmenga, B.N.M. van Buuren/Progress in Nuclear Magnetic Resonance Spectroscopy 32 (1998) 287–387
may be absent in the HCP/PCH experiment, due to
inadvertently small J
C49P
-couplings, may still show up
in the HPHCH experiment. Finally, Ramachandran et
al. [179] have introduced a simultaneous HCP/HCN
experiment which in principle combines the
sequential backbone assignment via through-bond
coherence transfer and sugar-to-base correlation via
through-bond coherence transfer.
In order to find the optimal settings for the various
experiments the coherence transfer functions have to
be considered:
HCP : (51)
Tr
H49C49
? sin(pJ
H49C49
2t
1
)
Tr
C49iP59i
? sin(pJ
C49iP59i
2t
2
)cos(pJ
C49iP39i t 1
2t
2
)
3 cos(pJ
C49iC59i9
2t
2
)cos(pJ
C49iC39i
2t
2
)
Tr
C49iP39i t 1
? cos(pJ
C49iP59i
2t
2
)sin(pJ
C49iP39i t 1
2t
2
)
3 cos(pJ
C49iC59i9
2t
2
)cos(pJ
C49iC39i
2t
2
)
PCH : (52)
Tr
C49iP59i
? sin(pJ
C49iP59i
2t
2
)cos(pJ
C49i 1 1P59i
2t
2
)
3 cos(pJ
C59iP59i9
2t
2
)cos(pJ
C39i 1 1P59i
2t
2
)
3 cos(pJ
C29i 1 1P59i
2t
2
)
Tr
C49iP59i
? sin(pJ
C49iP59i
2t
3
)cos(pJ
C49iP39i t 1
2t
3
)
3 cos(pJ
C49iC59i9
2t
3
)cos(pJ
C49iC39i
2t
3
)
Tr
C49i 1 1P59i
? cos(pJ
C49iP59i
2t
2
)sin(pJ
C49i 1 1P59i
2t
2
)
3 cos(pJ
C59iP59i
2t
2
)cos(pJ
C39i 1 1P59i
2t
2
)
3 cos(pJ
C29i 1 1P59i
2t
2
)
Tr
C49i 1 1P59i
? sin(pJ
C49i 1 1P59i
2t
3
)
3 cos(pJ
C49i 1 1P59i 1 1
2t
3
)
3 cos(pJ
C49i 1 1C59i 1 1
2t
3
)
3 cos(pJ
C49i 1 1C39i 1 1
2t
3
)
Tr
H49C49
? sin(pJ
H49C49
2t
1
)
HP(H)CH : (53)
Tr
H39iP39i
? sin(pJ
H39iP39i
2t
1
)cos(pJ
H39iH49i
2t
1
)
3 cos(pJ
H39iH29i
2t
1
)
Tr
H39iC39i
? sin(pJ
H39iC39i
2t
2
)
Tr
H59iP59i
? sin(pJ
H59iP59i
2t
1
)cos(pJ
H59iH49i
2t
1
)
3 cos(pJ
H59iH50i
2t
1
)
Tr
H59iC59i
? sin(pJ
H59iC59i
2t
2
)cos(pJ
H59iH50i
2t
2
)
Fig. 29 shows the transfer function of the HCP
experiment. As can be seen, the optimal transfer is
achieved for t
2
equal to 12 ms. It can also be con-
cluded from these curves that decoupling of C49
from C39 and C59 will not greatly improve the transfer
efficiency. The transfers from C39 to P39 and C59 to
P59 are considerably less effective due to the smaller
J
C59P59,C39P39
-couplings involved, but may still be
observed. The efficiency of the transfer from C49 to
P39 and P59 depends on the torsion angles ? and b,
Fig. 29. Efficiency of the HCP experiment for C49 to P59,C49 to
P39,C59 to P59, and C39 to P39 transfer. The transfer efficiency is
calculated according to the equations in the text as a function of the
C ! P transfer time (t); the overall efficiency for C to P transfer
and back is the product of the identical transfer functions; the
J-couplings are assumed to be 11 Hz for
3
J
C49P59/39
, 1 5 Hz for
2
J
C59P59
, and 1 4 Hz for
2
J
C39P39
; T
2
relaxation has not been taken
into account.
360 S.S. Wijmenga, B.N.M. van Buuren/Progress in Nuclear Magnetic Resonance Spectroscopy 32 (1998) 287–387
respectively. As discussed previously, the allowed
range of the torsion angle ? is 1708 to 2908. It usually
lies in the range 170–2408, but may in exceptional
cases become g
1
. When 1708 ,?,2408, the transfer
from C49 to P39 remains quite high for all values of b,
as when this torsion angle lies outside its usual trans
range (Fig. 30(A)). When ?.2408 the efficiency
drops, but then transfer from C29 to P39 increases to
detectable values (Fig. 30(C)). Thus, the transfer to
P39 should always be visible, either via the C49 to P39
pathway or via the P29 to P39 pathway. On the other
hand, the transfer from C49 to P59 may drop to very
low values when the torsion angle b lies outside its
usual trans range (Fig. 30(B)). Here, no other carbon
can take over, instead it is either the H59 to P59 trans-
fer, that becomes quite effective (b . < 2408), or the
H50 to P59 transfer that becomes effective (b , <
2408). The latter correlations can be detected in an
HP(H)CH experiment (Fig. 30(D)). Thus, when the
sequential backbone assignment is performed via a
combination of HCP (or PCH) and HP(H)CH experi-
ments the torsion angle dependence of the J-couplings
involved can adversely affect the presence of sequential
correlation.
Another aspect is the relative efficiency of the
various experiments and the effect of T
2
relaxation
on the efficiencies. In Table 9, the optimal transfer
efficiencies are given for the various proposed experi-
ments assuming a T
2
of 3 s and of 30 ms. The most
interesting conclusion that can be drawn is that suffi-
cient transfer will occur even for relatively small T2
values, i.e. for large systems (50 nucleotides).
Whether sequential backbone assignment is possible
for such large systems depends therefore mainly on
the aspect of resonance overlap. Unfortunately, the P
resonances overlap strongly, and the same applies for
the ribose H29,H39,H49 and H59/H50. It is therefore
expected that for larger systems one has to resort to
labeling only specific residues to achieve the assign-
ment (see below).
7.2.3. X-filter techniques
X-filters form a very powerful tool in the field of
NMR spectroscopy of biomolecules. Otting et al.
[180] were the first to introduce so-called X-filters
into the field of NMR of proteins; here X stands for
X-nucleus, which can be
13
Cor
15
N, for example.
These methods can also be used to great advantage
in the field of NMR of
13
C and
15
N labeled nucleic
acids.
X-filters can be incorporated into any pulse
sequence, but we will illustrate their usage in
NOESY spectroscopy and consider the application
given by Van Dongen et al. [143]. They used X-filters
to unambiguously demonstrate the formation of a
DNA·DNAxRNA triple helix, which consisted of a
13
C/
15
N labeled single-stranded RNA nucleotide
sequence, associated with an unlabeled DNA hairpin
with a B-helix stem. Fig. 31(A) shows the NOESY
pulse sequence with an q
2
15
N filter, while (B) shows
the NOESY pulse sequence with an q
1
13
C filter.
There are a variety of possibilities in the application.
Van Dongen et al. [143] used an q
2
15
N filter in
NOESY experiment I and an q
1
13
C filter in
NOESY experiment II. In experiment I, the q
2
15
N
filter can be set in such a way that magnetization of
protons directly bonded to
15
N have either a positive
(Ia) or negative sign (Ib) during acquisition. The
experiment can be set up in such a way that Ia and
Ib are recorded interleaved. When processing the data
the two data sets are either added, giving the NOESY
spectrum ADDI ( ? Ia t Ib), or subtracted giving the
NOESY spectrum SUBI ( ? Ia 1 Ib). Consequently,
in the NOESY spectrum SUBI only those cross peaks
which correspond to protons that are directly bonded
to
15
N during acquisition will be present, while in the
NOESY spectrum ADDI only those cross peaks will
be present of protons not bonded to
15
N. Similarly, for
the q
1
13
C filtered NOESY, cross peaks which stem
from protons that are directly bonded to
13
C in the q
1
dimension are found in the NOESY spectrum SUBII,
while in the NOESY spectrum ADDII, cross peaks of
protons that are not bonded to
13
C in the q
1
dimension
are found. Thus in the imino region of the NOESY
spectrum of the DNA·DNAxRNA triplex molecule the
DNA and RNA NOESY spectra can be separated in
the q
2
dimension. The ADDI spectrum contains only
cross peaks corresponding to NOE contacts to DNA
imino (Fig. 32(A)) and amino protons (Fig. 32(B))
protons, while in a SUBI spectrum only NOE contacts
to RNA imino (Fig. 32(C)) and amino protons (Fig.
32(D)) are found. In the latter imino spectrum (Fig.
32(C)) NOE cross peaks are present at 2 ppm, which
is a spectral region where only H29/20 protons of DNA
reside, thus demonstrating unambiguously that the
RNA is associated with the DNA hairpin.
361S.S. Wijmenga, B.N.M. van Buuren/Progress in Nuclear Magnetic Resonance Spectroscopy 32 (1998) 287–387
The two X-filters were not simultaneously incor-
porated into the NOESY sequence because of
sensitivity limitations, although this would concep-
tually have been more elegant, and intrinsically
more powerful. In that case four FIDs could have
been recorded, either with both filters off (no change
in sign in q
1
and q
2
) (I), or with the q
1
13
C filter on
and the q
2
15
N filter off (negative sign in q
1
and
positive in q
2
), or with the q
1
13
C filter off and the
q
2
15
N filter on (positive in q
1
and negative sign in
q
2
), or with the q
1
13
C filter on and the
15
N filter on
(negative in q
1
and negative in q
2
). From these four
data sets it is possible to create four NOESY spectra,
i.e. a NOESY spectrum which contains only DNA to
DNA NOE contacts (I t II t II t IV), or only RNA
to RNA NOE contacts (I 1 II 1 III t IV), or only
DNA to RNA contacts (I t II 1 III 1 IV), or only
RNA to DNA contacts (I 1 II t II 1 IV). The latter
two would contain all the peaks that directly demon-
strate the RNA to DNA interactions.
The X-filtered NOESY is an extremely powerful
experiment and has a wide range of applications.
Fig. 30. Efficiency for C49 ! P39 (A), C49 ! P59 (B), C29 ! P39 (C), and H59/H50 ! P59 (D) transfer and back as a function of the torsion
angles ? and b. The efficiency is calculated according to the transfer functions given in the text; optimal delay settings are assumed, i.e. 24 ms
for the transfer time in (A) to (C), and 19 ms for the time for the H59/H50 ! P59 transfer (D) (see also Table 9); T
2
relaxation has not been taken
into account.
362 S.S. Wijmenga, B.N.M. van Buuren/Progress in Nuclear Magnetic Resonance Spectroscopy 32 (1998) 287–387
For example, it has been used in the study of DNA–
protein interactions (see, e.g. Ref. [181]) or RNA–
protein or peptide complexes [31,33,37,182–184]. It
has been used to derive NOE contacts in a symmetric
ssDNA binding protein [185–187]. It could also be of
great value in deriving NOE contacts in selectively
labeled RNAs or DNAs. For example, if only the A
residues are labeled a NOESY spectrum can be
derived that contains only the intra-residue NOE
contacts and a spectrum that contains only the inter-
residue contacts.
8. Relaxation and dynamics
For relaxation in nucleic acids two mechanisms of
interaction with the environment are important: the
dipole–dipole interaction and the chemical shift
Fig. 30. Continued.
363S.S. Wijmenga, B.N.M. van Buuren/Progress in Nuclear Magnetic Resonance Spectroscopy 32 (1998) 287–387
anisotropy (CSA). In the case of dipole–dipole
interaction between two nuclei of spin
1
/
2
, rotational
motion of the vector connecting the two interacting
nuclei relative to the external magnetic field
modulates the interactions and thus causes transitions
between the energy levels of the spin
1
/
2
nuclei if the
rates of the motions correspond to the nuclear transi-
tion frequencies. Similarly, modulation of the length
of the vector occurring at the right frequencies can
cause transitions. The latter is not important if the
distance between the interacting nuclei is fixed by a
chemical bond. This makes
15
N and
13
C relaxation
attractive for studies of mobility because they relax
primarily by dipole–dipole interaction with directly
bonded protons. The chemical shift of a nucleus
expresses the shielding of the external magnetic
field by its environment. This shielding depends on
the orientation of the chemical structure relative to the
external magnetic field, i.e. it depends on the orientation
of the principal axes of the chemical shift tensor relative
to the external magnetic field. Rotational diffusion of
the nucleic acid and internal mobility can modulate this
effect and thus cause transitions between the two energy
levels of the spin
1
/
2
nucleus if the rates of the motions
correspond to the nuclear transition frequency. Hence,
CSA relaxation depends on whether the chemical
structure in the nucleic acid to be investigated moves
with frequencies corresponding to differences between
energy levels. The distribution of these frequencies is
usually expressed with so-called power spectral density
functions (see, for example, Ref. [188]):
J
AB
(q) ? 2
Z
‘
0
cos(qt)hA(0)B(t)idt
? 2
Z
‘
0
cos(qt)G
AB
(t)dt e54T
The symbols A(t) and B(t) stand for the Hamiltonians
relevant for fluctuating components of the relaxation,
such as dipole–dipole interaction and CSA. The
brackets represent the average over all molecules in
the sample. The ensemble average of the product
function, hA(0)B(t)i, is also called the time correlation
function, G
AB
(t), which for the dipole–dipole and
CSA relaxation is given by:
G
0
(t) ?
1
5
hP
2
{cos[x(0) 1 x(t)]}i (55)
Here, x is the angle of the dipole–dipole vector or the
principal axis of the CSA tensor relative to the exter-
nal field. We call this vector the relaxation vector.
Table 9
Overview transfer efficiencies in HCP type experiments
#
C49PC39/H39PC59/H59PC49PC39/H39PC59/H59P
HCP
a
28 15 8 6 3 1.8
HCP (dec)
b
30 45 30 7 3 3
HP(H)CH
c
–251 (10 1 3) 45 1 (10 1 1.2) –61 (2 1 0.6) 18 1 (2 1 0.4)
PCH
d
26 15(20) 5(13) 6 3 1.3
#
The overall transfer efficiencies, T
txxx
, are given in % and calculated as the product of the transfer functions, T
xxx
, of the separate transfer
steps (eqns (51)–(53)); xxx stands for C49P, C39PorC59P, and represent transfer from Cx to P. It is assumed that the H to C transfer is 100%
effective for C–H systems, whereas for CH
2
spin systems a smaller value may be obtained. Columns two to four give the transfer efficiencies for
T
2
? 3 s, the last three columns give the transfer efficiencies for T
2
? 30 ms; the J-coupling values used are (Hz): J
H49C49
? 145; J
H59/H50C59
? 145;
J
H39C39
? 145; J
C49P39
? 10:8; J
C49P59
? 11; J
C59P59
?15; J
C39P39
?14; J
C49C39
? 43; J
C49C59
? 43; J
C39C29
? 43; J
H59H50
?114.
a
HCP: optimal setting for delay 2t
2
is 24 ms for all transfers for both T
2
? 3 s and T
2
? 30 ms.
b
HCP (dec), here it is assumed that C59 can be decoupled from C49;C49 can be decoupled from C39 and C59;C39 can be decoupled from C49
(C29 still present): optimal setting delay 2t
2
is 24 ms for C49P, 50 ms for C39P and C59P transfers when T
2
? 3 s; for T
2
? 30 ms: all optimal
delays range 16 to 18 ms.
c
optimal delay settings all 18–20 ms; assumed is that the sugar conformer is N-type, and that g is gauche t ; the three numbers represent
efficiencies under three conditions: the first number represents the efficiency when J
H39P
or J
H59/H50P
have maximum value; the second and the
third (in bold) correspond to the limiting values of J
H39P
or J
H59/H50P
usually found in a double helix.
d
T
2
? 3 s: optimal setting for delays, 2t
2
and 2t
3
is about 24 ms for C49P; for C39P and C49P optimal setting is 18–20 ms for the first transfer
step and 45 ms for the refocus step, which are indicated by the number in parentheses; T
2
? 30 ms: first delay at 16–18 ms and second at 24 ms
for C49P, C39P and C59P.
364 S.S. Wijmenga, B.N.M. van Buuren/Progress in Nuclear Magnetic Resonance Spectroscopy 32 (1998) 287–387
In order to calculate G
0
(t) a description of motion is
required. We first consider the rotation of internally
rigid molecules. The simplest motion is the isotropic
rotational motion shown by spherical molecules, the
time constant of which can be calculated from the
Stokes–Einstein equation:
t
c
?
4phR
3
3kT
(56)
Here h is the viscosity of the solvent, k is the
Boltzmann constant, T is the absolute temperature,
and R is the hydrodynamic radius. The correlation
function decays exponentially for isotropic rotational
motion with the time constant t
c
:
G(t) ?
1
5
exp( 1 t=t
c
) (57)
so that the corresponding spectral density is given
by:
J(q) ?
2
5
t
c
1 t (qt
c
)
2
(58)
The assumption of isotropic tumbling does not exactly
hold for nucleic acids because the form of these
molecules is not spherical but rather cylindrical. For
a molecule with cylindrical or ellipsoidal shape two
rotation correlation times exist, namely, a time con-
stant for the rotation of the long axis, t
L
, and one for
rotation of the short axis, t
s
. The time constants t
L
and
t
s
can be related to the molecular radius R and the
length L by [189]:
t
L
?
phL
3
18kT
(ln(L=2R) t d
L
)
1 1
(59)
Fig. 31. Pulse schemes of the
15
N(q
2
) filtered NOESY (A) and the
13
C(q
1
) filtered NOESY (B) experiments. Homospoil delays of 20 ms,
which were applied during the relaxation delay and the NOE mixing period, are indicated by blocks marked with HS. Phase cycle for both
experiments: f
1
? 2238; f
2
? 4exT, 4e1xT; f
3
? 4e1xT, 4exT; f
4
? x, 1 x, y, 1 y, 1 x, x, 1 y, y; f
5
?1x, x, 1 y, y, x, 1 x,y, 1 y; f
6
(receiver)
? 2(x),4( 1 x),2(x). Two separate data sets are obtained with w ? x and w ?1x, respectively. To achieve phase-sensitive detection in the
indirect dimension, the phase of the
1
H excitation pulse was alternately x and y, and the procedure was completed during processing by inverting
the signs of every third and fourth data point. This approach allows the H
2
O signal to be kept prior to acquisition for both the x and y phases of
the
1
H excitation pulse in the t z half of the xyz sphere, reducing radiation damping effects. In addition, keeping (
1
/
2
)?2 of the H
2
O signal along
the t z axis prior to acquisition increases the signal intensity of the exchangeable protons [258]. The pulse schemes are adapted from Ref. [143].
365S.S. Wijmenga, B.N.M. van Buuren/Progress in Nuclear Magnetic Resonance Spectroscopy 32 (1998) 287–387
t
s
?
3:842phLR
2
6kT
(1 t d
s
)
1 1
where
d
L
?10:0662 t 0:917(2R=L) 1 0:05(2R=L)
2
(60)
and
d
s
? 1:119 3 10
1 4
t 0:6884(2R=L) 1 0:2019(2R=L)
2
(61)
This equation holds for axial ratios, 2 , L/R , 30.
Other equations that correlate the cylinder length and
radius to the time constants exist, for example the
equation of Perrin and Broersma, but they only
apply for large L/R ratios [190,191].
As was shown by Woessner [192–194] the
correlation function for the vector connecting two
nuclei or the long axis of the chemical shift
tensor will then contain three exponentially decaying
Fig. 32. Parts of the spectrum after summation (A, B) and subtraction (C, D) of the w ? x and w ?1x data sets from the
15
N(q
2
) filtered
NOESY experiment of hybrid DNA·DNAxRNA triple helix, in which the RNA is
13
C and
15
N labeled. The DNA sequence is 59TCTCTC-TTT-
GAGAGA and folds as a hairpin. The RNA has sequence 59CUCUCU and is single stranded at high pH. At low pH the RNA cytosines are
protonated, and the RNA strand binds to the major groove of the DNA hairpin so that the triple helix is formed. The sum spectrum shows NOEs
to NH protons (A), and NH2 protons (B) of the DNA; the difference spectrum shows NOEs to NH protons (C), and NH2 protons of the RNA and
NH2 protons (D) of the RNA. In the spectral regions B and D only the NH2 protons of protonated cytosines are expected to resonate. The
spectral region B does not contain any cross peaks, i.e. no DNA NH2 resonances reside in this spectral region, demonstrating that none of the
DNA residues are protonated. The cross peaks in the boxed regions f in C directly demonstrate the existence of NOE contacts between DNA
H29/20 protons and RNA imino protons, providing direct evidence for the formation of a triple helix. The other boxed peaks are discussed in
detail by Van Dongen et al. [143]. Reproduced with permission from Ref. [143].
366 S.S. Wijmenga, B.N.M. van Buuren/Progress in Nuclear Magnetic Resonance Spectroscopy 32 (1998) 287–387
terms:
G
0
(t) ? 0:125(3 cos
2
f 1 1)
2
exp( 1 t=t
1
) t
3
2
cos
2
f
3 sin
2
f exp( 1 t=t
2
) t 0:375 sin
4
f exp( 1 t=t
3
)
e62T
where t
1
? t
L
, t
2
? 6t
L
t
s
/(5t
s
t t
L
), t
3
? 3t
L
t
s
/
(t
s
t 2t
L
). This expression also includes an explicit
dependence on the angle, f, which the dipole–dipole
vector makes with the principal axis of rotation. The
spectral density function will then consist of the fol-
lowing three terms:
J(q) ? 0:25(3 cos
2
f 1 1)
2
J(q, t
1
) t 3 cos
2
f sin
2
(f)J
(q, t
2
) t 0:75 sin
4
(f)J(q, t
3
) e63T
Thus, a vector along the long axis is affected only by
the end-over-end tumbling. On the other hand, for a
vector perpendicular to the long axis the spectral
density function is given by J(q) ? 0.25J(q,t
L
) t
0.75J(qt
3
), so that J(q) approaches 0.25J(q,t
L
)at
large L/R ratios. For sequential contacts the relaxation
vectors are mainly found almost parallel to the long
helix axis. In that case the correlation function G
0
(t)
contains only one relaxation time, t
L
. In a double
helical conformation the H6 to H5 vector lies approxi-
mately perpendicular to the long axis. The correlation
function G
0
(t) then contains two relaxation terms with
time constants t
L
and t
3
, respectively. In case the
vector lies somewhere inbetween these extremes
three relaxation times are present in G
0
(t), namely,
t
L
, t
2
and t
3
. In the particular case that the relaxation
vector is at ‘the magic angle’ G
0
(t) does not contain
the relaxation term with time constant t
L
. Conse-
quently, in this particular case the relaxation is deter-
mined by J(q,t
2
) and J(q,t
3
) and thus has a rather
weak dependence on helix length. For close to sym-
metric molecules all three relaxation times are almost
identical and the orientation of the relaxation vector is
not very dependent on the angle it makes with the
helix axis.
Internal motions will be superimposed on the over-
all motion, and as a result additional correlation times
will show up in the correlation functions. Woessner et
al. [194] have, for example, derived the time
correlation function for internal motion superimposed
on anisotropic overall rotation of an axially symmetric
molecule; in their derivation they assumed that inter-
nal rotation consists of free diffusion on a cone. The
time correlation function can then contain up to nine
relaxation times. Thus, generally speaking, quite com-
plex functions are obtained. The approach is then to
use a particular model for internal motion and to try to
interpret the relaxation parameters in terms of these
models. As pointed out by Lipari and Szabo [65] there
is the danger of overinterpretation of limited data and
the possibility that the resulting physical picture is not
unique. Models cannot be proven; they can only be
eliminated in favorable cases. Lipari and Szabo
[65,66] have addressed this problem by invoking
their model-free approach. They show that the infor-
mation content of NMR relaxation data on internal
motions can be completely described by (1) a general-
ized order parameter S, which is the measure of
spatial restriction of the motion, (2) an effective
correlation time t
e
, which is a measure of the rate
of motion. Lipari and Szabo assume that overall
motion and internal motion are independent. The
time correlation function can then be rigorously
factored into two contributions for isotropic overall
tumbling, namely, one from the overall motion,
G
0
(t), corresponding to G(t) discussed above,
and one contribution representing internal motions,
G
I
(t):.
G(t) ? G
0
(t)·G
I
(t) (64)
As pointed out by Lipari and Szabo, anisotropic over-
all motion cannot rigorously be factored into a product
of contributions due to overall and internal motion
even when it is assumed that these motions are
independent, but numerical evidence shows that the
factoring provides a very good approximation. Thus,
the above equation can also be used for internal
motion superimposed on anisotropic overall motion.
The second step is to describe the internal motion.
Internal motion can rigorously be described by a
sum of exponentially decaying terms:
G
I
(t) ?
X
i
a
i
e
1 t=t
i
(65)
This leads to a large number of parameters in the time
correlation functions that cannot be determined from
experimental NMR relaxation data. Lipari and Szabo
367S.S. Wijmenga, B.N.M. van Buuren/Progress in Nuclear Magnetic Resonance Spectroscopy 32 (1998) 287–387
then try to determine the simplest possible description
for internal motion. From the definition of G
I
(t)it
follows that G
I
(0) ? 0; they define S as G
I
(‘) and
t
e
as the surface under the time correlation function,
i.e.
t
e
?
Z
‘
0
(G
I
(t) 1 S
2
)dt
1 1 S
2
(66)
They then assume that the simplest approximation to
G
I
(t), which is exact at t ? 0 and t ? ‘, has the form:
G
I
(t) ? S
2
t (1 1 S
2
)exp( 1 t=t
e
) (67)
The overall correlation function then becomes, for the
case of isotropic overall motion:
G(t) ?
1
5
S
2
exp( 1 t=t
c
) t
1
5
(1 1 S
2
)exp( 1 t=t) (68)
with
t
1 1
? t
1 1
c
t t
1 1
e
(69)
This leads to a spectral density function of the form
J(q) ?
2
5
S
2
t
c
1 t (qt
c
)
2
t
(1 1 S
2
)t
1 t (qt)
2
(70)
The spectral density function thus contains two fitting
parameters for each vector since the overall correla-
tion time t
c
is expected to be the same for all nuclei in
the molecule.
Thus, Lipari and Szabo invoke no particular model
to describe the internal motion; the approach is there-
fore model-free. S is a model-independent measure of
the spatial restriction of the internal motion. The
effective correlation time is determined by the area
under the correlation function for internal motion. In
that sense it is a model-independent quantity.In
summary, Lipari and Szabo have shown that this
description of internal motion in terms of two para-
meters, t
e
and S
2
, suffices, i.e. such a description
represents the total information content of NMR
relaxation measurements. The subsequent question
is whether these quantities can be used to describe
physical models for internal motion. As pointed out
by Lipari and Szabo, the quantities t
e
and S
2
cannot be
uniquely interpreted, and their values are, in principle,
consistent with an infinite number of physical pictures
of motion [66]. To be more specific, it is not possible
to relate t
e
to some physical aspects of the internal
motion in a model-independent way; the connection
between t
e
and the microscopic diffusion constants or
rates can only be established in the context of a model;
moreover, the value of t
e
depends on both the micro-
scopic time constants and the spatial nature of the
motion. In the special case of models with only the
rate of diffusion as an adjustable parameter, one could
derive from t
e
a physically significant value for it. But
for the usual, physically more relevant models which
have more than one adjustable rate parameter for
internal motion, t
e
can be reproduced in an infinite
number of ways. On the other hand, the quantity S
2
can be used to disprove certain models. It is therefore
of interest to consider S for a number of models.
In the special case that the internal motion is repre-
sented by free diffusion of the relaxation vector on a
cone, we have
S ? P
2
[cos(v)] (71)
Here v is the angle between the symmetry axis of the
cone and the relaxation vector connecting the nuclei.
Thus, it is possible for S to vanish at the ‘magic angle’.
Such a model would be a good representation of the
motion of a methyl group. In that case the angle v is
known a priori and S
2
, 0.11, if the free diffusion
model applies. For free diffusion of the vector in a
cone, S is given by
S
cone
?
1
2
cos v
0
(1 t cos v
0
) (72)
Here v
0
represents the maximum angle that the relaxa-
tion vector can rotate away from the symmetry axis of
the cone. As pointed out by Lipari and Szabo, any type
of internal motion can be modeled as diffusion in a
cone, irrespective of whether it makes sense
physically. Restricted diffusion on a cone is given by
S
2
? [P
2
(cos v)]
2
t
3 sin
2
v sin
2
g
g
2
3 (cos
2
v t
1
4
sin
2
b cos
2
g) e73T
Here v is the half angle of the cone, and the angular
range is given by 6 g. This model is more complex
then free diffusion in a cone and is multi-interpretable.
Interestingly, Lipari and Szabo derived that for the case
368 S.S. Wijmenga, B.N.M. van Buuren/Progress in Nuclear Magnetic Resonance Spectroscopy 32 (1998) 287–387
of free diffusion in a cone superimposed on anisotropic
motion, the order parameter is exactly given by
S
2
? S
2
cone
[P
2
(cos v
MD
)]
2
(74)
where S
cone
is given as above and v
MD
represents the
angle the vector makes with the long axis of the
cylindrical molecule. It is important to note here
that for the derivation of this equation for S, Lipari
and Szabo have taken for the internal correlation func-
tion all relaxation times, as well as those resulting
from rotation about the long axis, except for the one
representing the tumbling of the long axis. This is of
interest for nucleic acids, since it demonstrates that
S ? 0 for a relaxation vector moving on a cone, which
has its symmetry axis at the magic angle with respect
to the long axis; thus, when tumbling of the long axis
does not contribute to the relaxation.
It has been found for some residues in proteins that
the experimental parameters cannot be fitted to the
simple analytical expression (Eq. (67)) and additional
degrees of freedom must be introduced i.e. an additional
order parameter, S
f
, and time constant, t
f
. The analytical
model introduced to take care of the inconsistencies
employs the following internal correlation function:
G
I
(t) ? S
2
t (1 1 S
2
f
)e
1 t=t
f
t (S
2
f
1 S
2
)e
1 t=t
s
(75)
For anisotropic overall motion with internal motion
superimposed, the overall time correlation function,
G(t), follows from Eq. (64) with G
0
(t) and G
I
(t) given
by Eqs. (62) and (67), respectively,
G(t) ?
1
2
S
2
[0:25(3 cos
2
f 1 1)
2
exp( 1 t=t
1
)
t 13 cos
2
f sin
2
f exp( 1 t=t
2
) t 0:75 sin
4
f
3 exp( 1 t=t
3
)] t
1
2
(1 1 S
2
)[0:25(3 cos
2
f 1 1)
2
3 exp( 1 t=t) t 13 cos
2
f sin
2
f exp( 1 t=t
i2
)
t 0:75 sin
4
f exp( 1 t=t
i3
)] e76T
with
t
1 1
? t
1 1
L
t t
1 1
e
(77)
t
1 1
i2
? t
1 1
2
t t
1 1
e
t
1 1
i3
? t
1 1
3
t t
1 1
e
This leads to a spectral density function of the form
J(q) ? S
2
[0:25(3 cos
2
f 1 1)
2
J(q, t
L
) t 13 cos
2
(f)sin
2
3 (f)J(q, t
2
) t 0:75 sin
4
(f)J(q, t
3
)]
t (1 1 S
2
)[0:25(3 cos
2
f 1 1)
2
J(q, t)
t 13 cos
2
(f)sin
2
(f)J(q, t
i2
) t 0:75 sin
4
(f)
3 J(q, t
i3
)] e78T
For anisotropic overall motion the complete time
correlation function contains six time constants,
even in its simplest form. In addition, the quantity S
is present as a parameter. It is therefore of interest to
consider some examples. If the ratio L/R is close to 2
(spherical) all three time constants for overall motion
are quite similar. It is then appropriate to describe the
overall motion by one exponential term decaying with
an effective average relaxation time, t
av
. This average
value will now depend on the orientation of the
relaxation vector with respect to the cylinder axis,
and contrary to spherical molecules is not the same
for all nuclei. The internal motion can be included by
multiplying G
0
(t)byG
I
(t) and a simple description
similar to that for isotropic tumbling is obtained.
The order parameter is then representative of the
spatial restriction for internal motion. When the
ratio L/R q 1, the overall motion consists of a long
relaxation time t
L
and two much shorter relaxation
times. Again it seems appropriate to use the simplest
description, and let the time correlation function con-
sist of a long correlation time, t
c
, representing the
rotation of the long axis (t
c
? t
L
), and a shorter cor-
relation time, t
e
, which represents the time constants
resulting from rotation about the long axis as well as
the time for internal motion. The description then
becomes again a two parameter description, as for
isotropic motion. The order parameter S
2
is in this
case given by Lipari and Szabo (Eq. (74)). The quan-
tity S
2
now depends on the angle, v
MD
, the relaxation
vector makes with the long axis, and on S
2
cone
repre-
senting the spatial restriction of internal motion. Thus,
S
2
is now not directly representative of the spatial
restriction of internal motion; t
c
( ? t
L
) should be
the same for all nuclei as in the case of isotropic over-
all motion. For intermediate R/L ratios a description in
terms of an overall time constant, t
c
, a time constant
369S.S. Wijmenga, B.N.M. van Buuren/Progress in Nuclear Magnetic Resonance Spectroscopy 32 (1998) 287–387
for fast motions, t
e
, and an order parameter S
2
should
suffice. However, in contrast to the case of isotropic
motion, these parameters will now have a more
complex interpretation and the orientation of the relaxa-
tion vector relative to the long axis has to be taken into
account. In fact, Lipari and Szabo proposed using, for the
case of anisotropic overall motion, two exponentially
decaying terms to describe the overall motion:
G
0
(t) ? A exp( 1 t=t
L
) t (1 1 A)exp( 1 t=t
s
) (79)
where t
L
is the time constant for end-over-end
tumbling of the long axis rotation, and t
s
represents
the shorter time constant related to overall tumbling.
Internal motion is then taken into account by
multiplying G
0
(t)byG
I
(t):
G(t) ? S
2
[A exp( 1 t=t
L
) t (1 1 A)exp( 1 t=t
s
)]
t (1 1 S
2
)[A exp( 1 t=t) t (1 1 A)exp( 1 t=t
es
)]
e80T
with
t
1 1
? t
1 1
L
t t
1 1
e
t
1 1
es
? t
1 1
s
t t
1 1
e
This ‘extended’ Lipari and Szabo approach now
represents a five parameter description, A, S
2
, t
L
, t
s
and t
e
. It has the advantage over the two parameter
description that t
L
is an interpretable quantity that
should be the same for all nuclei, and S
2
is the para-
meter that represents the spatial restriction for internal
motion. On the other hand the number of parameters
has increased. The quantity A is related to the orienta-
tion of the relaxation vector with respect to the long
axis; in its simplest interpretation A ? P
2
(cosv
MD
)
2
.
The amplitude of the term containing t
L
, which is
usually interpreted as the order parameter, equals
S
2
A ? S
2
P
2
(cosv
MD
)
2
. Note further that inclusion of
t
s
in the description will lead to a better reproduction
of the value of t
L
. This description has been used to
interpret nucleic acid relaxation data (see below).
Lipari and Szabo have investigated the range of
validity of their description. The general trend is to
predict values of S
2
that are too large and values of t
e
that are too small; the effect being stronger for smaller
values of S
2
and larger values of t
e
. For internal
motions slow on the NMR time-scale, i.e. (qt
e
)
2
. 1,
they predict for values of S
2
. 0.5 an error of 15% or
less, and for 0.3 , S
2
, 0.5 an error of the order of
30%. For (qt
e
)
2
, 1 the error is smaller. Secondly, as
pointed out by Lipari and Szabo, if there is an overall
motion common to all parts of the molecule there
cannot be a component in the correlation function
that decays more slowly then the overall motion. Con-
sequently, internal motions on a time-scale slower then
the overall tumbling are not detectable. Thus, the longest
time constant is always that of the overall tumbling.
As pointed out, relaxation depends on whether the
chemical structure to be investigated moves with fre-
quencies corresponding to differences between energy
levels. This distribution of frequencies is usually
expressed with so-called power spectral density
functions. For example, the commonly measured
relaxation parameters T
1
, T
2
, and the NOE effect
depend on spectral density functions at five frequen-
cies. In addition, the relaxation of longitudinal spin
order, anti-phase coherences, and multiple quantum
coherences can be measured. With the advent of
proton-detected heteronuclear experiments, sensitive
2D experiments have been developed for the
measurement of (heteronuclear) relaxation parameters
(see, for example, Ref. [196] for a summary). Below we
summarize the relations for the relaxation rates, R, of the
most relevant types of coherences and spin order:
R
IN
(I
z
! N
z
) ? d( 1 J(q
I
1 q
N
) t 6J(q
I
t q
N
)) (81)
R
IN
(N
z
! 2I
z
N
z
) ? KJ(q
N
)
R
IN
(N
x
! 2I
z
N
x
) ? K(
2
3
J(0) t
1
2
J(q
N
))
R
N
(N
z
) ? d[J(q
I
1 q
N
) t 3J(q
N
) t 6J(q
I
t q
N
)]
t c[J(q
N
)] e82T
R
N
(N
x
) ?
d
2
[4J(0) t J(q
I
1 q
N
) t 6J(q
I
) t 3J(q
N
)
t 6J(q
I
t q
N
)] t c[
2
3
J(0) t
1
2
J(q
N
)] t R
N
ex
R
I
(I
z
) ? d[J(q
I
1 q
N
) t 3J(q
I
) t 6J(q
I
t q
N
)] t r
L
R
I
(I
x
) ?
d
2
[4J(0) t J(q
I
1 q
N
) t 6J(q
N
) t 3J(q
I
)
t 6J(q
I
t q
N
)] t r
T
t R
I
ex
370 S.S. Wijmenga, B.N.M. van Buuren/Progress in Nuclear Magnetic Resonance Spectroscopy 32 (1998) 287–387
R
IN
(2I
z
N
x
) ?
d
2
(4J(0) t J(q
I
1 q
N
) t 3J(q
N
)
t 6J(q
I
t q
N
)) t c(
2
3
J(0) t
1
2
J(q
N
))
t r
L
t R
N
ex
e83T
R
IN
(2I
x
N
z
) ?
d
2
(4J(0) t J(q
I
1 q
N
) t 3J(q
I
)
t 6J(q
I
t q
N
)) t c(J(q
N
)) t r
T
t R
I
ex
R
IN
(2I
z
N
z
) ? d(3J(q
N
) t 3J(q
I
)) t c(J(q
N
)) t r
L
R
IN
(2I
1
N
11
) ? R
IN
(2I
11
N
1
) ?
d
2
(2J(q
I
1q
N
)t3J(q
N
)
t 3J(q
I
)) t c(
2
3
J(0) t
1
2
J(q
N
)) t r
T
t R
NI
ex
R
IN
(2I
1
N
1
) ? R
IN
(2I
1 1
N
1 1
) ?
d
2
(3J(q
I
) t 3J(q
N
)
t 12J(q
I
tq
N
)) t c(
2
3
J(0) t
1
2
J(q
N
))
t r
T
t R
NI
ex
R
IN
(2I
x
N
x
) ?
d
2
(J(q
I
1 q
N
) t 3J(q
N
) t 3J(q
I
)
t 6J(q
I
t q
N
)) t c(
2
3
J(0) t
1
2
J(q
N
))
tr
T
t R
NI
ex
R
ex
stands here for the exchange contribution to the
relaxation of transverse coherences. The longitudinal
and cross relaxation rates in a NOESY experiment, r
j
L
and j
j
L
, and the longitudinal and cross relaxation rates
in a ROESY experiment, r
j
T
and j
j
T
, are given by:
r
j
L
?
X
N
i ? 1
q[J
ji
(0) t 3J
ji
(q) t 6J
ji
(2q)] (84)
r
j
T
?
X
N
i ? 1
q[
5
2
J
ji
(0) t
9
2
J
ji
(q) t 3J
ji
(2q)]
j
j
L
? q[6J
ji
(2q) 1 J
ji
(0)]
j
j
T
? q[2J
ji
(0) t 3J
ji
(q)]
In Eqs. (82) and (83), d is the constant for
heteronuclear dipolar relaxation:
d ?
(
h
2p
)
2
g
2
I
g
2
N
4r
6
IN
(85)
The constant c for relaxation by chemical shift
anisotropy, D (ppm), is given by
c ?
q
2
N
D
2
N
3
(86)
The constant K is given by
K ?
(
h
2p
)g
I
g
N
q
N
D
N
hP
2
(cos(F)i
r
3
IN
(87)
and the constant for homonuclear dipolar interaction
is
q ?
(
h
2p
)
2
g
4
I
4r
6
ij
(88)
The terms g
N
and q
N
stand for the gyromagnetic ratio
and the Larmor frequency of spin N; r
IN
is the distance
between spin I and N, h is Planck’s constant.
Thus, NMR relaxation parameters are expressed in
spectral density functions. In order to analyze these
data one can proceed in two ways: (1) the spectral
density function at the sampled frequencies can be
derived exactly from a number of different relaxation
parameters; this procedure, called spectral density
mapping, is proposed by Peng and Wagner
[197,198]. The commonly measured relaxation para-
meters T
1
, T
2
and NOE depend on spectral density
functions at five frequencies. Such data cannot
therefore be interpreted without a known form of
the spectral density function, or, in other words, an
analytical form for the spectral density functions that
depends on not more than three parameters. In the
concept of spectral density mapping the approach
has been abandoned. Instead, six experiments are
selected or designed for measuring relaxation proper-
ties. The spectral density functions at the five frequen-
cies can then be calculated from a linear combination
of relaxation rates [197,198]. Thus, values of the
spectral density function are obtained at a number of
frequencies without reference to any model. Subse-
quently, these spectral density data can be analyzed
in terms of the model-free approach of Lipari and
371S.S. Wijmenga, B.N.M. van Buuren/Progress in Nuclear Magnetic Resonance Spectroscopy 32 (1998) 287–387
Szabo or in terms of specific physical models. (2) The
second approach is to analyze the relaxation rates
directly in terms of either the Lipari and Szabo
model-free approach or in terms of explicit physical
models.
The approach of spectral density mapping further
clarifies the question central to the approach of Lipari
and Szabo about the information content of NMR
relaxation data. The best result that can be obtained
from NMR relaxation measurements is a partial map-
ping of the spectral density function. The question can
then be rephrased as: What is the information content
of a spectral density function mapped at a number of
frequencies? Since the spectral density function is
simply the fourier transform of the time correlation
function, what is the information content of a time
correlation function? From various tests (see, for
example, Refs. [199–201]) on time correlation func-
tions containing a number ( . 3) of exponentially
decaying terms with random errors of 0.1–0.2%, it
follows that at most three exponential terms can be
derived, i.e. increasing the number of exponentially
decaying terms does not statistically improve the fit
[202]. A fit in terms of two exponentially decaying
terms can show residuals of significant magnitude,
and increasing the number of exponential terms to
three leads to a statistically significant better fit, but
a further increase in this number does not. Thus, any
(multi-)exponentially decaying time correlation func-
tion can be described with high accuracy by at most
three time constants and two amplitude parameters
(assuming that the decay function is normalized to
say 1.0). Consequently, any spectral density function
can, with high accuracy, be described by the sum of
three Lorentzians, i.e. by three time constants and two
amplitude parameters. This is exactly the ‘extended’
Lipari and Szabo description, i.e. isotropic overall
motion with two decaying functions for internal
motions (Eqs. (57), (64) and (75)) or anisotropic over-
all motion described by two decaying terms and one
term for internal motion (Eqs. (79) and (80)). A sec-
ond aspect of the tests on (multi-)exponential decay
curves is that a three-exponential fit gives correct
values for the time constant and amplitude of the
slowest decaying exponential. The accuracy is better
the larger the amplitude. The time constants and
amplitudes of the other exponential terms generally
lie inbetween the actual values and cannot be related
to specific processes. Only in the case when the actual
time correlation function contains two exponential
terms does the shorter time represent the actual time
constant. This very nicely compares with the conclu-
sions of Lipari and Szabo mentioned above, namely,
that the order parameters can reliably be determined
(the accuracy increases with the magnitude of the
order parameter) and interpreted; the time constant
for overall motion can be determined, but the time
constant for internal motion cannot, except for special
cases where only one adjustable parameter is present.
This leads to the conclusion that NMR relaxation data
cannot provide time constants for internal motions
except under special circumstances (see below).
NMR relaxation data does provide a measure for the
time constant of overall motion and can reliably pro-
vide a measure for the generalized order parameter,
which is in fact the scaling of the contribution of over-
all motions. The order parameter can be interpreted as
a measure for the spatial restriction of internal
motions (but see above for the special case of aniso-
tropic overall motion). This also means that any
physical model for internal motion which contains
as adjustable parameters two time constants and two
amplitude parameters, can always be made to fit NMR
relaxation data. Only in the special case that the
physical model for internal motion contains as
adjustable parameters one time constant and one
amplitude parameter can one establish whether a
particular physical model applies or not.
Recently, two papers have appeared that address
two problems: firstly, how to derive anisotropic
motion from NMR relaxation data, and secondly,
how to detect and determine conformational exchange
processes. In the first paper by Tjandra et al. [203] the
NMR relaxation data of HIV protease is studied. They
obtain
15
N relaxation data (T
1
, T
2
and NOE). NMR
relaxation data have a limited information content.
Nevertheless, from such data one can determine, for
both isotropic as well as anisotropic motion, reason-
ably accurately the overall or end-over-end tumbling
time for each relaxation vector individually. In the
case of anisotropic motion additional time constants
will show up in the spectral density functions, and
reliable determination of, for example, the other
time constant related to overall motion, seems at
first glance impossible. However, there is an impor-
tant restraint on these parameters. They are the same
372 S.S. Wijmenga, B.N.M. van Buuren/Progress in Nuclear Magnetic Resonance Spectroscopy 32 (1998) 287–387
for all residues in a molecule. Consequently, one can
use the rather complex spectral density functions
which contain the terms resulting from anisotropic
tumbling and internal motion. In the analysis of
Tjandra et al. T
1
/T
2
ratios are considered, since
these are the least affected by internal motion;
residues with internal motion are excluded from the
analysis. In order to identify residues with internal
motion two separate cases can be distinguished.
Firstly, changes in J(q
N
) resulting from internal
motions that are faster than overall rotational diffu-
sion but slower than ,100 ps can significantly alter
the NOE values. Residues with a low NOE value are
excluded. Secondly, residues with conformational
exchange on a microsecond to millisecond time-
scale experience additional line broadening, com-
monly described by an exchange term R
ex
. In this
study a residue n is excluded when the following
criterion applies:
(hT
2
i 1 T
2, n
)=hT
2
i 1 (hT
1
i 1 T
1, n
)=hT
1
i . 1:5 3 SD
(89)
Here, the average is taken over residues that have not
been excluded because of a low NOE, and SD
represents the standard deviation of these data points.
Residues that exhibit slow conformational exchange
will have a considerably different T
2
but not T
1
, thus
failing to satisfy this equation. After this selection
enough residues are left to determine the rotational
diffusion tensor. The HIV protease has cylindrical
symmetry. Consequently, its diffusion tensor contains
two independent components, and two angles are
required to attach the diffusion to the molecular
frame, i.e. four independent parameters. The results
show that these parameters can be determined well.
The next step in the analysis is to seek to determine
parameters for internal motion. Previously when an
isotropic motional model was used a number of resi-
dues were identified as showing conformational
exchange. However, after the inclusion of anisotropic
motion such residues were no longer identified. Given
the values for the rotational diffusion tensor it is
straightforward to determine S
2
and t
e
. This generally
gives a good agreement between the values at
360 MHz and at 600 MHz, except for a small but
significant systematic difference in S
2
and t
e
. Tjandra
et al. [203] now take the ratio T
360
1
=T
600
1
and the ratio
T
360
2
=T
600
2
. The systematically lower values of S
2
at
360 Mhz compared to those at 600 Mhz are reflected
in the T
360
1
=T
600
1
and T
360
2
=T
600
2
ratios, which are higher
than expected for a rigid rotor. Tjandra et al. [203]
propose that the discrepancy results from two separate
causes: first, internal motions are not negligible when
calculating the diffusion tensor from T
1
/T
2
ratios, and
second, the chemical shift anisotropy (CSA) of
15
N is,
on average, about 10 ppm larger than the commonly
used value of 160 ppm.
A second paper, by Phan et al. [204], describes a
different approach to the analysis of NMR relaxation
data. Again
15
N relaxation data is used (T
1
, T
2
and
NOE), but here measurements are performed at
three different field strengths. The analysis shows
that the overall dimensions of the protein can be
approximated by a cylinder with an axial ratio D
par
/
D
per
of 1.9. The data can be fitted satisfactorily to a
Lipari–Szabo model, taking anisotropy into account.
A method is demonstrated for determining the
exchange contribution. Instead of the equation used
by Tjandra et al. [203] the exchange contribution is
determined from 1/T
1
( ? R
1
) and 1/T
2
( ? R
2
) using
the parameter (R
2
1 R
1
/2):
{R
2
1
R
1
2
} ?
c
2
t d
2
3
J(0) t
d
2
2
J(q
H
) t R
ex
(90)
where c and d represent as before the constants for
CSA and dipolar contribution to the relaxation. The
term (R
2
1 R
1
/2) is subsequently plotted against the
field strength. The first term on the right-hand side is
field dependent, i.e. c‘B
0
, as is the term involving R
ex
.
The term involving J(q
H
) also depends on the field but
may be neglected due to its smallness. One then
obtains (with c
1
? c/B
0
and A ? R
ex
/B
0
2
):
{R
2
1
R
1
2
} < (
c
2
1
3
J(0) t A)B
2
0
t
d
2
3
J(0) (91)
From a plot of (R
2
1 R
1
/2) against B
0
2
one can deduce
from the intercept I
0
? d
2
J(0)/3 the value of J(0) and
from the slope, m, the exchange constant A:
A ? m 1
c
2
1
I
0
d
2
(92)
Thus, any spin which has a slope m . (c
1
2
I
0
/d
2
) will
have an exchange contribution AB
0
2
. Ultimately six
residues were identified as having an exchange con-
tribution. However, the condition A ? 0 for the other
373S.S. Wijmenga, B.N.M. van Buuren/Progress in Nuclear Magnetic Resonance Spectroscopy 32 (1998) 287–387
residues was also not strictly adhered to. For 45 resi-
dues (56%) a straight line with a positive slope could
be fitted to these data. Practically, the analysis was
conducted in detail for 31 residues for which the
slopes differed by less than 40%, on going from
50.6 MHz to 60.8 MHz and on going from 50.6 to
76.0 MHz.
It interesting to note here that Phan et al. [204] do
not observe the systematic deviation due to CSA as
found by Tjandra et al. [203] in their analysis. In the
analysis of Phan et al. [204] this deviation should
show up as a slope m being systematically larger
than c
1
2
I
0
/d
2
) by about 7% (10 ppm/160 ppm). Such
a difference is not detectable in the method used by
Phan et al. [204] because of the rather large variations
in the slopes observed ( , 40% for the 31 residues
analyzed in detail). This demonstrates the difficulties
in the analysis and in determining how errors may
propagate.
Relaxation studies on nucleic acids are not new and
pioneering work has been done on larger DNA frag-
ments (100 bps) over the last 20 years [205–208].
Studies on short synthetic oligonucleotide sequences
have also been performed, but have been limited by
the lack of spectral dispersion and the necessity to
work at natural isotopic abundance. Only recently
have
13
C labeled RNAs become available, opening
up the way for more detailed relaxation studies of
well-defined sequences. The amount of detailed
experimental relaxation data is thus limited (for recent
reviews see Refs. [18,195], and Ref. [209], which
describes solid state NMR relaxation studies). We
will briefly discuss here some specific aspects.
Boxer and co-workers have performed detailed
1
H
and
31
P relaxation studies on a series of oligonucleo-
tides varying in length from 6 to 20 base pairs
[210,211]. The overall tumbling time derived from
NMR data gave a lower value then derived from
dynamic light scattering (DLS). Agreement could be
made by assuming an order parameter S
2
of 0.8.
Relaxation studies involving
13
C of short synthetic
oligonucleotides are scarce. Williamson and Boxer
[210] studied a hairpin molecule with a loop thymi-
dine C6
13
C-enriched to measure the loop mobility.
The apparent correlation times for the thymidine C6
in the loop were shorter than those for the cytosine
H6–H5 cross-relaxation indicating mobility in the
loop. This was confirmed by the rather low order
parameter for the C6–H6 vectors (between 0.6 and
0.7). In a multinuclear NMR study on a DNA octamer
it was found that for both the C–H as well as the H–H
vectors in the ribose ring the order parameters were
smallest at the termini and largest at the center; the
bases showed the least evidence of mobility [212].
Borer et al. [213] have reached the same conclusion
based on
13
C NMR relaxation studies at three different
field strengths. Order parameters calculated from long
molecular dynamics simulations also show this same
trend [214]. A detailed
13
C relaxation study has
recently also been performed by Gaudin et al. [215]
on a DNA dodecamer, which has the central three
thymines
13
C-labeled at the C19-position. The relaxa-
tion rates R(Cz), R(Hz), R(Hz ! Cz), R(2HzCz),
R(2HzCxy) related to the H19–C19 vector were
measured. Instead of deriving spectral density
functions and/or order parameters and relaxation
time constants from these data, the rates were directly
compared with rates calculated for two models of
internal motion. A good fit with experimental data
was reached for a two-state jump model between
x-syn and x-anti conformations with P(syn)/P(anti) ?
9/91, or for a restricted rotation model with Dx ? 288,
and an internal diffusion coefficient of 30 3 10
7
s
11
.
King et al. [18] present relaxation data for
13
C-
enriched DTAR RNA which has a trinucleotide
bulge and a six-base loop. Here it is also observed
that the ribose nuclei have smaller order parameters
then the base nuclei. Furthermore, the order
parameters of the C19 and C6/C8 sites vary widely
(0.44 to 1.0) with a reasonable correlation between
secondary structure and mobility. Interestingly, the
loop residues have relatively high order parameters
indicating a rather well structured loop.
In the analysis of the relaxation data discussed
above both the Lipari and Szabo model-free approach
and more complex physical models have been used. It
is interesting to note that according to the analysis of
Lipari and Szabo the information content of NMR
relaxation measurements is such that apart from the
overall time constant, two parameters are sufficient to
describe the relaxation data. We later argued, using
simulation of exponentially decaying time correlation
functions, that in the case of highly precise data two
extra parameters may be added, i.e. a description in
terms of an overall time constant, two smaller time
constants and two amplitude parameters may be
374 S.S. Wijmenga, B.N.M. van Buuren/Progress in Nuclear Magnetic Resonance Spectroscopy 32 (1998) 287–387
possible (this is the extended Lipari and Szabo
description). Hence, any physical model for internal
motion superimposed on overall motion that contains
as adjustable parameters three time constants and two
amplitude parameters can always be made to fit NMR
relaxation data. Only in the special case that the
physical model contains a smaller number of adjust-
able parameters, for example, for internal motion
contains one time constant and one amplitude para-
meter, can one establish whether a particular physi-
cal model applies or not. The latter condition is
generally physically quite unrealistic for nucleic
acids, where often much more complex motions
occur, each of which are not well known. One is
forced to conclude that for the interpretation of
NMR relaxation data one should not go beyond
determining and interpreting the order parameter.
The generalized order parameter S
2
has for nucleic
acids (when anisotropic motion occurs) a more
complex interpretation than for proteins (assuming
isotropic overall motion); it may not even directly
reflect internal motion. For example, the C6–H6
vector lies in a helical region almost perpendicular
to the long axis, but in bulge regions this may not
be the case. This affects both the effective overall
relaxation time and the order parameter. In particular
when the vector lies at the ‘magic angle’ with respect
to the helix axis, shorter time constants govern the
relaxation even in the absence of internal motion.
The complexity which results from the presence of
anisotropic motion in nucleic acids represents both a
challenge and an opportunity. In particular, the
dependence of the relaxation data, i.e. S
2
, on the
orientation of the dipolar relaxation vector with
respect to the overall axis, enables the use of this
type of data in structure determination. The important
characteristic of such data is that it represents global
structural knowledge, which is in contrast to NOE and
J-coupling data, which is strictly local.
9. Calculation of structures
The different methods that have been used over the
years to determine structures from NMR data, have
nowadays more or less converged into one consensus
approach, although the area is very active and new
methods are regularly developed. This present
consensus on how NMR derived structures should
be obtained and how the data should be presented is
reflected in a set of IUPAC recommendations [216].
These recommendations form a very useful overview
and provide a checklist when performing structure
calculations. The consensus approach currently
adhered to is to calculate the final set of structures
using a simulated annealing (SA) protocol, with
X-plor [78] being one of the most popular programs
used for this purpose. This is also the method we have
mainly used in our calculations. The SA method
should, however, not be used as a black box, since it
does not allow one to screen with complete certainty
the full conformational space consistent with the
experimental data. It is thus important to ensure firstly
that methods are used that search conformation space
as much as possible, and secondly, that the ultimately
derived structures are validated. In the following we
will discuss a structure calculation protocol and
aspects concerning the validation of the derived
structure or set of structures.
The consensus structure calculation protocol used
by us [12,77,139,217] consists of the following steps.
The procedure starts out with generating extended
random structures, in order to ensure that confor-
mation space is searched as fully as possible. Two
options can be chosen to derive the initial structures
from these extended structures: (1) A distance geo-
metry type of approach can be applied; such methods
are less computationally demanding and consequently
allow the derivation of a set with a large number of
initial structures (up to 500) [12,217]. (2) Starting
from extended randomized structures, an SA folding
protocol is used to calculate the initial structures
[77,139]. (3) The best structures, i.e. the converged
structures, can subsequently be subjected to rounds of
refinement using an SA refinement protocol.
The SA folding protocol we use [77] has simi-
larities to the protocols as described by Santa Lucia
and Turner [218] and Varani et al. [36]. It was applied
to a two-strand RNA duplex with sequence
59GGGCUGAAGCCU39, in which a tandem G.A
mismatched base pair was formed (see below). The
molecule is folded from the extended structures via a
number of successive SA runs. In the first and second
round the global fold is determined by adding succes-
sively more and tighter distance and torsion angle
constraints. A high level of convergence was obtained
375S.S. Wijmenga, B.N.M. van Buuren/Progress in Nuclear Magnetic Resonance Spectroscopy 32 (1998) 287–387
(75 out of 80). In the final round, i.e. the SA refine-
ment protocol, all constraints are used and the upper
and lower bounds set to their final values [77]. Fig. 33
shows an overlay of the final structures. The high
number of constraints per residue ( . 30) leads to a
defined structure (local superposition of the central
four residues yields and rmsd of 0.6 A
?
). The global
rmsd gives a value of 1.25 A
?
. Due to the linear
extended nature of helices and the local nature of
NMR constraints, the NMR derived structures are
well defined locally, but not globally (see Fig. 33).
When presenting nucleic acid structures one should
therefore show a number of locally superimposed
structures (see Fig. 33), instead of one set of struc-
tures. We have observed that the most difficult step in
the SA protocol is to obtain the global fold. Applying
the SA protocol to an intra-molecular triple helix via
SA from extended chains leads to very low levels of
convergence [139]. Recently, Stein et al. have pro-
posed a method called torsion angle dynamics [219].
This method is less computationally demanding, since
the chemical structure is maintained during the
dynamics. We have applied this method and observed
a considerably improved convergence rate (around
50%), when starting from random extended structures
[139].
In the final stage the derived structures have to be
analyzed and validated. The usual approach is to pre-
sent an overview of the structure calculation statistics
(see, e.g. Refs. [12,77,139,217]). This overview
should contain a description of the constraints, i.e.
NOE or distance constraints, dihedral, hydrogen
bond constraints, etc. The NOE or distance constraints
should be divided, according to the type of constraint,
into intra-residue and inter-residue constraints. In
view of the discussion concerning distances and
their effect on the accuracy of the derived structure
in Section 4, it seems advisable to further divide these
two categories, for example, according to the division
made in Table 1. For example, constraints involving
imino and amino protons indicate base pairing. The
usual approach is to base the structure calculation on
the experimental data, i.e. the derived structure or set
of structures should follow from the experimental data
and not be biased by uncertain force field effects.
Therefore, the constraint set should contain the
experimental constraints and the holonomic con-
straints, i.e. bond, bond angles, dihedrals, impropers,
and hard-sphere van der Waals constraints. The
experimental constraints (distance and torsion angle)
are usually given as upper and lower bounds, with a
quadratic increase in penalty outside the allowed
range. The distance between upper and lower bound
is the estimated error in the constraint value. To check
whether the experimental data is consistent with the
derived structure, the overview should therefore give
the number of violations of experimental constraints
(distance, dihedral angle, etc.), and/or the energies,
divided according to the categories bond, bond angles,
dihedral, restraint dihedrals, impropers, van der Waals
(hard sphere), and restrained distances (plus other
categories when applied). The energies provide an
Fig. 33. Final set of ten best structures of UGAA (see text): (a) structures were fitted on the whole molecule; (b) structures were fitted on the
UGAA part of the molecule; (c) structures were fitted on the bottom stem of the molecule.
376 S.S. Wijmenga, B.N.M. van Buuren/Progress in Nuclear Magnetic Resonance Spectroscopy 32 (1998) 287–387
indication of whether the structure is strained (to an
unusual extent) by the experimental data. The rmsd
of the set of structures that is ultimately derived
therefore reflects the uncertainty in the derived struc-
ture, as it results from the uncertainty in, or lack of,
the experimental data.
We note that the precision of the derived structures,
as reflected in the rmsd values of the family of struc-
tures, is determined by three main aspects (see also
discussion in Sections 4.1 and 4.4): (1) the precision
of the derived distance constraints, or for that matter
of the torsion angle constraints, is not the major con-
tributor to highly precise structures, but see Section
4.4 for a detailed overview of the type of constraint
lists that can be used and the consequences for the
precision of the family of structures; (2) the major
contributor to highly precise structures is the spread
and the number of experimental distance constraints;
the spread should be as even as possible throughout
the chemical structure, and the number should be at
least 10 per residue, but highly precise structures
require significantly larger numbers (see Section 4.4
for a detailed description); (3) torsion angle con-
straints are not the major contributor to highly precise
structures; only for already well defined structures
will their inclusion lead to significant further improve-
ment in precision, as suggested by the systematic
study performed by Allain and Varani [79].
An approach to further test the reliability of the
structure calculations is the method of cross validation
suggested by Brunger et al. [220]. Cross validation is a
statistical method which involves randomly removing
about 10% of the constraints from the total list of
experimental constraints, and subsequently deriving
the structures based on these different subsets of con-
straint data. The spread in the derived structures then
gives an indication of the reliability of the derived
structure or set of structures. However, this approach
can present a problem for structures derived from
NMR data, because important structural features
often depend heavily on a few characteristic NOEs.
If this is the case, cross validation is not a viable
method, since removing these characteristic NOE
data would lead to obviously wrong structures, or to
a set of structures with too much spread.
Because important structural features often depend
on a few characteristic NOE contacts, and the preci-
sion depends heavily on the number of NOE-based
constraints, it is very important to make use of all
possible NOEs and at the same time to carefully
check the constraints against the experimental data.
A very valuable approach in this respect is back-
calculation of the NOE data. This should be done
during each of the various rounds of structure refine-
ment to ensure that all the cross peaks have been
correctly interpreted; this includes a check on whether
short distances in the calculated structures are indeed
absent in the NOE spectra, otherwise lower bounds
should be introduced.
A second type of check concerns the geometry of
the derived structures. For proteins various programs
exist that check the geometry of the derived struc-
tures. No such programs are in use for nucleic acids.
This is unfortunate, but understandable, since the
structure of nucleic acids is determined by a much
greater number of torsion angles; the nucleic acid
backbone is determined by six variable torsion angles,
while for proteins this number is only two. Neverthe-
less, certain restrictions apply to these torsion angles,
i.e. the ? torsion angle is sterically restricted to
approximately 150–2808, and the d torsion angle,
related to the sugar ring puckering, lies between
about 808 and 1458. The other torsion angles have
usual ranges for A- and B-type helices, but may in
other cases be found in any of their three possible
rotamer states. For a complete overview of the
sterically allowed ranges we refer to Ref. [88], and
Sections 4 and 5 of this review. Another very
important aspect of correct geometry concerns the
stereospecificity; the most salient here is the correct
placement of H20 and H29 in DNA, of H29 and OH20
in RNA, and of H59 and H50 in both DNA and RNA.
We have found during the course of our study on the
proton chemical shift calculations (see Section 6 and
Ref. [50]), that in quite a number of deposited PDB
files the correct stereospecific geometry is violated.
The H59 and H50 protons seem to be placed almost
at random, i.e. a high percentage have the wrong geo-
metry, but some H29 and H20 protons, and sometimes
in RNA the H29, were placed in the wrong position.
The latter errors are of the utmost relevance, because
these protons provide essential experimental distance
constraints in the structure calculations. It is therefore
important that calculated structures are checked
against correct geometry. A program which checks
correct stereospecificity is available from the authors
377S.S. Wijmenga, B.N.M. van Buuren/Progress in Nuclear Magnetic Resonance Spectroscopy 32 (1998) 287–387
on request [50]. Similar findings were recently
published by Schultze and Feigon [221].
Finally, we note that the application of distance
constraints in restrained molecular dynamics or in
SA protocols is strictly speaking only allowed when
the structure under study is rigid. When internal
dynamics are present, the SA protocols, as discussed,
cannot be applied. One alternative is to use time
averaged constraints [222,223]. However, in the
molecular dynamic runs only fast internal motions
tend to be visible. These are in turn often related to
libration motions, i.e. motions within a certain
rotamer domain. For this type of motion the distances
fluctuate around a middle value, so that one can use
one average distance, and averaging would not be
required. For situations such as the sugar ring flips
between N- and S-states, and torsion angle flips
between different rotamer states, distances can be
small in one case and large in the other. In order to
account for these differences in distances, averaging
techniques have to be applied. Torsion angle flips and/
or sugar ring flips do not occur often enough in the
usual molecular dynamic runs, so that averaging is
insufficient and the approach of time averaged con-
straints is not a good one. One either has to extend the
dynamics period or apply ensemble averaging. Both
approaches present enormous computational problems.
Further developments in this area are awaited.
10. Prospects for larger systems
In the field of protein NMR, three-dimensional
structure determination can nowadays be successfully
performed on proteins with a molecular weight up to
30 kDa. Most NMR studies of nucleic acids, including
those using labeled RNA nucleotides, involve systems
of limited size, up to 40 nucleotides, which corre-
sponds to a considerably smaller molecular weight
of up to 13 kDa.
13
C and
15
N labeling of RNA is
still a fairly recent development. It is therefore not
surprising that the initial studies have been limited
to smaller systems. Although performed on smaller
systems, these studies do provide good indications
as to what the bottlenecks are for larger systems. It
seems that the two main aspects that need to be con-
sidered are the T
2
relaxation times (or line widths),
and the resonance overlap.
The factor that mainly determines the T
2
relaxation
rates is the tumbling time of the molecule studied (see
Section 8). We have calculated the tumbling time as a
function of the number of base pairs for a number of
situations that are likely to be encountered when
studying nucleic acids, namely, for a spherical mol-
ecule (radius r), and for cylindrical molecules, which
have either the usual helix radius or double the radius
(Fig. 33). Since the helix radius is about 11 A
?
, short
nucleotide sequences (up to 10 base pairs), such as
encountered in hairpins studied by NMR, will have
a more or less spherical shape. Longer sequences will
tend to fold into helices of cylindrical shape. For
sequences containing more than 30 to 50 base pairs
in a row, a turn may occur, leading to two helices
folded side by side (see, for example, the X-ray
Fig. 34. The rotation times of differently shaped nucleic acid mole-
cules as a function of the number of nucleotides. A cylindrically
shaped and a randomly coiled molecule are considered. The rotation
time for the end-over-end tumbling of a cylinder, t
L
, and for rota-
tion about the long cylinder axis, t
s
, are calculated according to Eq.
(59) in Section 8 (kT ? 4.0 3 10
121
J; h ? 0.001 poise). For the
calculations it is assumed that the nucleotides form base pairs which
fold as an A-type double helix; the length of the cylinder, L, then
equals the number of nucleotides times 1.35. The cylinder radius is
taken to be 11 A
?
. The rotation times, t
Ld
and t
sd
, represent end-
over-end tumbling and rotation about the long axis, respectively, of
a cylinder of length L/2 and radius 22 A
?
. These rotation times cor-
respond to two helices stacked side by side. For the calculation of a
randomly coiled molecule, t
rc
is taken to be proportional to L
1.5
and
scaled in such a way that it equals t
L
at a length corresponding to 17
base pairs. The rotation times t
cyt
and t
cytd
correspond to t
3
of Eq.
(62) in Section 8, i.e. t
cyt
? 3t
L
t
s
/(t
s
t 2t
L
) and t
cytd
? 3t
Ld
t
sd
/
et
sd
t 2t
Ld
T. They represent the shorter rotation time of a relaxation
vector perpendicular to the helix axis.
378 S.S. Wijmenga, B.N.M. van Buuren/Progress in Nuclear Magnetic Resonance Spectroscopy 32 (1998) 287–387
structure of part of the Tetrahymena group I intron
[4,5]). The time constant for tumbling of a spherical
molecule, t
sphere
, increases steeply with the third
power of the diameter, D, with D taken to be equal
to the number of base pairs (nb) multiplied by 2.7 A
?
,
so that t
sphere
~ (nb)
3
~ D
3
. Similarly, the time con-
stant for end-over-end tumbling of a cylinder, t
L
,
increases with the third power of the length, L, and
thus with the number of base pairs, i.e. t
L
~ (nb)
3
~
L
3
. On the other hand, the time constant for tumbling
about the long axis is smaller and increases only lin-
early with length, t
L
~ (nb) ~ L. Included in Fig. 34
are also the tumbling times, t
Ld
and t
sd
, for the case
where the helix has twice the usual diameter. As
expected t
Ld
is considerably smaller than t
L
, and t
sd
is larger then t
s
. We also show t
cyt
?
1
/
2
(t
L
t t
s
), and
t
cytd
?
1
/
2
(t
Ld
t t
sd
). These values should correspond
closely to the effective rotation times governing H5
relaxation [195]. We find that for a system of 20 base
pairs (40 nucleotides, 12 kDa) the tumbling time, t
cyt
,
is of the order of 5 ns, while for side by side helices
each 20 base pairs in length (80 nucleotides; 24 kDa),
the tumbling time, t
cytd
, is about 13 ns. Taking t
L
and
t
Ld
as yardsticks we find 12 ns (12 kDa) and 26 ns
(24 kDa), respectively. The former values compare
quite closely with the tumbling times of proteins,
where 3.4 ns is estimated for an 8 kDa protein (eglin
c [198,224], and 10.2 ns for a protein of 22kDa [203].
Thus, the tumbling times for nucleic acids are 1.0 to
2.0 times larger than for proteins of the same
molecular weight.
In Fig. 35(A)–(D), estimates are given of transverse
relaxation rates as a function of the number of nucleo-
tides for four different situations. In Fig. 35(A) non-
exchangeable protons directly bonded to
13
C are
considered. The figure shows the transverse rate
resulting from dipolar interaction between
13
C and
1
H. Thus, the curves represent the relaxation rates of
in-phase Cx coherence with the
13
C nucleus attached
to a proton. In the inset the relaxation rates are shown
that ensue for HzCx and Hx coherences. Fig. 35(B)
shows transverse relaxation rates of non-exchange-
able protons not attached to a
13
C nucleus (or directly
bonded to
12
C), and in Fig. 35(C) the effect on the
transverse relaxation rates of deuteration of surround-
ing protons is examined. Finally, in Fig. 35(D) the
transverse relaxation of imino protons, directly
bound to
15
N, is shown with and without deuteration
of surrounding protons. The effect of deuteration is
simulated by removing short distances from the pro-
ton relaxation pathways (see legend). Internal motion,
which reduces the relaxation rates by a factor of 0.8, is
assumed to be present in some cases.
In Section 7 we presented the transfer functions for
a variety of through-bond NMR experiments
assuming two different T
2
relaxation times. Setting a
somewhat arbitrary limit for breakdown of these
experiments at transfer efficiencies ranging from 0.1
to 1%, we arrive at limiting T
2
relaxation rates or line
widths of the order of 20–30 Hz, indicated in
Fig. 35(A)–(D) by the horizontal broken lines. The
way in which through-bond experiments are affected
by transverse relaxation depends on the type of
coherences present during the longest transfer steps.
We consider the experiments in that order. The first
set of through-bond experiments encompasses HCN,
HCNCH, PCH and HCP. These experiments all
contain long transfer steps of the type, Cx ! CyNz
or Cx ! CyPz, i.e. the transfer steps involve transfer
of in-phase transverse coherence of a
13
C nucleus
directly attached to a proton. The sensitivity of these
experiments will therefore be determined by the
relaxation rates of these in-phase transverse
13
C
coherences, the rates of which are completely deter-
mined by the dipolar interaction of the
13
C nucleus
with its directly bonded proton and can be read off
from Fig. 35(A). As can be seen from Fig. 35(A),
these through-bond experiments will start to fail at
rather short helix lengths of around 25 bp’s corre-
sponding to sequences of around 50 nucleotides
(15 kDa). The range of these experiments can be
extended tremendously by means of selective deuter-
ation. The relaxation rate of in-phase transverse
coherences of
13
C not bonded to a proton is a factor
of over a 100 smaller than for
13
C bonded to a proton.
This means that
13
C !
15
Nor
13
C !
31
P transfers can
remain effective in specifically deuterated compounds
(possibly up to 200 nucleotides). For example, the
HCCP–TOCSY experiment on a deuterated RNA,
except at the H19 position, will remain a sensitive
experiment even for long sequences. The second set
of through-bond experiments consists of HCNCH,
HPCH, long-range (H,N) HSQC, and long-range
(H,C) HSQC. They all contain as the main transfer
step either
15
Nz !
1
HxNz,
31
Pz !
1
HxPz,or
13
Cz !
1
HxCz, where the
1
H is attached to a
13
C nucleus. Here
379S.S. Wijmenga, B.N.M. van Buuren/Progress in Nuclear Magnetic Resonance Spectroscopy 32 (1998) 287–387
one can take the relaxation rates of Hx as a measure to
derive the sensitivity of these transfers. As shown in
the inset of Fig. 35(A) these experiments will start to
fail for even shorter lengths (20–40 nucleotides). The
life span of such experiments can be improved by
removing the
13
C nucleus from the
1
H relaxation path-
way. When the
13
C labeling is absent the proton
relaxation rates are as shown in Fig. 35(B). As can
be seen, the experiments can be performed up to 50 to
70 nucleotides. This range can be extended further, up
to 90 to 140 nucleotides, by deuteration of the
surrounding protons, as shown in Fig. 35(C). Thus,
deuteration and removal of
13
C labeling can extend
the range of these experiments to sequences which are
over 100 nucleotides long. A third set of through-bond
experiments involves those aimed at assignment
of exchanging protons. Here, the main transfer step
is
15
Nx !
13
CzNx, with the
15
N nucleus directly
bonded to a proton and no proton directly attached
to the
13
C nucleus. In this case one has to consider
the relaxation of in-phase transverse
15
N coherences,
shown in the inset of Fig. 35(D). The experiment will
only start to fail for much longer sequences (around
100 to 150 nucleotides). In summary, the range of the
through-bond experiments can be extended to
sequences of over a 100 nucleotides by judiciously
applied labeling techniques. In other word, selective
deuteration in combination with selective
13
C labeling
can make most through-bond experiments, albeit in
modified form, sensitive enough even for sequences
with more than 100 nucleotides. Finally, we also need
to consider, apart from the through-bond experiment,
Fig. 35.
380 S.S. Wijmenga, B.N.M. van Buuren/Progress in Nuclear Magnetic Resonance Spectroscopy 32 (1998) 287–387
the multi-dimensional
13
C and/or
15
N edited NOE
experiments, since these form an important part of
the assignment and the structural characterization.
Multi-dimensional
13
C and/or
15
N edited NOE experi-
ments contain only short transfer steps (3 ms), and are
thus expected to start to fail at line widths greater than,
say, around 100 Hz, which corresponds to a limit of 100
nucleotides (Fig. 35(A)). NOE information can also
be obtained from samples that are partially deuterated
instead of
13
C labeled. In this case the upper size limit
is difficult to give, but will be above 100 nucleotides,
as follows from Fig. 35(A) and (B). These lengths are
all considerably beyond those achieved in practice
nowadays. It can be concluded from these data that
T
2
relaxation times are not the limiting factor for
extending the size of RNAs or DNAs studied today.
The other factor that has to be considered is the
resonance overlap. As pointed out before, the intro-
duction of
13
C and
15
N labeling does considerably
reduce overlap. A prime example is that in RNA the
H29 to H59/50 resonances overlap completely, while
the (C29,C39), C49 and C59 resonances reside in sepa-
rate regions. Unfortunately, these
13
C spectral regions
are narrow and within these regions rather strong
overlap may occur. It seems that in particular the
C29 resonances reside in a region of about 1 to
2 ppm. The C49 resonances do spread out if they are
in a non-helical environment, but in a helical environ-
ment the spread is also approximately 2 ppm. The H19
and C19 resonances do have a reasonable dispersion.
The imino
15
N resonances have limited dispersion,
and the
31
P resonances, in a helix, are notorious for
their small dispersion. The size limit is thus mainly
determined by resonance overlap even in the era of
13
C and
15
N labeling. Today, the size limit lies around
40 nucleotides, but even for this size residue-type-
specific labeling is required (see, e.g. Ref. [225]). In
the case of uniform labeling, a rough estimate of the
size limit is 30 nucleotides. Using this value as a
yardstick it is possible to hypothesize on the prospects
for larger systems, and on the amount of selective
labeling required by the following reasoning. Assume
a sequence of length 120, which contains 30 A resi-
dues, 30 G residues, 30 C residues and 30 U residues.
With 30 nucleotides as the limit for assignment using
uniform labeling, it follows that assignment should be
possible, when only the 30 A residues are labeled. The
same applies in the case of residue-type-specific label-
ing of the G, C and U residues. Thus, with residue-
type-specific labeling and applying through-bond
Fig. 35. Transverse relaxation rates versus the number of nucleotides involving non-exchanging protons (A, B and C) and imino protons (D).
The rotation times for end-over-end tumbling and tumbling about the helix axis have been calculated using eqns (59) and (60) (Section 8). The
helix is assumed to be of the A-type. When the number of nucleotides is less than or equal to 50, the length of the helix is taken to be equal to half
the number of nucleotides times 2.7 A
?
, and the helix radius is assumed to be 11 A
?
. When the number of nucleotides is greater than or equal to
100, the length of the helix is taken to be equal to one-quarter the number of nucleotides times 2.7 A
?
, and the helix radius is assumed to be 22 A
?
.
It is thus assumed that when the number of nucleotides exceeds 100, two side-by-side helices are formed (this is to represent for example the
case of a group I intron RNA). Between 50 and 100 nucleotides the values for the relaxation rates are linearly interpolated from the values at 50
and 100 nucleotides. Calculated are the transverse relaxation rates of a proton directly bonded to a
13
C nucleus (A), a
12
C nucleus (B and C), and
a
15
N nucleus (D). The relaxation is assumed to result from dipolar interactions alone, either with the directly bonded nucleus (the bond length is
assumed to be 1.095 A
?
) or with other protons (see eqns (81)–(88), Section 8). Note that the hetero nucleus relaxes by the directly bonded proton
with the same rate as the proton is relaxed by the directly bonded hetero nucleus. Two situations are always considered, namely the relaxation
vector lies parallel to the helix axis or perpendicular to it. The spectral density function is then calculated according to Eq. (63); in the former
case the density function is given by J(q,t
L
) and in the latter by 0.75J(q,t
L
) t 0.25J(q,t
3
). The horizontal dotted lines indicate relaxation rates
of 20 and 30 Hz, respectively. These values can be considered roughly the level above which triple resonance experiments start to become too
insensitive. The two vertical arrows indicate where these levels cross the expected relaxation rates, and should thus indicate roughly the
maximum lengths at which triple resonance experiments can be performed. (A) The effect on the proton transverse relaxation rate due to dipolar
interaction with a directly bonded
13
C nucleus. The inset shows the sum of the rates due to dipolar interaction with the directly bonded nucleus
and due to dipolar interaction with other protons at 2.2, 2.4 and 3.0 A
?
, respectively; internal motion is taken to be present, scaling the transverse
relaxation rates by 0.8 ( ? S
2
). (B) Transverse proton relaxation rate resulting from dipolar interactions with protons at 2.2, 2.4 and 3.0 A
?
. (C)
The effect of deuteration; transverse proton relaxation rate resulting from dipolar interaction with one proton at 2.4 A
?
. (D) Transverse imino
relaxation rate resulting from dipolar interaction with the directly bonded
15
N, and dipolar interaction with protons at 3.0, 3.0, 2.7 and 2.7 A
?
(broken lines); in addition, 10 Hz exchange broadening is assumed (drawn lines). The broken lines represent the effect of deuteration of non-
exchanging protons; the rate is calculated as above but now with protons at 3.0, 3.0 and 2.7 A
?
. The inset gives the transverse rate resulting from
dipolar interaction with the directly bonded nucleus alone. This represents the relaxation rates of in-phase
15
N relaxation. Internal mobility is
taken to be present scaling the rate by a factor of 0.8 ( ? S
2
).
381S.S. Wijmenga, B.N.M. van Buuren/Progress in Nuclear Magnetic Resonance Spectroscopy 32 (1998) 287–387
assignment techniques, one arrives at a size limit of
120 nucleotides. It remains to be seen whether this
limit is unrealistically large.
The above considerations show that, for nucleic
acids, it is of penultimate importance to establish
labeling methods that reduce the overlap in the
spectra. The above-mentioned approach of residue-
type-specific labeling is a straightforward extension
of existing and practiced methods which employ
enzymatic means to achieve the labeling. A further
extension of the size limit could possibly be achieved
when using different site-specific mutant forms of the
studied sequence. Finally, site-specific labeling is an
ideal approach, but for RNAs is extremely expensive,
even more so for larger systems. Obviously, present
labeling methods have to be extended to be able to
successfully study larger systems ( . 120) by NMR.
A final aspect that plays a significant role is the
development of improved structure calculation
methods. As pointed out in Section 6, the first steps
have only just been taken towards quantitative utiliza-
tion of chemical shifts [50,226]. Furthermore, the use
of ambiguously assigned NOEs in simulated anneal-
ing has been demonstrated for protein NMR by Nilges
[227], but has as yet not been tried for nucleic acids.
This method is particularly interesting, since Nilges
has shown that only a very small number of assigned
NOEs is required to derive the correct three-
dimensional structure. Furthermore, Smith et al.
[228] have recently considered how minimal a dataset
is required for NMR studies of large proteins (over
40 kDa), to still be able to derive a three-dimensional
structure, albeit of low resolution. Application of
these methods in the field of nucleic acids will
certainly help to further extend the size limit.
In conclusion, it seems that with the introduction of
specific deuteration and partial
13
C labeling techni-
ques, the limit of the systems that can be studied
lies well beyond 100 nucleotides. Residue-type-
specific labeling can extend the range to about 120
nucleotides. Beyond that, site-specific-labeling
techniques are required. It is interesting to consider
in this respect the group I domain, for which the struc-
ture has recently been determined by means of X-ray
diffraction. The total length of this sequence is 160
nucleotides. The structure of the group I intron
domain shows two side-by-side helices of approxi-
mately 60 base pairs each. This corresponds to a
molecule with a rotational correlation time as
described earlier in Fig. 35, for which many
through-bond assignment techniques can still be
applied, when labeling techniques are judicially
applied. It is also interesting to note that the model
of the structure of the complete group I intron does not
have a size much larger than this.
11. Conclusions
The last five years have seen very exciting innova-
tions in the field of NMR of nucleic acids, spawned by
the development of RNA labeling techniques. It is to
be expected that this field will further mature in the
near future. A number of possible developments,
which have been hinted at in this review, are expected
to occur in the near future. Rather simple extensions
of the present labeling methods for RNA will allow
larger systems to be tackled. It is also expected that
DNA labeling will become a viable technique, so that
parallel developments can occur in this area.
Relaxation studies are expected to be performed and
to provide answers to questions concerning the
conformational exchange processes, which are so
important for the functionality of RNAs. Furthermore,
the spin relaxation depends on the orientation of its
relaxation vector with respect to the long axis of the
cylindrical shape of nucleic acid molecules. This
dependence may possibly be used as a structural
parameter to determine the relative orientation of
molecular segments (see Section 8). An exciting
development is to make use of the slight alignment
via orientation along the magnetic field which shows
up in high magnetic fields for biomolecules that have
an anisotropic diamagnetic susceptibility tensor
[229,230]. This alignment influences the magnitude
of the
1
J
NH
-and
1
J
CH
-couplings in a field-dependent
manner. When measurable, this also provides global
structural information [229,230]. Furthermore, the
first results indicate that
1
H chemical shifts, which
mainly result from magnetic anisotropy effects of
the aromatic rings, can be calculated reliably. Since
nucleic acids are essentially aromatic in nature it is to
be expected that chemical shift based structural con-
straints will contribute considerably to better, i.e.
more detailed, structures. Finally, computational
methods based on floating assignments are expected
382 S.S. Wijmenga, B.N.M. van Buuren/Progress in Nuclear Magnetic Resonance Spectroscopy 32 (1998) 287–387
to be introduced and further improve the NMR
structure calculations.
Acknowledgements
The authors wish to thank Drs J.H. Ippel and J.
Schleucher for helpful comments and critical reading
of the manuscript. S.W. wishes to thank his former
colleagues at the Nijmegen laboratory for the exciting
scientific environment they provided; Dr H.A. Heus
introduced him to isotope labeled RNA, and made
him more aware of the many biological functions of
RNA. Furthermore, the authors want to express their
appreciation to Professor C.W. Hilbers for his support
and advice, and the many scientific discussions con-
cerning NMR and its application to structural studies
of nucleic acids.
References
[1] R.F. Gesteland, J.F. Atkins, The RNA World, Cold Spring
Harbor Laboratory Press, 1997.
[2] H.W. Pley, K.M. Flaherty, D.B. McKay, Nature 372 (1994)
68.
[3] W.G. Scott, J.T. Finch, A. Klug, Cell 81 (1995) 991.
[4] J.H. Cate, A.R. Gooding, E. Podell, K. Zhou, B.L. Golden,
A.A. Szewczak, C.E. Kundrot, T.R. Cech, J.A. Doudna,
Science 273 (1996) 1696.
[5] J.H. Cate, A.R. Gooding, E. Podell, K. Zhou, B.L. Golden,
C.E. Kundrot, T.R. Cech, J.A. Doudna, Science 273 (1996)
1678.
[6] C. Oubridge, N. Ito, P.R. Evans, C.H. Teo, K. Nagai, Nature
372 (1994) 432.
[7] K. Vaiegard, J.B. Murray, P.G. Stockley, J. Stonehouse, L.
Liljas, Nature 371 (1994) 623.
[8] C. Kang, X. Zhang, R. Ratliff, R. Moyzis, A. Rich, Nature
356 (1992) 126.
[9] S.R. Holbrook, C. Cheong, I. Tinoco Jr., Nature 353 (1991)
579.
[10] R. Chattopadhyaya, K. Grzeskowiak, R.E. Dickerson, J. Mol.
Biol. 211 (1990) 189.
[11] C.W. Hilbers, H.A. Heus, M.J.P. Van Dongen, S.S.
Wijmenga, Nucleic Acids and Molecular Biology, Springer,
Berlin, 1994, p. 56.
[12] M.J.P. Van Dongen, S.S. Wijmenga, G.A. Van der Marel,
J.H. Van Boom, C.W. Hilbers, J. Mol. Biol. 263 (1996) 715.
[13] F.W. Smith, J. Feigon, Nature 356 (1992) 164.
[14] A. Kettani, R.A. Kumar, D.J. Patel, J. Mol. Biol. 254 (1995)
638.
[15] J. Feigon, K.M. Koshlap, F.W. Smith, Methods Enzymol. 261
(1995) 225.
[16] K. Gehring, J.L. Leroy, M. Gueron, Nature 363 (1993) 561.
[17] J.R. Williamson, Nature 382 (1996) 112.
[18] G.C. King, J.W. Harper, Z. Xi, Methods Enzymol. 261 (1995)
436.
[19] E.V. Puglisi, J.D. Puglisi, J.R. Williamson, U.L. Raj-
Bhandary, Proc. Natl. Acad. Sci. USA 91 (1994) 11467.
[20] F. Aboul-Ela, J. Karn, G. Varani, Nucleic Acids Res. 24
(1996) 3974.
[21] G. Varani, Ann. Rev. Biophys. Biomol. Struct. 24 (1995)
379.
[22] F.H. Allain, G. Varani, J. Mol. Biol. 250 (1995) 333.
[23] F.H. Allain, G. Varani, Nucleic Acids Res. 23 (1995) 341.
[24] S.R. Mirmira, I. Tinoco Jr., Biochemistry 35 (1996) 7664.
[25] Z.P. Cai, I. Tinoco Jr., Biochemistry 35 (1996) 6026.
[26] H.S. Kang, J.V. Hines, I. Tinoco Jr., J. Mol. Biol. 259 (1996)
135.
[27] K.J. Luebke, I. Tinoco Jr., Biochemistry 35 (1996) 11677.
[28] S.R. Lynch, J.G. Pelton, I. Tinoco Jr., Magn. Reson. Chem.
34 (1996) S11.
[29] S.R. Mirmira, I. Tinoco Jr., Biochemistry 35 (1996) 7675.
[30] J. Nowakowski, I. Tinoco Jr., Biochemistry 35 (1996) 2577.
[31] D. Fourmy, M.I. Recht, S.C. Blanchard, J.D. Puglisi, Science
274 (1996) 1367.
[32] J.L. Battiste, H.Y. Mao, N.S. Rao, R.Y. Tan, D.R.
Muhandiram, L.E. Kay, A.D. Frankel, J.R. Williamson,
Science 273 (1996) 1547.
[33] A.S. Brodsky, J.R. Williamson, J. Mol. Biol. 267 (1997) 624.
[34] F. Aboul-Ela, J. Karn, G. Varani, J. Mol. Biol. 253 (1995)
313.
[35] X.Y. Chen, H.S. Kang, L.X. Shen, M. Chamorro, H.E.
Varmus, I. Tinoco Jr., J. Mol. Biol. 260 (1996) 479.
[36] G. Varani, F. Aboul-Ela, F.H. Allain, Progress in Nuclear
Magnetic Resonance Spectroscopy 29 (1996) 51.
[37] F.H.T. Allain, C.C. Gubser, P.W.A. Howe, K. Nagai, D.
Neuhaus, G. Varani, Nature 380 (1996) 646.
[38] C.C. Gubser, G. Varani, Biochemistry 35 (1996) 2253.
[39] D.P. Zimmer, D.M. Crothers, Proc. Natl. Acad. Sci. USA 92
(1995) 3091.
[40] D.J. Aberhard, personal communication.
[41] D.M.J. Lilley, DNA–Protein Structural Interactions, Oxford
University Press, Oxford, 1995.
[42] C. Altona, J.A. Pikkemaat, F.J.J. Overmars, Curr. Opin.
Struct. Biol. 6 (1996) 305.
[43] J.A. Pikkemaat, H. Van den Elst, J.H. Van Boom, C. Altona,
Biochemistry 33 (1994) 14896.
[44] F.J.J. Overmars, J.A. Pikkemaat, H. Van den Elst, J.H. Van
Boom, C. Altona, J. Mol. Biol. 255 (1996) 702.
[45] R.T. Batey, J.L. Battiste, J.R. Williamson, Methods Enzymol.
261 (1995) 300.
[46] K.B. Hall, Methods Enzymol. 261 (1995) 542.
[47] J.D. Puglisi, J.R. Wyatt, Methods Enzymol. 261 (1995) 323.
[48] A. Pardi, Methods Enzymol. 261 (1995) 350.
[49] J.H. Ippel, S.S. Wijmenga, R. de Jong, H.A. Heus, C.W.
Hilbers, E. de Vroom, G.A. Van der Marel, J.H. Van
Boom, Magn. Reson. Chem. 34 (1996) S156.
[50] S.S. Wijmenga, M. Kruithof, C.W. Hilbers, Analysis of
1
H
chemical shifts in DNA. Assessment of the reliability of
1
H
383S.S. Wijmenga, B.N.M. van Buuren/Progress in Nuclear Magnetic Resonance Spectroscopy 32 (1998) 287–387
chemical calculations for use in structure refinement, J. Bio-
mol. NMR 10 (1997) 337.
[51] J.F. Milligan, D.R. Groebe, G.W. Witherell, O.C. Uhlenbeck,
Nucleic Acids Res. 15 (1987) 8783.
[52] J.F. Milligan, O.C. Uhlenbeck, in: J.E. Dahlberg, J. Abelson
(Eds.), Methods in Enzymology, Academic Press, New York
180 (1989) 51.
[53] R.T. Batey, N. Cloutier, H.Y. Mao, J.R. Williamson, Nucleic
Acids Res. 24 (1996) 4836.
[54] T.J. Tolbert, J.R. Williamson, J. Am. Chem. Soc. 118 (1996)
7929.
[55] R.T. Batey, M. Inada, E. Kujawinski, J.D. Puglisi, J.R. Wil-
liamson, Nucleic Acids Res. 20 (1992) 4515.
[56] S.S. Wijmenga, H.A. Heus, F.J. Van De Ven, C.W. Hilbers,
in: C.I. Stassinopoulou (Ed.), NMR of Biological Macro-
molecules, Springer Verlag, Berlin, 1994, p. 304.
[57] E.P. Nikonowicz, M. Michnicka, K. Kalurachchi, E. DeJong,
Nucleic Acids Res. 25 (1997) 1390.
[58] A. Ono, T. Makita, S.I. Tate, E. Kawashima, Y. Ishido, M.
Kainosho, Magn. Reson. Chem. 34 (1996) S40.
[59] B.L. Gaffney, P.P. Kung, C. Wang, R.A. Jones, J. Am. Chem.
Soc. 117 (1995) 12281.
[60] M.J. Michnicka, J.W. Harper, G.C. King, Biochemistry 32
(1993) 395.
[61] IUPAC–IUB Joint Commission Biochemical Nomenclature,
Eur. J. Biochem. 131 (1983) 9.
[62] S.S. Wijmenga, M.M. Mooren, C.W. Hilbers, in: G.C.K.
Roberts (Ed.), NMR of Macromolecules, A Practical
Approach, Oxford University Press, New York, 1993, p. 217.
[63] M.J. Blommers, F.J. Van De Ven, G.A. Van der Marel,
J.H. Van Boom, C.W. Hilbers, Eur. J. Biochem. 201 (1991)
33.
[64] M.R. Conte, C.J. Bauer, A.N. Lane, J. Biomol. NMR 7 (1996)
190.
[65] G. Liparo, A. Szabo, J. Am. Chem. Soc. 104 (1982) 4546.
[66] G. Liparo, A. Szabo, J. Am. Chem. Soc. 104 (1982) 4559.
[67] B.A. Borgias, T.L. James, J. Magn. Reson. 87 (1990) 475.
[68] U. Schmitz, T.L. James, Methods Enzymol. 261 (1995) 3.
[69] N.B. Ulyanov, T.L. James, Methods Enzymol. 261 (1995) 90.
[70] C.B. Post, R.P. Meadows, D.G. Gorenstein, J. Am. Chem.
Soc. 112 (1990) 6796.
[71] K. Kaluarachchi, R.P. Meadows, D.G. Gorenstein, Biochem-
istry 30 (1991) 8785.
[72] B.A. Luxon, D.G. Gorenstein, Methods Enzymol. 261 (1995)
45.
[73] R. Boelens, T.M.G. Koning, G.A. Van der Marel, J.H. Van
Boom, R. Kaptein, J. Magn. Reson. 82 (1989) 290.
[74] F.J.M. Van De Ven, M.J.J. Blommers, R.E. Schouten, C.W.
Hilbers, J. Magn. Reson. 94 (1991) 140.
[75] S.S. Wijmenga, H.A. Heus, B. Werten, G.A. Van der Marel,
J.H. Van Boom, C.W. Hilbers, J. Magn. Reson. B. 103 (1994)
134.
[76] C.W. Hilbers, S.S. Wijmenga, H. Hoppe, H.A. Heus, in: O.
Jardetzky, J.F. Lefevre (Eds.), Dynamics and the Problem of
Recognition in Biological Macromolecules, Plenum Press,
New York, 1996, p. 193.
[77] H.A. Heus, S.S. Wijmenga, H. Hoppe, C.W. Hilbers, The
detailed structure of tandem G.A mismatched base pair motifs
in RNA duplexes is context dependent, J. Mol. Biol. 271
(1997) 147.
[78] A.T. Brunger, XPLOR manual [computer program], 1995.
[79] F.H.T. Allain, G. Varani, J. Mol. Biol. 267 (1997) 338.
[80] F.J. Van De Ven, C.W. Hilbers, Eur. J. Biochem. 178 (1988) 1.
[81] C.W. Hilbers, M.J.J. Blommers, F.J.M. Van De Ven, J.H. Van
Boom, G.A. Van der Marel, Nucleosides and Nucleotides 10
(1991) 61.
[82] D.B. Davies, M. MacCoss, S.S. Danyluk, J. Chem. Soc.
Chem. Comm. 99 (1984) 536.
[83] C.A. Haasnoot, F.A.A.M. de Leeuw, C. Altona, Tetrahedron
36 (1980) 2783.
[84] J. Van Wijk, B.D. Huckriede, J.H. Ippel, C. Altona, Methods
Enzymol. 211 (1992) 286.
[85] Y.T. van den Hoogen, C.M. Hilgersom, D. Brozda, K. Lesiak,
P.F. Torrence, C. Altona, Eur. J. Biochem. 182 (1989) 629.
[86] J.P. van Duynhoven, J. Goudriaan, C.W. Hilbers, S.S.
Wijmenga, J. Am. Chem. Soc. 114 (1992) 10055.
[87] K. Bock, C. Pederson, Acta Chim. Scand., Ser. B 34 (1997)
354.
[88] M.M. Mooren, S.S. Wijmenga, G.A. Van der Marel, J.H. Van
Boom, C.W. Hilbers, Nucleic Acids Res. 22 (1994) 2658.
[89] M.M. Mooren, On nucleic acid structure analysis by NMR
University of Nijmegen, Netherlands, 1994.
[90] S.A. Schroeder, V. Roongta, J.M. Fu, C.R. Jones, D.G.
Gorenstein, Biochemistry 28 (1989) 8292.
[91] D.A.Pearlman,S.H. Kim, J. Biomol. Struct. Dyn. 4 (1986) 49.
[92] J.V. Hines, S.M. Landry, G. Varani, I. Tinoco Jr., J. Am.
Chem. Soc. 116 (1994) 5823.
[93] A. Bax, G.W. Vuister, S. Grzesiek, F. Delaglio, A.C. Wang,
R. Tschudin, G. Zhu, Methods Enzymol. 239 (1994) 97.
[94] M. Eberstadt, G. Gemmecker, D.F. Mierke, H. Kessler,
Angewandte Chemie – International Edition English 34
(1995) 1671.
[95] Y. Kim, J.H. Prestegard, J. Magn. Reson. 84 (1989) 9.
[96] H. Widmer, K. Wu¨thrich, J. Magn. Reson. 74 (1987) 316.
[97] M.J. Blommers, D. Nanz, O. Zerbe, J. Biomol. NMR 4 (1994)
595.
[98] J.A. Pikkemaat, C. Altona, Magn. Reson. Chem. 34 (1996)
S33.
[99] R.F. Macaya, P. Schultze, J. Feigon, J. Am. Chem. Soc. 114
(1991) 781.
[100] M. Leijon, J. Zdunek, H. Fritzsche, H. Sklenar, A. Graslund,
Eur. J. Biochem. 234 (1995) 832.
[101] R. McIntyre, R. Freeman, J. Magn. Reson. 96 (1992) 425.
[102] J.J. Titman, J. Keeler, J. Magn. Reson. 89 (1990) 640.
[103] J. Stonehouse, J. Keeler, J. Magn. Reson. Series A 112 (1995)
43.
[104] T. Szyperski, P. Guntert, G. Otting, K. Wuthrich, J. Magn.
Reson. 99 (1992) 552.
[105] L. Mueller, J. Magn. Reson. 72 (1987) 191.
[106] S. Seip, J. Balbach, H. Kessler, J. Magn. Reson. Series B 104
(1994) 172.
[107] P. Schmieder, J.H. Ippel, H. Van den Elst, G.A. Van der
Marel, J.H. Van Boom, C. Altona, H. Kessler, Nucleic
Acids Res. 20 (1992) 4747.
384 S.S. Wijmenga, B.N.M. van Buuren/Progress in Nuclear Magnetic Resonance Spectroscopy 32 (1998) 287–387
[108] B.N.M. van Buuren, S.S. Wijmenga, H.A. Heus, S. Mronga,
C.W. Hilbers, Determination of all backbone J-couplings in a
uniformly
13
C labeled RNA hairpin, 1997 (unpublished).
[109] H. Schwalbe, W. Samtag, J.W. Engels, W. Bermel, C.
Griesinger, J. Biomol. NMR 3 (1993) 479.
[110] S.S. Wijmenga, K. Hallenga, C.W. Hilbers, J. Magn. Reson.
84 (1989) 634.
[111] S.J. Glaser, H. Schwalbe, J.P. Marino, C. Griesinger, J. Magn.
Reson. Series B 112 (1996) 160.
[112] H. Schwalbe, J.P. Marino, G.C. King, R. Wechselberger, W.
Bermel, C. Griesinger, J. Biomol. NMR 4 (1994) 631.
[113] D.P. Zimmer, J.P. Marino, C. Griesinger, Magn. Reson.
Chem. 34 (1996) S177.
[114] M.M. Mooren, C.W. Hilbers, S.S. Wijmenga, J. Magn.
Reson. 94 (1991) 101.
[115] G. Zhu, D. Live, A. Bax, J. Am. Chem. Soc. 116 (1994)
8370.
[116] G.W. Vuister, A. Bax, J. Magn. Reson. 98 (1992) 428.
[117] L. Szilagyi, Progress in Nuclear Magnetic Resonance Spec-
troscopy 27 (1995) 325.
[118] D.S. Wishart, B.D. Sykes, Methods Enzymol. 239 (1994)
363.
[119] D.A. Case, H.J. Dyson, P.E. Wright, Methods Enzymol. 239
(1994) 392.
[120] M.P. Williamson, T. Asakura, J. Magn. Reson. Series B 101
(1993) 63.
[121] M.P. Williamson, J. Kikuchi, T. Asakura, J. Mol. Biol. 247
(1995) 541.
[122] K. O
¨
sapaya, D.A. Case, J. Am. Chem. Soc. 113 (1991) 9436.
[123] K. O
¨
sapaya, Y. Theriault, P.E. Wright, D.A. Case, J. Mol.
Biol. 244 (1994) 183.
[124] E. Oldfield, J. Biomol. NMR 5 (1995) 217.
[125] J. Kuszewski, A.M. Gronenborn, G.M. Clore, J. Magn.
Reson. Series B 107 (1995) 293.
[126] J. Kuszewski, J. Qin, A.M. Gronenborn, G.M. Clore, J. Magn.
Reson. Series B 106 (1995) 92.
[127] C. Giessner-Prettre, B. Pullman, Q. Rev. Biophys. 20 (1987) 113.
[128] C.E. Johnson, F.A. Bovey, J. Chem. Phys. 29 (1958) 1012.
[129] C.W. Haigh, R.B. Mallion, Progress in Nuclear Magnetic
Resonance Spectroscopy 13 (1980) 303.
[130] R. Ribas Prado, C. Giessner-Prettre, J. Molec. Structure 76
(1981) 81.
[131] A.D. Buckingham, Can. J. Chem. 38 (1960) 300.
[132] P. Marshall, M. Pople, Molec. Phys. 1 (1958) 199.
[133] F.J. Van De Ven, C.W. Hilbers, Nucleic Acids Res. 16 (1988)
5713.
[134] R. Ghose, J.P. Marino, K.B. Wiberg, J.H. Prestegard, J. Am.
Chem. Soc. 116 (1994) 8827.
[135] P. Legault, A. Pardi, J. Magn. Reson. Series B 103 (1993) 82.
[136] H.A. Heus, A. Pardi, J. Am. Chem. Soc. 113 (1991) 4360.
[137] A. Pardi, R. Walker, S. Rapoport, G. Wider, K. Wuthrich, J.
Am. Chem. Soc. 105 (1983) 1652.
[138] G.W. Kellog, B.I. Schweitzer, J. Biomol. NMR 3 (1993)
577.
[139] M.J.P. Van Dongen, J.F. Doreleijers, J.H. Van Boom, G.A.
Van der Marel, C.W. Hilbers, S.S. Wijmenga, 1997 (in
preparation).
[140] N.B. Leontis, M.E. Piotto, M.T. Hills, A. Malhotra, I.V.
Ouporov, J.M. Nussbaum, D.G. Gorenstein, Methods Enzy-
mol. 261 (1995) 183.
[141] C. Altona, J. Mol. Biol. 263 (1996) 568.
[142] J.A. Pikkemaat, F.J.J. Overmars, C.M. Dreef-Tromp, H. Van
den Elst, J.H. Van Boom, C. Altona, J. Mol. Biol. 262 (1996)
349.
[143] M.J.P. Van Dongen, H.A. Heus, S.S. Wijmenga, G.A. Van
der Marel, J.H. Van Boom, C.W. Hilbers, Biochemistry 35
(1996) 1733.
[144] M.J.P. Van Dongen, S.S. Wijmenga, R. Eritja, F. Azor?′n,
C.W. Hilbers, J. Biomol. NMR 8 (1996) 207.
[145] G.M. Clore, J.G. Omichinski, K. Sakaguchi, N. Zambrano, H.
Sakamoto, E. Appella, A.M. Gronenborn, Science 265 (1994)
386.
[146] G.M. Clore, J.G. Omichinski, K. Sakaguchi, N. Zambrano, H.
Sakamoto, E. Appella, A.M. Gronenborn, Science 267 (1995)
1515.
[147] G. Varani, C. Cheong, I. Tinoco Jr., Biochemistry 30 (1991)
3280.
[148] E.P. Nikonowicz, A. Pardi, J. Mol. Biol. 232 (1993) 1141.
[149] A. Pardi, E.P. Nikonowicz, J. Am. Chem. Soc. 114 (1992)
9202.
[150] J.P. Simorre, G.R. Zimmermann, A. Pardi, B.T.Farmer II, , L.
Mueller, J. Biomol. NMR 6 (1995) 427.
[151] J.P. Simorre, G.R. Zimmermann, L. Mueller, A. Pardi, J.
Biomol. NMR 7 (1996) 153.
[152] V. Sklenar, T. Dieckmann, S.E. Butcher, J. Feigon, J. Biomol.
NMR 7 (1996) 83.
[153] R. Fiala, F. Jiang, D.J. Patel, J. Am. Chem. Soc. 118 (1996)
689.
[154] J.P. Marino, J.H. Prestegard, D.M. Crothers, J. Am. Chem.
Soc. 116 (1994) 2205.
[155] P. Legault, B.T.Farmer II, , L. Mueller, A. Pardi, J. Am.
Chem. Soc. 116 (1994) 2203.
[156] E.R. Zuiderweg, J. Magn. Reson. 89 (1990) 533.
[157] S.W. Fesik, H.L. Eaton, E.T. Olejniczak, E.R. Zuiderweg,
L.P. McIntosh, F.W. Dahlquist, J. Am. Chem. Soc. 112
(1990) 886.
[158] A. Bax, G.M. Clore, A. Gronenborn, J. Magn. Reson. 88
(1990) 425.
[159] L.E. Kay, P. Keifer, T. Saarinen, J. Am. Chem. Soc. 114
(1992) 10663.
[160] J. Schleucher, M. Schwendinger, M. Sattler, P. Schmidt, O.
Schedletzky, S.J. Glaser, O.W. Sorensen, C. Griesinger, J.
Biomol. NMR 4 (1994) 301.
[161] M. Sattler, M. Schwendinger, J. Schleucher, C. Griesinger, J.
Biomol. NMR 6 (1995) 11.
[162] M. Sattler, P. Schmidt, J. Schleucher, O. Schedletzky, S.J. Glaser,
C. Griesinger, Journal of Magnetic Resonance B108 (1995) 235.
[163] S.S. Wijmenga, E. Steensma, C.P.M. van Mierlo, J. Magn.
Reson. 124 (1997) 459.
[164] A. Majumdar, H. Wang, R.C. Morshauer, E.R. Zuiderweg, J.
Biomol. NMR 3 (1993) 387.
[165] H. Wang, E.R. Zuiderweg, J. Biomol. NMR 5 (1995) 207.
[166] F.J.M. Van De Ven, M.E.P. Philippens, J. Magn. Reson. 97
(1992) 202.
385S.S. Wijmenga, B.N.M. van Buuren/Progress in Nuclear Magnetic Resonance Spectroscopy 32 (1998) 287–387
[167] V. Sklenar, R.D. Peterson, M.R. Rejante, J. Feigon, J.
Biomol. NMR 3 (1993) 721.
[168] B.T.Farmer II, , L. Mueller, E.P. Nikonowicz, A. Pardi, J.
Biomol. NMR 4 (1994) 129.
[169] V. Sklenar, R.D. Peterson, M.R. Rejante, E. Wang, J. Feigon,
J. Am. Chem. Soc. 115 (1993) 12181.
[170] B.T. Farmer, L. Mueller, E.P. Nikonowicz, A. Pardi, J. Am.
Chem. Soc. 115 (1993) 11040.
[171] S. Tate, A. Ono, M. Kainosho, J. Am. Chem. Soc. 116 (1994)
5977.
[172] V. Sklenar, R.D. Peterson, M.R. Rejante, J. Feigon, J. Bio-
mol. NMR 4 (1994) 117.
[173] L.E. Kay, M. Ikura, A. Bax, J. Magn. Reson. 91 (1991) 84.
[174] L.E. Kay, M. Ikura, R. Tschudin, A. Bax, J. Magn. Reson. 89
(1990) 496.
[175] H.A. Heus, S.S. Wijmenga, F.J. Van De Ven, C.W. Hilbers, J.
Am. Chem. Soc. 116 (1994) 4983.
[176] J.P. Marino, J.H. Prestegard, D.M. Crothers, J. Am. Chem.
Soc. 116 (1994) 2205.
[177] S.S. Wijmenga, H.A. Heus, H.A. Leeuw, H. Hoppe, M. van
der Graaf, C.W. Hilbers, J. Biomol. NMR 5 (1995) 82.
[178] G. Varani, F. Aboul-Ela, F. Allain, C.C. Gubser, J. Biomol.
NMR 5 (1995) 315.
[179] R. Ramachandran, C. Sich, M. Grune, V. Soskic, L.R. Brown,
J. Biomol. NMR 7 (1996) 251.
[180] G. Otting, H. Senn, G. Wagner, K. Wuthrich, J. Magn. Reson.
70 (1986) 500.
[181] R. Kaptein, R. Boelens, V.P. Chuprina, J.A.C. Rullmann, M.
Slijper, Methods Enzymol. 261 (1995) 513.
[182] J.L. Battiste, R. Tan, A.D. Frankel, J.R. Williamson, J.
Biomol. NMR 6 (1995) 375.
[183] J.R. Williamson, J.L. Battiste, H. Mao, A.D. Frankel, Nucleic
Acids Symp. Ser. (1995) 46.
[184] J.D. Puglisi, L. Chen, A.D. Frankel, J.R. Williamson, Proc.
Natl. Acad. Sci. USA 90 (1993) 3680.
[185] P.J. Folkers, M. Nilges, R.H. Folmer, R.N. Konings, C.W.
Hilbers, J. Mol. Biol. 236 (1994) 229.
[186] J.J. Prompers, R.H. Folmer, M. Nilges, P.J. Folkers, R.N.
Konings, C.W. Hilbers, Eur. J. Biochem. 232 (1995) 506.
[187] R.H. Folmer, M. Nilges, R.N. Konings, C.W. Hilbers, EMBO
J. 14 (1995) 4132.
[188] F.J. Van de Ven, Multidimensional NMR in Liquids; Basic
Principles and Experimental Methods, VCH Publishers, New
York, 1995.
[189] M.M. Tirado, G.J. de la Torre, J. Chem. Phys. 73 (1980) 1986.
[190] S. Broersma, J. Chem. Phys. 63 (1959) 423.
[191] F. Perrin, J. Phys. Radium 5 (1934) 497.
[192] D.E. Woessner, J. Chem. Phys. 37 (1962) 647.
[193] D.E. Woessner, J. Chem. Phys. 42 (1964) 1855.
[194] D.E. Woessner, B.S. Snowden, G.H. Meyer, J. Chem. Phys.
50 (1968) 710.
[195] A.N. Lane, Methods Enzymol. 261 (1995) 413.
[196] G. Wagner, Curr. Opin. Struct. Biol. 3 (1993) 748.
[197] J. Peng, G. Wagner, J. Magn. Reson. 98 (1992) 308.
[198] J. Peng, G. Wagner, Biochemistry 32 (1992) 8571.
[199] C. Houssier, C.T.O. Konski, in: S. Krause (Ed.), Molecular
Optics, Plenum Press, New York, 1980, p. 309.
[200] S.S. Wijmenga, Electric birefringe experiments on aqueous
polyelectrolyte salt solutions. Influence of concentration and
flexibility, Thesis, University of Leiden, 1984.
[201] S.S. Wijmenga, F. van der Touw, M. Mandel, in: B. Sedlacek
(Ed.), Physical Optics of Dynamic Phenomena and Processes
in Macromolecular Systems, Walter de Gruyter and Co,
Berlin, 1985, p. 87.
[202] P.R. Bevington, Data Reduction and Error Analysis for the
Physical Sciences. McGraw-Hill, New York, 1969.
[203] N. Tjandra, P. Wingfield, S. Stahl, A. Bax, J. Biomol. NMR 8
(1996) 273.
[204] I. Phan, J. Boyd, I.D. Campbell, J. Biomol. NMR 8 (1996)
369.
[205] R.A. Komoroski, A. Allerhand, Proc. Natl. Acad. Sci. USA
69 (1972) 1804.
[206] R.A. Komoroski, A. Allerhand, Biochemistry 13 (1974) 369.
[207] M.E. Hogan, O. Jardetzky, Biochemistry 19 (1980) 3460.
[208] J. Keepers, T. James, J. Am. Chem. Soc. 104 (1982) 929.
[209] B.H. Robinson, G.P. Drobny, Methods Enzymol. 261 (1995)
451.
[210] J.R. Williamson, S.G. Boxer, Biochemistry 28 (1989) 2819.
[211] W. Eimer, J.R. Williamson, S.G. Boxer, R. Pecora, Biochem-
istry 29 (1990) 799.
[212] A.N. Lane, Carbohydr. Res. 221 (1991) 123.
[213] P.N. Borer, S.R. LaPlante, A. Kumar, N. Zanatta, A. Martin,
A. Hakkinen, G.C. Levy, Biochemistry 33 (1994) 2441.
[214] M. Slijper, R. Boelens, A.L. Davis, R.N.H. Konings, G.A.
Van der Marel, J.H. Van Boom, R. Kaptein, Biochemistry
36 (1997) 249.
[215] F. Gaudin, F. Paquet, L. Chanteloup, J.M. Beau, T.T. Nguyen,
G. Lancelot, J. Biomol. NMR 5 (1995) 49.
[216] J.L. Markley, A. Bax, Y. Arata, C.W. Hilbers, R. Kaptein,
B.D. Sykes, K. Wuthrich, IUPAC recommendations, draft,
1997.
[217] M.J.P. Van Dongen, M.M.W. Mooren, E.F.A. Willems, G.A.
Van der Marel, J.H. Van Boom, S.S. Wijmenga, C.W.
Hilbers, Nucleic Acids Res. 25 (1997) 1537.
[218] J.Santa Lucia Jr., , D.H. Turner, Biochemistry 32 (1993)
12612.
[219] E.G. Stein, L.M. Rice, A.T. Brunger, J. Magn. Reson. 124
(1997) 154.
[220] A.T. Brunger, G.M. Clore, A.M. Gronenborn, R. Saffrich, M.
Nilges, Science 261 (1993) 328.
[221] P. Schultze, J. Feigon, Nature 387 (1997) 668.
[222] A.E. Torda, R. Scheek, W.F. van Gunsteren, J. Mol. Biol. 214
(1990) 223.
[223] A.E. Torda, R.M. Brunne, T. Huber, H. Kessler, W.F. van
Gunsteren, J. Biomol. NMR 3 (1993) 55.
[224] G. Wagner, J. Biomol. NMR 3 (1993) 375.
[225] T. Dieckmann, J. Feigon, J. Biomol. NMR 9 (1997) 259.
[226] J. Nowakowski, J.L. Miller, P.A. Kollman, I. Tinoco Jr., J.
Am. Chem. Soc. 118 (1996) 12812.
[227] M. Nilges, J. Mol. Biol. 245 (1995) 645.
[228] B.O. Smith, Y. Ito, A. Raine, S. Teichmann, L. Ben-Tovin, D.
Nietlispach, R.W. Broadhurst, T. Terada, M. Kelly, H.
Oschkinat, T. Shibata, S. Yokoyama, E.D. Laue, J. Biomol.
NMR 8 (1996) 360.
386 S.S. Wijmenga, B.N.M. van Buuren/Progress in Nuclear Magnetic Resonance Spectroscopy 32 (1998) 287–387
[229] N. Tjandra, A. Bax, J. Magn. Reson. 124 (1997) 512.
[230] N. Tjandra, S. Grzesiek, A. Bax, J. Am. Chem. Soc. 118
(1996) 6264.
[231] P. Bu¨chner, W. Maurer, H. Ru¨terjans, J. Magn. Reson. 29
(1978) 45.
[232] C.D. Poulter, C.L. Livinnston, Tetrahedron Lett. 13 (1979) 755.
[233] R.L. Lipnick, J.P. Fissekis, J. Org. Chem. 44 (1979) 1627.
[234] T.M. Logan, E.T. Olejniczak, R.X. Xu, S.W. Fesik, J.
Biomol. NMR 3 (1993) 225.
[235] R.D. Bertrand, W.B. Moniz, A.N. Garroway, G.C. Chingas, J.
Am. Chem. Soc. 100 (1978) 5227.
[236] L. Mueller, R.R. Ernst, Molec. Phys. 38 (1979) 963.
[237] A.J. Shaka, C.J. Lee, A. Pines, J. Magn. Reson. 77 (1988)
274.
[238] A. Mohebbi, A.J. Shaka, J. Chem. Phys. Lett. 178 (1991) 374.
[239] L. Braunschweiler, R.R. Ernst, J. Magn. Reson. 53 (1983)
521.
[240] A. Majumdar, E.R. Zuiderweg, J. Magn. Reson. Series A 113
(1995) 19.
[241] H. Geen, R. Freeman, J. Magn. Reson. 93 (1991) 93.
[242] S. Grzesiek, A. Bax, J. Am. Chem. Soc. 114 (1992) 6291.
[243] M. Piotto, V. Saudek, V. Sklenar, J. Biomol. NMR 2 (1992)
661.
[244] S. Grzesiek, A. Bax, J. Am. Chem. Soc. 115 (1993) 12593.
[245] D. Marion, M. Ikura, R. Tschudin, A. Bax, J. Magn. Reson.
85 (1989) 393.
[246] S.L. Patt, J. Magn. Reson. 96 (1992) 94.
[247] A.J. Shaka, P. Barker, R.J. Freeman, J. Magn. Reson. 64 (1985) 547.
[248] M.H. Levitt, Progress in Nuclear Magnetic Resonance Spec-
troscopy 18 (1986) 61.
[249] V. Sklenar, J. Magn. Reson. Series A 114 (1995) 132.
[250] A.G. Palmer, J. Cavanagh, P.E. Wright, M. Rance, J. Magn.
Reson. 93 (1991) 151.
[251] B.L. Tomlinson, H.D.W. Hill, J. Chem. Phys. 59 (1973) 1775.
[252] R. Konrat, I. Burghardt, G. Bodenhausen, J. Am. Chem. Soc.
113 (1991) 9135.
[253] U. Eggenberger, P. Schmidt, S.J. Sattler, S.J. Glaser, C.
Griesinger, J. Magn. Reson. 100 (1992) 604.
[254] L. Emsley, G. Bodenhausen, J. Chem. Phys. Lett. 165 (1990)
469.
[255] J. Boyd, N.J. Soffe, J. Magn. Reson. 85 (1989) 406.
[256] A.J. Shaka, J. Keeler, T.A. Frenkiel, R.J. Freeman, J. Magn.
Reson. 52 (1983) 335.
[257] D. Marion, K. Wuthrich, Biochem. Biophys. Res. Commun.
113 (1983) 967.
[258] G. Kontaxis, J. Stonehouse, E.D. Laue, J. Keeler, J. Magn.
Reson. Series A 111 (1994) 70.
[259] D.B. Davies, P. Rajana, H. Sadikot, J. Chem. Soc., Perkin
Trans. 2 (1985) 279.
[260] M.H. Kolk, M. van der Graaf, S.S. Wijmenga, C.W.A. Pleij,
H.A. Heus, C.W. Hilbers, Science 280 (1998) 434.
387S.S. Wijmenga, B.N.M. van Buuren/Progress in Nuclear Magnetic Resonance Spectroscopy 32 (1998) 287–387