Solution structure of the pseudo-5′ splice site of a retroviral splicing suppressor

  1. JAVIER CABELLO-VILLEGAS1,
  2. KEITH E. GILES2,
  3. ANA MARIA SOTO3,
  4. PING YU1,
  5. ANNIE MOUGIN4,
  6. KAREN L. BEEMON2, and
  7. YUN-XING WANG1
  1. 1Protein-Nucleic Acid Interaction Section, Structural Biophysics Laboratory, CCR, NCI-Frederick, NIH, Frederick, Maryland 21702, USA
  2. 2Department of Biology and
  3. 3Department of Chemistry, Johns Hopkins University, Baltimore, Maryland 21218, USA
  4. 4Laboratoire de Maturation des ARN et Enzymologie Moléculaire, UMR 7567 CNRS-UHP Nancy I, Vandoeuvre-Les-Nancy 54506, France

Abstract

Control of Rous sarcoma virus RNA splicing depends in part on the interaction of U1 and U11 snRNPs with an intronic RNA element called the negative regulator of splicing (NRS). A 23mer RNA hairpin (NRS23) of the NRS directly binds U1 and U11 snRNPs. Mutations that disrupt base-pairing between the loop of NRS23 and U1 snRNA abolish its negative control of splicing. We have determined the solution structure of NRS23 using NOEs, torsion angles, and residual dipolar couplings that were extracted from multidimensional heteronuclear NMR spectra. Our structure showed that the 6-bp stem of NRS23 adopts a nearly A-form duplex conformation. The loop, which consists of 11 residues according to secondary structure probing, was in a closed conformation. U913, the first residue in the loop, was bulged out or dynamic, and loop residues G914–C923, G915–U922, and U916–A921 were base-paired. The remaining UUGU tetraloop sequence did not adopt a stable structure and appears flexible in solution. This tetraloop differs from the well-known classes of tetraloops (GNRA, CUYG, UNCG) in terms of its stability, structure, and function. Deletion of the bulged U913, which is not complementary to U1 snRNA, increased the melting temperature of the RNA hairpin. This hyperstable hairpin exhibited a significant decrease in binding to U1 snRNP. Thus, the structure of the NRS RNA, as well as its sequence, is important for interaction with U1 snRNP and for splicing suppression.

Keywords

INTRODUCTION

The spliceosome is a large and highly dynamic RNA–protein machine that is assembled in a stepwise manner (for review, see Burge et al. 1999). An early step in splicing is the recognition of the 5′ splice site, which involves binding of U1 snRNP (Mount et al. 1983; Bindereif and Green 1987; Michaud and Reed 1991) through sequence-specific RNA–RNA and protein–RNA interactions with the splice site (Du and Rosbash 2002; Lund and Kjems 2002). However, the binding of U1 snRNP to a pre-mRNA is not sufficient to define a 5′ splice site (Eperon et al. 1993; Gontarek et al. 1993; Hastings et al. 2001). According to the exon definition model, both splice sites of an internal exon must be recognized together by interactions that occur across the exon (Robberson et al. 1990). In addition to this major type of intron in which U2 snRNP binds the branch site, a minor class of introns features U11 snRNP binding to the 5′ splice site and U12 snRNP binding to the branch-point sequence (Kolossova and Padgett 1997; Burge et al. 1999). RNA structure has previously been shown to play a role in splice site selection because the placement of an exon within a hairpin loop promotes its skipping (Solnick 1985). Similarly, placement of a splice site within a helical stem can prevent its recognition (Jacquenet et al. 2001; Honig et al. 2002).

Rous sarcoma virus (RSV) is a simple avian retrovirus that generates both unspliced and two singly spliced mRNAs from a single primary transcript through alternative splicing (for review, see Rabson and Graves 1997). The maintenance of the unspliced RNA is aided by a cis-acting element, spanning RSV nucleotides 703–930, termed the negative regulator of splicing (NRS; Arrigo and Beemon 1988; McNally et al. 1991; McNally and Beemon 1992). The NRS is a bipartite element that suppresses the use of the upstream viral 5′ splice site. It has a 5′ region that can function as a splicing enhancer in heterologous contexts and has been shown to bind SR proteins and hnRNP H (McNally and McNally 1998; Fogel and McNally 2000). The 3′ region of the NRS resembles overlapping major and minor 5′ splice site sequences and is capable of binding both U1 and U11 snRNPs (Gontarek et al. 1993; Hibbert et al. 1999; McNally and McNally 1999). Mutations in the U1 snRNP-binding region result in increased viral src mRNA splicing (O’Sullivan et al. 2002). When U1 snRNP binds the NRS, it competes with the authentic viral 5′ splice site for interaction with the downstream 3′ splice sites and suppresses splicing by forming a nonproductive splicing complex (Cook and McNally 1999; Hibbert et al. 1999; O’Sullivan et al. 2002). In contrast, when U11 snRNP binds the NRS, it is hypothesized to block binding of U1 snRNP to the overlapping sequence and to allow viral splicing to occur (Hibbert et al. 1999; McNally and McNally 1999). In this way, an appropriate balance between unspliced and spliced viral RNAs is maintained. Although the viral NRS sequence is not spliced, a single U-to-A substitution at non-consensus position 913 (−2), 917 (+3), or 918 (+4) converts it into an efficient splice site (Paca et al. 2001).

To aid in determining why the NRS sequence functions as a splicing suppressor rather than as a splice site, we determined the structure of NRS23, a 23-nt RNA that interacts with U1 and U11 snRNPs. Chemical and enzymatic probing of the full-length NRS RNA secondary structure showed that NRS23 RNA formed a hairpin. The 3D structure results, obtained by NMR, indicate that NRS23 is a stem–loop structure, containing 9 bp, a nonstructured UUGU tetraloop, and a bulged or dynamic U at position -2 of the major 5′ splice-site-like sequence. Five of the nucleotides that are poised to base-pair with U1 snRNA are involved in intramolecular base-pairing in the absence of U1. Thus, the NRS23 helix must be at least partially unwound before U1 snRNA can base-pair with it. The bulged U913 appears to prevent NRS23 from forming a continuous A-form helix. We deleted U913 from NRS23 and observed a dramatic increase in the stability of the remaining RNA structure, coupled with a decrease in the affinity of U1 snRNP for the NRS23 RNA. This result suggests that helix destabilization is necessary for binding of U1 snRNP to the NRS. We conclude that intramolecular interactions within NRS23 must be destabilized before intermolecular interactions with U1 can be formed. The partial occlusion of the U1 binding site by intramolecular base-pairing may regulate RSV RNA splicing.

RESULTS

Secondary structure determination reveals NRS23 forms a hairpin structure

Nuclease probing with RNases T2 and V1 and chemical modification with lead was used to determine the secondary structure of the full-length NRS RNA (data not shown). When experiments were carried out in the presence of 20 mM Mg+2, a hairpin structure was clearly observed at the 3′ end of the NRS, containing nucleotides 907 to 929 (Fig. 1A). This hairpin had a 6-bp stem and an 11-nt loop and was termed NRS23. Centered within this hairpin, a 15-residue internal sequence resembles overlapping major and minor (U2- and U12-dependent) 5′ splice site sequences and is capable of binding either U1 or U11 snRNP (Gontarek et al. 1993; Hibbert et al. 1999; McNally and McNally 1999). This region has a 5/8 identity to the consensus sequence for a major 5′ splice site (shown in yellow in Fig. 1A; Mount et al. 1983). Further, U1 snRNA has been shown, by NRS mutation and compensatory U1 mutation experiments (Hibbert et al. 1999), to interact with residues within this loop, as shown in Figure 1B. U1 snRNA nucleotides 1–9 are complementary to the NRS loop sequence, with the exception of mismatches at nucleotides 5 and 6. The loop and the flanking stem also contain the minor 5′ splice site consensus sequence (GUAUCCUU; Kolossova and Padgett 1997; Burge et al. 1999), and U11 snRNA nucleotides 4 through 18 are complementary to sequences in the loop and flanking it (Fig. 1C). A portion of these same snRNA sequences bind to authentic 5′ splice sites (Mount et al. 1983; Kolossova and Padgett 1997; Burge et al. 1999).

NMR analysis and solution structure of NRS23

We have determined solution structures for the NRS 23 RNA, using NOEs, torsion angles, hydrogen bond restraints, the DELPHIC database (Clore and Kuszewski 2003), and residual dipolar couplings (RDCs; Fig. 2; Table 1). The NRS23 solution structures comprise a stem, consisting of G907–G912 on one side and C924–C929 on the complementary side, as predicted by secondary structure analysis (Fig. 1A). The stem is followed by a bulged U913 and three in-loop base pairs, G914–C923, G915–U922, and U916–A921, and a less defined UUGU tetraloop. The U916–A921 base pair is unstable. The average RMSD of heavy atoms of the stem (residues 907–915/922–929, excluding the bulged U913) of the 15 lowest-energy structures relative to those of the minimized average structure is ~0.61 Å (Table 1). In the average minimized structure, the bulge caused an ~13° tilt between bases 912 and 914 in an otherwise continuous A-form helix (Walsh et al. 2004). This tilt was ~27° when RDCs were not included in the refinement. Based on this 3D structure, a new secondary structure scheme, with some detected NOE connectivity, is drawn in Figure 3 (for clarity, we omitted the first 3 bp in the stem from this figure). The experimental data to support the new base-pairing in the 11-residue loop are discussed in detail later.

NRS23 contains a bulged or dynamic U913 that disrupts an otherwise continuous helix from the stem to the loop (Fig. 2). This bulge was predicted by the program Mfold (Mathews et al. 1999; Zuker 2003). Point mutations at this position affect NRS activity (Paca et al. 2001). The absence of sequential NOEs and of imino resonance for U913 indicates that this residue is either dynamic or bulged out (Fig. 2B). The NOEs, COSY cross-peaks, and 13C chemical shifts indicate that ribose puckers flanking residues 912 to 914 have some C2′ endo character. The bulged U913 position is not well defined in the final ensemble of structures. The back-calculated RDCs, based on the non-RDC refined structure, have a large RMSD compared with the experimental data, possibly because of the flexibility of the bulge. The residues around U913 stack and form part of a helix that has an ~15° bend in the helical axis around the bulge in the RDC-refined structure. The bulged U913 appears to be readily available for intermolecular base-pairing with G17 of U11 snRNA (Hall and Padgett 1994). The presence of this bulge may contribute to the instability of the following 3 bp (914–916) proximal to the tetraloop. The HNN-COSY cross-peaks (Fig. 4) and the imino resonance peaks associated with these 3 bp are weak, and the U922 imino resonance is not detected. This indicates an increased solvent exchange in this region and, presumably, destabilization of the base-pairing within the loop.

Like the bulge, the 917–920 UUGU tetraloop (Fig. 5) is more dynamic than the base-paired residues. Although it is partially structured by stacking of U917 on U916 in some of the lowest-energy structures, it is not a compact structure. There is no particular intramolecular interaction that would hinder the loop from forming intermolecular interactions. Although the interresidue NOEs suggest that U916 and U917 stack, the use of the RDC restraints prevents them from stacking in some of the convergent structures. Furthermore, U917 has a C2′ endo character, indicating that it does not stack in a regular A-form helix. Its position variability suggests a conflict between the NOEs, indicating the stacking, and the ribose C2′ endo pucker torsion angles. The two sets of conformations for U917 seen in the structures may not be present in a stable form. The preliminary measurements of 13C8–1H8 of G919 show an attenuated T and an elevated T1 compared with those in the stem region, indicating that G919 is in anisotropic slow motion compared with residues in the stem. Although a conclusive assessment of the dynamics of the tetraloop will have to come from complete relaxation measurements using site-specific labeling to remove the complication due to 13C–13C homonuclear cross-relaxation, the line widths, signal intensities, and few interresidue NOEs detected suggest this region is dynamic compared with the stem of the molecule. As a result, the tetraloop has poor RMSDs between the measured RDCs and those calculated on the basis of the structures, even when one of two alignment tensors in the two-alignment tensor approach is optimized for the latter residues.

Stability of NRS23 and U913-deleted NRS23 determined by UV melting

The bulged U913 of NRS23 disrupts the helix and is predicted to destabilize the structure of NRS23. This prediction was tested by thermal denaturation experiments, monitored by UV spectroscopy. NRS23 RNA was compared with a U913 deletion mutant of NRS23 (NRS23Δ913). Melting curves for each hairpin were measured at 260 nm in a thermoelectrically controlled spectrophotometer with a buffer containing 5 mM sodium cacodylate (pH 6.4) and 50 mM NaCl. The temperature was scanned at a heating rate of 0.5°C/min and the unfolding of each hairpin was found to be reversible under these conditions. Transition temperature (Tm) and van’t Hoff enthalpy (ΔH) values were determined from the first derivatives using procedures described earlier (Laing and Draper 1994). Tm and ΔH values were also obtained by fitting the melting curves to a two-state model as described earlier (Marky and Breslauer 1987). The error in the Tm determination is <0.5 °C from the experimental variation that we measured in triplicate melts.

The Tm for NRS23 was determined to be 59.2°C (Fig. 6). Deletion of residue U913 from NRS23 increased the melting temperature ~14 ± 0.5°C and the ΔH ~12 kcal/mole to 69 kcal/mole (Fig. 6). The estimated difference in ΔG between the wild-type NRS sequence and the U913-deleted NRS23 is +3.8 kcal/mole. Thus, the melting data confirmed the prediction that the bulged U913 residue destabilizes the stem–loop structure of the wild-type NRS23. Its deletion generates a more stable stem–loop structure without changing any of the existing base pairs in NRS23. A structure determination of the U913-deleted NRS23 RNA is currently under way.

Interaction of NRS RNA with U1 snRNA

Residues 913–920 of the NRS bear a 5/8 identity to a consensus 5′ splice site (Fig. 1A; Hibbert et al. 1999; McNally and McNally 1999). The three nonconsensus residues are the U’s at residues 913, 917, and 918 (positions -2, +3, and +4, respectively). Each of these U’s is poised to interact with a U or Ψ in U1 snRNA (Fig. 1B). It has previously been shown that these interactions are critical for NRS-mediated splicing suppression. Mutation of any of these U’s to A abolishes splicing suppression and converts the NRS into a functional 5′ splice site (Paca et al. 2001). Conversely, mutating any of these U’s to a C abolishes splicing suppression but does not convert the NRS into a functional 5′ splice site (Paca et al. 2001). We have titrated the NRS23 RNA with the 5′ 10-residue sequence AUAC Ψ Ψ ACCU (U1–5′) of U1 snRNA and monitored the interaction between the two RNAs using HSQC spectra. Our preliminary results support the notion that U1–5′ RNA interacts with the loop of NRS23, whereas the base pairs in the stem remain largely unperturbed (data not shown).

The three U’s exist in interesting locations in the NRS23 3D structure. U913 is bulged from the helix and U917 and U918 are part of the tetraloop (Figs. 2, 5). Because the 3 bp within the loop must be unwound before U1 can base-pair, the presence of the bulged residue may promote U1 binding by partially destabilizing the otherwise continuous helix. To address this issue, we tested U1 snRNP binding to four NRS RNA fragments spanning nucleotides 902–939. The wild-type NRS sequence and Δ913 were compared with a mutant that splices well (UU917/918AA) and another mutant that neither splices nor suppresses splicing (UU917/918CC). The RNAs were covalently linked to agarose beads via their 3′ ends and used in an affinity selection assay in HeLa cell nuclear extract, carried out in the absence of ATP and Mg+2 (Caputi et al. 1999). The affinity-selected RNA was harvested and Northern analysis performed to measure U1 snRNP binding (Fig. 7A). As a control, an aliquot of each affinity-selected sample was analyzed by SDS-PAGE and Coomassie staining to demonstrate that the nonspecific binding of proteins was equivalent between the samples (data not shown).

Affinities for U1 snRNP binding were calculated by quantitating the relative amount of 32P-labeled U1 antisense riboprobe binding to the selected RNAs in each case. The background binding to the beads was subtracted and the average values normalized to that of the wild-type NRS in each trial. The results of this normalization are shown in Figure 7B. The Δ913 mutant, which stabilized the helix (Fig. 6), reduced binding of U1 snRNP to 33% of the wild-type level (Fig. 7B). The binding to this single-base deletion mutant was similar to that of the nonfunctional UU917/918CC mutant (30%). In contrast, the UU917/918AA mutant, which is capable of splicing, bound slightly more U1 snRNP (136%) than the wild-type NRS. This experiment shows that mutations affecting the structure of the RNA, as well as its sequence complementarity to U1 snRNA, can affect binding of U1 snRNP.

DISCUSSION

We have analyzed the solution structure of a 23-nt RNA hairpin, NRS23, that interacts with U1 and U11 snRNPs to control alternative splicing and generate the proper balance of spliced and unspliced retroviral RNAs. Secondary structure analysis showed that NRS23 contained a 6-bp stem and an 11-residue loop. Disruption of the stem by mutagenesis caused a loss of splicing regulation in vivo; this was restored by compensatory mutations that restored the stem structure (O’Sullivan 2001). Residues that base-pair with U1 snRNA were contained within the loop. Determination of the 3D structure of NRS23 by NMR spectroscopy revealed a 6-bp stem, a bulged U, 3 bp across the loop, and a dynamic UUGU tetraloop. The structure of NRS23 provides insight into the regulation of alternative splicing in this system.

The stability of the NRS23 helix is likely to be central to RSV regulation of alternative splicing. The 31P spectrum of the NRS23 indicates no U-turn type sharp twist for the α or ξ torsion angles. The measured backbone torsion angles do not deviate greatly from an A-form type helix, with the exception of the ribose pucker for the bulge, its flanking residues, and the tetraloop. The loop residues appear to exist in a closed conformation in solution, and this loop must be unwound before U1 snRNA can base-pair with it. The bulged U913 serves to destabilize the helix and likely allows this unwinding to be more energetically favorable. Our results show that NRS23Δ913 is more stable than NRS23 (Fig. 6), suggesting that deletion of the bulged U at position 913 results in the formation of better stacking interactions in the helical stem of the hairpin. This is consistent with the larger ΔH values of NRS23Δ913, although the large errors associated with van’t Hoff enthalpies make it difficult to further analyze the enthalpic effects of this deletion. The formation of a helix with better stacking interactions results in a decrease in the conformational freedom of NRS23Δ913, consistent with the observed increase in ΔS of unfolding. The ΔG37 penalty of 3.4 kcal/mole for the incorporation of a single bulge in the stem of NRS23 is in excellent agreement with the average bulge destabilization of 3.6 kcal/mole reported earlier (Znosko et al. 2002).

The hyperstable Δ913 mutant of NRS23 had a decreased ability to bind U1 snRNP (Fig. 7). This is interesting in light of the fact that the wild-type NRS and the Δ913 mutant share the same Watson-Crick base pairs with U1 snRNA. The regulatory problem facing RSV is to maintain pools of both spliced and unspliced RNA. Therefore, the NRS must not suppress splicing of every full-length RNA. The presence of a moderately destabilized helix appears to allow for an optimal affinity of U1 snRNP for the loop. The mutant NRS sequence that splices has U/A substitutions within the tetraloop, suggesting that the structure determined for NRS23 may closely resemble that of a 5′ splice site. The U913 bulge may also facilitate the interaction between the NRS and U11 snRNP (Gontarek et al. 1993).

Like the bulge, the UUGU tetraloop has more dynamic characteristics than the base-paired residues (Figs. 2, 5). This less structured tetraloop is in a sharp contrast to the well-known GNRA, CUYG, and UNCG classes of tetraloops, which serve as ultrastable structural elements and nucleation sites for RNA folding (Jucker et al. 1996; Molinaro and Tinoco 1995). The UUGU tetraloop in the NRS, on the other hand, is poised to interact with the U1 and U11 snRNPs as required for its regulatory function. It is worth noting that a UUGU tetraloop is also found in hairpins near the 5′ terminus of the spleen necrosis virus RNA (Roberts and Boris-Lawrie 2003). This tetraloop, together with U-rich bulges, is speculated to be a binding site for host factors that regulate the translation of the viral RNA. In conclusion, it appears that UUGU tetraloop is distinguished from the well-known class of tetraloops in terms of structure, stability, and function, being a dynamic structure that interacts with other RNAs.

MATERIALS AND METHODS RNA

samples

DNAse I, pyruvate kinase, myokinase, guanylate kinase, P1 nuclease, and unlabeled 5′ nucleoside triphosphates (NTPs) were purchased from Sigma Chemical. T7 RNA polymerase was prepared as described (Davanloo et al. 1984). In some instances, 13C/15N-enriched NTPs were purchased from Spectra Isotopes (CIL) and MarTech. The 99%{13C} glucose and 98%{15N}-labeled ammonium sulfate were purchased from CIL. RNAse-free pf1 phage was purchased from Asla (Riga).

Unlabeled RNA was purchased from Dharmacon or synthesized by in vitro transcription (Batey et al. 1992; Niknonowicz et al. 1992). 13C/15N or 15N isotopic enrichment RNAs were synthesized by in vitro transcription using labeled NTPs that were either purchased or prepared from ribosomal RNA as described (Batey et al. 1992; Niknonowicz et al. 1992). Transcribed RNA molecules were purified using preparative gel electrophoresis in 10% (w/v) poly-acrylamide/7M urea followed by electroelution on a Schleicher and Schuell elution chamber. The RNA was then precipitated and spin filtered once with 2 mL 1M NaCl and 50 mM NaPi (pH 6.5) and three times with 2 mL of the NMR buffer of 25 mM NaCl, 10 mM NaPi (pH 6.5), and 0.05 mM EDTA. The identities of the samples were confirmed by molecular weight through MALDI-TOF mass spectrometry. The samples were taken to a volume of 350 μL, lyophilized, and resuspended in 99.96% D2O or 95% H2O–10% D2O for experiments detecting nonexchangeable or exchangeable protons, respectively. The samples consisted of 100–200 ODs (~1.3–2.6 mM) of unlabeled material, 130 ODs (~1.7 mM) of 15N-labeled material, and 115–130 A260 ODs (~1.5–1.7 mM) of 13C/15N-labeled RNA. The pf1 phage was taken into the NMR buffer by spin filtering on 50-kDa cutoff Vivaspin concentrators and mixed with RNA in the same buffer to a final phage concentration of ~21 mg/mL. The unlabeled U1 snRNA fragment of sequence AUAΨΨACCU was purchased from Dharmacon and resuspended in deionized H2O to a concentration of 4.24 mM. An NRS23 sample in the NMR buffer at a concentration of ~0.35 mM was titrated with this molecule, to a molar ratio of 0.5 in the NMR tube.

RNAs used for affinity selection were in vitro transcribed using two DNA oligonucleotides as the template. The sense strand of the T7 promoter region, 5′-taatacgactcactataggg-3′, was hybridized with one of four oligonucleotides, each of which contain the an-tisense strand of the T7 promoter, followed by the nucleotides 902–939 of the NRS. The following list of oligonucleotides has the T7 antisense strand underlined:

wild-type NRS, 5′-acccccgccagggaaggatacaaaccactccccacataccctata gtgagtcgtatta-3′; Δ 913 mutant, 5′- acccccgccagggaaggatacaaaccctccccacataccctatagt gagtcgtatta-3′; AA mutant, 5′-acccccgccagggaaggatacttaccactccccacataccctatagtga gtcgtatta-3′; CC mutant, 5′-acccccgccagggaaggatacggaccactccccacataccctatagtga gtcgtatta-3′.

For the affinity selection, total RNA from the in vitro transcription reaction was loaded onto an 8-M urea, 20% polyacrylamide gel and gel purified via UV shadowing. The gel slices were eluted overnight at 4°C in RNA elution buffer: 0.5 M ammonium acetate, 0.1% SDS, 1 mM EDTA.

Secondary structure determination

Nuclease probing with RNases T2 and V1 was as described by Mougin et al. (2002), except that it was performed on 3′-end labeled RNA. Lead cleavages were performed in 50 mM HEPES-KOH buffer (pH 7.5), 50 mM K acetate, and 20 mM Mg acetate, in the presence of 4 or 20 mM Pb(OAc)2 for 5 min at 20°C. Reactions were stopped by addition of EDTA (250 mM final). Direct analysis of the fragments produced was made after migration on a 15% acrylamide-urea gel.

Affinity selection assay

Affinity selection was carried out essentially as described in Caputi et al. (1999) with the following modifications. One nanomole of target RNA was covalently linked to the beads. After the affinity selection protocol was complete, the beads were resuspended in proteinase K digestion buffer (10 mM Tris at pH 7.2, 0.5 mM EDTA, 0.5% SDS) and 50 μg of proteinase K was added for 30 min at 37°C. Next, the beads were boiled and the RNA was extracted with phenol:chloroform:isoamyl alcohol and precipitated with ethanol. Northern analysis was carried out as previously described (Gontarek et al. 1993).

UV melting

The thermal denaturation of NRS23 and NRS23ΔU913 was followed by UV spectroscopy. Absorbance versus temperature profiles (melting curves) for each hairpin were measured at 260 nm with a thermoelectrically controlled Cary 400 Bio spectrophotometer (Varian Inc.). All experiments were conducted in 5 mM sodium cacodylate (pH 6.4) and 50 mM NaCl. The temperature was scanned at a heating rate of 0.5°C/min, and the unfolding of each hairpin was found to be reversible under these conditions. Tm and Δ H values were determined from the first derivatives using procedures described earlier (Laing and Draper 1994). Tm and ΔH values were also obtained by fitting the melting curves to a two-state model as described earlier (Marky and Breslauer 1987). The error in the Tm determination is <0.5°C from the experimental variation that we measured in triplicate melts. However, it is worth noticing that systematic deviations in the fits, because of the presence of small deviations from a two-state behavior (especially evident at low temperatures) may increase the error in the determination of the TM. Nevertheless, from the values obtained from the two different fittings, the errors in the Tms do not exceed 0.5°C. Free energies at 37°C (ΔG37) were calculated according to Δ G37 = ΔH (1 - 310.15 /Tm). Entropy values (ΔS) were calculated from the Gibbs equation ΔG = ΔH - TΔS.

NMR spectroscopy, chemical shift assignments, and structural constraints

Homonuclear 15N or 15N/13C heteronuclear NMR spectra were recorded in Varian INOVA spectrometers, operating at 500-, 600-, and 800-MHz proton frequencies and equipped with triple resonance probe heads and z-axis pulse field gradients. The 500-MHz spectrometer is equipped with a Varian cryoprobe. Spectra involving 13C and 31P channels were recorded on a Bruker DRX 500 spectrometer equipped with a triple resonance HCP probe and z-axis pulse field gradients. Spectra involving H2O exchangeable resonances were collected at 15°C and those involving nonexchangeable protons at 25°C. The H2O signal was suppressed using Z gradient pulses and WATERGATE or gradient double-echo sequences. In the case of imino proton HMQC spectra, it was suppressed using 1331 read pulses optimized for a maximum signal at 12.5 ppm. 1H resonances were referenced to DSS and 13C and 15N were referenced to H2O internally; 31P was referenced to 31P signal of PO4−3. Broadband decoupling during acquisition was done using Wurst80 with γB2 = 12.79 ppm*γCB0 for 13C and GARP with γB2 = 21.31 ppm*γNB0 for 15N. The States-TPPI method was used for quadrature detection. Detection in indirectly detected dimensions was normally delayed by a half dwell time.

Spectra were processed in SGI workstations with the nmrPipe software package (Delaglio et al. 1995) and analyzed with nmr-Draw and nmrView (Johnson and Blevins 1994). The spectra were typically linear, predicted to double the size of the data in the indirectly detected dimensions and apodized with a sinebell function and zero filled to twice the number of points. The spectra used for RDC determination were zero-filled in the dimensions used to measure splittings to a digital resolution of 1.56–3.3 Hz/ point for 1H and 2 Hz/point for 15N; dimensions in these spectra were apodized using a gaussian function with 30-Hz line broadening with the exception of the 15N dimension, which was apodized with 20-Hz line broadening.

The assignment procedure was based on standard protocols (Nikonowicz and Pardi 1993; Marino et al. 1997; Lukavsky and Puglisi 2001). All of the nonexchangeable base 1H and 13C resonances were assigned. A sequential walk along the RNA was done on the basis of the intra- and interresidue H1′-H6/H8 NOEs, using 3D 13C-edited NOESY-HSQC experiments (Nikonowicz and Pardi 1993) together with 2D-NOESY spectra, all with 300-msec mixing times. The exchangeable resonance assignments were also obtained through an NOE sequential walk performed on a 2D NOESY (200-msec mixing time) and by their 15N chemical shifts in 2D 15N/1H HMQC (Mueller 1979) and HSQC (Kay et al. 1992) spectra. 31P resonances were assigned using a 3D single-quantum coherence HCP experiment (Marino et al. 1995). Watson-Crick base pairs were identified through the downfield shift of the NH and NH2 resonances, and by G-imino to C-amino and U-imino to A H2 1H-1H cross-peaks in 2D NOESY spectra. The base pairs were confirmed through the use of 2hJNN HNN-COSY experiments optimized for the G H1-C N3 and U N3-A H2 correlations (Dingley and Grzesiek 1998; Hennig and Williamson 2000).

NOE distance constraints were derived from 2D NOESY (200-and 300-msec mixing time) and 3D 13C-edited NOESY-HSQC (300-msec mixing time) spectra (Nikonowicz and Pardi 1993). For the exchangeable resonances, NOE cross-peaks were identified in a 200-msec NOESY. NOE cross-peak intensities were classified as very strong, strong, medium, weak, and very weak using the intensities of base H5–H6 (2.4 Å) and of ribose H1′–H2′ (2.8–3 Å) as references and allotted semiquantitative distance constraints of 3, 4, 5, 6, and 7 Å, respectively (Varani et al. 1996). Cross-peaks observed only in the 300-msec mixing time 2D NOESY were classified as extremely weak. No lower bounds for the NOE constraints were used. The distance constraints for the exchangeable protons did not include the very strong category, except for the proton pairs H2-imino and imino-amino of A-U and G-C base pairs, respectively. No intraresidue ribose-to-ribose NOE constraints were used. The hydrogen bond distance restraints for the base pairs were set as 2.9 ± 0.3 Å between donor and acceptor atoms and 2.02 ± 0.2 Å between acceptor and hydrogen atoms. The hydrogen bond constraints were introduced together with the NOE distance constraints.

Torsion angle constraints were estimated using Karplus equations from 3J coupling constant measurements. These were obtained from spin echo difference 13C CT-HMQC (Hu et al. 1999; Szypersky et al. 1999) and 13C CT-HSQC (Legault et al. 1995) in 13C-labeled samples as well as 1H-1H DQF-COSY (Rinkel and Altona 1987) and 31P-1H HETCOR in unlabeled samples. The backbone torsion angles of loop residues 917–920 were left unconstrained because of evidence of increased dynamics in that region with the exception of the ribose pucker of 917–918. More details are presented in Supplementary Material.

The RDCs were taken as the difference between the corresponding measurements of anisotropic and isotropic media conditions. Riboses 1H-13C couplings were measured from peak splittings in the directly detected 1H dimension on the ribose 1H and 13C spectral region on a 3D HCCH-COSY-relay IPAP (Vallurapalli and Moore 2002). Base 1H-13C couplings were measured from 2D 13C-1H TROSY upfield and downfield 1H cross-peaks with the downfield 13C component (Weigelt 1998). 13C-13C one-bond RDCs were measured from 1H-13C HSQC experiments as described by McCallum and Pardi (2003), except that all measurements were done in the 13C dimension with the appropriate selective 13C decoupling. Imino couplings were measured in the 15N dimension on a 2D 15N-1H HSQC (Hansen et al. 2000). The initial axial component of the alignment tensor, Da, was estimated from the largest absolute value of the measured set of RDCs (Fig. 8A). After determining the initial rhombicity from the histogram of normalized RDCs (Clore et al. 1998), R and Da were optimized by a grid search using the Xplor refinement protocol described following in the absence of the DELPHIC force field and excluding the RDC measurements for residues 917–920 in the loop. Da was determined as -14.5 Hz (ribose CH) and R as 0.4. The deuterium quadrupole splitting was used to correct Da for different pf1 phage concentrations. Da was set for the NH, the aromatic CH, the ribose CC, and the pyrimidines C4C5 and C5C6 as (γPγQrCH3)/(γCγHrPQ3) being rCH the interatomic distance of the CH in the ribose (McCallum and Pardi 2003). The force constants for the RDC constraints were scaled relative to that for NH as (γNγHrPQ3) / (γPγQrNH3). The RDC constraints were given a range with no energy penalty of ±0.6 Hz.

Structure calculations and refinement

The initial NRS structure was calculated using a reported protocol (Cabello-Villegas et al. 2002). The constraints included 423 distance restraints, among which 377 are NOE constraints, 46 hydrogen bond constraints, and 63 torsion angle restraints. The 10 lowest-energy structures have no NOE and torsion angle violations >0.5 Å and 5°, respectively. The minimized average of the convergent structures was used as an initial structure for a refinement with RDCs (Schwieters et al. 2003). The final ensemble of structures was calculated using a single alignment tensor and RDCs of the stem region. The DELPHIC potential for relative positions of close bases and for torsion angles in nucleic acids was turned on for the residues, excluding the budged U913 and the loop residues (917–920) in the last refinement procedure using the same protocol (Clore and Kuszewski 2003; Schwieters et al. 2003). We also performed a refinement with dual alignment tensors. In this case, the calculation used two alignment tensors simultaneously, one for the upper loop residues 916 to 921 and the other one for the rest of the molecule. The values for Da and R for the second tensor were determined by a grid search to be -12.5 Hz and 0.4, and the values for the latter were those used in the single-tensor calculations. Compared with the calculation with the single alignment tensor, the calculation with two-alignment tensors yields number of NOE and torsion angle violations in the upper loop region. This is because this region may be dynamic and nonstructured, and the interpretation of the RDCs is problematic. Therefore, the final ensemble of structures was calculated without RDCs from the upper loop region. The Pearson’s correlation (Bax et al. 2001) between the normalized calculated and experimental values for the stem structure is 0.83 (Fig. 8B).

Chemical shift data for NRS23 RNA have been deposited in BioMagResBank (http://www.bmrb.wisc.edu), entry code 1534. Coordinates for the NRS23 structures calculated with RDCs and DELPHIC database have been deposited in Protein Data Bank (http://www.rcsb.org), access code xxx.

TABLE 1.

Restraints and statistics of NRS23 NMR structure


FIGURE 1.

RSV NRS23 RNA is complementary to both U1 and U11 snRNAs. (A) The NRS23 secondary structure determined by nuclease probing and chemical modification. The pseudo 5′ splice site is indicated by a large arrow. The sequence highlighted in yellow is identical to the 5′ splice site consensus. Results of the chemical and enzymatic probing are indicated as follows: RNase V1 or T2 cleavages are indicated by small arrows linked to squares or circles, respectively, and lead cleavages by triangles. Colors indicate cleavage yields (green and orange for weak and medium). (B) NRS23 is shown interacting with complementary sequences from U1 snRNA (residues 1 to 10). (C) NRS23 can also base-pair with U11 snRNA (residues 4 to 18).


FIGURE 2.

Solution structures of NRS23. The solid model drawing of the NRS23 RNA structure (A) and the stereo plot of the 15 lowest-energy structures calculated with RDCs (B). The structures are aligned with the stem residues. The structures refined with RDCs are more extended than those refined without RDCs. The tilt of the bases flanking U913 is ~13°. Color code in A: purple, stem region before the U913 bulge; blue, 3 bp in loop; light red, the UUGU tetraloop. For clarity, the bulged U913 is drawn as a space-filling model.


FIGURE 3.

Schematic representation of NOE data collected on NRS23 (only residues 910–926 are shown). Thin lines represent interresidue NOEs. The ribose pucker is represented by dark filling for C2′ endo, gray for mixed 2C2′/C3′ endo, and open for C3′ endo. An extremely weak NOE is represented by a broken line.


FIGURE 4.

HNN-COSY spectrum of NRS23 showing the A H2 to U N3 region (A). 15N-1H HMQC optimized for long-range correlations showing the H5/N1 region of U (B). The horizontal line connects the interresidue correlation between A921 H2 and U916 N3 (circled) and the intrabase correlation between U916 H5 and N3. The H8 to N9 and G N3 correlations are seen in the HNN-COSY; some N9-H1′ and N1-H5/H1′ correlations are seen in the HMQC.


FIGURE 5.

The loop residues 914–923 from the minimized average structure of NRS23. This structure was calculated from the 15 lowest-energy NRS23 structures. Note that the top of the loop is closed by 3 bp. The UUGU tetraloop adopts no particular structure and presumably is ready for intermolecular interaction.


FIGURE 6.

Melting profiles show that NRS23 is less stable than NRS23ΔU913. Melting curves (A) and their first derivatives (B) of NRS23 and the NRS23ΔU913 mutant, respectively. Dark blue and red, experimental; pink and light blue, fit curves. The thermodynamic data are listed in the embedded table. The melting temperature has an error of ±0.5°C.


FIGURE 7.

Binding of U1 snRNP to the NRS requires destabilizing the double-stranded helix. (A) Northern analysis of U1 snRNA affinity selected from HeLa nuclear extract, with wild-type (WT) NRS RNA, the UU917/918AA mutant that is capable of splicing (AA), the U913 deletion mutant, and the nonfunctional UU917/918CC mutant (CC). (B) Relative U1 snRNP affinities of the wild-type NRS and the mutants. The background binding to the beads was subtracted, and each trial was normalized to the WT levels. The average of three experiments is presented.


FIGURE 8.

Histogram of the RDC distribution (A) and calculated versus experimental RDCs for NRS23 (Pearson’s plot, B). The one-bond C-C and C-H RDCs are normalized to the NH couplings. Pearson’s plot of experimental RDC vs. the values calculated after best fitting the alignment tensor to the lowest-energy structure refined without RDC constraints. The Pearson’s coefficient is ~0.83. The RDCs of the tetraloop residues were excluded because of their dynamic behavior.


Acknowledgments

We acknowledge Dr. G. Marius Clore for discussion about the DELPHIC database, Dennis A. Torchia for use of a Bruker 500-MHz spectrometer equipped with an HCP triple resonance probe, David E. Draper for use of a Cary 400 Bio spectrophotometer, R. Andrew Byrd and Christiane Branlant for encouragement, and Jason LeBlanc for review of the manuscript. This work was supported in part by National Institutes of Health (NIH) grant RO1CA48745 to K.L.B. K.E.G. was supported by NIH predoctoral training grant T32GM07231.

The publication costs of this article were defrayed in part by payment of page charges. This article must therefore be hereby marked “advertisement” in accordance with 18 USC section 1734 solely to indicate this fact.

Footnotes

REFERENCES