- Research article
- Open Access
Recognition of essential purines by the U1A protein
BMC Biochemistry volume 8, Article number: 22 (2007)
The RNA recognition motif (RRM) is one of the largest families of RNA binding domains. The RRM is modulated so that individual proteins containing RRMs can specifically recognize RNA targets with diverse sequences and structures. Understanding the principles governing this specificity will be important for the rational modification and design of RRM-RNA complexes.
In this paper we have investigated the origins of specificity of the N terminal RRM of the U1A protein for stem loop 2 (SL2) of U1 snRNA by substituting modified bases for essential purines in SL2 RNA. In one series of modified bases, hydrogen bond donors and acceptors were replaced by aliphatic groups to probe the importance of these functional groups to binding. In a second series of modified bases, hydrogen bond donors and acceptors were incorrectly placed on the purine bases to analyze the origins of discrimination between cognate and non-cognate RNA. The results of these experiments show that three different approaches are used by the U1A protein to gain specificity for purines. Specificity for the first base in the loop, A1, is based primarily on discrimination against RNA containing the incorrect base, specificity for the fourth base in the loop, G4, is based largely on recognition of the donors and acceptors of G4, while specificity for the sixth base in the loop, A6, results from a combination of direct recognition of the base and discrimination against incorrectly placed functional groups.
These investigations identify different roles that hydrogen bond donors and acceptors on bases in both cognate and non-cognate RNA play in the specific recognition of RNA by the U1A protein. Taken together with investigations of other RNA-RRM complexes, the results contribute to a general understanding of the origins of RNA-RRM specificity and highlight, in particular, the contribution of steric and electrostatic repulsion to binding specificity.
The RRM is one of the most common RNA-binding domains [1–3] and is found in proteins that participate in all steps of gene expression and RNA processing [4, 5]. The RRM is approximately 100 amino acids and forms a general single-stranded RNA binding scaffold comprised of a four-stranded anti-parallel β-sheet flanked by two α-helices . RRMs bind RNAs of different sequences and in many different structural contexts. In general, RRMs make limited contacts with the sugar-phosphate backbone compared to other RNA-binding proteins and large cooperative networks of hydrogen bonds are formed with the nucleobases. Although individual structures of RRM-RNA complexes have been solved [7, 8], it remains unclear how this domain forms a general RNA binding scaffold, while individual proteins containing RRMs achieve high specificity for particular RNA sequences.
Extensive biophysical and biochemical investigations have made U1A a paradigm of RRM-RNA recognition [9–22]. U1A is a component of the U1 snRNP and also regulates polyadenylation of U1A pre-mRNA [23–25]. The N-terminal RRM of U1A binds with high affinity to stem loop 2 (SL2) in U1 snRNA and an internal loop target site of nearly identical sequence in the U1A pre-mRNA [9, 26]. The structure of the N-terminal RRM of U1A bound to SL2 RNA is shown in Figure 1. U1A contacts the AUUGCAC sequence of the loop and the closing CG base pair. Most hydrogen bond donors and acceptors of these bases are involved in hydrogen bonds with protein main chain or side chain functional groups in loop 1, loop 3, β1, β4, and loop 6 of U1A. In addition, conserved aromatic amino acid side chains in β1 and β3 stack with RNA bases C5 and A6. Cooperative networks of interactions important for complex formation involving amino acids in loops 1, 3, and 6 and β1, β3, and β4 have been identified experimentally and have been suggested computationally [13, 15, 18, 22, 27–31].
We are using the U1A-RNA complex as a model system to understand how RRMs are modulated for specific recognition of RNA target sites [15, 18, 19, 21, 22]. The U1A protein is highly specific for the AUUGCAC loop sequence. The AUUGCAC sequence was selected for binding to U1A with 87–100% conservation at each site in a variety of loop sizes or in a linear sequence . The U1A protein is particularly sensitive to base substitutions for the purines in the RNA target site. Mutation of A1 to G resulted in a large destabilization of the complex of 5.0 kcal/mol (Table 1) [9, 28]. Mutation of the purines G4 or A6 to A or G, respectively, resulted in a 50,000–100,000-fold increase in the KD of the complex and a 6.4–6.8 kcal/mol destabilization of the complex (Tables 2 and 3) [18, 28]. Mutations of pyrimidines in the target sequence were less destabilizing, 1.6–4 kcal/mol .
We have focused on investigating the specificity of U1A for the purines A1, G4, and A6 in the target SL2 RNA, because U1A is finely tuned to recognize the correct purine bases at these positions. The data we report here suggest that for A1, discrimination against non-cognate RNA is a significant contributor to specificity in the absence of substantial direct contacts between U1A and the base. In contrast, hydrogen bond donors and acceptors on G4 are essential contributors to binding and cannot be substituted with aliphatic groups. Recognition of A6 involves both positive contributions to binding by A6 hydrogen bond donors and acceptors and the destabilization of complexes with incorrectly placed functional groups. Thus, the relative contributions to specificity of hydrogen bond donors and acceptors on cognate and non-cognate RNA bases vary for recognition of A1, G4, and A6 in SL2 RNA by the U1A protein.
Strategy for Probing Specificity
To investigate the specificity requirements of the U1A protein, we have measured the affinity of the U1A protein for SL2 RNA target sites containing modified purine bases. In one series of modifications, individual hydrogen bond donors or acceptors were eliminated or substituted with aliphatic groups to probe the energetic contributions of these functional groups to binding. In a second series of modifications, the purines were substituted with alternative hydrogen bond donors or acceptors to investigate the ability of the U1A protein to discriminate against incorrectly placed functional groups. It should be noted that these experiments do not identify the molecular origins of changes in binding affinity observed when altering a hydrogen bond donor or acceptor in the complex. The effects of these base modifications on binding are likely to be complex because they may alter the complex interface, change the structure and dynamics of both the free RNA and the complex, alter cooperative networks of interactions involved in binding, or change solvation effects. The experiments reported here probe the importance of selected functional groups to the specificity of binding, and this importance may arise by altering any or all of these contributions to binding affinity.
Recognition of A1
A1 stacks between U2 and the CG base pair that closes the loop in the U1A-SL2 RNA complex (Figure 2A, 2B) . A1 forms only one hydrogen bond with the U1A protein, which is between N1 and the side chain of Arg52. The side chain of Arg52 also forms hydrogen bonds with the G that forms the closing GC base pair of SL2 RNA. Substitution of Arg52 with Gln abolishes binding [11, 32]. The KD's of complexes formed with SL2 RNAs containing modifications at the A1 position are reported in Table 1. Representative gel mobility shift analyses and binding curves are shown in Figure 3. Despite the small number of hydrogen bonds between A1 and the U1A protein, the substitution of G for A1 resulted in a 5 kcal/mol destabilization of the complex. This is similar to the destabilization of the complex observed previously for the substitution of A1 with C [9, 28]. The substitution of G for A1 may introduce a base pair between G and C10 on the 3'-end of the loop, thus reducing the loop to 8 nucleotides. This additional base pair would be expected to destabilize the complex because complex formation would require dissociation of the additional base pair. However, the double mutant A1G C10A SL2 RNA, which cannot form an analogous additional base pair, bound with similar affinity (KD = 1.3 ( ± 0.6) × 10-6 M) as A1G SL2 RNA to U1A. The C10A substitution alone had little effect on binding affinity (KD = 6 ( ± 1) × 10-10 M). Thus, these experiments suggest that specific recognition of A1 does not require an extensive hydrogen bond network with the base.
To probe the role of base functional groups without introducing new functional groups A1 was substituted with purine, in which the 6-NH2 group is replaced with hydrogen, and with c1A in which N1 is replaced with C-H. Both substitutions resulted in very small destabilizations of the complex (0.5–0.7 kcal/mol) even though a hydrogen bond is formed between N1 and the side chain of Arg52 in the X-ray structure. In contrast, base substitutions that altered the pattern of hydrogen bond donors and acceptors, such as A1I, or added hydrogen bond donors or acceptors, such as A1DAP or A1-2AP, resulted in a much larger destabilization of the complex of 2.4–2.9 kcal/mol. Within the wild type structure the 2AP and DAP substitutions would introduce unfavorable steric interactions between the side chain of Leu49 and the 2-NH2 group, while the substitution of inosine for A1 would introduce an electrostatic repulsion between N1-H and the side chain of Arg52. These unfavorable interactions may contribute to the destabilization of the complexes containing A1-2AP and A1-DAP, which could contribute to alternate structures being formed in these complexes. These data suggest that the poor affinity of A1G SL2 RNA for U1A is due to the altered pattern of hydrogen bond donors and acceptors on G compared to A, rather than a loss of specific hydrogen bonding interactions. Thus, at this position, specificity is a consequence primarily of the destabilization of complexes containing non-cognate bases.
Recognition of G4
G4 is packed between U3 and a number of amino acids in the U1A protein (Figure 2C) . G4 forms the largest number of hydrogen bonds with the U1A protein of any base in the AUUGCAC sequence. Hydrogen bonds are formed between N7 and the amide side chain of Asn15, O6 and the main chain amide of Asn16, O6 and the main chain carbonyl of Leu17 (water-mediated), 2-NH2 and O2 of U2, 2-NH2 and the side chain of Glu19, and N1-H and the side chain of Glu19. Substitution of A for G4 resulted in a 6.4 kcal/mol loss of binding affinity. This value is comparable to that reported previously .
The binding affinities of U1A for SL2 RNAs containing a series of base analogs substituted for G4 are reported in Table 2. Representative gel mobility shift analyses and binding curves are shown in Figure 3. The elimination of 2-NH2 (G4I) or the substitution of N7 with C-H (G4c7G) resulted in destabilizations of the complex of 1.5 and 3.3 kcal/mol, respectively. The greater loss in binding free energy upon substitution of G4 with c7G than with I suggests that N7 is a more important contributor to binding than is the 2-NH2 group even though two hydrogen bonds are observed between the 2-NH2 group and the U1A protein in the X-ray structure, compared to one hydrogen bond between N7 and the U1A protein. \These results are consistent with previously performed protein substitutions. The substitution of Asn15, which forms a hydrogen bond between the amide side chain and N7 of G4, with Val abolished binding . In contrast, the substitution of Glu19, which forms a hydrogen bond between the side chain and 2-NH2, with Ser destabilized the complex, but did not abolish binding .
The substitution of either 2AP or DAP for G4 results in a larger destabilization of the complex, 5.8 and 5.4 kcal/mol, respectively, than was observed for either the G4I or G4c7G substitutions, perhaps because these substitutions alter functionality at both the 1 and 6 positions of the purine ring. Because of the weak binding affinity of U1A for these RNA sequences full binding curves were not obtained leading to more uncertainty in the reported dissociation constants compared to those for other RNA sequences. However, the large destabilization of the complex with the G4-2AP substitution suggests that the 6-O or N1-H groups are essential for binding to U1A. In the structure of the wild type complex G4DAP and G4-2AP substitutions would introduce an electrostatic repulsion between the lone pair of N1 and the side chain of Glu19. However, based on the low binding affinity of U1A for SL2 RNA containing these substitutions, it is likely that the structure of these complexes are altered from the wild type structure. The significant destabilization of the complex observed upon substitution of hydrogen bond donors and acceptors on G4 with aliphatic groups and the substitution of G4 with 2AP suggest that the specificity of the U1A protein for G4 is dependent in large part on direct and indirect contributions of the G4 functional groups to complex stability.
Recognition of A6
The interactions between A6 and U1A in the X-ray cocrystal structure are shown in Figure 2D. A6 stacks between Phe56 and C7. The substitution of non-aromatic amino acids for Phe56 results in a large destabilization of the complex [15, 21, 34]. N1 forms a hydrogen bond with the side chain of Ser91, the 6-NH2 forms a hydrogen bond with the main chain carbonyl of Thr89, and N7 forms a water-mediated hydrogen bond with the main chain amide of Thr89. We previously showed that the substitution of A with any other base results in a large destabilization of the complex of 6.3–6.7 kcal/mol, while the elimination of individual hydrogen bond donors or acceptors resulted in a 0.8–1.9 kcal/mol destabilization of the complex . The substitution of Ser91 with Ala resulted in a similar destabilization of the complex as resulted from the substitution of N1 of A6 with a C-H group . In addition, we observed energetic coupling between Phe56 and hydrogen bond donors and acceptors on A6 . Thus, the hydrogen bond donors and acceptors on A6 play direct and indirect roles in stabilizing the complex.
The exchange of hydrogen bond donors and acceptors (for example, A6I) or the addition of a hydrogen bond donor (A6DAP) resulted in a much larger destabilization of the complex than the elimination of individual hydrogen bond donors and acceptors (Table 3, Figure 3). Within the context of the wild type structure, the introduction of the 2-NH2 would introduce unfavorable steric interactions with the side chain of Leu44, an amino acid previously suggested to be important for specific recognition of SL2 RNA , and the substitution of inosine for A6 would result in unfavorable steric interactions between N1-H and the side chain of Ser91 and an unfavorable electrostatic interaction between O6 and the main chain carbonyl of Asp90. These unfavorable interactions may destabilize the complex, which could contribute to alternative complex structures being formed upon incorporation of these modified bases. Together, the results from the experiments eliminating hydrogen bond donors and acceptors from A6 reported previously  and those exchanging hydrogen bond donor and acceptors reported here suggest that specific recognition of A6 involves both direct recognition of the base and discrimination against incorrectly placed functional groups.
The large destabilizations of the complexes formed with U1A upon incorporation of many of the base analogs described here suggests considerable variation in free RNA or complex structure as a result of these base substitutions. The RNA loop is dynamic when free, making it difficult to characterize the effect of the base substitutions for A1, G4, and A6 on RNA structure. Because the base analogs contain modified hydrogen bond donor and acceptor patterns that are similar in polarity and stacking ability to G and A, it is likely that the primary effect of the base analogs is to alter the structure of the complex. However, we were concerned that the A1G SL2 RNA and perhaps the A1I SL2 RNA could form an additional base pair, thus stabilizing SL2 RNA and reducing the size of the loop.
To investigate the effect of base substitutions on SL2 RNA stability, temperature dependent melting analyses were performed. The Tm's of the SL2 RNAs not previously reported are listed in Table 4. A limitation of this method is that it primarily probes the structure and stability of the helical portion of SL2 RNA, rather than the loop. Thus, not surprisingly, the Tm's of SL2 RNAs containing base substitutions were similar to or within error of that of the wild type RNA. The measured Tm's varied between 57°C and 62°C, with no correlation observed between the Tm and the binding affinities reported in Tables 1, 2, 3. As expected, the Tm of the A1G SL2 RNA (64.2 ± 0.5°C) was 2°C higher than the range observed for the other RNA's, including A1I SL2 RNA (61 ± 2°C). The Tm of the SL2 RNA containing A1G/C10A, which eliminates the possibility of an additional GC base pair, was measured to be within the range of the other SL2 RNAs (61.6 ± 0.9). Because the stabilities of the complexes containing A1G and A1G/C10A SL2 RNAs are within experimental error and the C10A substitution does not itself alter the stability of the complex, the additional stability of the A1G SL2 RNA does not appear to be a large contributor to the destabilization of the complex by this substitution.
The substitution of the three essential purines in SL2 RNA with bases in which hydrogen bond donors and acceptors are replaced with aliphatic groups and with bases in which hydrogen bond donors and acceptors are placed incorrectly on the base has enabled us to compare the positive contributions of correct functional groups with the negative contributions of incorrectly placed functional groups to the binding specificity of the U1A protein. These comparisons have suggested that specific recognition of essential purines by the U1A protein varies from primarily discrimination against non-cognate bases for A1 to direct contributions of the base functional groups for G4. The base modifications introduced in these experiments are likely to not only eliminate and introduce individual interactions that either destabilize or stabilize the complex, but to affect other interactions in the complex that are energetically coupled with the modified base. Cooperative networks of interactions involving both protein and RNA residues have been identified experimentally and suggested computationally in the U1A system [13, 18, 22, 27–31]. These studies have focused more on cooperative networks in the free and bound proteins than on networks involving RNA. However, experiments have suggested energetic coupling between A6 and Phe56 and between loop 3 of U1A, Tyr13, Gln54 and the RNA [13, 18], and an analysis of collective atomic fluctuations in MD simulations of the U1A-SL2 RNA complex suggest large networks of interactions involving both protein and RNA that have not yet been explored experimentally [22, 30]. Thus, the energetic roles of individual hydrogen bond donors and acceptors that we have identified to be important for the discrimination by U1A between cognate and non-cognate RNA sequences are likely to be complex.
Previously, Shamoo and coworkers investigated the recognition of purines by two linked RRMs from hnRNPA, called UP1 . UP1 is a significantly different RRM than U1A and a comparison of U1A and UP1 should be valuable for developing principles of RRM-RNA specificity. Shamoo and coworkers suggested three generalizations of RNA recognition by RRMs based on experiments with UP1. First, interactions with main chain amides might provide greater base discrimination than interactions with side chains. Although this idea has not yet been fully evaluated in the U1A-RNA complex, binding studies that have been performed with RNAs containing modified bases have suggested that interactions with main chain amides are not necessarily more important than those with side chains in the U1A-RNA complex. For example, N7 of G4 forms a hydrogen bond with the side chain of Asn15. As reported in Table 2, substitution of N7 with C-H results in a significant destabilization of the complex. In contrast, the substitution of A6 with purine, which eliminates the 6-NH2 group that forms a hydrogen bond with the main chain carbonyl of Thr89 destabilizes the complex comparably as the substitution of N1, which forms a hydrogen bond with the side chain of Ser91, with C-H [15, 19]. Second, hydrogen bonds to bases involving charged amino acids are more energetically important than those involving neutral amino acid functional groups. U1A does not have a large number of hydrogen bonds with charged residues, so these are not required for tight binding. Third, steric repulsion is a key discriminatory tool for gaining sequence specificity. In fact, for the U1A-RNA complex, steric and electrostatic repulsion due to incorrectly placed functional groups can be more important than direct base recognition in controlling specificity. Thus, specific recognition of RNA target sites by UP1 and U1A are similarly guided by discrimination against non-cognate RNA.
In conclusion, these investigations show three different approaches used by the U1A protein to specifically recognize essential purines in the SL2 RNA target site and underscore the ability of steric and electrostatic repulsion to be important for specificity even in the absence of a direct hydrogen bond network with the base. The contributions of negative recognition to specificity have been shown to be important in RNA recognition by other proteins, for example by the MS2 coat protein and tRNA synthetases, and also in DNA-protein recognition [37–45]. Thus, the data from the U1A-SL2 RNA and UP1-RNA complexes extend this generalization to the RRM-RNA complexes. Together, these results form part of a growing body of data that shows the importance of steric and electrostatic discrimination against incorrectly placed functional groups on non-cognate bases in governing the specificity of RNA-protein complexes. These general principles describing the origins of specificity of protein-RNA complexes will be invaluable in understanding and controlling complex formation.
Protein Expression and Purification
An expression vector for the N-terminal RRM of U1A (amino acids 1–102) was obtained from Nagai . The wild type protein was expressed and purified as described previously . The molecular weight was confirmed by ESI mass spectrometry and the concentration was determined by amino acid analysis.
RNA Synthesis and Purification
RNA containing c7G was synthesized at the KECK facility at Yale Medical School. Other oligonucleotides were purchased from Dharmacon and IDT. The RNA and chimeric oligomers were purified using denaturing gel electrophoresis. Concentrations were determined by UV at 260 nm. All oligonucleotides were characterized by MALDI mass spectrometry.
Equilibrium Binding Assays
Protein-RNA equilibrium dissociation constants were measured by gel mobility shift assays. Reaction mixtures containing 25 pM 32P-labeled RNA and protein in 10 mM Tris-HCl (pH 7.4), 250 mM NaCl, 1 mM EDTA, 0.5% Triton X-100 and 1 mg/mL tRNA in a total volume of 14 μL were equilibrated for at least 45 min. After addition of glycerol to a final concentration of 5%, the reactions were separated on an 8% polyacrylamide gel in a buffer containing 100 mM Tris-borate (pH 8.3), 1 mM EDTA, and 0.1% Triton X-100 for 35 min at 350 V. The temperature of the gel was maintained at 25°C by a circulating water bath. Gels were analyzed on a Molecular Dynamics Storm Phosphorimager. Fraction RNA bound versus protein concentration was plotted and curves were fit to the equation: fraction bound = 1/(1+KD/[P]), where [P] is the total protein concentration. In gel mobility shift assays that did not saturate at approximately 100% bound, this equation was modified to: fraction bound = A/(1+KD/[P]), where A was allowed to vary between 0.7 and 1. This modification was necessary to estimate the KD's of some of the least stable complexes. Representative gel mobility shift assays and plots illustrating fraction RNA bound as a function of U1A concentration are shown in Figure 3. The KD's obtained from these experiments are listed in Tables 1, 2, 3. The errors listed in the tables are the standard deviations of the results of at least three independent binding experiments and thus, represent the reproducibility of the experimental data.
RNA Melting Experiments
UV melting curves were performed on a Shimadzu UV-2401PC spectrophotometer using 2–10 μM RNA samples in 100 mM NaCl, 0.5 mM EDTA, and 10 mM sodium phosphate at pH 7. The samples were heated from 10 to 95 °C with a heating rate of 1 °C/min, while monitoring absorbance at 280 nm. The melting curves were fit using the program Meltwin 3.5.
Birney E, Kumar S, Krainer AR: Analysis of the RNA-recognition motif and RS and RGG domains: conservation in metazoan pre-mRNA splicing factors. Nucleic Acids Res. 1993, 21: 5803-5816. 10.1093/nar/21.25.5803.
Lorkovic ZJ, Barta A: Genome analysis: RNA recognition motif (RRM) and K homology (KH) domain RNA-binding proteins from the flowering plant Arabidopsis thaliana. Nucleic Acids Res. 2002, 30: 623-635. 10.1093/nar/30.3.623.
Rubin GM, Yandell MD, Wortman JR, Miklos GLG, Nelson CR, Hariharan IK, Fortini ME, Li PW, Apweiler R, Fleischmann W, Cherry JM, Henikoff S, Skupski MP, Misra S, Ashburner M, Birney E, Boguski MS, Brody T, Brokstein P, Celniker SE, Chervitz SA, Coates D, Cravchik A, Gabrielian A, Galle RF, Gelbart WM, George RA, Goldstein LSB, Gong F, P.Guan, Harris NL, Hay BA, Hoskins RA, Li J, Li Z, Hynes RO, Jones SJM, Kuehl PM, Lemaitre B, Litleton JT, Morrison DK, Mungall C, O'Farrell PH, Pickeral OK, Shue C, Vosshall LB, Zhang J, Zhao Q, Zheng XH, Zhong F, W.Zhong, Gibbs R, Venter JC, Adams MD, Lewis S: Comparitive Genomics of the Eukaryotes. Science. 2000, 287: 2204-2215. 10.1126/science.287.5461.2204.
Krecic AM, Swanson MS: hnRNP complexes: composition, structure and function. Curr Opin Cell Biol. 1999, 11: 363-371. 10.1016/S0955-0674(99)80051-9.
Varani G, Nagai K: RNA recognition by RNP proteins during RNA processing. Annu Rev Biophys Biomol Struct. 1998, 27: 407-445. 10.1146/annurev.biophys.27.1.407.
Burd CG, Dreyfuss G: Conserved structures and diversity of functions of RNA-binding proteins. Science. 1994, 265: 615-621. 10.1126/science.8036511.
Sickmier EA, Frato KE, Shen H, Paranawithana SR, Green MR, Kielkopf CL: Structural Basis for Polypyrimidine Tract Recognition by the Essential Pre-mRNA Splicing Factor U2AF65. Mol Cell. 2006, 23: 49-59. 10.1016/j.molcel.2006.05.025.
Maris C, Dominguez C, Allain FHT: The RNA recognition motif, a plastic RNA-binding platform to regulate post-transcriptional gene expression. FEBS J. 2005, 272: 2118-2131. 10.1111/j.1742-4658.2005.04653.x.
Hall KB: Interaction of RNA hairpins with the human U1A N-terminal RNA binding domain. Biochemistry. 1994, 33: 10076-10088. 10.1021/bi00199a035.
Hall KB, Stump WT: Interaction of N-terminal domain of U1A protein with an RNA stem/loop. Nucleic Acids Res. 1992, 20: 4283-4290. 10.1093/nar/20.16.4283.
Jessen TH, Oubridge C, Teo CH, Pritchard C, Nagai K: Identification of molecular contacts between the U1A small nuclear ribonucleoprotein and U1 RNA. EMBO J. 1991, 10: 3447-3456.
Katsamba PS, Bayramyan M, Haworth IS, Myszka DG, Laird-Offringa IA: Complex role of the b2-b3 Loop in the Interaction of U1A with U1 Hairpin II RNA. J Biol Chem. 2002, 277: 33267-33274. 10.1074/jbc.M200304200.
Kranz JK, Hall KB: RNA recognition by the human U1A protein is mediated by a network of local cooperative interactions that create the optimal binding surface. J Mol Biol. 1999, 285: 215-231. 10.1006/jmbi.1998.2296.
Law MJ, Chambers EJ, Katsamba PS, Haworth IS, Laird-Offinga IA: Kinetic analysis of the role of the tyrosine 13, phenylalanine 56 and glutamine 54 network in the U1A/U1 hairpin II interaction. Nucleic Acids Res. 2005, 33: 2917-2928. 10.1093/nar/gki602.
Nolan SJ, Shiels JC, Tuite JB, Cecere KL, Baranger AM: Recognition of an essential adenine at a protein-RNA interface: comparison of the contributions of hydrogen bonds and a stacking interaction. J Am Chem Soc. 1999, 121: 8951-8952. 10.1021/ja991617n.
Oubridge C, Ito N, Evans PR, Teo CH, Nagai K: Crystal structure at 1.92 Å resolution of the RNA-binding domain of the U1A spliceosomal protein complexed with an RNA hairpin. Nature. 1994, 372: 432-438. 10.1038/372432a0.
Rimmele ME, Belasco JG: Target discrimination by RNA-binding proteins: Role of the ancillary protein U2A' and a critical leucine residue in differentiating the RNA-binding specificity of spliceosomal proteins U1A and U2B". RNA. 1998, 4: 1386-1396. 10.1017/S1355838298981171.
Shiels JC, Tuite JB, Nolan SJ, Baranger AM: Investigation of a conserved stacking interaction in target site recognition by the U1A protein. Nucleic Acids Res. 2002, 30: 550-558. 10.1093/nar/30.2.550.
Tuite JB, Shiels JC, Baranger AM: Substitution of an Essential Adenine in the U1A-RNA Complex with a Non-polar Isostere. Nucleic Acids Res. 2002, 30: 5269-5275. 10.1093/nar/gkf636.
Varani L, Gunderson SI, Mattaj IW, Kay LE, Neuhaus D, Varani G: The NMR structure of the 38 kDa U1A protein-PIE RNA complex reveals the basis of cooperativity in regulation of polyadenylation by human U1A protein. Nature Struct Biol. 2000, 7: 329-335. 10.1038/74101.
Zhao Y, Baranger AM: Design of an Adenosine Analog that Selectively Improves the Affinity of a Mutant U1A Protein for RNA. J Am Chem Soc. 2003, 125: 2480-2488. 10.1021/ja021267w.
Kormos BL, Baranger AM, Beveridge DL: Do Collective Atomic Fluctuations Account for Cooperative Effects? Molecular Dynamics Studies of the U1A-RNA Complex. J Am Chem Soc. 2006, 128: 8992-8993. 10.1021/ja0606071.
Boelens WC, Jansen EJR, van Venrooij WJ, Stripecke R, Mattaj IW, Gunderson SI: The human U1 snRNP-specific U1A protein inhibits polyadenylation of its own pre-mRNA. Cell. 1993, 72: 881-892. 10.1016/0092-8674(93)90577-D.
Green MR: Biochemical mechanisms of constitutive and regulated pre-mRNA splicing. Annu Rev Cell Biol. 1991, 7: 559-599. 10.1146/annurev.cb.07.110191.003015.
Stark H, Dube P, Lührmann R, Kastner B: Arrangement of RNA and proteins in the spliceosomal U1 small nuclear ribonucleoprotein particle. Nature. 2001, 409: 539-542. 10.1038/35054102.
Tsai DE, Harper DS, Keene JD: U1-snRNP-A selects a ten nucleotide consensus sequence from a degenerate RNA pool presented in various structural contexts. Nucl Acids Res. 1991, 19: 4931-4936. 10.1093/nar/19.18.4931.
Showalter SA, Hall KB: A Functional Role for Correlated Motion in the N-terminal RNA-binding Domain of Human U1A Protein. J Mol Biol. 2002, 322: 533-542. 10.1016/S0022-2836(02)00804-5.
Showalter SA, Hall KB: Altering the RNA-binding Mode of the U1A RBD1 Protein. J Mol Biol. 2004, 335: 465-480. 10.1016/j.jmb.2003.10.055.
Showalter SA, Hall KB: Correlated Motions in the U1 snRNA Stem/Loop 2: U1A RBD1 Complex. Biophys J. 2005, 89: 2046-2058. 10.1529/biophysj.104.058032.
Kormos BL, Baranger AM, Beveridge DL: A Study of Collective Atomic Fluctuations and Cooperativity in a U1A-RNA Complex Based on Molecular Dynamics Simulations. J Struct Biol. 2007, 157: 500-513. 10.1016/j.jsb.2006.10.022.
Kranz JK, Hall KB: RNA binding mediates the local cooperativity between the b-sheet and the C-terminal tail of the human U1A RBD1 protein. J Mol Biol. 1998, 275: 465-481. 10.1006/jmbi.1997.1441.
Nagai K, Oubridge C, Jessen TH, J.Li, Evans PR: Crystal structure of the RNA-binding domain of the U1 small nuclear ribonucleoprotein A. Nature. 1990, 348: 515-520. 10.1038/348515a0.
Boelens W, Scherly D, Jansen EJR, Kolen K, Mattaj IW, van Venrooij WJ: Analysis of in vitro binding of U1-A protein mutants to U1 snRNA. Nucleic Acids Res. 1991, 19: 4611-4618. 10.1093/nar/19.17.4611.
Katsamba PS, Myszka DG, Laird-Offringa IA: Two functionally distinct steps mediate high affinity binding of U1A protein to U1 hairpin II RNA. J Biol Chem. 2001, 276: 21476-21481. 10.1074/jbc.M101624200.
Scherly D, Boelens W, Dathan NA, van Venrooij WJ, Mattaj IW: Major determinants of the specificity of interaction between small nuclear ribonucleoprotein U1A and U2B" and their cognate RNAs. Nature. 1990, 345: 502-506. 10.1038/345502a0.
Myers JC, Shamoo Y: Human UP1 as a Model for Understanding Purine Recognition in the Family of Protein Containing the RNA Recognition Motif (RRM). J Mol BIol. 2004, 342: 743-756. 10.1016/j.jmb.2004.07.029.
Dertinger D, Dale T, Uhlenbeck OC: Modifying the Specificity of an RNA Backbone Contact. J Mol Biol. 2001, 314: 649-654. 10.1006/jmbi.2001.5132.
Horn WT, Tars K, Grahn E, Helgstrand C, Baron AJ, Lago H, Adams CJ, Peabody DS, Phillips SEV, Stonehouse NJ, Liljas L, Stockley PG: Structural Basis of RNA Binding Discrimination between Bacteriophages QB and MS2. Structure. 2006, 14: 487-495. 10.1016/j.str.2005.12.006.
Kiesewetter S, Ott G, Sprinzl M: The role of modified purine 64 in initiator/elongator discrimination of tRNAiMet from yeast and wheat germ. Nucleic Acids Res. 1990, 18: 4677-4682. 10.1093/nar/18.16.4677.
Perret V, Garcia A, Grosjean H, Ebel JP, Flroentz C, Giegé R: Relaxation of a transfer RNA specificity by removal of modified nucleotides. Nature. 1990, 344: 783-789. 10.1038/344787a0.
Tardif KD, Horowitz J: Functional group recognition at the aminoacylation and editing sites of E. coli valyl-tRNA synthetase. RNA. 2004, 10: 493-503. 10.1261/rna.5166704.
Lesser DR, Kurpiewski MR, Jen-Jacobson L: The Energetic Basis of Specificity in the Eco RI Endonuclease-DNA Interaction. Science. 1990, 250: 776-786. 10.1126/science.2237428.
von Hippel PH, Berg OG: On the specificity of DNA-protein interactions. Proc Natl Acad Sci. 1986, 83: 1608-1612. 10.1073/pnas.83.6.1608.
Härd T, Lundbäck T: Thermodynamics of sequence-specific protein-DNA interactions. Biophys Chem. 1996, 62: 121-139. 10.1016/S0301-4622(96)02197-7.
Morozova N, Allers J, Myers J, Shamoo Y: Protein-RNA interactions: exploring binding patterns with three-dimensional superposition analysis of high resolution structures. Bioinformatics. 2006, 22: 2746-2752. 10.1093/bioinformatics/btl470.
We are grateful to Prof. K. Nagai for the expression vector for U1A. Acknowledgement is made to the donors of the Petroleum Research Fund, administered by the American Chemical Society, for partial support of this work. Funding was also provided by the NIH (GM-056857).
YB carried out the all of the experiments presented here and participated in the design of the study. AMB conceived of the study, participated in its design and coordination, and drafted the manuscript. Both authors have read and approved the final manuscript.
Authors’ original submitted files for images
About this article
Cite this article
Benitex, Y., Baranger, A.M. Recognition of essential purines by the U1A protein. BMC Biochem 8, 22 (2007). https://doi.org/10.1186/1471-2091-8-22
- Hydrogen Bond Donor
- Cooperative Network
- Base Analog
- Amide Side Chain
- Main Chain Carbonyl