Thermophile-specific proteins: the gene product of aq_1292 from Aquifex aeolicus is an NTPase
© Klinger et al; licensee BioMed Central Ltd. 2003
Received: 13 June 2003
Accepted: 23 September 2003
Published: 23 September 2003
To identify thermophile-specific proteins, we performed phylogenetic patterns searches of 66 completely sequenced microbial genomes. This analysis revealed a cluster of orthologous groups (COG1618) which contains a protein from every thermophile and no sequence from 52 out of 53 mesophilic genomes. Thus, COG1618 proteins belong to the group of thermophile-specific proteins (THEPs) and therefore we here designate COG1618 proteins as THEP1s. Since no THEP1 had been analyzed biochemically thus far, we characterized the gene product of aq_1292 which is THEP1 from the hyperthermophilic bacterium Aquifex aeolicus (aaTHEP1).
aaTHEP1 was cloned in E. coli, expressed and purified to homogeneity. At a temperature optimum between 70 and 80°C, aaTHEP1 shows enzymatic activity in hydrolyzing ATP to ADP + Pi with kcat = 5 × 10-3 s-1 and Km = 5.5 × 10-6 M. In addition, the enzyme exhibits GTPase activity (kcat = 9 × 10-3 s-1 and Km= 45 × 10-6 M). aaTHEP1 is inhibited competitively by CTP, UTP, dATP, dGTP, dCTP, and dTTP. As shown by gel filtration, aaTHEP1 in its purified state appears as a monomer. The enzyme is resistant to limited proteolysis suggesting that it consists of a single domain. Although THEP1s are annotated as "predicted nucleotide kinases" we could not confirm such an activity experimentally.
Since aaTHEP1 is the first member of COG1618 that is characterized biochemically and functional information about one member of a COG may be transferred to the entire COG, we conclude that COG1618 proteins are a family of thermophilic NTPases.
The comparison of protein coding sequences from complete genomes led to a classification, based on sequence similarities, that assigns proteins to clusters of orthologous groups (COGs) [1, 2]. Phylogenetic patterns search  is a tool to retrieve COGs that contain protein sequences that match a certain predefined pattern of organisms. Recently, Forterre  used this tool to identify reverse gyrase as the only hyperthermophile-specific protein. Another survey identified 30 COGs enriched in hyperthermophilic procaryotes . In contrast, we here extended a similar search with respect to a group consisting of both thermophiles and hyperthermophiles i. e. the currently 3 organisms with optimal growth temperatures between 55 and 80°C were included. This search retrieved COG1618 as one of the high ranking clusters containing a protein from every of the 13 thermophilic and, by including only one representative, the fewest sequences from mesophilic genomes of the current COG database. Thus, amongst others, the COG1618 proteins belong to a group of thermophile-specific proteins (THEPs) and in this report we designate them as THEP1s.
Orthologs typically have the same function, allowing the transfer of functional information from one member of a COG to the entire COG. To the best of our knowledge, no function could be established for any of the THEP1s thus far. On the other hand, Cort et al. predicted THEP1s being ATPases . This paper confirms their hypothesis experimentally by characterizing the gene product of aq_1292 from Aquifex aeolicus (aaTHEP1) as a first example representative for this protein family. Aquifex aeolicus is a hyperthermophilic bacterium growing optimally at 95°C .
Identification of COG1618
COGs containing the THEPs with the highest thermophile-specificity.
Number of mesophilic genomes in COG
Number of thermophilic genomes in COG
Sequence analysis of THEP1s
Purification of recombinant aaTHEP1
Functional activity of aaTHEP1
Inhibition of ATP-hydrolysis by other nucleotides
Possible homologous non-covalent interactions
Secondary structure analysis
Our approach is based on the assumption that proteins abundant in thermophilic and rare in mesophilic genomes are the most attractive targets for further biochemical investigations to understand the physiology specific for thermophiles in more detail. Consequently, our first goal was to identify such candidates via bioinformatic methods and as the most suitable protein THEP1 was selected for this study. The high ranking thermophile-specificity of THEP1 among procaryotes may be explained by an essential physiological role in thermophiles that is of no functional relevance for almost all mesophilic microorganisms. As an alternative explanation, a function also present in mesophilic organisms could be carried out by a protein that was not able to adapt to higher temperatures and in the course of convergent evolution, THEP1 could have taken over this particular function. Methanosarcina acetivorans str. C2A is the only mesophilic organism containing THEP1 (MA3402). Since the genome of M. acetivorans reveals extensive metabolic and physiological diversity and there are thermophilic Methanosarcinae , one may take into consideration the possibility that M. acetivorans facultatively could be thermophilic. In addition to phylogenetic patterns search we also performed BLAST with the aaTHEP1 sequence. However, although no significant homologies to further sequences from mesophilic unicellular organisms could be detected, we discovered homologies to 6 multicellular eucaryotes. Consequently, THEP1s could belong to a class of proteins that are conserved in Archeae and Eukarya (with losses) and have been passed to thermophilic bacteria by lateral gene transfer.
The present data show that aaTHEP1 catalyzes ATP and GTP hydrolysis in vitro as predicted by Cort et al. . In contrast, the annotated nucleotide kinase activity could not be confirmed experimentally.
As expected, the observed turnover rates are too low to represent a physiological in vivo situation where the free energy of ATP hydrolysis is coupled to energy consuming tasks. In vitro, similar turnover rates of purified NTPase are published in the literature: 1.1 × 10-2 sec-1 for PilT from A. aeolicus , 1.2 × 10-2 sec-1 for TadA for Actinobacillus actinomycetemcomitans , 3 × 10-3 sec-1 for TrwD from Escherichia coli , and 6.9 × 10-4 sec-1 for PilQ from Escherichia coli . In vivo, additional proteins might be needed to activate aaTHEP1 by protein-protein interactions and take up the free energy released by NTP-hydrolysis e. g. for motion, active transport or another energy consuming cellular function. Alternatively, aaTHEP1 could catalyze an NTP driven thermodynamically unfavourable anabolic reaction of a yet undiscovered cosubstrate or play a role in cellular regulation.
ATP-hydrolysis could be inhibited by all other 7 nucleosidetriphosphates under investigation in a competitive manner which may be interpreted that in addition to GTP the other nucleotides are also substrates for isolated aaTHEP1. Possibly the enzyme lost a certain in vivo substrate specificity upon isolation. Since GDP inhibits ATP-hydrolysis whereas GMP, AMP and UMP do not, we propose that aaTHEP1 recognizes the β- and γ-phosphates rather than the nucleoside moieties.
A low activity of a recombinant protein may also be explained by trace amounts of E. coli enzymes still present after purification. For the measured NTPase activity of aaTHEP1 we exclude this possibility because we do not expect an E. coli enzyme exhibiting the same temperature dependence as shown in figure 6. There are even more indications that aaTHEP1 is a thermostable protein. The elution during gel filtration at a position corresponding to a lower relative molecular weight than the calculated 20,555 Da for a monomer and the resistance to limited proteolysis suggest that aaTHEP1 is a highly compact folded one-domain protein, a well known feature for thermophilic proteins. No cooperative behaviour in the kinetic experiments is also expected for a monomeric enzyme.
Although this study clearly defines a biochemical in vitro activity of aaTHEP1, there are many possible in vivo functions and bioinformatic analysis allows to make some predictions. Based on the genomic context, gene functions can be predicted by searching for the conservation of operons and gene orders because genes found in gene strings, particularly in multiple genomes, can be legitimately assumed to be functionally linked . For THEP1, we indeed detected 4 genomes where the THEP1-gene immediately is followed on the same strand by a COG1867 protein (N2, N2-dimethylguanosine tRNA methyltransferase). Furthermore, COG1867 also belongs to the group of THEPs indicating a functional link (table 1).
Functional predictions of high-scoring COGs.
Predicted nucleotide kinase
Predicted phosphoglycerate mutase, AP superfamily
Uncharacterized conserved protein
Uncharacterized conserved protein
Archaeal fructose 1,6-bisphosphatase
Pyruvate:ferredoxin oxidoreductase and related 2-oxoacid:ferredoxin oxidoreductases, delta subunit
Predicted prefoldin, molecular chaperone implicated in de novo protein folding
Peptide chain release factor 1 (eRF1)
N2, N2-dimethylguanosine tRNA methyltransferase
Archaeal DNA-binding protein
Uncharacterized protein conserved in archaea
Predicted alternative tryptophan synthase beta-subunit (paralog of TrpB)
Uncharacterized protein conserved in archaea
Predicted transcriptional regulators
Uncharacterized protein conserved in archaea
Uncharacterized conserved protein related to C-terminal domain of eukaryotic chaperone, SACSIN
Inhibition constants of ATP hydrolysis by nucleoside-3-phosphates.
57 ± 20
6 ± 2
5 ± 2
3 ± 0.2
18 ± 2
9 ± 3
6 ± 2
Cloning, expression and purification of aaTHEP1 from A. aeolicus
Genomic DNA from A. aeolicus was kindly provided by Dr. R. Huber, Regensburg, Germany. aq_1292 was amplified by PCR using the primers 5'-CACCATGAAAATCATCATAACCGGTGA-3' and 5'-TTACCGCTCAAGAAGTGAGAGAAT-3'. The PCR fragment was inserted into the pre-linearized plasmid pET101/D-TOPO (Invitrogen). For propagation and maintenance, the E. coli strain TOP10 (Invitrogen) was used. The correct sequence of the insert as well as its orientation were verified by sequence analysis using the reverse T7-primer 5'-TAGTTATTGCTCAGCGGTGG-3'.
To express aq_1292, E. coli BL21 Star™ (DE3) was used. According to the instructions of the manufacturer, freshly transformed cells were grown by transferring the entire transformation mixture to 10 ml Luria-Bertani broth (LB) medium containing 50 μg/ml carbenicillin and 1% glucose. After growing for 4 h at 37°C, the preculture was added to 40 ml of fresh medium, grown for additional 16 h and then 30 ml were used to inoculate a 2 l main culture. At A600 = 0.7, protein expression was induced by 1 mM isopropyl β-D-thiogalactopyranoside (IPTG, Roth). Cells were harvested at A600 = 1.3, yielding approximately 4 g after centrifugation at 6,000 × g for 10 min. The pellet was stored at -20°C before use.
Buffer A for protein purification was 50 mM Tris/HCl, 25 mM MgCl2, 5 mM KCl, 1 mM DL-dithiothreitol (DTT), 0.1 mM ethylenediaminetetraacetic acid (EDTA), pH 7.0. To disrupt the cell walls, 20 ml of buffer A containing 10 instead of 50 mM Tris, 5 mM sodium deoxycholate, 40,000 U/ml lysozyme, and 50 μg/ml DNaseI were added to 1 g of thawed cells. Lysis was performed by stirring the suspension at 20°C for 1 h and the cell debris were removed by centrifugation at 20,000 × g for 30 min. To precipitate the bulk of E. coli proteins, the supernatant was heated to 75°C for 10 min and immediately chilled on ice. Denatured proteins were removed by centrifugation at 20,000 × g and 4°C for 30 min. To remove nucleic acids and further purify recombinant aaTHEP1, the supernatant was loaded at 0.5 ml/min onto a 1 ml HiTrap™ SP HP cation exchange column (Amersham Biosciences AB) equilibrated with buffer A. The column was washed with buffer A containing 150 mM KCl and aaTHEP1 was eluted at 500 mM KCl. To the collected peak a 4-fold volume of buffer A containing 2.438 M (NH4)2SO4 (62.5 % saturation) instead of KCl was added. After centrifugation at 20,000 × g for 30 min, the resulting protein solution was applied to a 1 ml HiTrap™ (low sub) Phenyl Sepharose™ FF (Amersham Biosciences AB) equilibrated in buffer A containing 1.85 M (NH4)2SO4 (50 % saturation) at 0.5 ml/min. The column was washed at 40 % and eluted at 20 % (NH4)2SO4 saturation. Finally, (NH4)2SO4 was removed from aaTHEP1 by using a NAP™ 5 column (Amersham Biosciences AB) equilibrated in buffer A. The purified protein could be stored in buffer A for several weeks at -20°C without significant loss in activity. Protein concentrations were determined by densitometry  of SDS-Gels  using the Sigma bovine serum albumin protein micro standard. Gels were analyzed via an imaging densitometer (Bio-Rad GS-700) and the Molecular Analyst software.
Steady state kinetics
To measure ATP or GTP hydrolysis, the release of [γ-32P] from [γ-32P]ATP or [γ-32P]GTP was determined. Assays were performed using purified aaTHEP1 in 25 μl buffer A. Aliquots of the reaction mixture were stopped after different times of incubation by adding 25 μl of 40 % formic acid and the mixture was separated by thin layer chromatography on PEI-cellulose in 0.5 M potassium phosphate pH 3.9. Quantification was performed by scanning exposed and developed Hyperfilm™ – βmax films (Amersham Biosciences AB) using an imaging densitometer (Bio-Rad GS-700) and the Molecular Analyst software. Each catalytic activity was determined by the average of end point measurements divided by time of at least three different times of incubation under steady-state conditions. Blank values determined in the absence of aaTHEP1 were subtracted from each data point. Unless stated otherwise, the experiments were performed at 70°C in a Biometra T3 thermocycler. Competition experiments were measured at a constant concentration of 5 μM ATP. Nonlinear regressions to determine kcat- and Km-values were performed with the GraphPad Prism™ software using the Michaelis-Menten equation. Ki-values were determined by fitting the data points to
V = (Vmax × [ATP])/([ATP] + Km × (1 + [Inhibitor]/Ki)).
Nucleotide kinase activities were assayed at 5 and 1000 μM ATP by adding 5 as well as 1000 μM of GDP, UMP, AMP, and GMP respectively.
To determine if purified aaTHEP1 is in mono- or oligomeric form, a 1 × 50 cm column filled with superose 6 prep grade (Amersham Biosciences AB) was calibrated with cytochrom C, carbonic anhydrase monomer and dimer  and bovine serum albumin monomer and dimer  as molecular weight standards. Proteins were run at 0.2 ml/min in buffer A containing 150 mM KCl. Approximately 80 μg of aaTHEP1 or marker protein in 0.5 ml were applied to the column. To dissociate possible cold-induced unspecific aggregates, aaTHEP1 was heated for 10 min at 75°C prior to gel filtration.
In order to probe the domain structure of aaTHEP1, limited proteolysis by trypsin and endoproteinase Glu-C was performed. The reactions were carried out in buffer A containing 4 μg/ml protease and 40 μg/ml protein substrate. After incubation at 20°C for 1 h, samples were analyzed by SDS-PAGE. The β-subunit of tryptophan synthase from E. coli was prepared as described earlier .
Far UV circular dichroism spectra were recorded using a J-810 CD spectralpolarimeter (Jasco) at 30°C. 2.5 μg of purified aaTHEP1 was measured in 200 μl of buffer A in an 1 mm quartz cuvette. Helix- and beta-sheet contents were calculated using the Spectrum Analyzer software (Jasco).
COGs were analyzed by extended phylogenetic patterns search (EPPS) [23, 24] using the march 5, 2003 release of the COG database. Multiple sequence alignments were performed at EMBL Outstation  using CLUSTALW 1.81  and the data was visualized using BOXSHADE v3.21 . BLAST searches were performed at NCBI .
We thank Andreas Savelsberg for his help in performing the kinetic measurements and Reiner Würdinger for assistance during the CD-measurements at Jasco GmbH Germany. We greatly appreciate discussions with Marina V. Rodnina on all aspects of the manuscript.
- Tatusov RL, Koonin EV, Lipman DJ: A genomic perspective on protein families. Science. 1997, 278: 631-637. 10.1126/science.278.5338.631.View ArticlePubMedGoogle Scholar
- Tatusov RL, Natale DA, Garkavtsev IV, Tatusova TA, Shankavaram UT, Rao BS, Kiryutin B, Galperin MY, Fedorova ND, Koonin EV: The COG database: new developments in phylogenetic classification of proteins from complete genomes. Nucleic Acids Res. 2001, 29: 22-28. 10.1093/nar/29.1.22.PubMed CentralView ArticlePubMedGoogle Scholar
- Phylogenetic patterns search. [http://www.ncbi.nlm.nih.gov/COG/phylox.html]
- Forterre P: A hot story from comparative genomics: reverse gyrase is the only hyperthermophile-specific protein. Trends Genet. 2002, 18: 236-237. 10.1016/S0168-9525(02)02650-1.View ArticlePubMedGoogle Scholar
- Makarova KS, Wolf YI, Koonin EV: Potential genomic determinants of hyperthermophily. Trends Genet. 2003, 19: 172-176. 10.1016/S0168-9525(03)00047-7.View ArticlePubMedGoogle Scholar
- Cort JR, Yee A, Edwards AM, Arrowsmith CH, Kennedy MA: Structure-based functional classification of hypothetical protein MTH538 from Methanobacterium thermoautotrophicum. J Mol Biol. 2000, 302: 189-203. 10.1006/jmbi.2000.4052.View ArticlePubMedGoogle Scholar
- Deckert G, Warren PV, Gaasterland T, Young WG, Lenox AL, Graham DE, Overbeek R, Snead MA, Keller M, Aujay M, Huber R, Feldman RA, Short JM, Olsen GJ, Swanson RV: The complete genome of the hyperthermophilic bacterium Aquifex aeolicus. Nature. 1998, 392: 353-358. 10.1038/32831.View ArticlePubMedGoogle Scholar
- Leipe DD, Wolf YI, Koonin EV, Aravind L: Classification and evolution of P-loop GTPases and related ATPases. J Mol Biol. 2002, 317: 41-72. 10.1006/jmbi.2001.5378.View ArticlePubMedGoogle Scholar
- Ridder IS, Dijkstra BW: Identification of the Mg2+-binding site in the P-type ATPase and phosphatase members of the HAD (haloacid dehalogenase) superfamily by structural similarity to the response regulator protein CheY. Biochem J. 1999, 339 ( Pt 2): 223-226. 10.1042/0264-6021:3390223.View ArticleGoogle Scholar
- Kaufmann M, Schwarz T, Jaenicke R, Schnackerz KD, Meyer HE, Bartholmes P: Limited proteolysis of the beta 2-dimer of tryptophan synthase yields an enzymatically active derivative that binds alpha-subunits. Biochemistry. 1991, 30: 4173-4179.View ArticlePubMedGoogle Scholar
- Hogberg-Raibaud A, Goldberg ME: Preparation and characterization of a modified form of beta2 subunit of Escherichia coli tryptophan synthetase suitable for investigating protein folding. Proc Natl Acad Sci U S A. 1977, 74: 442-446.PubMed CentralView ArticlePubMedGoogle Scholar
- Galagan JE, Nusbaum C, Roy A, Endrizzi MG, Macdonald P, FitzHugh W, Calvo S, Engels R, Smirnov S, Atnoor D, Brown A, Allen N, Naylor J, Stange-Thomann N, DeArellano K, Johnson R, Linton L, McEwan P, McKernan K, Talamas J, Tirrell A, Ye W, Zimmer A, Barber RD, Cann I, Graham DE, Grahame DA, Guss AM, Hedderich R, Ingram-Smith C, Kuettner HC, Krzycki JA, Leigh JA, Li W, Liu J, Mukhopadhyay B, Reeve JN, Smith K, Springer TA, Umayam LA, White O, White RH, Conway de Macario E, Ferry JG, Jarrell KF, Jing H, Macario AJ, Paulsen I, Pritchett M, Sowers KR, Swanson RV, Zinder SH, Lander E, Metcalf WW, Birren B: The genome of M. acetivorans reveals extensive metabolic and physiological diversity. Genome Res. 2002, 12: 532-542. 10.1101/gr.223902.PubMed CentralView ArticlePubMedGoogle Scholar
- Herdendorf TJ, McCaslin DR, Forest KT: Aquifex aeolicus PilT, homologue of a surface motility protein, is a thermostable oligomeric NTPase. J Bacteriol. 2002, 184: 6465-6471. 10.1128/JB.184.23.6465-6471.2002.PubMed CentralView ArticlePubMedGoogle Scholar
- Bhattacharjee MK, Kachlany SC, Fine DH, Figurski DH: Nonspecific adherence and fibril biogenesis by Actinobacillus actinomycetemcomitans: TadA protein is an ATPase. J Bacteriol. 2001, 183: 5927-5936. 10.1128/JB.183.20.5927-5936.2001.PubMed CentralView ArticlePubMedGoogle Scholar
- Rivas S, Bolland S, Cabezon E, Goni FM, de la Cruz F: TrwD, a protein encoded by the IncW plasmid R388, displays an ATP hydrolase activity essential for bacterial conjugation. J Biol Chem. 1997, 272: 25583-25590. 10.1074/jbc.272.41.25583.View ArticlePubMedGoogle Scholar
- Sakai D, Horiuchi T, Komano T: Atpase activity and multimer formation of Pilq protein are required for thin pilus biogenesis in plasmid R64. J Biol Chem. 2001, 276: 17968-17975. 10.1074/jbc.M010652200.View ArticlePubMedGoogle Scholar
- Wolf YI, Rogozin IB, Kondrashov AS, Koonin EV: Genome alignment, evolution of prokaryotic genome organization, and prediction of gene function using genomic context. Genome Res. 2001, 11: 356-372. 10.1101/gr.GR-1619R.View ArticlePubMedGoogle Scholar
- Wadsworth C: A simple stained protein assay (SPA) for rapid total protein determination applied to spinal fluids. Scand J Immunol. 1976, 5: 1209-1213.View ArticlePubMedGoogle Scholar
- Laemmli UK: Cleavage of structural proteins during the assembly of the head of bacteriophage T4. Nature. 1970, 227: 680-685.View ArticlePubMedGoogle Scholar
- Cleland JL, Wang DI: Transient association of the first intermediate during the refolding of bovine carbonic anhydrase B. Biotechnol Prog. 1992, 8: 97-103.View ArticlePubMedGoogle Scholar
- Levi V, Gonzalez Flecha FL: Reversible fast-dimerization of bovine serum albumin detected by fluorescence resonance energy transfer. Biochim Biophys Acta. 2002, 1599: 141-148. 10.1016/S1570-9639(02)00414-4.View ArticlePubMedGoogle Scholar
- Bartholmes P, Kirschner K, Gschwind HP: Cooperative and noncooperative binding of pyridoxal 5'-phosphate to tryptophan synthase from Escherichia coli. Biochemistry. 1976, 15: 4712-4717.View ArticlePubMedGoogle Scholar
- Reichard K, Kaufmann M: EPPS: mining the COG database by an extended phylogenetic patterns search. Bioinformatics. 2003, 19: 784-785. 10.1093/bioinformatics/btg089.View ArticlePubMedGoogle Scholar
- Universität Witten/Herdecke: EPPS. [http://www.uni-wh.de/epps]
- ClustalW. [http://www.ebi.ac.uk/clustalw/]
- Thompson JD, Higgins DG, Gibson TJ: CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic Acids Res. 1994, 22: 4673-4680.PubMed CentralView ArticlePubMedGoogle Scholar
- BoxShade Server. [http://www.ch.embnet.org/software/BOX_form.html]
- NCBI BLAST Home Page. [http://www.ncbi.nlm.nih.gov/BLAST/]
This article is published under license to BioMed Central Ltd. This is an Open Access article: verbatim copying and redistribution of this article are permitted in all media for any purpose, provided this notice is preserved along with the article's original URL.