|
|
||||||||
1 Information Génomique et Structurale, CNRS UPR2589, Case 934, 163 Avenue de Luminy, 13288 Marseille cedex 09, France
2 Unité des rickettsies, IFR 48, CNRS UMR 6020, Faculté de Médecine, Université de la Méditerranée, 27 Boulevard Jean Moulin, 13385 Marseille cedex 05, France
Correspondence
Pierre-Edouard Fournier
Pierre-Edouard.Fournier{at}medecine.univ-mrs.fr
Determination of the DNA G+C content of prokaryotic genomes using traditional methods is time-consuming and results may vary from laboratory to laboratory, depending on the technique used. We explored the possibility of extrapolating the genomic DNA G+C content of prokaryotes from gene sequences. For this, 127 universally conserved genes were studied from 50 prokaryotic genomes in the Clusters of Orthologous Groups database. Of these, 57 genes were present as a single copy in the genomes of 157 different prokaryote species available in GenBank. There was a strong correlation [coefficient of determination (r2) >95 %] between the DNA G+C contents of 20 genes and their corresponding genomes. For each of the 157 prokaryotic genomes studied, the DNA G+C content of the 20 genes was used to determine a calculated genome DNA G+C content (CGC) and this value was compared with the real genome DNA G+C content (RGC). In order to select the most suitable gene for the determination of CGC values, we compared the r2 and median mol% difference between CGC and RGC as well as the sensitivity of each gene to provide CGC values for prokaryotic genomes that differ by less than 5 mol% from their RGC. The highly conserved ftsY gene (median size 1144 nucleotides), a vertically inherited member of the GTPase superfamily, showed the highest r2 value of 0.98, the smallest median mol% difference between CGC and RGC of 1.06 and a sensitivity of 100 %. Using ftsY DNA G+C content values, the CGC values of 100 genomes not included in the calculation of r2 differed by less than 5 mol% from their RGC values. These data suggest that the genomic DNA G+C content of prokaryotes may be estimated easily and reliably from the ftsY gene sequence.
Published online ahead of print on 31 December 2005 as DOI 10.1099/ijs.0.63903-0.
Tables detailing the prokaryotic species for which complete genome sequences are available in GenBank, the differences between RGC and CGC values obtained in the study for 100 prokaryotic genomes and a list of genes conserved in prokaryotic genomes in the COG database are available as supplementary data in IJSEM Online.
This article has been cited by other articles:
![]() |
T. Adekambi, T. M. Shinnick, D. Raoult, and M. Drancourt Complete rpoB gene sequencing as a suitable supplement to DNA-DNA hybridization for bacterial species and genus delineation Int J Syst Evol Microbiol, August 1, 2008; 58(8): 1807 - 1814. [Abstract] [Full Text] [PDF] |
||||
![]() |
O. Mediannikov, K. Matsumoto, I. Samoylenko, M. Drancourt, V. Roux, E. Rydkina, B. Davoust, I. Tarasevich, P. Brouqui, and P.-E. Fournier Rickettsia raoultii sp. nov., a spotted fever group rickettsia associated with Dermacentor ticks in Europe and Russia Int J Syst Evol Microbiol, July 1, 2008; 58(7): 1635 - 1639. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Christensen, P. Kuhnert, H.-J. Busse, W. C. Frederiksen, and M. Bisgaard Proposed minimal standards for the description of genera, species and subspecies of the Pasteurellaceae Int J Syst Evol Microbiol, January 1, 2007; 57(1): 166 - 178. [Abstract] [Full Text] [PDF] |
||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| INT J SYST EVOL MICROBIOL | MICROBIOLOGY | J GEN VIROL |
| J MED MICROBIOL | ALL SGM JOURNALS | |