|
List of sequenced eukaryotic genomes
|
This list of sequenced eukaryotic genomes contains all the eukaryotes known to have publicly available complete nuclear and organelle genome sequences that have been assembled, annotated and published; draft genomes are not included, nor are organelle only sequences.
DNA was first sequenced in 1977. The first free-living organism to have its genome completely sequenced was the bacterium Haemophilus influenzae, in 1995. In 1996 Saccharomyces cerevisiae (baker's yeast) was the first eukaryote genome sequence to be released and in 1998 the first genome sequence for a multicellular eukaryote, Caenorhabditis elegans, was released.
Protists
Chromista
The Chromista are a group of protists that contains the algal phyla Heterokontophyta, Haptophyta and Cryptophyta. Members of this group are mostly studied for evolutionary interest.
Alveolata
Alveolata are a group of protists which includes the Ciliophora, Apicomplexa and Dinoflagellata. Members of this group are of particular interest to science as the cause of serious human and livestock diseases.
Excavata
Excavata is a group of related free living and symbiotic protists; it includes the Metamonada, Loukozoa, Euglenozoa and Percolozoa. They are researched for their role in human disease.
Amoebozoa
Amoebozoa are a group of motile amoeboid protists, members of this group move or feed by means of temporary projections, called pseudopods. The best known member of this group is the slime mold which has been studied for centuries; other members include the Archamoebae, Tubulinea and Flabellinea. Some Amoeboza cause disease.
| Organism |
Type |
Relevance |
Genome size |
Number of genes predicted |
Organization |
Year of completion |
Dictyostelium discoideum
Strain:AX4 |
Slime mold |
Model organism |
34 Mb |
12,500[15] |
Consortium from University of Cologne, Baylor College of Medicine and the Sanger Centre |
2005[15] |
Entamoeba histolytica
HM1:IMSS |
Parasitic protozoan |
Human pathogen (amoebic dysentery) |
23.8 Mb |
9,938[16] |
TIGR, Sanger Institute and the London School of Hygiene and Tropical Medicine |
2005[16] |
Plants
| Organism |
Type |
Relevance |
Genome size |
Number of genes predicted |
Organization |
Year of completion |
Arabidopsis thaliana
Ecotype:Columbia |
Wild mustard |
Model plant |
120 Mb |
25,498[17] |
Arabidopsis Genome Initiative[18] |
2000[17] |
Cyanidioschyzon merolae
Strain:10D |
Red alga |
Simple eukaryote |
16.5 Mb |
5,331[19] |
University of Tokyo, Rikkyo University, Saitama University and Kumamoto University |
2004[19] |
Oryza sativa
ssp indica |
Rice |
Crop and model organism |
420 Mb |
32-50,000[20] |
Beijing Genomics Institute, Zhejiang University and the Chinese Academy of Sciences |
2002[20] |
Oryza sativa
ssp japonica |
Rice |
Crop and model organism |
466 Mb |
46,022-55,615[21] |
Syngenta and Myriad Genetics |
2002[21] |
| Ostreococcus tauri |
Green alga |
Simple eukaryote |
12.6 Mb |
|
Laboratoire Arago |
2006[22] |
| Physcomitrella patens |
Bryophyte |
Model organism
early diverging land plant
|
500 Mb |
39,458[23] |
US Department of Energy Office of Science Joint Genome Institute |
2008[23] |
| Populus trichocarpa |
Balsam poplar or Black Cottonwood |
Carbon sequestration, model tree, commercial use (timber), and comparison to A. thaliana |
550 Mb |
45,555[24] |
The International Poplar Genome Consortium |
2006[24] |
| Vitis vinifera |
Grapevine PN40024 |
Fruit crop |
490 Mb[25] |
30,434[25] |
The French-Italian Public Consortium for Grapevine Genome Characterization |
2007[25] |
Fungi
| Organism |
Type |
Relevance |
Genome size |
Number of genes predicted |
Organization |
Year of completion |
Ashbya gossypii
Strain:ATCC 10895 |
Fungus |
Plant pathogen |
9.2 Mb |
4,718[26] |
SyngentaAG and University of Basel |
2004[26] |
Aspergillus fumigatus
Strain:Af293 |
Fungus |
Human pathogen |
29.4 Mb |
9,926[27] |
Sanger Institute, University of Manchester, TIGR, Institut Pasteur, Nagasaki University, University of Salamanca and OpGen |
2005[27] |
Aspergillus nidulans
Strain:FGSC A4 |
Fungus |
Model organism |
30 Mb |
9,500[28] |
|
2005[28] |
Aspergillus niger
Strain:CBS 513.88 |
Fungus |
Biotechnology - fermentation |
33.9 Mb |
14,165[29] |
|
2007[29] |
Aspergillus oryzae
Strain:RIB40 |
Fungus |
Used to ferment soy |
37 Mb |
12,074[30] |
National Institute of Technology and Evaluation |
2005[30] |
Candida glabrata
Strain:CBS138 |
Fungus |
Human pathogen |
12.3 Mb |
5,283[31] |
Génolevures Consortium [32] |
2004[31] |
Cryptococcus (Filobasidiella) neoformans
JEC21 |
Fungus |
Human pathogen |
20 Mb |
6,500[33] |
TIGR and Stanford University |
2005[33] |
Debaryomyces hansenii
Strain:CBS767 |
Yeast |
Cheese ripening |
12.2 Mb |
6,906[31] |
Génolevures Consortium |
2004[31] |
| Encephalitozoon cuniculi |
Microsporidium |
Human pathogen |
2.9 Mb |
1,997[34] |
Genoscope and Université Blaise Pascal |
2001[34] |
Kluyveromyces lactis
Strain:CLIB210 |
Yeast |
|
10-12 Mb |
5,329[31] |
Génolevures Consortium |
2004[31] |
| Magnaporthe grisea |
Fungus |
Plant pathogen |
37.8 Mb |
11,109[35] |
|
2005[35] |
| Neurospora crassa |
Fungus |
Model eukaryote |
40 Mb |
10,082[28] |
Broad Institute, Oregon Health and Science University, University of Kentucky, and the University of Kansas |
2003[28] |
Saccharomyces cerevisiae
Strain:S288C |
Baker's yeast |
Model eukaryote |
12.1 Mb |
6,294[36] |
International Collaboration for the Yeast Genome Sequencing[37] |
1996[36] |
Schizosaccharomyces pombe
Strain:972h |
Yeast |
Model eukaryote |
14 Mb |
4,824[38] |
Sanger Institute and Cold Spring Harbor Laboratory |
2002[38] |
Yarrowia lipolytica
Strain:CLIB99 |
Yeast |
Industrial uses |
20 Mb |
6,703[31] |
Génolevures Consortium |
2004[31] |
Mammals
| Organism |
Type |
Shotgun Coverage |
Genome size |
Number of genes predicted |
Organization |
Year of completion |
| Bos taurus |
Cow |
6* |
3.0 Gb[39] |
|
Cattle Genome Sequencing International Consortium |
| Canis lupus familiaris |
Dog |
7.6* |
2.4 Gb[40] |
19,300[40] |
Broad Institute and Agencourt Bioscience |
2005[40] |
| Cavia porcellus |
Guinea Pig |
2* |
3.4 Gb |
|
The Genome Sequencing Platform, The Genome Assembly Team[39] |
| Dasypus novemcinctus |
Nine-banded Armadillo |
2* [41] |
3.0 Gb |
|
Broad Institute[39] |
| Echinops telfairi |
Hedgehog-Tenrec |
2* [41] |
|
|
Broad Institute |
| Equus caballus |
Horse |
6.8* |
2.1 Gb [39] |
|
Broad Institute et al.[39] |
2007 [42] |
| Erinaceus europaeus |
Western European Hedgehog |
2* [41] |
|
|
Broad Institute |
| Felis catus |
Cat |
2* |
3 Gb |
20,285 |
The Genome Sequencing Platform, The Genome Assembly Team[39] |
2007[43] |
| Homo sapiens |
Human |
|
3.2 Gb [44] |
25,000[44] |
Human Genome Project Consortium and Celera Genomics |
Draft 2001[45][46]
Complete 2006[47] |
| Loxodonta africana |
African Elephant |
2* [41] |
3 Gb |
|
Broad Institute |
| Macaca mulatta |
Rhesus Macaque |
6* |
|
|
Macaque Genome Sequencing Consortium[39] |
| Microcebus murinus |
Gray Mouse Lemur |
2* [41] |
|
|
The Genome Sequencing Platform, The Genome Assembly Team[39] |
| Monodelphis domestica |
Gray Short-tailed Opossum |
|
3.475 Gb
(only 10% in Genbank)[48] |
18 - 20,000
(protein coding) |
Broad Institute et al. |
2007[49] |
| Mus musculus |
Mouse |
|
2.5 Gb |
24,174[50] |
International Collaboration for the Mouse Genome Sequencing[51] |
2002[50] |
| Myotis lucifugus |
Little Brown Bat |
2* [39] |
|
|
Broad Institute |
| Ochotona princeps |
American Pika |
2* [41] |
|
|
Broad Institute |
| Ornithorhynchus anatinus [48] |
Platypus |
6* [39] |
|
|
Washington University |
| Oryctolagus cuniculus |
Rabbit |
2* [41] |
2.5 Gb |
|
Broad Institute et al. [39] |
| Otolemur garnettii |
Small-eared Galago, or Bushbaby |
2* [41] |
|
|
Broad Institute |
| Pan troglodytes |
Chimpanzee |
6* [39] |
3.1 Gb |
|
Chimpanzee Sequencing and Analysis Consortium |
2005[52] |
| Pongo pygmaeus |
Orangutan |
|
3.0 Gb |
|
Institute for Molecular Biotechnology [39] |
| Rattus norvegicus |
Rat |
1.8* or better |
2.8 Gb [39] |
21,166[53] |
Rat Genome Sequencing Project Consortium |
2004[53] |
| Sorex araneus |
European Shrew |
2* [41] |
3.0 Gb [39] |
|
The Genome Sequencing Platform, The Genome Assembly Team[39] |
| Spermophilus tridecemlineatus |
Thirteen-lined Ground Squirrel |
2* |
|
|
The Genome Sequencing Platform, The Genome Assembly Team[39] |
| Tupaia belangeri |
Northern Tree Shrew |
2* |
|
|
Broad Institute[39] |
Other Animals
| Organism |
Type |
Relevance |
Genome size |
Number of genes predicted |
Organization |
Year of completion |
Anopheles gambiae
Strain: PEST |
Mosquito |
Vector of malaria |
27.8 Mb |
13,683[54] |
Celera Genomics and Genoscope |
2002[54] |
| Apis mellifera |
Honey bee |
|
1.8 Gb |
10,157[55] |
The Honeybee Genome Sequencing Consortium |
2006[55] |
Bombyx mori
Strain:p50T |
Moth (domestic silk worm) |
Silk production |
530 Mb |
|
University of Tokyo and National Institute of Agrobiological Sciences |
2004[56] |
| Caenorhabditis briggsae |
Nematode worm |
For comparison with C. elegans |
104 Mb |
19,500[57] |
Washington University, Sanger Institute and Cold Spring Harbor Laboratory |
2003[57] |
Caenorhabditis elegans
Strain:Bristol N2 |
Nematode worm |
Model animal |
97 Mb |
19,000[58] |
Washington University and the Sanger Institute |
1998[58] |
| Ciona intestinalis |
Tunicate |
Simple chordate |
116.7 Mb |
16,000[59] |
Joint Genome Institute |
2003[59] |
| Ciona savignyi |
Tunicate |
|
174 Mb |
|
Broad Institute |
2007[60] |
| Drosophila melanogaster |
Fruit fly |
Model animal |
165 Mb |
13,600[61] |
Celera, UC Berkeley, Baylor College of Medicine, European DGP |
2000[61] |
| Gallus gallus |
Chicken |
|
1 Gb |
20-23,000[62] |
International Chicken Genome Sequencing Consortium |
2004[62] |
| Strongylocentrotus purpuratus |
Sea urchin |
Model eukaryote |
814 Mb |
23,300[63] |
Sea Urchin Genome Sequencing Consortium |
2006[63] |
| Takifugu rubripes |
Puffer fish |
Vertebrate with small genome |
390 Mb |
22-29,000[64] |
International Fugu Genome Consortium[65] |
2002[66] |
| Tetraodon nigroviridis |
Puffer fish |
Vertebrate with compact genome |
340 Mb[67] |
22,400[67] |
Genoscope and the Broad Institute |
2004[67] |
See also
References
- ^ a b Douglas S, Zauner S, Fraunholz M, et al (April 2001). "The highly reduced genome of an enslaved algal nucleus". Nature 410 (6832): 1091–6. doi:10.1038/35074092. PMID 11323671.
- ^ a b Armbrust EV, Berges JA, Bowler C, et al (October 2004). "The genome of the diatom Thalassiosira pseudonana: ecology, evolution, and metabolism". Science (journal) 306 (5693): 79–86. doi:10.1126/science.1101156. PMID 15459382.
- ^ a b Xu P, Widmer G, Wang Y, et al (October 2004). "The genome of Cryptosporidium hominis". Nature 431 (7012): 1107–12. doi:10.1038/nature02977. PMID 15510150.
- ^ a b Abrahamsen MS, Templeton TJ, Enomoto S, et al (April 2004). "Complete genome sequence of the apicomplexan, Cryptosporidium parvum". Science (journal) 304 (5669): 441–5. doi:10.1126/science.1094786. PMID 15044751.
- ^ a b Aury JM, Jaillon O, Duret L, et al (November 2006). "Global trends of whole-genome duplications revealed by the ciliate Paramecium tetraurelia". Nature 444 (7116): 171–8. doi:10.1038/nature05230. PMID 17086204.
- ^ a b Gardner MJ, Hall N, Fung E, et al (October 2002). "Genome sequence of the human malaria parasite Plasmodium falciparum". Nature 419 (6906): 498–511. doi:10.1038/nature01097. PMID 12368864.
- ^ a b Carlton JM, Angiuoli SV, Suh BB, et al (October 2002). "Genome sequence and comparative analysis of the model rodent malaria parasite Plasmodium yoelii yoelii". Nature 419 (6906): 512–9. doi:10.1038/nature01099. PMID 12368865.
- ^ a b Gardner MJ, Bishop R, Shah T, et al (July 2005). "Genome sequence of Theileria parva, a bovine pathogen that transforms lymphocytes". Science (journal) 309 (5731): 134–7. doi:10.1126/science.1110439. PMID 15994558.
- ^ Pain A, Renauld H, Berriman M, et al (July 2005). "Genome of the host-cell transforming parasite Theileria annulata compared with T. parva". Science (journal) 309 (5731): 131–3. doi:10.1126/science.1110418. PMID 15994557.
- ^ a b Eisen JA, Coyne RS, Wu M, et al (September 2006). "Macronuclear genome sequence of the ciliate Tetrahymena thermophila, a model eukaryote". PLoS Biol. 4 (9): e286. doi:10.1371/journal.pbio.0040286. PMID 16933976.
- ^ a b Ivens AC, Peacock CS, Worthey EA, et al (July 2005). "The genome of the kinetoplastid parasite, Leishmania major". Science (journal) 309 (5733): 436–42. doi:10.1126/science.1112680. PMID 16020728.
- ^ a b Carlton JM, Hirt RP, Silva JC, et al (January 2007). "Draft genome sequence of the sexually transmitted pathogen Trichomonas vaginalis". Science (journal) 315 (5809): 207–12. doi:10.1126/science.1132894. PMID 17218520.
- ^ a b Berriman M, Ghedin E, Hertz-Fowler C, et al (July 2005). "The genome of the African trypanosome Trypanosoma brucei". Science (journal) 309 (5733): 416–22. doi:10.1126/science.1112642. PMID 16020726.
- ^ a b El-Sayed NM, Myler PJ, Bartholomeu DC, et al (July 2005). "The genome sequence of Trypanosoma cruzi, etiologic agent of Chagas disease". Science (journal) 309 (5733): 409–15. doi:10.1126/science.1112631. PMID 16020725.
- ^ a b Eichinger L, Pachebat JA, Glöckner G, et al (May 2005). "The genome of the social amoeba Dictyostelium discoideum". Nature 435 (7038): 43–57. doi:10.1038/nature03481. PMID 15875012.
- ^ a b Loftus B, Anderson I, Davies R, et al (February 2005). "The genome of the protist parasite Entamoeba histolytica". Nature 433 (7028): 865–8. doi:10.1038/nature03291. PMID 15729342.
- ^ a b The Arabidopsis Genome Initiative, (December 2000). "Analysis of the genome sequence of the flowering plant Arabidopsis thaliana". Nature 408 (6814): 796–815. doi:10.1038/35048692. PMID 11130711.
- ^ Arabidopsis Genome Initiative
- ^ a b Matsuzaki M, Misumi O, Shin-I T, et al (April 2004). "Genome sequence of the ultrasmall unicellular red alga Cyanidioschyzon merolae 10D". Nature 428 (6983): 653–7. doi:10.1038/nature02398. PMID 15071595.
- ^ a b Goff SA, Ricke D, Lan TH, et al (April 2002). "A draft sequence of the rice genome (Oryza sativa L. ssp. japonica)". Science (journal) 296 (5565): 92–100. doi:10.1126/science.1068275. PMID 11935018.
- ^ a b Yu J, Hu S, Wang J, et al (April 2002). "A draft sequence of the rice genome (Oryza sativa L. ssp. indica)". Science (journal) 296 (5565): 79–92. doi:10.1126/science.1068037. PMID 11935017.
- ^ Derelle E, Ferraz C, Rombauts S, et al (August 2006). "Genome analysis of the smallest free-living eukaryote Ostreococcus tauri unveils many unique features". Proc. Natl. Acad. Sci. U.S.A. 103 (31): 11647–52. doi:10.1073/pnas.0604795103. PMID 16868079.
- ^ a b Rensing SA, Lang D, Zimmer AD, et al (January 2008). "The Physcomitrella genome reveals evolutionary insights into the conquest of land by plants". Science (journal) 319 (5859): 64–9. doi:10.1126/science.1150646. PMID 18079367.
- ^ a b Tuskan GA, Difazio S, Jansson S, et al (September 2006). "The genome of black cottonwood, Populus trichocarpa (Torr. & Gray)". Science (journal) 313 (5793): 1596–604. doi:10.1126/science.1128691. PMID 16973872.
- ^ a b c Jaillon O, Aury JM, Noel B, et al (September 2007). "The grapevine genome sequence suggests ancestral hexaploidization in major angiosperm phyla". Nature 449 (7161): 463–7. doi:10.1038/nature06148. PMID 17721507.
- ^ a b Dietrich FS, Voegeli S, Brachat S, et al (April 2004). "The Ashbya gossypii genome as a tool for mapping the ancient Saccharomyces cerevisiae genome". Science (journal) 304 (5668): 304–7. doi:10.1126/science.1095781. PMID 15001715.
- ^ a b Nierman WC, Pain A, Anderson MJ, et al (December 2005). "Genomic sequence of the pathogenic and allergenic filamentous fungus Aspergillus fumigatus". Nature 438 (7071): 1151–6. doi:10.1038/nature04332. PMID 16372009.
- ^ a b c d Galagan JE, Calvo SE, Cuomo C, et al (December 2005). "Sequencing of Aspergillus nidulans and comparative analysis with A. fumigatus and A. oryzae". Nature 438 (7071): 1105–15. doi:10.1038/nature04341. PMID 16372000.
- ^ a b Pel HJ, de Winde JH, Archer DB, et al (February 2007). "Genome sequencing and analysis of the versatile cell factory Aspergillus niger CBS 513.88". Nat. Biotechnol. 25 (2): 221–31. doi:10.1038/nbt1282. PMID 17259976.
- ^ a b Machida M, Asai K, Sano M, et al (December 2005). "Genome sequencing and analysis of Aspergillus oryzae". Nature 438 (7071): 1157–61. doi:10.1038/nature04300. PMID 16372010.
- ^ a b c d e f g h Dujon B, Sherman D, Fischer G, et al (July 2004). "Genome evolution in yeasts". Nature 430 (6995): 35–44. doi:10.1038/nature02579. PMID 15229592.
- ^ About Génolevures
- ^ a b Loftus BJ, Fung E, Roncaglia P, et al (February 2005). "The genome of the basidiomycetous yeast and human pathogen Cryptococcus neoformans". Science (journal) 307 (5713): 1321–4. doi:10.1126/science.1103773. PMID 15653466.
- ^ a b Katinka MD, Duprat S, Cornillot E, et al (November 2001). "Genome sequence and gene compaction of the eukaryote parasite Encephalitozoon cuniculi". Nature 414 (6862): 450–3. doi:10.1038/35106579. PMID 11719806.
- ^ a b Dean RA, Talbot NJ, Ebbole DJ, et al (April 2005). "The genome sequence of the rice blast fungus Magnaporthe grisea". Nature 434 (7036): 980–6. doi:10.1038/nature03449. PMID 15846337.
- ^ a b Goffeau A, Barrell BG, Bussey H, et al (October 1996). "Life with 6000 genes". Science (journal) 274 (5287): 546, 563–7. PMID 8849441.
- ^ International Collaboration for the Yeast Genome Sequencing
- ^ a b Wood V, Gwilliam R, Rajandream MA, et al (February 2002). "The genome sequence of Schizosaccharomyces pombe". Nature 415 (6874): 871–80. doi:10.1038/nature724. PMID 11859360.
- ^ a b c d e f g h i j k l m n o p q r PubMed Home
- ^ a b c Lindblad-Toh K, Wade CM, Mikkelsen TS, et al (December 2005). "Genome sequence, comparative analysis and haplotype structure of the domestic dog". Nature 438 (7069): 803–19. doi:10.1038/nature04338. PMID 16341006.
- ^ a b c d e f g h i Mammalian Genome Project - Broad
- ^ Horse Genome Assembled, February 7, 2007 News Release - National Institutes of Health (NIH)
- ^ Pontius JU, Mullikin JC, Smith DR, et al (November 2007). "Initial sequence and comparative analysis of the cat genome". Genome Res. 17 (11): 1675–89. doi:10.1101/gr.6380007. PMID 17975172.
- ^ a b Human Genome Sequencing Consortium, International (October 2004). "Finishing the euchromatic sequence of the human genome". Nature 431 (7011): 931–45. doi:10.1038/nature03001. PMID 15496913.
- ^ McPherson JD, Marra M, Hillier L, et al (February 2001). "A physical map of the human genome". Nature 409 (6822): 934–41. doi:10.1038/35057157. PMID 11237014.
- ^ Venter JC, Adams MD, Myers EW, et al (February 2001). "The sequence of the human genome". Science (journal) 291 (5507): 1304–51. doi:10.1126/science.1058040. PMID 11181995.
- ^ Gregory SG, Barlow KF, McLay KE, et al (May 2006). "The DNA sequence and biological annotation of human chromosome 1". Nature 441 (7091): 315–21. doi:10.1038/nature04727. PMID 16710414.
- ^ a b Whole Genome Shotgun sequencing project list
- ^ Mikkelsen TS, Wakefield MJ, Aken B, et al (May 2007). "Genome of the marsupial Monodelphis domestica reveals innovation in non-coding sequences". Nature 447 (7141): 167–77. doi:10.1038/nature05805. PMID 17495919.
- ^ a b Waterston RH, Lindblad-Toh K, Birney E, et al (December 2002). "Initial sequencing and comparative analysis of the mouse genome". Nature 420 (6915): 520–62. doi:10.1038/nature01262. PMID 12466850.
- ^ International Collaboration for the Mouse Genome Sequencing
- ^ Chimpanzee Sequencing and Analysis Consortium. (September 2005). "Initial sequence of the chimpanzee genome and comparison with the human genome". Nature 437 (7055): 69–87. doi:10.1038/nature04072. PMID 16136131.
- ^ a b Gibbs RA, Weinstock GM, Metzker ML, et al (April 2004). "Genome sequence of the Brown Norway rat yields insights into mammalian evolution". Nature 428 (6982): 493–521. doi:10.1038/nature02426. PMID 15057822.
- ^ a b Holt RA, Subramanian GM, Halpern A, et al (October 2002). "The genome sequence of the malaria mosquito Anopheles gambiae". Science (journal) 298 (5591): 129–49. doi:10.1126/science.1076181. PMID 12364791. H
- ^ a b Honeybee Genome Sequencing Consortium. (October 2006). "Insights into social insects from the genome of the honeybee Apis mellifera". Nature 443 (7114): 931–49. PMID 17073008.
- ^ Mita K, Kasahara M, Sasaki S, et al (February 2004). "
|