SPD3.k51.ForMGRAST
by undefined undefined, undefined .
Visibility private NCBI Project ID - ID 7b76dc45d26d676d343836313733332e33 GOLD ID - ENA Project ID ENA Library ID Static Link private metagenomes cannot be linked PubMed ID - Sample - Library -
The data set SPD3.k51.ForMGRAST was uploaded on 2019-09-30 at 14:19:02 and contains 67,077 sequences totaling 10,342,990 basepairs with an average length of 154 bps.
Of the sequences tested, 10,773 sequences (16.06%) failed to pass the QC pipeline. Of those, dereplication identified 6,655 sequences as artificial duplicate reads.
Of the sequences that passed QC, 707 sequences (2%) contain ribosomal RNA genes, 12,083 sequences (27.68%) contain predicted proteins with known functions, and 30,868 sequences (70.70%) contain predicted proteins with unknown function.
Data shown here is displayed as a quick way to assess the quality and contents of the data set. We note that the submitting authors may have performed their own analysis. The
analysis page provides the best way to perform in-depth analyses of this data set.
The data on this page represents the automated analysis generated by the MG-RAST automated processing pipeline. Details on the processing of this data set are in the
processing receipt .
failed QC - 10,773 (16.06%) unknown - 12,646 (18.85%) predicted feature - 43,658 (65.09%) unknown protein - 30,868 (70.70%) annotated protein - 12,083 (27.68%) ribosomal RNA - 707 (1.62%) Upload: bp Count 10,342,990 bp Upload: Sequences Count 67,077 Upload: Mean Sequence Length 154 ± 95 bp Upload: Mean GC percent 42 ± 7 % Artificial Duplicate Reads: Sequence Count 6,655 Post QC: bp Count 8,387,440 bp Post QC: Sequences Count 56,304 Post QC: Mean Sequence Length 149 ± 62 bp Post QC: Mean GC percent 42 ± 7 % Processed: Predicted Protein Features 38,817 Processed: Predicted rRNA Features 608 Alignment: Identified Protein Features 8,748 Alignment: Identified rRNA Features 464 Annotation: Identified Functional Categories undefined
Investigation Type WGS Study Name SPlab-run431-SPD3-asm Latitude and Longitude Country and/or Sea, Location Collection Date Environment (Biome) Environment (Feature) Environment (Material) Environmental Package Sequencing Method other More Metadata click for full table
The kmer rank abundance graph plots the kmer coverage as a function of abundance rank, with the most abundant sequences at the left.
undefined sequence size k-mer coverage 0 101 102 103 0 101 102 103 104 105 106 107
The graph below displays the number of hits in the different databases listed. These include protein databases, protein databases with functional hierarchy information, and ribosomal RNA databases. The bars representing annotated reads are colored by e-value range. Different databases have different numbers of hits but can also have different types of annotation data.
583 1,437 3,157 1,823 794 506 1,253 2,651 1,445 612 635 1,560 3,647 2,277 962 369 785 1,739 984 378 507 1,253 2,960 1,771 750 623 1,524 3,455 2,161 887 159 397 810 426 166 501 1,221 2,566 1,398 552 413 940 2,245 1,315 518 254 564 1,212 676 242 411 915 2,138 1,221 468 36 9 10 1 0 336 66 19 3 0 174 29 17 4 0 1 16 42 48 18 35 4 1 0 0 0 1,000 2,000 3,000 4,000 5,000 6,000 7,000 8,000 9,000 10,000 RefSeq IMG TrEMBL Subsystems KEGG GenBank SwissProt PATRIC eggNOG KO GO COG RDP LSU SSU NOG e^-30 & less e^-20 to e^-30 e^-10 to e^-20 e^-5 to e^-10 e^-3 to e^-5
The pie charts below illustrate the distribution of functional categories for COGs, KOs, NOGs, and Subsystems at the highest level supported by these functional hierarchies. Each slice indicates the percentage of reads with predicted protein functions annotated to the category for the given source.
METABOLISM - 2,213 (37.58%) INFORMATION STORAGE AND PROCESSING - 1,490 (25.30%) CELLULAR PROCESSES AND SIGNALING - 1,134 (19.26%) POORLY CHARACTERIZED - 1,052 (17.86%) POORLY CHARACTERIZED - 83 (62.41%) METABOLISM - 23 (17.29%) INFORMATION STORAGE AND PROCESSING - 15 (11.28%) CELLULAR PROCESSES AND SIGNALING - 12 (9.02%) Metabolism - 2,425 (58.31%) Genetic Information Processing - 1,018 (24.48%) Environmental Information Processing - 497 (11.95%) Cellular Processes - 123 (2.96%) Human Diseases - 79 (1.90%) Organismal Systems - 17 (0.41%) Clustering-based subsystems - 1,962 (15.93%) Carbohydrates - 1,409 (11.44%) Miscellaneous - 1,099 (8.92%) Protein Metabolism - 1,014 (8.23%) Amino Acids and Derivatives - 931 (7.56%) RNA Metabolism - 928 (7.53%) DNA Metabolism - 703 (5.71%) Cofactors, Vitamins, Prosthetic Groups, Pigments - 659 (5.35%) Nucleosides and Nucleotides - 566 (4.60%) Cell Wall and Capsule - 562 (4.56%) Fatty Acids, Lipids, and Isoprenoids - 335 (2.72%) Virulence, Disease and Defense - 282 (2.29%) Membrane Transport - 260 (2.11%) Cell Division and Cell Cycle - 242 (1.96%) Regulation and Cell signaling - 236 (1.92%) Stress Response - 232 (1.88%) Phages, Prophages, Transposable elements, Plasmids - 198 (1.61%) Respiration - 168 (1.36%) Phosphorus Metabolism - 147 (1.19%) Iron acquisition and metabolism - 88 (0.71%) Motility and Chemotaxis - 59 (0.48%) Metabolism of Aromatic Compounds - 56 (0.45%) Sulfur Metabolism - 47 (0.38%) Dormancy and Sporulation - 33 (0.27%) Photosynthesis - 29 (0.24%) Potassium metabolism - 28 (0.23%) Nitrogen Metabolism - 23 (0.19%) Secondary Metabolism - 21 (0.17%) The charts below represent the distribution of taxa using a
contigLCA algorithm finding a single consensus taxonomic entity for all features on each individual sequence.
Bacteria - 6,554 (63.64%) Eukaryota - 3,724 (36.16%) Viruses - 18 (0.17%) Archaea - 1 (0.01%) unclassified sequences - 1 (0.01%) Firmicutes - 4,593 (47.55%) Streptophyta - 3,442 (35.63%) Proteobacteria - 1,141 (11.81%) Actinobacteria - 136 (1.41%) Deinococcus-Thermus - 136 (1.41%) Bacteroidetes - 42 (0.43%) Arthropoda - 40 (0.41%) Chordata - 25 (0.26%) Basidiomycota - 20 (0.21%) unclassified (derived from Viruses) - 18 (0.19%) Acidobacteria - 9 (0.09%) Tenericutes - 8 (0.08%) Ascomycota - 7 (0.07%) Fusobacteria - 6 (0.06%) Cyanobacteria - 4 (0.04%) Spirochaetes - 4 (0.04%) Chlorobi - 4 (0.04%) Aquificae - 4 (0.04%) Verrucomicrobia - 3 (0.03%) unclassified (derived from Eukaryota) - 3 (0.03%) Platyhelminthes - 2 (0.02%) Chloroflexi - 2 (0.02%) Synergistetes - 1 (0.01%) Gemmatimonadetes - 1 (0.01%) Bacillariophyta - 1 (0.01%) Planctomycetes - 1 (0.01%) Euryarchaeota - 1 (0.01%) Cnidaria - 1 (0.01%) unclassified (derived from Bacteria) - 1 (0.01%) Annelida - 1 (0.01%) Apicomplexa - 1 (0.01%) Chrysiogenetes - 1 (0.01%) unclassified (derived from unclassified sequences) - 1 (0.01%) Bacilli - 4,312 (53.34%) unclassified (derived from Streptophyta) - 1,843 (22.80%) Gammaproteobacteria - 507 (6.27%) Alphaproteobacteria - 415 (5.13%) Liliopsida - 342 (4.23%) Actinobacteria (class) - 136 (1.68%) Deinococci - 136 (1.68%) Betaproteobacteria - 100 (1.24%) Clostridia - 71 (0.88%) Insecta - 38 (0.47%) Mammalia - 22 (0.27%) unclassified (derived from Viruses) - 18 (0.22%) Exobasidiomycetes - 17 (0.21%) Bacteroidia - 12 (0.15%) Negativicutes - 11 (0.14%) Deltaproteobacteria - 9 (0.11%) Mollicutes - 8 (0.10%) Flavobacteria - 8 (0.10%) Fusobacteria (class) - 6 (0.07%) Sphingobacteria - 6 (0.07%) Erysipelotrichi - 5 (0.06%) unclassified (derived from Cyanobacteria) - 4 (0.05%) Chlorobia - 4 (0.05%) Spirochaetes (class) - 4 (0.05%) Aquificae (class) - 4 (0.05%) Sordariomycetes - 4 (0.05%) Acidobacteria (class) - 4 (0.05%) Bryopsida - 4 (0.05%) Solibacteres - 3 (0.04%) Cestoda - 2 (0.02%) Cytophagia - 2 (0.02%) Coniferopsida - 2 (0.02%) Chloroflexi (class) - 2 (0.02%) unclassified (derived from Eukaryota) - 2 (0.02%) unclassified (derived from Verrucomicrobia) - 2 (0.02%) Halobacteria - 1 (0.01%) Malacostraca - 1 (0.01%) Leotiomycetes - 1 (0.01%) Saccharomycetes - 1 (0.01%) Epsilonproteobacteria - 1 (0.01%) Bacillariophyceae - 1 (0.01%) Gemmatimonadetes (class) - 1 (0.01%) Actinopterygii - 1 (0.01%) Planctomycetacia - 1 (0.01%) Hydrozoa - 1 (0.01%) Synergistia - 1 (0.01%) unclassified (derived from Acidobacteria) - 1 (0.01%) Verrucomicrobiae - 1 (0.01%) Chrysiogenetes (class) - 1 (0.01%) unclassified (derived from Bacteria) - 1 (0.01%) Foraminifera - 1 (0.01%) Polychaeta - 1 (0.01%) Aconoidasida - 1 (0.01%) unclassified (derived from unclassified sequences) - 1 (0.01%) Lactobacillales - 3,734 (56.08%) Vitales - 528 (7.93%) Poales - 296 (4.45%) Pseudomonadales - 254 (3.81%) Rhodospirillales - 246 (3.69%) Bacillales - 210 (3.15%) Enterobacteriales - 176 (2.64%) Brassicales - 152 (2.28%) Thermales - 133 (2.00%) Actinomycetales - 125 (1.88%) Malpighiales - 102 (1.53%) Sphingomonadales - 79 (1.19%) Clostridiales - 68 (1.02%) Fabales - 55 (0.83%) Burkholderiales - 47 (0.71%) Asparagales - 31 (0.47%) Hymenoptera - 30 (0.45%) Caryophyllales - 27 (0.41%) Primates - 22 (0.33%) Neisseriales - 20 (0.30%) Lamiales - 19 (0.29%) Asterales - 18 (0.27%) Malasseziales - 17 (0.26%) Solanales - 15 (0.23%) Rhizobiales - 14 (0.21%) Bacteroidales - 12 (0.18%) Caudovirales - 12 (0.18%) Rhodocyclales - 11 (0.17%) Cucurbitales - 11 (0.17%) Selenomonadales - 11 (0.17%) Ranunculales - 10 (0.15%) Pasteurellales - 8 (0.12%) Fusobacteriales - 6 (0.09%) Rosales - 6 (0.09%) unclassified (derived from Viruses) - 6 (0.09%) Flavobacteriales - 6 (0.09%) Bifidobacteriales - 6 (0.09%) Sphingobacteriales - 6 (0.09%) Vibrionales - 5 (0.08%) Alteromonadales - 5 (0.08%) Erysipelotrichales - 5 (0.08%) Chlorobiales - 4 (0.06%) Acidobacteriales - 4 (0.06%) Hypocreales - 4 (0.06%) Chroococcales - 4 (0.06%) Spirochaetales - 4 (0.06%) Aquificales - 4 (0.06%) Rhodobacterales - 4 (0.06%) Acholeplasmatales - 4 (0.06%) Funariales - 4 (0.06%) Caulobacterales - 3 (0.05%) Solibacterales - 3 (0.05%) Diptera - 3 (0.05%) Alismatales - 2 (0.03%) unclassified (derived from Gammaproteobacteria) - 2 (0.03%) Entomoplasmatales - 2 (0.03%) Myxococcales - 2 (0.03%) Desulfuromonadales - 2 (0.03%) Mycoplasmatales - 2 (0.03%) Methylacidiphilales - 2 (0.03%) Legionellales - 2 (0.03%) Deinococcales - 2 (0.03%) Cyclophyllidea - 2 (0.03%) Malvales - 2 (0.03%) Chloroflexales - 2 (0.03%) Xanthomonadales - 2 (0.03%) unclassified (derived from Flavobacteria) - 2 (0.03%) Cytophagales - 2 (0.03%) Coniferales - 2 (0.03%) Chromatiales - 2 (0.03%) Saccharomycetales - 1 (0.02%) Gallionellales - 1 (0.02%) Naviculales - 1 (0.02%) Oceanospirillales - 1 (0.02%) Tetraodontiformes - 1 (0.02%) Methylophilales - 1 (0.02%) Hydroida - 1 (0.02%) Thermoanaerobacterales - 1 (0.02%) Thiotrichales - 1 (0.02%) Nitrosomonadales - 1 (0.02%) Parvularculales - 1 (0.02%) Bdellovibrionales - 1 (0.02%) Desulfobacterales - 1 (0.02%) Synergistales - 1 (0.02%) Halobacteriales - 1 (0.02%) unclassified (derived from Acidobacteria) - 1 (0.02%) Celastrales - 1 (0.02%) Campylobacterales - 1 (0.02%) Ericales - 1 (0.02%) Verrucomicrobiales - 1 (0.02%) Gentianales - 1 (0.02%) Chrysiogenales - 1 (0.02%) unclassified (derived from Bacteria) - 1 (0.02%) Phyllodocida - 1 (0.02%) Haemosporida - 1 (0.02%) Coleoptera - 1 (0.02%) Sapindales - 1 (0.02%) Gemmatimonadales - 1 (0.02%) Planctomycetales - 1 (0.02%) Rotaliida - 1 (0.02%) Kinetoplastida - 1 (0.02%) Peronosporales - 1 (0.02%) Decapoda - 1 (0.02%) Helotiales - 1 (0.02%) unclassified (derived from unclassified sequences) - 1 (0.02%) Lactobacillaceae - 2,495 (44.88%) Vitaceae - 528 (9.50%) Poaceae - 292 (5.25%) Acetobacteraceae - 239 (4.30%) Moraxellaceae - 233 (4.19%) Enterobacteriaceae - 176 (3.17%) Brassicaceae - 151 (2.72%) Thermaceae - 133 (2.39%) Leuconostocaceae - 113 (2.03%) Bacillaceae - 111 (2.00%) Streptococcaceae - 80 (1.44%) Propionibacteriaceae - 66 (1.19%) Sphingomonadaceae - 57 (1.03%) Salicaceae - 57 (1.03%) Fabaceae - 55 (0.99%) Enterococcaceae - 50 (0.90%) Staphylococcaceae - 45 (0.81%) Euphorbiaceae - 40 (0.72%) Comamonadaceae - 31 (0.56%) Asparagaceae - 30 (0.54%) Corynebacteriaceae - 28 (0.50%) Clostridiales Family XI. Incertae Sedis - 26 (0.47%) Apidae - 25 (0.45%) Listeriaceae - 24 (0.43%) Amaranthaceae - 24 (0.43%) Clostridiaceae - 21 (0.38%) Hominidae - 21 (0.38%) Neisseriaceae - 20 (0.36%) Malasseziaceae - 17 (0.31%) Asteraceae - 17 (0.31%) Pseudomonadaceae - 16 (0.29%) Solanaceae - 13 (0.23%) Cucurbitaceae - 11 (0.20%) Veillonellaceae - 11 (0.20%) Rhodocyclaceae - 11 (0.20%) Burkholderiaceae - 10 (0.18%) Pasteurellaceae - 8 (0.14%) Orobanchaceae - 8 (0.14%) Siphoviridae - 7 (0.13%) Carnobacteriaceae - 7 (0.13%) Papaveraceae - 6 (0.11%) unclassified (derived from Viruses) - 6 (0.11%) Plantaginaceae - 6 (0.11%) Flavobacteriaceae - 6 (0.11%) Bifidobacteriaceae - 6 (0.11%) Fusobacteriaceae - 6 (0.11%) Rosaceae - 5 (0.09%) Prevotellaceae - 5 (0.09%) Erysipelotrichaceae - 5 (0.09%) Vibrionaceae - 5 (0.09%) Aerococcaceae - 5 (0.09%) Eubacteriaceae - 4 (0.07%) Shewanellaceae - 4 (0.07%) Aquificaceae - 4 (0.07%) Myoviridae - 4 (0.07%) Micrococcaceae - 4 (0.07%) Sphingobacteriaceae - 4 (0.07%) Chlorobiaceae - 4 (0.07%) Acidobacteriaceae - 4 (0.07%) unclassified (derived from Chroococcales) - 4 (0.07%) Paenibacillaceae - 4 (0.07%) Spirochaetaceae - 4 (0.07%) Acholeplasmataceae - 4 (0.07%) Rhodobacteraceae - 4 (0.07%) Funariaceae - 4 (0.07%) Lachnospiraceae - 4 (0.07%) Clavicipitaceae - 3 (0.05%) Methylobacteriaceae - 3 (0.05%) Caryophyllaceae - 3 (0.05%) Peptococcaceae - 3 (0.05%) Caulobacteraceae - 3 (0.05%) Ranunculaceae - 3 (0.05%) Mycobacteriaceae - 3 (0.05%) Bradyrhizobiaceae - 3 (0.05%) Solibacteraceae - 3 (0.05%) Ruminococcaceae - 2 (0.04%) unclassified (derived from Gammaproteobacteria) - 2 (0.04%) Deinococcaceae - 2 (0.04%) Xanthomonadaceae - 2 (0.04%) Chloroflexaceae - 2 (0.04%) Culicidae - 2 (0.04%) Methylacidiphilaceae - 2 (0.04%) unclassified (derived from Clostridiales) - 2 (0.04%) unclassified (derived from Flavobacteria) - 2 (0.04%) Formicidae - 2 (0.04%) Bacteroidaceae - 2 (0.04%) Ectothiorhodospiraceae - 2 (0.04%) Nocardiaceae - 2 (0.04%) Mycoplasmataceae - 2 (0.04%) Geobacteraceae - 2 (0.04%) Convolvulaceae - 2 (0.04%) Pinaceae - 2 (0.04%) Araceae - 2 (0.04%) Spiroplasmataceae - 2 (0.04%) Gordoniaceae - 2 (0.04%) Desulfobacteraceae - 1 (0.02%) Streptomycetaceae - 1 (0.02%) Brucellaceae - 1 (0.02%) Synergistaceae - 1 (0.02%) Geodermatophilaceae - 1 (0.02%) Methylocystaceae - 1 (0.02%) Phyllobacteriaceae - 1 (0.02%) Verrucomicrobiaceae - 1 (0.02%) Rhizobiaceae - 1 (0.02%) unclassified (derived from Bacteria) - 1 (0.02%) Nereididae - 1 (0.02%) unclassified (derived from Haemosporida) - 1 (0.02%) Scarabaeidae - 1 (0.02%) Chrysiogenaceae - 1 (0.02%) Drosophilidae - 1 (0.02%) unclassified (derived from Acidobacteria) - 1 (0.02%) Erythrobacteraceae - 1 (0.02%) Sclerotiniaceae - 1 (0.02%) Metschnikowiaceae - 1 (0.02%) Microbacteriaceae - 1 (0.02%) Phaeodactylaceae - 1 (0.02%) Alcaligenaceae - 1 (0.02%) Tetraodontidae - 1 (0.02%) Porphyromonadaceae - 1 (0.02%) Hydridae - 1 (0.02%) Anoplocephalidae - 1 (0.02%) Taeniidae - 1 (0.02%) Actinomycetaceae - 1 (0.02%) Gallionellaceae - 1 (0.02%) Methylophilaceae - 1 (0.02%) Cytophagaceae - 1 (0.02%) Orchidaceae - 1 (0.02%) Nitrosomonadaceae - 1 (0.02%) Actinosynnemataceae - 1 (0.02%) Campanulaceae - 1 (0.02%) Bacteriovoracaceae - 1 (0.02%) Brevibacteriaceae - 1 (0.02%) Thermoanaerobacteraceae - 1 (0.02%) Celastraceae - 1 (0.02%) Myxococcaceae - 1 (0.02%) Helicobacteraceae - 1 (0.02%) Rubiaceae - 1 (0.02%) Alteromonadaceae - 1 (0.02%) unclassified (derived from Bacillales) - 1 (0.02%) Rhodothermaceae - 1 (0.02%) Halobacteriaceae - 1 (0.02%) Cistaceae - 1 (0.02%) Malvaceae - 1 (0.02%) Coxiellaceae - 1 (0.02%) Legionellaceae - 1 (0.02%) Oceanospirillaceae - 1 (0.02%) Urticaceae - 1 (0.02%) Rutaceae - 1 (0.02%) Gemmatimonadaceae - 1 (0.02%) Planctomycetaceae - 1 (0.02%) unclassified (derived from Sphingobacteriales) - 1 (0.02%) Nonionidae - 1 (0.02%) Trypanosomatidae - 1 (0.02%) unclassified (derived from Peronosporales) - 1 (0.02%) Piscirickettsiaceae - 1 (0.02%) Podoviridae - 1 (0.02%) Parvularculaceae - 1 (0.02%) Beijerinckiaceae - 1 (0.02%) unclassified (derived from unclassified sequences) - 1 (0.02%) Lactobacillus - 1,879 (41.52%) Vitis - 528 (11.67%) Acinetobacter - 222 (4.91%) Oryza - 160 (3.54%) Arabidopsis - 127 (2.81%) Meiothermus - 112 (2.48%) Gluconobacter - 81 (1.79%) Pediococcus - 70 (1.55%) Streptococcus - 66 (1.46%) Propionibacterium - 66 (1.46%) Zea - 61 (1.35%) Populus - 57 (1.26%) Leuconostoc - 55 (1.22%) Enterococcus - 50 (1.10%) Staphylococcus - 44 (0.97%) Bacillus - 36 (0.80%) Ricinus - 33 (0.73%) Geobacillus - 30 (0.66%) Asparagus - 30 (0.66%) Corynebacterium - 28 (0.62%) Medicago - 26 (0.57%) Apis - 25 (0.55%) Beta - 24 (0.53%) Listeria - 24 (0.53%) Novosphingobium - 22 (0.49%) Weissella - 21 (0.46%) Oenococcus - 20 (0.44%) Clostridium - 20 (0.44%) Gluconacetobacter - 17 (0.38%) Malassezia - 17 (0.38%) Homo - 17 (0.38%) Pseudomonas - 16 (0.35%) Sphingomonas - 14 (0.31%) Erwinia - 14 (0.31%) Delftia - 13 (0.29%) Cynara - 13 (0.29%) Neisseria - 12 (0.27%) Brassica - 11 (0.24%) Cucumis - 11 (0.24%) Solanum - 11 (0.24%) Lactococcus - 10 (0.22%) Pisum - 10 (0.22%) Finegoldia - 9 (0.20%) Arsenophonus - 9 (0.20%) Anaerococcus - 9 (0.20%) Acetobacter - 8 (0.18%) Burkholderia - 8 (0.18%) Thermus - 7 (0.15%) Euphorbia - 7 (0.15%) unclassified (derived from Siphoviridae) - 7 (0.15%) Glycine - 7 (0.15%) Vicia - 6 (0.13%) Bifidobacterium - 6 (0.13%) Anoxybacillus - 6 (0.13%) Veillonella - 6 (0.13%) Peptoniphilus - 6 (0.13%) unclassified (derived from Viruses) - 6 (0.13%) Sorghum - 6 (0.13%) Antirrhinum - 6 (0.13%) Papaver - 5 (0.11%) Granulicatella - 5 (0.11%) Prevotella - 5 (0.11%) Serratia - 4 (0.09%) Vibrio - 4 (0.09%) Shewanella - 4 (0.09%) Eubacterium - 4 (0.09%) Physcomitrella - 4 (0.09%) Aerococcus - 4 (0.09%) Pantoea - 4 (0.09%) Fusobacterium - 4 (0.09%) Borrelia - 3 (0.07%) Thauera - 3 (0.07%) Metarhizium - 3 (0.07%) Acidobacterium - 3 (0.07%) Candidatus Solibacter - 3 (0.07%) Catonella - 3 (0.07%) Photorhabdus - 3 (0.07%) unclassified (derived from Myoviridae) - 3 (0.07%) Methylobacterium - 3 (0.07%) Paenibacillus - 3 (0.07%) Silene - 3 (0.07%) Lysinibacillus - 3 (0.07%) Cicer - 3 (0.07%) Pedobacter - 3 (0.07%) Enhydrobacter - 3 (0.07%) Klebsiella - 3 (0.07%) Prunus - 3 (0.07%) Mycobacterium - 3 (0.07%) Mitsuokella - 2 (0.04%) Verminephrobacter - 2 (0.04%) Geobacter - 2 (0.04%) Providencia - 2 (0.04%) Turicibacter - 2 (0.04%) Azoarcus - 2 (0.04%) Lemna - 2 (0.04%) unclassified (derived from Clostridiales) - 2 (0.04%) Desulfotomaculum - 2 (0.04%) Yersinia - 2 (0.04%) Haemophilus - 2 (0.04%) Pasteurella - 2 (0.04%) Carnobacterium - 2 (0.04%) Variovorax - 2 (0.04%) Sphingopyxis - 2 (0.04%) Psychrobacter - 2 (0.04%) Caulobacter - 2 (0.04%) Escherichia - 2 (0.04%) Deinococcus - 2 (0.04%) Cyanothece - 2 (0.04%) Chloroflexus - 2 (0.04%) Chloroherpeton - 2 (0.04%) Phelipanche - 2 (0.04%) unclassified (derived from Gammaproteobacteria) - 2 (0.04%) unclassified (derived from Flavobacteria) - 2 (0.04%) Dechloromonas - 2 (0.04%) Bacteroides - 2 (0.04%) Candidatus Phytoplasma - 2 (0.04%) Malus - 2 (0.04%) Spiroplasma - 2 (0.04%) Mycoplasma - 2 (0.04%) Rothia - 2 (0.04%) Methylacidiphilum - 2 (0.04%) Gordonia - 2 (0.04%) Lutiella - 1 (0.02%) Polaromonas - 1 (0.02%) Chryseobacterium - 1 (0.02%) Flavobacterium - 1 (0.02%) Gallionella - 1 (0.02%) Methylovorus - 1 (0.02%) Kordia - 1 (0.02%) Polaribacter - 1 (0.02%) Simonsiella - 1 (0.02%) Nitrosomonas - 1 (0.02%) Aromatoleum - 1 (0.02%) Zunongwangia - 1 (0.02%) Rathayibacter - 1 (0.02%) Rhodothermus - 1 (0.02%) Bacteriovorax - 1 (0.02%) Desulfatibacillum - 1 (0.02%) Arthrobacter - 1 (0.02%) Anaeromyxobacter - 1 (0.02%) Helicobacter - 1 (0.02%) Alteromonas - 1 (0.02%) Sphingobacterium - 1 (0.02%) Alkalilimnicola - 1 (0.02%) Thioalkalivibrio - 1 (0.02%) Chitinophaga - 1 (0.02%) Citrobacter - 1 (0.02%) Enterobacter - 1 (0.02%) Alkaliphilus - 1 (0.02%) Chlorobium - 1 (0.02%) Micrococcus - 1 (0.02%) Actinomyces - 1 (0.02%) Pectobacterium - 1 (0.02%) Desulfurispirillum - 1 (0.02%) Proteus - 1 (0.02%) Actinosynnema - 1 (0.02%) Rahnella - 1 (0.02%) Microcystis - 1 (0.02%) Sodalis - 1 (0.02%) Coprococcus - 1 (0.02%) Rickettsiella - 1 (0.02%) Legionella - 1 (0.02%) Neptuniibacter - 1 (0.02%) Rhodococcus - 1 (0.02%) Faecalibacterium - 1 (0.02%) Ruminococcus - 1 (0.02%) Brevibacterium - 1 (0.02%) Carboxydothermus - 1 (0.02%) Coprobacillus - 1 (0.02%) Methylophaga - 1 (0.02%) Photobacterium - 1 (0.02%) Solobacterium - 1 (0.02%) Xanthomonas - 1 (0.02%) Streptomyces - 1 (0.02%) unclassified (derived from Erysipelotrichaceae) - 1 (0.02%) Aminomonas - 1 (0.02%) Haloterrigena - 1 (0.02%) Selenomonas - 1 (0.02%) Hydrogenobacter - 1 (0.02%) Verrucomicrobium - 1 (0.02%) Thermocrinis - 1 (0.02%) unclassified (derived from Bacteria) - 1 (0.02%) Platynereis - 1 (0.02%) Plasmodium - 1 (0.02%) Rhyparus - 1 (0.02%) Aedes - 1 (0.02%) Anopheles - 1 (0.02%) Drosophila - 1 (0.02%) Ilyobacter - 1 (0.02%) Sclerotinia - 1 (0.02%) Clavispora - 1 (0.02%) Streptobacillus - 1 (0.02%) Phaeodactylum - 1 (0.02%) Gemmatimonas - 1 (0.02%) Tetraodon - 1 (0.02%) Rhodopirellula - 1 (0.02%) Pongo - 1 (0.02%) Hydra - 1 (0.02%) Moniezia - 1 (0.02%) Echinococcus - 1 (0.02%) Geodermatophilus - 1 (0.02%) Picea - 1 (0.02%) Pinus - 1 (0.02%) Phenylobacterium - 1 (0.02%) Parvularcula - 1 (0.02%) Phalaenopsis - 1 (0.02%) Beijerinckia - 1 (0.02%) Phyllostachys - 1 (0.02%) Bradyrhizobium - 1 (0.02%) Triticum - 1 (0.02%) Rhodopseudomonas - 1 (0.02%) Ochrobactrum - 1 (0.02%) Helianthus - 1 (0.02%) Lactuca - 1 (0.02%) Trachelium - 1 (0.02%) Oceanobacillus - 1 (0.02%) Methylosinus - 1 (0.02%) Raphanus - 1 (0.02%) Parvibaculum - 1 (0.02%) Agrobacterium - 1 (0.02%) Oceanicola - 1 (0.02%) Roseobacter - 1 (0.02%) Porphyromonas - 1 (0.02%) Brevibacillus - 1 (0.02%) Phaseolus - 1 (0.02%) Candidatus Koribacter - 1 (0.02%) Trifolium - 1 (0.02%) Granulibacter - 1 (0.02%) Coffea - 1 (0.02%) Erythrobacter - 1 (0.02%) Citromicrobium - 1 (0.02%) Macrococcus - 1 (0.02%) Sphingobium - 1 (0.02%) Microscilla - 1 (0.02%) Helianthemum - 1 (0.02%) Gossypium - 1 (0.02%) Gemella - 1 (0.02%) Coptis - 1 (0.02%) Zymomonas - 1 (0.02%) Capnocytophaga - 1 (0.02%) Boehmeria - 1 (0.02%) Citrus - 1 (0.02%) Convolvulus - 1 (0.02%) Ipomoea - 1 (0.02%) Capsicum - 1 (0.02%) Lautropia - 1 (0.02%) Ralstonia - 1 (0.02%) Nonionella - 1 (0.02%) Leishmania - 1 (0.02%) Phytophthora - 1 (0.02%) T4-like viruses - 1 (0.02%) Acidovorax - 1 (0.02%) T7-like viruses - 1 (0.02%) Curvibacter - 1 (0.02%) Eremococcus - 1 (0.02%) unclassified (derived from unclassified sequences) - 1 (0.02%) The plots below show the taxonomic abundances ordered from the most abundant to least abundant. Only the top 50 most abundant are shown. The y-axis plots the abundances of annotations on a log scale.
The rank abundance chart is a tool for visually representing taxonomic richness and evenness.
taxon abundance Bacteria: 6554 Eukaryota: 3724 Viruses: 18 Archaea: 1 unclassified sequences: 1 0 101 102 103 104 Bacteria Eukaryota Viruses Archaea unclassified sequences
taxon abundance Firmicutes: 4593 Streptophyta: 3442 Proteobacteria: 1141 Actinobacteria: 136 Deinococcus-Thermus: 136 Bacteroidetes: 42 Arthropoda: 40 Chordata: 25 Basidiomycota: 20 unclassified (derived from Viruses): 18 Acidobacteria: 9 Tenericutes: 8 Ascomycota: 7 Fusobacteria: 6 Cyanobacteria: 4 Spirochaetes: 4 Chlorobi: 4 Aquificae: 4 Verrucomicrobia: 3 unclassified (derived from Eukaryota): 3 Platyhelminthes: 2 Chloroflexi: 2 Synergistetes: 1 Gemmatimonadetes: 1 Bacillariophyta: 1 Planctomycetes: 1 Euryarchaeota: 1 Cnidaria: 1 unclassified (derived from Bacteria): 1 Annelida: 1 Apicomplexa: 1 Chrysiogenetes: 1 unclassified (derived from unclassified sequences): 1 0 101 102 103 104 Firmicutes Streptophyta Proteobacteria Actinobacteria Deinococcus-Thermus Bacteroidetes Arthropoda Chordata Basidiomycota unclassified (derived from Viruses) Acidobacteria Tenericutes Ascomycota Fusobacteria Cyanobacteria Spirochaetes Chlorobi Aquificae Verrucomicrobia unclassified (derived from Eukaryota) Platyhelminthes Chloroflexi Synergistetes Gemmatimonadetes Bacillariophyta Planctomycetes Euryarchaeota Cnidaria unclassified (derived from Bacteria) Annelida Apicomplexa Chrysiogenetes unclassified (derived from unclassified sequences)
taxon abundance Bacilli: 4312 unclassified (derived from Streptophyta): 1843 Gammaproteobacteria: 507 Alphaproteobacteria: 415 Liliopsida: 342 Actinobacteria (class): 136 Deinococci: 136 Betaproteobacteria: 100 Clostridia: 71 Insecta: 38 Mammalia: 22 unclassified (derived from Viruses): 18 Exobasidiomycetes: 17 Bacteroidia: 12 Negativicutes: 11 Deltaproteobacteria: 9 Mollicutes: 8 Flavobacteria: 8 Fusobacteria (class): 6 Sphingobacteria: 6 Erysipelotrichi: 5 unclassified (derived from Cyanobacteria): 4 Chlorobia: 4 Spirochaetes (class): 4 Aquificae (class): 4 Sordariomycetes: 4 Acidobacteria (class): 4 Bryopsida: 4 Solibacteres: 3 Cestoda: 2 Cytophagia: 2 Coniferopsida: 2 Chloroflexi (class): 2 unclassified (derived from Eukaryota): 2 unclassified (derived from Verrucomicrobia): 2 Halobacteria: 1 Malacostraca: 1 Leotiomycetes: 1 Saccharomycetes: 1 Epsilonproteobacteria: 1 Bacillariophyceae: 1 Gemmatimonadetes (class): 1 Actinopterygii: 1 Planctomycetacia: 1 Hydrozoa: 1 Synergistia: 1 unclassified (derived from Acidobacteria): 1 Verrucomicrobiae: 1 Chrysiogenetes (class): 1 unclassified (derived from Bacteria): 1 0 101 102 103 104 Bacilli unclassified (derived from Streptophyta) Gammaproteobacteria Alphaproteobacteria Liliopsida Actinobacteria (class) Deinococci Betaproteobacteria Clostridia Insecta Mammalia unclassified (derived from Viruses) Exobasidiomycetes Bacteroidia Negativicutes Deltaproteobacteria Mollicutes Flavobacteria Fusobacteria (class) Sphingobacteria Erysipelotrichi unclassified (derived from Cyanobacteria) Chlorobia Spirochaetes (class) Aquificae (class) Sordariomycetes Acidobacteria (class) Bryopsida Solibacteres Cestoda Cytophagia Coniferopsida Chloroflexi (class) unclassified (derived from Eukaryota) unclassified (derived from Verrucomicrobia) Halobacteria Malacostraca Leotiomycetes Saccharomycetes Epsilonproteobacteria Bacillariophyceae Gemmatimonadetes (class) Actinopterygii Planctomycetacia Hydrozoa Synergistia unclassified (derived from Acidobacteria) Verrucomicrobiae Chrysiogenetes (class) unclassified (derived from Bacteria)
taxon abundance Lactobacillales: 3734 Vitales: 528 Poales: 296 Pseudomonadales: 254 Rhodospirillales: 246 Bacillales: 210 Enterobacteriales: 176 Brassicales: 152 Thermales: 133 Actinomycetales: 125 Malpighiales: 102 Sphingomonadales: 79 Clostridiales: 68 Fabales: 55 Burkholderiales: 47 Asparagales: 31 Hymenoptera: 30 Caryophyllales: 27 Primates: 22 Neisseriales: 20 Lamiales: 19 Asterales: 18 Malasseziales: 17 Solanales: 15 Rhizobiales: 14 Bacteroidales: 12 Caudovirales: 12 Rhodocyclales: 11 Cucurbitales: 11 Selenomonadales: 11 Ranunculales: 10 Pasteurellales: 8 Fusobacteriales: 6 Rosales: 6 unclassified (derived from Viruses): 6 Flavobacteriales: 6 Bifidobacteriales: 6 Sphingobacteriales: 6 Vibrionales: 5 Alteromonadales: 5 Erysipelotrichales: 5 Chlorobiales: 4 Acidobacteriales: 4 Hypocreales: 4 Chroococcales: 4 Spirochaetales: 4 Aquificales: 4 Rhodobacterales: 4 Acholeplasmatales: 4 Funariales: 4 0 101 102 103 104 Lactobacillales Vitales Poales Pseudomonadales Rhodospirillales Bacillales Enterobacteriales Brassicales Thermales Actinomycetales Malpighiales Sphingomonadales Clostridiales Fabales Burkholderiales Asparagales Hymenoptera Caryophyllales Primates Neisseriales Lamiales Asterales Malasseziales Solanales Rhizobiales Bacteroidales Caudovirales Rhodocyclales Cucurbitales Selenomonadales Ranunculales Pasteurellales Fusobacteriales Rosales unclassified (derived from Viruses) Flavobacteriales Bifidobacteriales Sphingobacteriales Vibrionales Alteromonadales Erysipelotrichales Chlorobiales Acidobacteriales Hypocreales Chroococcales Spirochaetales Aquificales Rhodobacterales Acholeplasmatales Funariales
taxon abundance Lactobacillaceae: 2495 Vitaceae: 528 Poaceae: 292 Acetobacteraceae: 239 Moraxellaceae: 233 Enterobacteriaceae: 176 Brassicaceae: 151 Thermaceae: 133 Leuconostocaceae: 113 Bacillaceae: 111 Streptococcaceae: 80 Propionibacteriaceae: 66 Sphingomonadaceae: 57 Salicaceae: 57 Fabaceae: 55 Enterococcaceae: 50 Staphylococcaceae: 45 Euphorbiaceae: 40 Comamonadaceae: 31 Asparagaceae: 30 Corynebacteriaceae: 28 Clostridiales Family XI. Incertae Sedis: 26 Apidae: 25 Listeriaceae: 24 Amaranthaceae: 24 Clostridiaceae: 21 Hominidae: 21 Neisseriaceae: 20 Malasseziaceae: 17 Asteraceae: 17 Pseudomonadaceae: 16 Solanaceae: 13 Cucurbitaceae: 11 Veillonellaceae: 11 Rhodocyclaceae: 11 Burkholderiaceae: 10 Pasteurellaceae: 8 Orobanchaceae: 8 Siphoviridae: 7 Carnobacteriaceae: 7 Papaveraceae: 6 unclassified (derived from Viruses): 6 Plantaginaceae: 6 Flavobacteriaceae: 6 Bifidobacteriaceae: 6 Fusobacteriaceae: 6 Rosaceae: 5 Prevotellaceae: 5 Erysipelotrichaceae: 5 Vibrionaceae: 5 0 101 102 103 104 Lactobacillaceae Vitaceae Poaceae Acetobacteraceae Moraxellaceae Enterobacteriaceae Brassicaceae Thermaceae Leuconostocaceae Bacillaceae Streptococcaceae Propionibacteriaceae Sphingomonadaceae Salicaceae Fabaceae Enterococcaceae Staphylococcaceae Euphorbiaceae Comamonadaceae Asparagaceae Corynebacteriaceae Clostridiales Family XI. Incertae Sedis Apidae Listeriaceae Amaranthaceae Clostridiaceae Hominidae Neisseriaceae Malasseziaceae Asteraceae Pseudomonadaceae Solanaceae Cucurbitaceae Veillonellaceae Rhodocyclaceae Burkholderiaceae Pasteurellaceae Orobanchaceae Siphoviridae Carnobacteriaceae Papaveraceae unclassified (derived from Viruses) Plantaginaceae Flavobacteriaceae Bifidobacteriaceae Fusobacteriaceae Rosaceae Prevotellaceae Erysipelotrichaceae Vibrionaceae
taxon abundance Lactobacillus: 1879 Vitis: 528 Acinetobacter: 222 Oryza: 160 Arabidopsis: 127 Meiothermus: 112 Gluconobacter: 81 Pediococcus: 70 Streptococcus: 66 Propionibacterium: 66 Zea: 61 Populus: 57 Leuconostoc: 55 Enterococcus: 50 Staphylococcus: 44 Bacillus: 36 Ricinus: 33 Geobacillus: 30 Asparagus: 30 Corynebacterium: 28 Medicago: 26 Apis: 25 Beta: 24 Listeria: 24 Novosphingobium: 22 Weissella: 21 Oenococcus: 20 Clostridium: 20 Gluconacetobacter: 17 Malassezia: 17 Homo: 17 Pseudomonas: 16 Sphingomonas: 14 Erwinia: 14 Delftia: 13 Cynara: 13 Neisseria: 12 Brassica: 11 Cucumis: 11 Solanum: 11 Lactococcus: 10 Pisum: 10 Finegoldia: 9 Arsenophonus: 9 Anaerococcus: 9 Acetobacter: 8 Burkholderia: 8 Thermus: 7 Euphorbia: 7 unclassified (derived from Siphoviridae): 7 0 101 102 103 104 Lactobacillus Vitis Acinetobacter Oryza Arabidopsis Meiothermus Gluconobacter Pediococcus Streptococcus Propionibacterium Zea Populus Leuconostoc Enterococcus Staphylococcus Bacillus Ricinus Geobacillus Asparagus Corynebacterium Medicago Apis Beta Listeria Novosphingobium Weissella Oenococcus Clostridium Gluconacetobacter Malassezia Homo Pseudomonas Sphingomonas Erwinia Delftia Cynara Neisseria Brassica Cucumis Solanum Lactococcus Pisum Finegoldia Arsenophonus Anaerococcus Acetobacter Burkholderia Thermus Euphorbia unclassified (derived from Siphoviridae)
The plot below shows the rarefaction curve of annotated species richness. This curve is a plot of the total number of distinct species annotations as a function of the number of sequences sampled. On the left, a steep slope indicates that a large fraction of the species diversity remains to be discovered. If the curve becomes flatter to the right, a reasonable number of individuals is sampled: more intensive sampling is likely to yield only few additional species.
Sampling curves generally rise very quickly at first and then level off toward an asymptote as fewer new species are found per unit of individuals collected. These rarefaction curves are calculated from the table of species abundance. The curves represent the average number of different species annotations for subsamples of the complete data set.
undefined number of reads species count 0 140 280 420 560 700 840 980 1,120 1,260 1,400 0 7,000 14,000 21,000 28,000 35,000 42,000 49,000 56,000 63,000 70,000
The α-diversity of this data set is 135 species.
2σ σ μ σ 2σ 134.56 134.56 134.56
The above image shows the range of α-diversity values in the study 'SPlab-run431-SPD3-asm'. The min, max, and mean values are shown, with the standard deviation ranges (σ and 2σ) in different shades. The α-diversity of this metagenome is shown in red.
Alpha diversity summarizes the diversity of organisms in a sample with a single number. The α-diversity of annotated samples can be estimated from the distribution of the species-level annotations.
Annotated species richness is the number of distinct species annotations in the combined MG-RAST data set. Shannon diversity is an abundance-weighted average of the logarithm of the relative abundances of annotated species. The species-level annotations are from all the annotation source databases used by MG-RAST.
The histograms below show the distribution of sequence lengths in basepairs for this metagenome. Each position represents the number of sequences within a length bp range.
The data used in these graphs is based on raw upload and post-QC sequences.
uploaded post-QC sequence length (bp) number of reads 0 140 280 420 560 700 840 980 1,120 1,260 1,400 52 347 642 936 1,231 1,526 1,821 2,116 2,410 2,705 3,000 uploaded post-QC
The histograms below show the distribution of the GC percentage for this metagenome. Each position represents the number of sequences within a GC percentage range. The data used in these graphs is based on raw upload and post-QC sequences.
uploaded post-QC sequence GC % number of reads 0 450 900 1,350 1,800 2,250 2,700 3,150 3,600 4,050 4,500 0 10 20 30 40 50 60 70 80 90 100 uploaded post-QC
This data set has no sample data