SPD4.k17.ForMGRAST
by undefined undefined, undefined .
Visibility private NCBI Project ID - ID 49d7d454086d676d343836313731342e33 GOLD ID - ENA Project ID ENA Library ID Static Link private metagenomes cannot be linked PubMed ID - Sample - Library -
The data set SPD4.k17.ForMGRAST was uploaded on 2019-09-29 at 10:44:45 and contains 661,413 sequences totaling 33,786,502 basepairs with an average length of 51 bps.
Of the sequences tested, 58,722 sequences (8.88%) failed to pass the QC pipeline. Of those, dereplication identified 28,524 sequences as artificial duplicate reads.
Of the sequences that passed QC, 771 sequences (1%) contain ribosomal RNA genes, 7,261 sequences (9.04%) contain predicted proteins with known functions, and 72,247 sequences (89.99%) contain predicted proteins with unknown function.
Data shown here is displayed as a quick way to assess the quality and contents of the data set. We note that the submitting authors may have performed their own analysis. The
analysis page provides the best way to perform in-depth analyses of this data set.
The data on this page represents the automated analysis generated by the MG-RAST automated processing pipeline. Details on the processing of this data set are in the
processing receipt .
failed QC - 58,722 (8.88%) unknown - 522,412 (78.98%) predicted feature - 80,279 (12.14%) unknown protein - 72,247 (89.99%) annotated protein - 7,261 (9.04%) ribosomal RNA - 771 (0.96%) Upload: bp Count 33,786,502 bp Upload: Sequences Count 661,413 Upload: Mean Sequence Length 51 ± 35 bp Upload: Mean GC percent 42 ± 10 % Artificial Duplicate Reads: Sequence Count 28,524 Post QC: bp Count 27,438,401 bp Post QC: Sequences Count 602,691 Post QC: Mean Sequence Length 46 ± 24 bp Post QC: Mean GC percent 42 ± 10 % Processed: Predicted Protein Features 78,273 Processed: Predicted rRNA Features 1,156 Alignment: Identified Protein Features 6,873 Alignment: Identified rRNA Features 798 Annotation: Identified Functional Categories undefined
Investigation Type WGS Study Name SPlab-run431-SPD4-asm Latitude and Longitude Country and/or Sea, Location Collection Date Environment (Biome) Environment (Feature) Environment (Material) Environmental Package Sequencing Method other More Metadata click for full table
The kmer rank abundance graph plots the kmer coverage as a function of abundance rank, with the most abundant sequences at the left.
undefined sequence size k-mer coverage 0 101 102 0 101 102 103 104 105 106 107 108
The graph below displays the number of hits in the different databases listed. These include protein databases, protein databases with functional hierarchy information, and ribosomal RNA databases. The bars representing annotated reads are colored by e-value range. Different databases have different numbers of hits but can also have different types of annotation data.
0 0 609 3,087 2,574 0 0 553 2,801 2,271 0 0 624 3,285 2,810 0 0 327 1,750 1,376 0 0 527 2,811 2,330 0 0 627 3,200 2,720 0 0 166 815 652 0 0 522 2,636 2,142 0 0 323 2,071 1,742 0 0 267 1,260 1,022 0 0 303 1,950 1,624 18 30 32 6 0 99 128 255 76 4 38 71 118 25 0 0 0 6 43 43 18 25 27 3 0 0 700 1,400 2,100 2,800 3,500 4,200 4,900 5,600 6,300 7,000 RefSeq IMG TrEMBL Subsystems KEGG GenBank SwissProt PATRIC eggNOG KO GO COG RDP LSU SSU NOG e^-30 & less e^-20 to e^-30 e^-10 to e^-20 e^-5 to e^-10 e^-3 to e^-5
The pie charts below illustrate the distribution of functional categories for COGs, KOs, NOGs, and Subsystems at the highest level supported by these functional hierarchies. Each slice indicates the percentage of reads with predicted protein functions annotated to the category for the given source.
METABOLISM - 1,782 (40.73%) INFORMATION STORAGE AND PROCESSING - 1,137 (25.99%) CELLULAR PROCESSES AND SIGNALING - 798 (18.24%) POORLY CHARACTERIZED - 658 (15.04%) POORLY CHARACTERIZED - 55 (56.70%) CELLULAR PROCESSES AND SIGNALING - 18 (18.56%) METABOLISM - 12 (12.37%) INFORMATION STORAGE AND PROCESSING - 12 (12.37%) Metabolism - 1,972 (55.10%) Genetic Information Processing - 922 (25.76%) Environmental Information Processing - 468 (13.08%) Cellular Processes - 137 (3.83%) Human Diseases - 61 (1.70%) Organismal Systems - 19 (0.53%) Clustering-based subsystems - 1,522 (15.43%) Carbohydrates - 1,152 (11.68%) Protein Metabolism - 933 (9.46%) Amino Acids and Derivatives - 916 (9.29%) Miscellaneous - 725 (7.35%) RNA Metabolism - 715 (7.25%) DNA Metabolism - 526 (5.33%) Cofactors, Vitamins, Prosthetic Groups, Pigments - 508 (5.15%) Cell Wall and Capsule - 405 (4.11%) Nucleosides and Nucleotides - 345 (3.50%) Fatty Acids, Lipids, and Isoprenoids - 259 (2.63%) Membrane Transport - 221 (2.24%) Virulence, Disease and Defense - 210 (2.13%) Respiration - 196 (1.99%) Stress Response - 190 (1.93%) Cell Division and Cell Cycle - 184 (1.87%) Regulation and Cell signaling - 154 (1.56%) Phages, Prophages, Transposable elements, Plasmids - 147 (1.49%) Phosphorus Metabolism - 105 (1.06%) Metabolism of Aromatic Compounds - 88 (0.89%) Iron acquisition and metabolism - 78 (0.79%) Motility and Chemotaxis - 61 (0.62%) Nitrogen Metabolism - 48 (0.49%) Sulfur Metabolism - 47 (0.48%) Dormancy and Sporulation - 40 (0.41%) Secondary Metabolism - 36 (0.37%) Photosynthesis - 31 (0.31%) Potassium metabolism - 21 (0.21%) The charts below represent the distribution of taxa using a
contigLCA algorithm finding a single consensus taxonomic entity for all features on each individual sequence.
Bacteria - 5,450 (73.50%) Eukaryota - 1,953 (26.34%) Viruses - 9 (0.12%) Archaea - 1 (0.01%) other sequences - 1 (0.01%) unclassified sequences - 1 (0.01%) Firmicutes - 2,265 (33.05%) Proteobacteria - 1,902 (27.75%) Streptophyta - 1,670 (24.37%) Deinococcus-Thermus - 475 (6.93%) Actinobacteria - 253 (3.69%) Arthropoda - 63 (0.92%) Chordata - 54 (0.79%) Bacteroidetes - 47 (0.69%) Basidiomycota - 40 (0.58%) Ascomycota - 15 (0.22%) Acidobacteria - 9 (0.13%) unclassified (derived from Viruses) - 9 (0.13%) Aquificae - 8 (0.12%) Spirochaetes - 7 (0.10%) Chloroflexi - 6 (0.09%) unclassified (derived from Bacteria) - 5 (0.07%) Fusobacteria - 4 (0.06%) Tenericutes - 3 (0.04%) unclassified (derived from Eukaryota) - 3 (0.04%) Chlorobi - 2 (0.03%) Planctomycetes - 2 (0.03%) Verrucomicrobia - 2 (0.03%) Cyanobacteria - 2 (0.03%) Apicomplexa - 1 (0.01%) Platyhelminthes - 1 (0.01%) Thermotogae - 1 (0.01%) Synergistetes - 1 (0.01%) Euryarchaeota - 1 (0.01%) unclassified (derived from other sequences) - 1 (0.01%) unclassified (derived from unclassified sequences) - 1 (0.01%) Bacilli - 2,032 (34.15%) unclassified (derived from Streptophyta) - 902 (15.16%) Gammaproteobacteria - 870 (14.62%) Alphaproteobacteria - 601 (10.10%) Deinococci - 475 (7.98%) Actinobacteria (class) - 253 (4.25%) Liliopsida - 233 (3.92%) Betaproteobacteria - 228 (3.83%) Clostridia - 66 (1.11%) Insecta - 58 (0.97%) Mammalia - 43 (0.72%) Exobasidiomycetes - 33 (0.55%) Negativicutes - 16 (0.27%) Bacteroidia - 13 (0.22%) unclassified (derived from Viruses) - 9 (0.15%) Aquificae (class) - 8 (0.13%) Spirochaetes (class) - 7 (0.12%) Coniferopsida - 6 (0.10%) Deltaproteobacteria - 6 (0.10%) Sphingobacteria - 6 (0.10%) Sordariomycetes - 5 (0.08%) Solibacteres - 5 (0.08%) Flavobacteria - 5 (0.08%) Erysipelotrichi - 5 (0.08%) unclassified (derived from Bacteria) - 5 (0.08%) Saccharomycetes - 4 (0.07%) Fusobacteria (class) - 4 (0.07%) Chloroflexi (class) - 4 (0.07%) Mollicutes - 3 (0.05%) Dothideomycetes - 3 (0.05%) Epsilonproteobacteria - 3 (0.05%) Bryopsida - 3 (0.05%) Cytophagia - 3 (0.05%) Eurotiomycetes - 2 (0.03%) unclassified (derived from Cyanobacteria) - 2 (0.03%) Chlorobia - 2 (0.03%) Planctomycetacia - 2 (0.03%) Arachnida - 2 (0.03%) unclassified (derived from Eukaryota) - 2 (0.03%) Actinopterygii - 2 (0.03%) Acidobacteria (class) - 1 (0.02%) Thermotogae (class) - 1 (0.02%) Agaricomycetes - 1 (0.02%) Spartobacteria - 1 (0.02%) Ustilaginomycetes - 1 (0.02%) Verrucomicrobiae - 1 (0.02%) Aves - 1 (0.02%) Dehalococcoidetes - 1 (0.02%) Trematoda - 1 (0.02%) Coccidia - 1 (0.02%) Thermomicrobia (class) - 1 (0.02%) Ginkgoopsida - 1 (0.02%) Isoetopsida - 1 (0.02%) Thermococci - 1 (0.02%) Malacostraca - 1 (0.02%) unclassified (derived from Acidobacteria) - 1 (0.02%) Synergistia - 1 (0.02%) unclassified (derived from other sequences) - 1 (0.02%) unclassified (derived from unclassified sequences) - 1 (0.02%) Lactobacillales - 1,577 (31.74%) Thermales - 470 (9.46%) Pseudomonadales - 381 (7.67%) Rhodospirillales - 327 (6.58%) Enterobacteriales - 293 (5.90%) Bacillales - 271 (5.45%) Actinomycetales - 229 (4.61%) Poales - 215 (4.33%) Vitales - 146 (2.94%) Burkholderiales - 141 (2.84%) Brassicales - 137 (2.76%) Sphingomonadales - 125 (2.52%) Malpighiales - 74 (1.49%) Clostridiales - 56 (1.13%) Hymenoptera - 43 (0.87%) Fabales - 42 (0.85%) Malasseziales - 33 (0.66%) Primates - 28 (0.56%) Neisseriales - 21 (0.42%) Asterales - 17 (0.34%) Selenomonadales - 16 (0.32%) Solanales - 14 (0.28%) Rhizobiales - 14 (0.28%) Bifidobacteriales - 14 (0.28%) Rhodocyclales - 13 (0.26%) Bacteroidales - 13 (0.26%) Ranunculales - 10 (0.20%) Pasteurellales - 10 (0.20%) Caudovirales - 8 (0.16%) Aquificales - 8 (0.16%) Caryophyllales - 8 (0.16%) Asparagales - 7 (0.14%) Spirochaetales - 7 (0.14%) Oceanospirillales - 7 (0.14%) Rhodobacterales - 6 (0.12%) Sphingobacteriales - 6 (0.12%) Coniferales - 6 (0.12%) unclassified (derived from Bacteria) - 5 (0.10%) Chromatiales - 5 (0.10%) Vibrionales - 5 (0.10%) Alismatales - 5 (0.10%) Solibacterales - 5 (0.10%) Erysipelotrichales - 5 (0.10%) Flavobacteriales - 5 (0.10%) unclassified (derived from Gammaproteobacteria) - 4 (0.08%) Fusobacteriales - 4 (0.08%) Saccharomycetales - 4 (0.08%) Thermoanaerobacterales - 4 (0.08%) unclassified (derived from Betaproteobacteria) - 4 (0.08%) Sapindales - 4 (0.08%) Caulobacterales - 4 (0.08%) Funariales - 3 (0.06%) Cucurbitales - 3 (0.06%) Gentianales - 3 (0.06%) Lamiales - 3 (0.06%) Pleosporales - 3 (0.06%) Rosales - 3 (0.06%) Campylobacterales - 3 (0.06%) Alteromonadales - 3 (0.06%) Cytophagales - 3 (0.06%) Apiales - 2 (0.04%) Xanthomonadales - 2 (0.04%) Hypocreales - 2 (0.04%) Sordariales - 2 (0.04%) Nitrosomonadales - 2 (0.04%) Chloroflexales - 2 (0.04%) Fagales - 2 (0.04%) Methylophilales - 2 (0.04%) unclassified (derived from Mammalia) - 2 (0.04%) Chlorobiales - 2 (0.04%) Malvales - 2 (0.04%) Myrtales - 2 (0.04%) Entomoplasmatales - 2 (0.04%) Hydrogenophilales - 2 (0.04%) Rickettsiales - 2 (0.04%) Legionellales - 2 (0.04%) Planctomycetales - 2 (0.04%) Peronosporales - 2 (0.04%) Acholeplasmatales - 1 (0.02%) Onygenales - 1 (0.02%) Aeromonadales - 1 (0.02%) unclassified (derived from Dehalococcoidetes) - 1 (0.02%) Cardiobacteriales - 1 (0.02%) Agaricales - 1 (0.02%) unclassified (derived from Alphaproteobacteria) - 1 (0.02%) Ustilaginales - 1 (0.02%) Tetraodontiformes - 1 (0.02%) Galliformes - 1 (0.02%) Thermomicrobiales - 1 (0.02%) Chroococcales - 1 (0.02%) Strigeidida - 1 (0.02%) Solirubrobacterales - 1 (0.02%) Thermococcales - 1 (0.02%) Ginkgoales - 1 (0.02%) Selaginellales - 1 (0.02%) unclassified (derived from Acidobacteria) - 1 (0.02%) Coriobacteriales - 1 (0.02%) Dioscoreales - 1 (0.02%) Halanaerobiales - 1 (0.02%) Bdellovibrionales - 1 (0.02%) Desulfovibrionales - 1 (0.02%) Synergistales - 1 (0.02%) Acidobacteriales - 1 (0.02%) Crossosomatales - 1 (0.02%) Desulfuromonadales - 1 (0.02%) Ericales - 1 (0.02%) Thermotogales - 1 (0.02%) unclassified (derived from Spartobacteria) - 1 (0.02%) Verrucomicrobiales - 1 (0.02%) Myxococcales - 1 (0.02%) Laurales - 1 (0.02%) Magnoliales - 1 (0.02%) Eucoccidiorida - 1 (0.02%) Ixodida - 1 (0.02%) Scorpiones - 1 (0.02%) Nymphaeales - 1 (0.02%) Proteales - 1 (0.02%) Coleoptera - 1 (0.02%) Diptera - 1 (0.02%) Herpetosiphonales - 1 (0.02%) Isoptera - 1 (0.02%) Mysida - 1 (0.02%) Acidithiobacillales - 1 (0.02%) Eurotiales - 1 (0.02%) unclassified (derived from Viruses) - 1 (0.02%) unclassified (derived from other sequences) - 1 (0.02%) unclassified (derived from unclassified sequences) - 1 (0.02%) Lactobacillaceae - 1,034 (23.59%) Thermaceae - 470 (10.72%) Moraxellaceae - 338 (7.71%) Acetobacteraceae - 320 (7.30%) Enterobacteriaceae - 293 (6.68%) Poaceae - 210 (4.79%) Bacillaceae - 181 (4.13%) Vitaceae - 146 (3.33%) Brassicaceae - 136 (3.10%) Propionibacteriaceae - 129 (2.94%) Comamonadaceae - 86 (1.96%) Sphingomonadaceae - 86 (1.96%) Streptococcaceae - 68 (1.55%) Leuconostocaceae - 60 (1.37%) Corynebacteriaceae - 58 (1.32%) Staphylococcaceae - 45 (1.03%) Fabaceae - 42 (0.96%) Pseudomonadaceae - 39 (0.89%) Euphorbiaceae - 38 (0.87%) Apidae - 35 (0.80%) Malasseziaceae - 33 (0.75%) Salicaceae - 29 (0.66%) Clostridiales Family XI. Incertae Sedis - 22 (0.50%) Neisseriaceae - 21 (0.48%) Burkholderiaceae - 19 (0.43%) Hominidae - 17 (0.39%) Asteraceae - 16 (0.37%) Veillonellaceae - 15 (0.34%) Bifidobacteriaceae - 14 (0.32%) Clostridiaceae - 13 (0.30%) Rhodocyclaceae - 13 (0.30%) Enterococcaceae - 12 (0.27%) Solanaceae - 11 (0.25%) Lachnospiraceae - 10 (0.23%) Paenibacillaceae - 10 (0.23%) Pasteurellaceae - 10 (0.23%) Myoviridae - 7 (0.16%) Erythrobacteraceae - 7 (0.16%) Papaveraceae - 7 (0.16%) Actinomycetaceae - 7 (0.16%) Listeriaceae - 6 (0.14%) Aquificaceae - 6 (0.14%) Asparagaceae - 6 (0.14%) Spirochaetaceae - 6 (0.14%) Erysipelotrichaceae - 5 (0.11%) unclassified (derived from Bacteria) - 5 (0.11%) Amaranthaceae - 5 (0.11%) Pinaceae - 5 (0.11%) Vibrionaceae - 5 (0.11%) Carnobacteriaceae - 5 (0.11%) Araceae - 5 (0.11%) Prevotellaceae - 5 (0.11%) Solibacteraceae - 5 (0.11%) unclassified (derived from Betaproteobacteria) - 4 (0.09%) Fusobacteriaceae - 4 (0.09%) unclassified (derived from Gammaproteobacteria) - 4 (0.09%) Flavobacteriaceae - 4 (0.09%) unclassified (derived from Sphingobacteriales) - 4 (0.09%) Caulobacteraceae - 4 (0.09%) Rhodobacteraceae - 4 (0.09%) unclassified (derived from Bacillales) - 4 (0.09%) Bromeliaceae - 3 (0.07%) Rubiaceae - 3 (0.07%) Micrococcaceae - 3 (0.07%) Cucurbitaceae - 3 (0.07%) Rosaceae - 3 (0.07%) Halomonadaceae - 3 (0.07%) Funariaceae - 3 (0.07%) Helicobacteraceae - 3 (0.07%) Ectothiorhodospiraceae - 3 (0.07%) Ruminococcaceae - 3 (0.07%) Thermoanaerobacteraceae - 3 (0.07%) Methylobacteriaceae - 2 (0.05%) Clavicipitaceae - 2 (0.05%) Formicidae - 2 (0.05%) Suidae - 2 (0.05%) Spiroplasmataceae - 2 (0.05%) Xanthomonadaceae - 2 (0.05%) Nitrosomonadaceae - 2 (0.05%) Methylophilaceae - 2 (0.05%) Hydrogenophilaceae - 2 (0.05%) Alcaligenaceae - 2 (0.05%) Rhodospirillaceae - 2 (0.05%) Hyphomonadaceae - 2 (0.05%) Rhizobiaceae - 2 (0.05%) Phyllobacteriaceae - 2 (0.05%) Cercopithecidae - 2 (0.05%) Fagaceae - 2 (0.05%) Bradyrhizobiaceae - 2 (0.05%) Planctomycetaceae - 2 (0.05%) Clostridiales Family XVII. Incertae Sedis - 2 (0.05%) Chloroflexaceae - 2 (0.05%) Chlorobiaceae - 2 (0.05%) Anacardiaceae - 2 (0.05%) Rutaceae - 2 (0.05%) Convolvulaceae - 2 (0.05%) Cytophagaceae - 2 (0.05%) Bacteroidaceae - 2 (0.05%) unclassified (derived from Peronosporales) - 2 (0.05%) Oxalobacteraceae - 2 (0.05%) unclassified (derived from Alphaproteobacteria) - 1 (0.02%) Leptospiraceae - 1 (0.02%) Thermoanaerobacterales Family III. Incertae Sedis - 1 (0.02%) Synergistaceae - 1 (0.02%) Acholeplasmataceae - 1 (0.02%) Dermacoccaceae - 1 (0.02%) Thermotogaceae - 1 (0.02%) unclassified (derived from Spartobacteria) - 1 (0.02%) Verrucomicrobia subdivision 3 - 1 (0.02%) unclassified (derived from Flavobacteriales) - 1 (0.02%) Cryptosporidiidae - 1 (0.02%) Ixodidae - 1 (0.02%) Buthidae - 1 (0.02%) Tenebrionidae - 1 (0.02%) Drosophilidae - 1 (0.02%) Rhodothermaceae - 1 (0.02%) Sphingobacteriaceae - 1 (0.02%) Rhinotermitidae - 1 (0.02%) Mysidae - 1 (0.02%) Trichocomaceae - 1 (0.02%) unclassified (derived from Onygenales) - 1 (0.02%) Metschnikowiaceae - 1 (0.02%) Saccharomycetaceae - 1 (0.02%) Coriobacteriaceae - 1 (0.02%) Chaetomiaceae - 1 (0.02%) Sordariaceae - 1 (0.02%) Marasmiaceae - 1 (0.02%) Conexibacteraceae - 1 (0.02%) Ustilaginaceae - 1 (0.02%) Tetraodontidae - 1 (0.02%) Phasianidae - 1 (0.02%) Thermococcaceae - 1 (0.02%) Herpetosiphonaceae - 1 (0.02%) unclassified (derived from Dehalococcoidetes) - 1 (0.02%) Schistosomatidae - 1 (0.02%) Bdellovibrionaceae - 1 (0.02%) Desulfovibrionaceae - 1 (0.02%) Ginkgoaceae - 1 (0.02%) Selaginellaceae - 1 (0.02%) Geobacteraceae - 1 (0.02%) Thermomicrobiaceae - 1 (0.02%) Orchidaceae - 1 (0.02%) Burmanniaceae - 1 (0.02%) Acidithiobacillaceae - 1 (0.02%) Cyperaceae - 1 (0.02%) Aeromonadaceae - 1 (0.02%) Apiaceae - 1 (0.02%) Alteromonadaceae - 1 (0.02%) Campanulaceae - 1 (0.02%) Shewanellaceae - 1 (0.02%) Cardiobacteriaceae - 1 (0.02%) Rhabdodendraceae - 1 (0.02%) Tamaricaceae - 1 (0.02%) Crossosomataceae - 1 (0.02%) Chromatiaceae - 1 (0.02%) Primulaceae - 1 (0.02%) unclassified (derived from Chroococcales) - 1 (0.02%) Halothiobacillaceae - 1 (0.02%) Eubacteriaceae - 1 (0.02%) Orobanchaceae - 1 (0.02%) Scrophulariaceae - 1 (0.02%) Lauraceae - 1 (0.02%) Magnoliaceae - 1 (0.02%) Coxiellaceae - 1 (0.02%) Legionellaceae - 1 (0.02%) Cistaceae - 1 (0.02%) Malvaceae - 1 (0.02%) Myrtaceae - 1 (0.02%) Onagraceae - 1 (0.02%) Nymphaeaceae - 1 (0.02%) Proteaceae - 1 (0.02%) Menispermaceae - 1 (0.02%) Alcanivoracaceae - 1 (0.02%) Ranunculaceae - 1 (0.02%) Hahellaceae - 1 (0.02%) Acidobacteriaceae - 1 (0.02%) Oceanospirillaceae - 1 (0.02%) Mycobacteriaceae - 1 (0.02%) Anaplasmataceae - 1 (0.02%) Rickettsiaceae - 1 (0.02%) unclassified (derived from Acidobacteria) - 1 (0.02%) Sanguibacteraceae - 1 (0.02%) Podoviridae - 1 (0.02%) unclassified (derived from Viruses) - 1 (0.02%) unclassified (derived from other sequences) - 1 (0.02%) unclassified (derived from unclassified sequences) - 1 (0.02%) Lactobacillus - 799 (22.41%) Meiothermus - 422 (11.83%) Acinetobacter - 312 (8.75%) Vitis - 146 (4.09%) Propionibacterium - 129 (3.62%) Gluconobacter - 116 (3.25%) Zea - 105 (2.94%) Arabidopsis - 99 (2.78%) Geobacillus - 63 (1.77%) Delftia - 59 (1.65%) Streptococcus - 59 (1.65%) Corynebacterium - 58 (1.63%) Oryza - 55 (1.54%) Staphylococcus - 44 (1.23%) Novosphingobium - 41 (1.15%) Gluconacetobacter - 40 (1.12%) Pseudomonas - 39 (1.09%) Pediococcus - 38 (1.07%) Leuconostoc - 37 (1.04%) Ricinus - 34 (0.95%) Malassezia - 33 (0.93%) Apis - 31 (0.87%) Populus - 29 (0.81%) Bacillus - 26 (0.73%) Thermus - 22 (0.62%) Arsenophonus - 21 (0.59%) Acetobacter - 21 (0.59%) Sphingomonas - 19 (0.53%) Erwinia - 18 (0.50%) Brassica - 17 (0.48%) Medicago - 17 (0.48%) Anoxybacillus - 15 (0.42%) Veillonella - 14 (0.39%) Homo - 13 (0.36%) Neisseria - 13 (0.36%) Enterococcus - 12 (0.34%) Clostridium - 12 (0.34%) Burkholderia - 11 (0.31%) Anaerococcus - 11 (0.31%) Serratia - 10 (0.28%) Pantoea - 10 (0.28%) Solanum - 9 (0.25%) Paenibacillus - 7 (0.20%) Bifidobacterium - 7 (0.20%) Lactococcus - 7 (0.20%) Trifolium - 7 (0.20%) Escherichia - 7 (0.20%) Erythrobacter - 7 (0.20%) Sorghum - 7 (0.20%) Asparagus - 6 (0.17%) Yersinia - 6 (0.17%) Lotus - 6 (0.17%) Oenococcus - 6 (0.17%) Listeria - 6 (0.17%) Finegoldia - 6 (0.17%) Providencia - 5 (0.14%) Catonella - 5 (0.14%) Papaver - 5 (0.14%) Prevotella - 5 (0.14%) Candidatus Solibacter - 5 (0.14%) Vibrio - 5 (0.14%) Actinomyces - 5 (0.14%) Lemna - 5 (0.14%) Psychrobacter - 5 (0.14%) unclassified (derived from Bacteria) - 5 (0.14%) Sphingobium - 5 (0.14%) Peptoniphilus - 5 (0.14%) Weissella - 5 (0.14%) Klebsiella - 5 (0.14%) Pisum - 4 (0.11%) Candidatus Accumulibacter - 4 (0.11%) Salmonella - 4 (0.11%) Beta - 4 (0.11%) Cicer - 4 (0.11%) Haemophilus - 4 (0.11%) Ralstonia - 4 (0.11%) Euphorbia - 4 (0.11%) Gemella - 4 (0.11%) Chitinophaga - 4 (0.11%) T4-like viruses - 4 (0.11%) Cronobacter - 4 (0.11%) Carboxydothermus - 3 (0.08%) Pectobacterium - 3 (0.08%) Aromatoleum - 3 (0.08%) Carnobacterium - 3 (0.08%) Fructobacillus - 3 (0.08%) Pinus - 3 (0.08%) unclassified (derived from Enterobacteriaceae) - 3 (0.08%) Glycine - 3 (0.08%) Physcomitrella - 3 (0.08%) Gardnerella - 3 (0.08%) Enhydrobacter - 3 (0.08%) Capnocytophaga - 3 (0.08%) Ananas - 3 (0.08%) Sisymbrium - 3 (0.08%) unclassified (derived from Gammaproteobacteria) - 3 (0.08%) unclassified (derived from Myoviridae) - 3 (0.08%) Treponema - 3 (0.08%) Methylobacterium - 2 (0.06%) Spiroplasma - 2 (0.06%) Pasteurella - 2 (0.06%) Metarhizium - 2 (0.06%) Enterobacter - 2 (0.06%) Macaca - 2 (0.06%) Dickeya - 2 (0.06%) Sus - 2 (0.06%) Citrobacter - 2 (0.06%) Alkalilimnicola - 2 (0.06%) Helicobacter - 2 (0.06%) Nitrosomonas - 2 (0.06%) Hydrogenophilus - 2 (0.06%) Comamonas - 2 (0.06%) Citromicrobium - 2 (0.06%) Rhodospirillum - 2 (0.06%) Crepis - 2 (0.06%) Granulibacter - 2 (0.06%) Agrobacterium - 2 (0.06%) Borrelia - 2 (0.06%) Asticcacaulis - 2 (0.06%) Leptotrichia - 2 (0.06%) Fusobacterium - 2 (0.06%) Holdemania - 2 (0.06%) Ruminococcus - 2 (0.06%) Coprococcus - 2 (0.06%) Thermaerobacter - 2 (0.06%) Granulicatella - 2 (0.06%) Brevibacillus - 2 (0.06%) Oceanobacillus - 2 (0.06%) Chloroflexus - 2 (0.06%) Citrus - 2 (0.06%) Ipomoea - 2 (0.06%) Nicotiana - 2 (0.06%) Microscilla - 2 (0.06%) Bacteroides - 2 (0.06%) Phytophthora - 2 (0.06%) Hydrogenobacter - 2 (0.06%) Simonsiella - 2 (0.06%) Acidithiobacillus - 1 (0.03%) Marinobacter - 1 (0.03%) Shewanella - 1 (0.03%) Cardiobacterium - 1 (0.03%) Allochromatium - 1 (0.03%) Solobacterium - 1 (0.03%) Halothiobacillus - 1 (0.03%) unclassified (derived from Erysipelotrichaceae) - 1 (0.03%) Selenomonas - 1 (0.03%) Acidobacterium - 1 (0.03%) Rhodothermus - 1 (0.03%) Pedobacter - 1 (0.03%) Thermococcus - 1 (0.03%) Planctomyces - 1 (0.03%) Rhodopirellula - 1 (0.03%) Sanguibacter - 1 (0.03%) Brevundimonas - 1 (0.03%) Proteus - 1 (0.03%) Phenylobacterium - 1 (0.03%) Rahnella - 1 (0.03%) Bradyrhizobium - 1 (0.03%) Herpetosiphon - 1 (0.03%) Sodalis - 1 (0.03%) Xenorhabdus - 1 (0.03%) Chelativorans - 1 (0.03%) Mesorhizobium - 1 (0.03%) Coxiella - 1 (0.03%) Legionella - 1 (0.03%) Kangiella - 1 (0.03%) Hahella - 1 (0.03%) Halomonas - 1 (0.03%) Zymobacter - 1 (0.03%) Oceanospirillum - 1 (0.03%) Dehalococcoides - 1 (0.03%) Hyphomonas - 1 (0.03%) Maricaulis - 1 (0.03%) Phaeobacter - 1 (0.03%) Moraxella - 1 (0.03%) Ruegeria - 1 (0.03%) Thermomicrobium - 1 (0.03%) Acidiphilium - 1 (0.03%) Xanthomonas - 1 (0.03%) Congregibacter - 1 (0.03%) Synechococcus - 1 (0.03%) Leptospira - 1 (0.03%) Arcanobacterium - 1 (0.03%) Spirochaeta - 1 (0.03%) Candidatus Koribacter - 1 (0.03%) Anaerobaculum - 1 (0.03%) Alkaliphilus - 1 (0.03%) Petrotoga - 1 (0.03%) Chthoniobacter - 1 (0.03%) unclassified (derived from Verrucomicrobia subdivision 3) - 1 (0.03%) Wolbachia - 1 (0.03%) Cryptosporidium - 1 (0.03%) Ixodes - 1 (0.03%) Tribolium - 1 (0.03%) Drosophila - 1 (0.03%) Rickettsia - 1 (0.03%) Myrmecia - 1 (0.03%) Coptotermes - 1 (0.03%) Paramysis - 1 (0.03%) Aspergillus - 1 (0.03%) Paracoccidioides - 1 (0.03%) Clavispora - 1 (0.03%) Collinsella - 1 (0.03%) Chaetomium - 1 (0.03%) Sordaria - 1 (0.03%) Moniliophthora - 1 (0.03%) Conexibacter - 1 (0.03%) Ustilago - 1 (0.03%) Tetraodon - 1 (0.03%) Gallus - 1 (0.03%) Aquifex - 1 (0.03%) Kytococcus - 1 (0.03%) Pan - 1 (0.03%) Arthrobacter - 1 (0.03%) Schistosoma - 1 (0.03%) Sphingopyxis - 1 (0.03%) Picea - 1 (0.03%) Zymomonas - 1 (0.03%) Ginkgo - 1 (0.03%) Selaginella - 1 (0.03%) unclassified (derived from Alphaproteobacteria) - 1 (0.03%) Sutterella - 1 (0.03%) Phalaenopsis - 1 (0.03%) Gymnosiphon - 1 (0.03%) Taylorella - 1 (0.03%) Eleocharis - 1 (0.03%) Festuca - 1 (0.03%) Hordeum - 1 (0.03%) Eubacterium - 1 (0.03%) Cupriavidus - 1 (0.03%) Limnobacter - 1 (0.03%) Platysace - 1 (0.03%) Chrysanthemum - 1 (0.03%) Anaerostipes - 1 (0.03%) Cynara - 1 (0.03%) Helianthus - 1 (0.03%) Lactuca - 1 (0.03%) Parthenium - 1 (0.03%) Saussurea - 1 (0.03%) Trachelium - 1 (0.03%) Alicycliphilus - 1 (0.03%) Micrococcus - 1 (0.03%) Eruca - 1 (0.03%) Rothia - 1 (0.03%) Amaranthus - 1 (0.03%) Polaromonas - 1 (0.03%) Rhabdodendron - 1 (0.03%) Tamarix - 1 (0.03%) Crossosoma - 1 (0.03%) Cucumis - 1 (0.03%) Trichosanthes - 1 (0.03%) Primula - 1 (0.03%) Herbaspirillum - 1 (0.03%) Dorea - 1 (0.03%) Methylobacillus - 1 (0.03%) Methylovorus - 1 (0.03%) Laribacter - 1 (0.03%) Roseburia - 1 (0.03%) Vicia - 1 (0.03%) Chrysolepis - 1 (0.03%) Coffea - 1 (0.03%) Neohymenopogon - 1 (0.03%) Phelipanche - 1 (0.03%) Aptosimum - 1 (0.03%) Cinnamomum - 1 (0.03%) Liriodendron - 1 (0.03%) Faecalibacterium - 1 (0.03%) Mycobacterium - 1 (0.03%) Chryseobacterium - 1 (0.03%) Helianthemum - 1 (0.03%) Malva - 1 (0.03%) Eucalyptus - 1 (0.03%) Oenothera - 1 (0.03%) Nymphaea - 1 (0.03%) Protea - 1 (0.03%) Cissampelos - 1 (0.03%) Azoarcus - 1 (0.03%) Fragaria - 1 (0.03%) Malus - 1 (0.03%) Prunus - 1 (0.03%) Dechloromonas - 1 (0.03%) Thauera - 1 (0.03%) Caldicellulosiruptor - 1 (0.03%) Bdellovibrio - 1 (0.03%) Desulfovibrio - 1 (0.03%) Geobacter - 1 (0.03%) Candidatus Sulcia - 1 (0.03%) Sulfuricurvum - 1 (0.03%) N4-like viruses - 1 (0.03%) unclassified (derived from Viruses) - 1 (0.03%) unclassified (derived from other sequences) - 1 (0.03%) unclassified (derived from unclassified sequences) - 1 (0.03%) The plots below show the taxonomic abundances ordered from the most abundant to least abundant. Only the top 50 most abundant are shown. The y-axis plots the abundances of annotations on a log scale.
The rank abundance chart is a tool for visually representing taxonomic richness and evenness.
taxon abundance Bacteria: 5450 Eukaryota: 1953 Viruses: 9 Archaea: 1 other sequences: 1 unclassified sequences: 1 0 101 102 103 104 Bacteria Eukaryota Viruses Archaea other sequences unclassified sequences
taxon abundance Firmicutes: 2265 Proteobacteria: 1902 Streptophyta: 1670 Deinococcus-Thermus: 475 Actinobacteria: 253 Arthropoda: 63 Chordata: 54 Bacteroidetes: 47 Basidiomycota: 40 Ascomycota: 15 Acidobacteria: 9 unclassified (derived from Viruses): 9 Aquificae: 8 Spirochaetes: 7 Chloroflexi: 6 unclassified (derived from Bacteria): 5 Fusobacteria: 4 Tenericutes: 3 unclassified (derived from Eukaryota): 3 Chlorobi: 2 Planctomycetes: 2 Verrucomicrobia: 2 Cyanobacteria: 2 Apicomplexa: 1 Platyhelminthes: 1 Thermotogae: 1 Synergistetes: 1 Euryarchaeota: 1 unclassified (derived from other sequences): 1 unclassified (derived from unclassified sequences): 1 0 101 102 103 104 Firmicutes Proteobacteria Streptophyta Deinococcus-Thermus Actinobacteria Arthropoda Chordata Bacteroidetes Basidiomycota Ascomycota Acidobacteria unclassified (derived from Viruses) Aquificae Spirochaetes Chloroflexi unclassified (derived from Bacteria) Fusobacteria Tenericutes unclassified (derived from Eukaryota) Chlorobi Planctomycetes Verrucomicrobia Cyanobacteria Apicomplexa Platyhelminthes Thermotogae Synergistetes Euryarchaeota unclassified (derived from other sequences) unclassified (derived from unclassified sequences)
taxon abundance Bacilli: 2032 unclassified (derived from Streptophyta): 902 Gammaproteobacteria: 870 Alphaproteobacteria: 601 Deinococci: 475 Actinobacteria (class): 253 Liliopsida: 233 Betaproteobacteria: 228 Clostridia: 66 Insecta: 58 Mammalia: 43 Exobasidiomycetes: 33 Negativicutes: 16 Bacteroidia: 13 unclassified (derived from Viruses): 9 Aquificae (class): 8 Spirochaetes (class): 7 Coniferopsida: 6 Deltaproteobacteria: 6 Sphingobacteria: 6 Sordariomycetes: 5 Solibacteres: 5 Flavobacteria: 5 Erysipelotrichi: 5 unclassified (derived from Bacteria): 5 Saccharomycetes: 4 Fusobacteria (class): 4 Chloroflexi (class): 4 Mollicutes: 3 Dothideomycetes: 3 Epsilonproteobacteria: 3 Bryopsida: 3 Cytophagia: 3 Eurotiomycetes: 2 unclassified (derived from Cyanobacteria): 2 Chlorobia: 2 Planctomycetacia: 2 Arachnida: 2 unclassified (derived from Eukaryota): 2 Actinopterygii: 2 Acidobacteria (class): 1 Thermotogae (class): 1 Agaricomycetes: 1 Spartobacteria: 1 Ustilaginomycetes: 1 Verrucomicrobiae: 1 Aves: 1 Dehalococcoidetes: 1 Trematoda: 1 Coccidia: 1 0 101 102 103 104 Bacilli unclassified (derived from Streptophyta) Gammaproteobacteria Alphaproteobacteria Deinococci Actinobacteria (class) Liliopsida Betaproteobacteria Clostridia Insecta Mammalia Exobasidiomycetes Negativicutes Bacteroidia unclassified (derived from Viruses) Aquificae (class) Spirochaetes (class) Coniferopsida Deltaproteobacteria Sphingobacteria Sordariomycetes Solibacteres Flavobacteria Erysipelotrichi unclassified (derived from Bacteria) Saccharomycetes Fusobacteria (class) Chloroflexi (class) Mollicutes Dothideomycetes Epsilonproteobacteria Bryopsida Cytophagia Eurotiomycetes unclassified (derived from Cyanobacteria) Chlorobia Planctomycetacia Arachnida unclassified (derived from Eukaryota) Actinopterygii Acidobacteria (class) Thermotogae (class) Agaricomycetes Spartobacteria Ustilaginomycetes Verrucomicrobiae Aves Dehalococcoidetes Trematoda Coccidia
taxon abundance Lactobacillales: 1577 Thermales: 470 Pseudomonadales: 381 Rhodospirillales: 327 Enterobacteriales: 293 Bacillales: 271 Actinomycetales: 229 Poales: 215 Vitales: 146 Burkholderiales: 141 Brassicales: 137 Sphingomonadales: 125 Malpighiales: 74 Clostridiales: 56 Hymenoptera: 43 Fabales: 42 Malasseziales: 33 Primates: 28 Neisseriales: 21 Asterales: 17 Selenomonadales: 16 Solanales: 14 Rhizobiales: 14 Bifidobacteriales: 14 Rhodocyclales: 13 Bacteroidales: 13 Ranunculales: 10 Pasteurellales: 10 Caudovirales: 8 Aquificales: 8 Caryophyllales: 8 Asparagales: 7 Spirochaetales: 7 Oceanospirillales: 7 Rhodobacterales: 6 Sphingobacteriales: 6 Coniferales: 6 unclassified (derived from Bacteria): 5 Chromatiales: 5 Vibrionales: 5 Alismatales: 5 Solibacterales: 5 Erysipelotrichales: 5 Flavobacteriales: 5 unclassified (derived from Gammaproteobacteria): 4 Fusobacteriales: 4 Saccharomycetales: 4 Thermoanaerobacterales: 4 unclassified (derived from Betaproteobacteria): 4 Sapindales: 4 0 101 102 103 104 Lactobacillales Thermales Pseudomonadales Rhodospirillales Enterobacteriales Bacillales Actinomycetales Poales Vitales Burkholderiales Brassicales Sphingomonadales Malpighiales Clostridiales Hymenoptera Fabales Malasseziales Primates Neisseriales Asterales Selenomonadales Solanales Rhizobiales Bifidobacteriales Rhodocyclales Bacteroidales Ranunculales Pasteurellales Caudovirales Aquificales Caryophyllales Asparagales Spirochaetales Oceanospirillales Rhodobacterales Sphingobacteriales Coniferales unclassified (derived from Bacteria) Chromatiales Vibrionales Alismatales Solibacterales Erysipelotrichales Flavobacteriales unclassified (derived from Gammaproteobacteria) Fusobacteriales Saccharomycetales Thermoanaerobacterales unclassified (derived from Betaproteobacteria) Sapindales
taxon abundance Lactobacillaceae: 1034 Thermaceae: 470 Moraxellaceae: 338 Acetobacteraceae: 320 Enterobacteriaceae: 293 Poaceae: 210 Bacillaceae: 181 Vitaceae: 146 Brassicaceae: 136 Propionibacteriaceae: 129 Comamonadaceae: 86 Sphingomonadaceae: 86 Streptococcaceae: 68 Leuconostocaceae: 60 Corynebacteriaceae: 58 Staphylococcaceae: 45 Fabaceae: 42 Pseudomonadaceae: 39 Euphorbiaceae: 38 Apidae: 35 Malasseziaceae: 33 Salicaceae: 29 Clostridiales Family XI. Incertae Sedis: 22 Neisseriaceae: 21 Burkholderiaceae: 19 Hominidae: 17 Asteraceae: 16 Veillonellaceae: 15 Bifidobacteriaceae: 14 Clostridiaceae: 13 Rhodocyclaceae: 13 Enterococcaceae: 12 Solanaceae: 11 Lachnospiraceae: 10 Paenibacillaceae: 10 Pasteurellaceae: 10 Myoviridae: 7 Erythrobacteraceae: 7 Papaveraceae: 7 Actinomycetaceae: 7 Listeriaceae: 6 Aquificaceae: 6 Asparagaceae: 6 Spirochaetaceae: 6 Erysipelotrichaceae: 5 unclassified (derived from Bacteria): 5 Amaranthaceae: 5 Pinaceae: 5 Vibrionaceae: 5 Carnobacteriaceae: 5 0 101 102 103 104 Lactobacillaceae Thermaceae Moraxellaceae Acetobacteraceae Enterobacteriaceae Poaceae Bacillaceae Vitaceae Brassicaceae Propionibacteriaceae Comamonadaceae Sphingomonadaceae Streptococcaceae Leuconostocaceae Corynebacteriaceae Staphylococcaceae Fabaceae Pseudomonadaceae Euphorbiaceae Apidae Malasseziaceae Salicaceae Clostridiales Family XI. Incertae Sedis Neisseriaceae Burkholderiaceae Hominidae Asteraceae Veillonellaceae Bifidobacteriaceae Clostridiaceae Rhodocyclaceae Enterococcaceae Solanaceae Lachnospiraceae Paenibacillaceae Pasteurellaceae Myoviridae Erythrobacteraceae Papaveraceae Actinomycetaceae Listeriaceae Aquificaceae Asparagaceae Spirochaetaceae Erysipelotrichaceae unclassified (derived from Bacteria) Amaranthaceae Pinaceae Vibrionaceae Carnobacteriaceae
taxon abundance Lactobacillus: 799 Meiothermus: 422 Acinetobacter: 312 Vitis: 146 Propionibacterium: 129 Gluconobacter: 116 Zea: 105 Arabidopsis: 99 Geobacillus: 63 Delftia: 59 Streptococcus: 59 Corynebacterium: 58 Oryza: 55 Staphylococcus: 44 Novosphingobium: 41 Gluconacetobacter: 40 Pseudomonas: 39 Pediococcus: 38 Leuconostoc: 37 Ricinus: 34 Malassezia: 33 Apis: 31 Populus: 29 Bacillus: 26 Thermus: 22 Arsenophonus: 21 Acetobacter: 21 Sphingomonas: 19 Erwinia: 18 Brassica: 17 Medicago: 17 Anoxybacillus: 15 Veillonella: 14 Homo: 13 Neisseria: 13 Enterococcus: 12 Clostridium: 12 Burkholderia: 11 Anaerococcus: 11 Serratia: 10 Pantoea: 10 Solanum: 9 Paenibacillus: 7 Bifidobacterium: 7 Lactococcus: 7 Trifolium: 7 Escherichia: 7 Erythrobacter: 7 Sorghum: 7 Asparagus: 6 0 101 102 103 Lactobacillus Meiothermus Acinetobacter Vitis Propionibacterium Gluconobacter Zea Arabidopsis Geobacillus Delftia Streptococcus Corynebacterium Oryza Staphylococcus Novosphingobium Gluconacetobacter Pseudomonas Pediococcus Leuconostoc Ricinus Malassezia Apis Populus Bacillus Thermus Arsenophonus Acetobacter Sphingomonas Erwinia Brassica Medicago Anoxybacillus Veillonella Homo Neisseria Enterococcus Clostridium Burkholderia Anaerococcus Serratia Pantoea Solanum Paenibacillus Bifidobacterium Lactococcus Trifolium Escherichia Erythrobacter Sorghum Asparagus
The plot below shows the rarefaction curve of annotated species richness. This curve is a plot of the total number of distinct species annotations as a function of the number of sequences sampled. On the left, a steep slope indicates that a large fraction of the species diversity remains to be discovered. If the curve becomes flatter to the right, a reasonable number of individuals is sampled: more intensive sampling is likely to yield only few additional species.
Sampling curves generally rise very quickly at first and then level off toward an asymptote as fewer new species are found per unit of individuals collected. These rarefaction curves are calculated from the table of species abundance. The curves represent the average number of different species annotations for subsamples of the complete data set.
undefined number of reads species count 0 160 320 480 640 800 960 1,120 1,280 1,440 1,600 0 70,000 140,000 210,000 280,000 350,000 420,000 490,000 560,000 630,000 700,000
The α-diversity of this data set is 210 species.
2σ σ μ σ 2σ 209.94 209.94 209.94
The above image shows the range of α-diversity values in the study 'SPlab-run431-SPD4-asm'. The min, max, and mean values are shown, with the standard deviation ranges (σ and 2σ) in different shades. The α-diversity of this metagenome is shown in red.
Alpha diversity summarizes the diversity of organisms in a sample with a single number. The α-diversity of annotated samples can be estimated from the distribution of the species-level annotations.
Annotated species richness is the number of distinct species annotations in the combined MG-RAST data set. Shannon diversity is an abundance-weighted average of the logarithm of the relative abundances of annotated species. The species-level annotations are from all the annotation source databases used by MG-RAST.
The histograms below show the distribution of sequence lengths in basepairs for this metagenome. Each position represents the number of sequences within a length bp range.
The data used in these graphs is based on raw upload and post-QC sequences.
uploaded post-QC sequence length (bp) number of reads 0 3,500 7,000 10,500 14,000 17,500 21,000 24,500 28,000 31,500 35,000 1 121 241 361 481 601 720 840 960 1,080 1,200 uploaded post-QC
The histograms below show the distribution of the GC percentage for this metagenome. Each position represents the number of sequences within a GC percentage range. The data used in these graphs is based on raw upload and post-QC sequences.
uploaded post-QC sequence GC % number of reads 0 4,000 8,000 12,000 16,000 20,000 24,000 28,000 32,000 36,000 40,000 0 10 20 30 40 50 60 70 80 90 100 uploaded post-QC
This data set has no sample data