assembled shotgun metagenome
This data is private and cannot be publicly linked. Click the share button for publication options. show sharing options
SPD3.k51.ForMGRAST
This assembled shotgun metagenome is part of the study 'SPlab-run431-SPD3-asm'
by undefined undefined, undefined.
VisibilityprivateNCBI Project ID-
ID7b76dc45d26d676d343836313733332e33GOLD ID-
ENA Project IDENA Library ID
Static Linkprivate metagenomes cannot be linkedPubMed ID-
Sample - Library -
The data set SPD3.k51.ForMGRAST was uploaded on 2019-09-30 at 14:19:02 and contains 67,077 sequences totaling 10,342,990 basepairs with an average length of 154 bps.
Of the sequences tested, 10,773 sequences (16.06%) failed to pass the QC pipeline. Of those, dereplication identified 6,655 sequences as artificial duplicate reads.
Of the sequences that passed QC, 707 sequences (2%) contain ribosomal RNA genes, 12,083 sequences (27.68%) contain predicted proteins with known functions, and 30,868 sequences (70.70%) contain predicted proteins with unknown function.
Data shown here is displayed as a quick way to assess the quality and contents of the data set. We note that the submitting authors may have performed their own analysis. The analysis page provides the best way to perform in-depth analyses of this data set.
Provenance
The data on this page represents the automated analysis generated by the MG-RAST automated processing pipeline. Details on the processing of this data set are in the processing receipt.
Sequence Breakdown
failed QC - 10,773 (16.06%)unknown - 12,646 (18.85%)predicted feature - 43,658 (65.09%)
Predicted Features
unknown protein - 30,868 (70.70%)annotated protein - 12,083 (27.68%)ribosomal RNA - 707 (1.62%)
Analysis Statistics
GSC MIxS Info
Upload: bp Count10,342,990 bp
Upload: Sequences Count67,077
Upload: Mean Sequence Length154 ± 95 bp
Upload: Mean GC percent42 ± 7 %
Artificial Duplicate Reads: Sequence Count6,655
Post QC: bp Count8,387,440 bp
Post QC: Sequences Count56,304
Post QC: Mean Sequence Length149 ± 62 bp
Post QC: Mean GC percent42 ± 7 %
Processed: Predicted Protein Features38,817
Processed: Predicted rRNA Features608
Alignment: Identified Protein Features8,748
Alignment: Identified rRNA Features464
Annotation: Identified Functional Categoriesundefined
Investigation TypeWGS
Study NameSPlab-run431-SPD3-asm
Latitude and Longitude
Country and/or Sea, Location
Collection Date
Environment (Biome)
Environment (Feature)
Environment (Material)
Environmental Package
Sequencing Methodother
More Metadataclick for full table
K-mer Profile
The kmer rank abundance graph plots the kmer coverage as a function of abundance rank, with the most abundant sequences at the left.
undefinedsequence sizek-mer coverage01011021030101102103104105106107
Source Hits Distribution
The graph below displays the number of hits in the different databases listed. These include protein databases, protein databases with functional hierarchy information, and ribosomal RNA databases. The bars representing annotated reads are colored by e-value range. Different databases have different numbers of hits but can also have different types of annotation data.
5831,4373,1571,8237945061,2532,6511,4456126351,5603,6472,2779623697851,7399843785071,2532,9601,7717506231,5243,4552,1618871593978104261665011,2212,5661,3985524139402,2451,3155182545641,2126762424119152,1381,221468369101033666193017429174011642481835410001,0002,0003,0004,0005,0006,0007,0008,0009,00010,000RefSeqIMGTrEMBLSubsystemsKEGGGenBankSwissProtPATRICeggNOGKOGOCOGRDPLSUSSUNOGe^-30 & lesse^-20 to e^-30e^-10 to e^-20e^-5 to e^-10e^-3 to e^-5
Functional Category Hits Distribution
The pie charts below illustrate the distribution of functional categories for COGs, KOs, NOGs, and Subsystems at the highest level supported by these functional hierarchies. Each slice indicates the percentage of reads with predicted protein functions annotated to the category for the given source.
COG
METABOLISM - 2,213 (37.58%)INFORMATION STORAGE AND PROCESSING - 1,490 (25.30%)CELLULAR PROCESSES AND SIGNALING - 1,134 (19.26%)POORLY CHARACTERIZED - 1,052 (17.86%)
NOG
POORLY CHARACTERIZED - 83 (62.41%)METABOLISM - 23 (17.29%)INFORMATION STORAGE AND PROCESSING - 15 (11.28%)CELLULAR PROCESSES AND SIGNALING - 12 (9.02%)
KO
Metabolism - 2,425 (58.31%)Genetic Information Processing - 1,018 (24.48%)Environmental Information Processing - 497 (11.95%)Cellular Processes - 123 (2.96%)Human Diseases - 79 (1.90%)Organismal Systems - 17 (0.41%)
Subsystems
Clustering-based subsystems - 1,962 (15.93%)Carbohydrates - 1,409 (11.44%)Miscellaneous - 1,099 (8.92%)Protein Metabolism - 1,014 (8.23%)Amino Acids and Derivatives - 931 (7.56%)RNA Metabolism - 928 (7.53%)DNA Metabolism - 703 (5.71%)Cofactors, Vitamins, Prosthetic Groups, Pigments - 659 (5.35%)Nucleosides and Nucleotides - 566 (4.60%)Cell Wall and Capsule - 562 (4.56%)Fatty Acids, Lipids, and Isoprenoids - 335 (2.72%)Virulence, Disease and Defense - 282 (2.29%)Membrane Transport - 260 (2.11%)Cell Division and Cell Cycle - 242 (1.96%)Regulation and Cell signaling - 236 (1.92%)Stress Response - 232 (1.88%)Phages, Prophages, Transposable elements, Plasmids - 198 (1.61%)Respiration - 168 (1.36%)Phosphorus Metabolism - 147 (1.19%)Iron acquisition and metabolism - 88 (0.71%)Motility and Chemotaxis - 59 (0.48%)Metabolism of Aromatic Compounds - 56 (0.45%)Sulfur Metabolism - 47 (0.38%)Dormancy and Sporulation - 33 (0.27%)Photosynthesis - 29 (0.24%)Potassium metabolism - 28 (0.23%)Nitrogen Metabolism - 23 (0.19%)Secondary Metabolism - 21 (0.17%)
Taxonomic Hits Distribution
The charts below represent the distribution of taxa using a contigLCA algorithm finding a single consensus taxonomic entity for all features on each individual sequence.
Domain
Bacteria - 6,554 (63.64%)Eukaryota - 3,724 (36.16%)Viruses - 18 (0.17%)Archaea - 1 (0.01%)unclassified sequences - 1 (0.01%)
Phylum
Firmicutes - 4,593 (47.55%)Streptophyta - 3,442 (35.63%)Proteobacteria - 1,141 (11.81%)Actinobacteria - 136 (1.41%)Deinococcus-Thermus - 136 (1.41%)Bacteroidetes - 42 (0.43%)Arthropoda - 40 (0.41%)Chordata - 25 (0.26%)Basidiomycota - 20 (0.21%)unclassified (derived from Viruses) - 18 (0.19%)Acidobacteria - 9 (0.09%)Tenericutes - 8 (0.08%)Ascomycota - 7 (0.07%)Fusobacteria - 6 (0.06%)Cyanobacteria - 4 (0.04%)Spirochaetes - 4 (0.04%)Chlorobi - 4 (0.04%)Aquificae - 4 (0.04%)Verrucomicrobia - 3 (0.03%)unclassified (derived from Eukaryota) - 3 (0.03%)Platyhelminthes - 2 (0.02%)Chloroflexi - 2 (0.02%)Synergistetes - 1 (0.01%)Gemmatimonadetes - 1 (0.01%)Bacillariophyta - 1 (0.01%)Planctomycetes - 1 (0.01%)Euryarchaeota - 1 (0.01%)Cnidaria - 1 (0.01%)unclassified (derived from Bacteria) - 1 (0.01%)Annelida - 1 (0.01%)Apicomplexa - 1 (0.01%)Chrysiogenetes - 1 (0.01%)unclassified (derived from unclassified sequences) - 1 (0.01%)
Class
Bacilli - 4,312 (53.34%)unclassified (derived from Streptophyta) - 1,843 (22.80%)Gammaproteobacteria - 507 (6.27%)Alphaproteobacteria - 415 (5.13%)Liliopsida - 342 (4.23%)Actinobacteria (class) - 136 (1.68%)Deinococci - 136 (1.68%)Betaproteobacteria - 100 (1.24%)Clostridia - 71 (0.88%)Insecta - 38 (0.47%)Mammalia - 22 (0.27%)unclassified (derived from Viruses) - 18 (0.22%)Exobasidiomycetes - 17 (0.21%)Bacteroidia - 12 (0.15%)Negativicutes - 11 (0.14%)Deltaproteobacteria - 9 (0.11%)Mollicutes - 8 (0.10%)Flavobacteria - 8 (0.10%)Fusobacteria (class) - 6 (0.07%)Sphingobacteria - 6 (0.07%)Erysipelotrichi - 5 (0.06%)unclassified (derived from Cyanobacteria) - 4 (0.05%)Chlorobia - 4 (0.05%)Spirochaetes (class) - 4 (0.05%)Aquificae (class) - 4 (0.05%)Sordariomycetes - 4 (0.05%)Acidobacteria (class) - 4 (0.05%)Bryopsida - 4 (0.05%)Solibacteres - 3 (0.04%)Cestoda - 2 (0.02%)Cytophagia - 2 (0.02%)Coniferopsida - 2 (0.02%)Chloroflexi (class) - 2 (0.02%)unclassified (derived from Eukaryota) - 2 (0.02%)unclassified (derived from Verrucomicrobia) - 2 (0.02%)Halobacteria - 1 (0.01%)Malacostraca - 1 (0.01%)Leotiomycetes - 1 (0.01%)Saccharomycetes - 1 (0.01%)Epsilonproteobacteria - 1 (0.01%)Bacillariophyceae - 1 (0.01%)Gemmatimonadetes (class) - 1 (0.01%)Actinopterygii - 1 (0.01%)Planctomycetacia - 1 (0.01%)Hydrozoa - 1 (0.01%)Synergistia - 1 (0.01%)unclassified (derived from Acidobacteria) - 1 (0.01%)Verrucomicrobiae - 1 (0.01%)Chrysiogenetes (class) - 1 (0.01%)unclassified (derived from Bacteria) - 1 (0.01%)Foraminifera - 1 (0.01%)Polychaeta - 1 (0.01%)Aconoidasida - 1 (0.01%)unclassified (derived from unclassified sequences) - 1 (0.01%)
Order
Lactobacillales - 3,734 (56.08%)Vitales - 528 (7.93%)Poales - 296 (4.45%)Pseudomonadales - 254 (3.81%)Rhodospirillales - 246 (3.69%)Bacillales - 210 (3.15%)Enterobacteriales - 176 (2.64%)Brassicales - 152 (2.28%)Thermales - 133 (2.00%)Actinomycetales - 125 (1.88%)Malpighiales - 102 (1.53%)Sphingomonadales - 79 (1.19%)Clostridiales - 68 (1.02%)Fabales - 55 (0.83%)Burkholderiales - 47 (0.71%)Asparagales - 31 (0.47%)Hymenoptera - 30 (0.45%)Caryophyllales - 27 (0.41%)Primates - 22 (0.33%)Neisseriales - 20 (0.30%)Lamiales - 19 (0.29%)Asterales - 18 (0.27%)Malasseziales - 17 (0.26%)Solanales - 15 (0.23%)Rhizobiales - 14 (0.21%)Bacteroidales - 12 (0.18%)Caudovirales - 12 (0.18%)Rhodocyclales - 11 (0.17%)Cucurbitales - 11 (0.17%)Selenomonadales - 11 (0.17%)Ranunculales - 10 (0.15%)Pasteurellales - 8 (0.12%)Fusobacteriales - 6 (0.09%)Rosales - 6 (0.09%)unclassified (derived from Viruses) - 6 (0.09%)Flavobacteriales - 6 (0.09%)Bifidobacteriales - 6 (0.09%)Sphingobacteriales - 6 (0.09%)Vibrionales - 5 (0.08%)Alteromonadales - 5 (0.08%)Erysipelotrichales - 5 (0.08%)Chlorobiales - 4 (0.06%)Acidobacteriales - 4 (0.06%)Hypocreales - 4 (0.06%)Chroococcales - 4 (0.06%)Spirochaetales - 4 (0.06%)Aquificales - 4 (0.06%)Rhodobacterales - 4 (0.06%)Acholeplasmatales - 4 (0.06%)Funariales - 4 (0.06%)Caulobacterales - 3 (0.05%)Solibacterales - 3 (0.05%)Diptera - 3 (0.05%)Alismatales - 2 (0.03%)unclassified (derived from Gammaproteobacteria) - 2 (0.03%)Entomoplasmatales - 2 (0.03%)Myxococcales - 2 (0.03%)Desulfuromonadales - 2 (0.03%)Mycoplasmatales - 2 (0.03%)Methylacidiphilales - 2 (0.03%)Legionellales - 2 (0.03%)Deinococcales - 2 (0.03%)Cyclophyllidea - 2 (0.03%)Malvales - 2 (0.03%)Chloroflexales - 2 (0.03%)Xanthomonadales - 2 (0.03%)unclassified (derived from Flavobacteria) - 2 (0.03%)Cytophagales - 2 (0.03%)Coniferales - 2 (0.03%)Chromatiales - 2 (0.03%)Saccharomycetales - 1 (0.02%)Gallionellales - 1 (0.02%)Naviculales - 1 (0.02%)Oceanospirillales - 1 (0.02%)Tetraodontiformes - 1 (0.02%)Methylophilales - 1 (0.02%)Hydroida - 1 (0.02%)Thermoanaerobacterales - 1 (0.02%)Thiotrichales - 1 (0.02%)Nitrosomonadales - 1 (0.02%)Parvularculales - 1 (0.02%)Bdellovibrionales - 1 (0.02%)Desulfobacterales - 1 (0.02%)Synergistales - 1 (0.02%)Halobacteriales - 1 (0.02%)unclassified (derived from Acidobacteria) - 1 (0.02%)Celastrales - 1 (0.02%)Campylobacterales - 1 (0.02%)Ericales - 1 (0.02%)Verrucomicrobiales - 1 (0.02%)Gentianales - 1 (0.02%)Chrysiogenales - 1 (0.02%)unclassified (derived from Bacteria) - 1 (0.02%)Phyllodocida - 1 (0.02%)Haemosporida - 1 (0.02%)Coleoptera - 1 (0.02%)Sapindales - 1 (0.02%)Gemmatimonadales - 1 (0.02%)Planctomycetales - 1 (0.02%)Rotaliida - 1 (0.02%)Kinetoplastida - 1 (0.02%)Peronosporales - 1 (0.02%)Decapoda - 1 (0.02%)Helotiales - 1 (0.02%)unclassified (derived from unclassified sequences) - 1 (0.02%)
Family
Lactobacillaceae - 2,495 (44.88%)Vitaceae - 528 (9.50%)Poaceae - 292 (5.25%)Acetobacteraceae - 239 (4.30%)Moraxellaceae - 233 (4.19%)Enterobacteriaceae - 176 (3.17%)Brassicaceae - 151 (2.72%)Thermaceae - 133 (2.39%)Leuconostocaceae - 113 (2.03%)Bacillaceae - 111 (2.00%)Streptococcaceae - 80 (1.44%)Propionibacteriaceae - 66 (1.19%)Sphingomonadaceae - 57 (1.03%)Salicaceae - 57 (1.03%)Fabaceae - 55 (0.99%)Enterococcaceae - 50 (0.90%)Staphylococcaceae - 45 (0.81%)Euphorbiaceae - 40 (0.72%)Comamonadaceae - 31 (0.56%)Asparagaceae - 30 (0.54%)Corynebacteriaceae - 28 (0.50%)Clostridiales Family XI. Incertae Sedis - 26 (0.47%)Apidae - 25 (0.45%)Listeriaceae - 24 (0.43%)Amaranthaceae - 24 (0.43%)Clostridiaceae - 21 (0.38%)Hominidae - 21 (0.38%)Neisseriaceae - 20 (0.36%)Malasseziaceae - 17 (0.31%)Asteraceae - 17 (0.31%)Pseudomonadaceae - 16 (0.29%)Solanaceae - 13 (0.23%)Cucurbitaceae - 11 (0.20%)Veillonellaceae - 11 (0.20%)Rhodocyclaceae - 11 (0.20%)Burkholderiaceae - 10 (0.18%)Pasteurellaceae - 8 (0.14%)Orobanchaceae - 8 (0.14%)Siphoviridae - 7 (0.13%)Carnobacteriaceae - 7 (0.13%)Papaveraceae - 6 (0.11%)unclassified (derived from Viruses) - 6 (0.11%)Plantaginaceae - 6 (0.11%)Flavobacteriaceae - 6 (0.11%)Bifidobacteriaceae - 6 (0.11%)Fusobacteriaceae - 6 (0.11%)Rosaceae - 5 (0.09%)Prevotellaceae - 5 (0.09%)Erysipelotrichaceae - 5 (0.09%)Vibrionaceae - 5 (0.09%)Aerococcaceae - 5 (0.09%)Eubacteriaceae - 4 (0.07%)Shewanellaceae - 4 (0.07%)Aquificaceae - 4 (0.07%)Myoviridae - 4 (0.07%)Micrococcaceae - 4 (0.07%)Sphingobacteriaceae - 4 (0.07%)Chlorobiaceae - 4 (0.07%)Acidobacteriaceae - 4 (0.07%)unclassified (derived from Chroococcales) - 4 (0.07%)Paenibacillaceae - 4 (0.07%)Spirochaetaceae - 4 (0.07%)Acholeplasmataceae - 4 (0.07%)Rhodobacteraceae - 4 (0.07%)Funariaceae - 4 (0.07%)Lachnospiraceae - 4 (0.07%)Clavicipitaceae - 3 (0.05%)Methylobacteriaceae - 3 (0.05%)Caryophyllaceae - 3 (0.05%)Peptococcaceae - 3 (0.05%)Caulobacteraceae - 3 (0.05%)Ranunculaceae - 3 (0.05%)Mycobacteriaceae - 3 (0.05%)Bradyrhizobiaceae - 3 (0.05%)Solibacteraceae - 3 (0.05%)Ruminococcaceae - 2 (0.04%)unclassified (derived from Gammaproteobacteria) - 2 (0.04%)Deinococcaceae - 2 (0.04%)Xanthomonadaceae - 2 (0.04%)Chloroflexaceae - 2 (0.04%)Culicidae - 2 (0.04%)Methylacidiphilaceae - 2 (0.04%)unclassified (derived from Clostridiales) - 2 (0.04%)unclassified (derived from Flavobacteria) - 2 (0.04%)Formicidae - 2 (0.04%)Bacteroidaceae - 2 (0.04%)Ectothiorhodospiraceae - 2 (0.04%)Nocardiaceae - 2 (0.04%)Mycoplasmataceae - 2 (0.04%)Geobacteraceae - 2 (0.04%)Convolvulaceae - 2 (0.04%)Pinaceae - 2 (0.04%)Araceae - 2 (0.04%)Spiroplasmataceae - 2 (0.04%)Gordoniaceae - 2 (0.04%)Desulfobacteraceae - 1 (0.02%)Streptomycetaceae - 1 (0.02%)Brucellaceae - 1 (0.02%)Synergistaceae - 1 (0.02%)Geodermatophilaceae - 1 (0.02%)Methylocystaceae - 1 (0.02%)Phyllobacteriaceae - 1 (0.02%)Verrucomicrobiaceae - 1 (0.02%)Rhizobiaceae - 1 (0.02%)unclassified (derived from Bacteria) - 1 (0.02%)Nereididae - 1 (0.02%)unclassified (derived from Haemosporida) - 1 (0.02%)Scarabaeidae - 1 (0.02%)Chrysiogenaceae - 1 (0.02%)Drosophilidae - 1 (0.02%)unclassified (derived from Acidobacteria) - 1 (0.02%)Erythrobacteraceae - 1 (0.02%)Sclerotiniaceae - 1 (0.02%)Metschnikowiaceae - 1 (0.02%)Microbacteriaceae - 1 (0.02%)Phaeodactylaceae - 1 (0.02%)Alcaligenaceae - 1 (0.02%)Tetraodontidae - 1 (0.02%)Porphyromonadaceae - 1 (0.02%)Hydridae - 1 (0.02%)Anoplocephalidae - 1 (0.02%)Taeniidae - 1 (0.02%)Actinomycetaceae - 1 (0.02%)Gallionellaceae - 1 (0.02%)Methylophilaceae - 1 (0.02%)Cytophagaceae - 1 (0.02%)Orchidaceae - 1 (0.02%)Nitrosomonadaceae - 1 (0.02%)Actinosynnemataceae - 1 (0.02%)Campanulaceae - 1 (0.02%)Bacteriovoracaceae - 1 (0.02%)Brevibacteriaceae - 1 (0.02%)Thermoanaerobacteraceae - 1 (0.02%)Celastraceae - 1 (0.02%)Myxococcaceae - 1 (0.02%)Helicobacteraceae - 1 (0.02%)Rubiaceae - 1 (0.02%)Alteromonadaceae - 1 (0.02%)unclassified (derived from Bacillales) - 1 (0.02%)Rhodothermaceae - 1 (0.02%)Halobacteriaceae - 1 (0.02%)Cistaceae - 1 (0.02%)Malvaceae - 1 (0.02%)Coxiellaceae - 1 (0.02%)Legionellaceae - 1 (0.02%)Oceanospirillaceae - 1 (0.02%)Urticaceae - 1 (0.02%)Rutaceae - 1 (0.02%)Gemmatimonadaceae - 1 (0.02%)Planctomycetaceae - 1 (0.02%)unclassified (derived from Sphingobacteriales) - 1 (0.02%)Nonionidae - 1 (0.02%)Trypanosomatidae - 1 (0.02%)unclassified (derived from Peronosporales) - 1 (0.02%)Piscirickettsiaceae - 1 (0.02%)Podoviridae - 1 (0.02%)Parvularculaceae - 1 (0.02%)Beijerinckiaceae - 1 (0.02%)unclassified (derived from unclassified sequences) - 1 (0.02%)
Genus
Lactobacillus - 1,879 (41.52%)Vitis - 528 (11.67%)Acinetobacter - 222 (4.91%)Oryza - 160 (3.54%)Arabidopsis - 127 (2.81%)Meiothermus - 112 (2.48%)Gluconobacter - 81 (1.79%)Pediococcus - 70 (1.55%)Streptococcus - 66 (1.46%)Propionibacterium - 66 (1.46%)Zea - 61 (1.35%)Populus - 57 (1.26%)Leuconostoc - 55 (1.22%)Enterococcus - 50 (1.10%)Staphylococcus - 44 (0.97%)Bacillus - 36 (0.80%)Ricinus - 33 (0.73%)Geobacillus - 30 (0.66%)Asparagus - 30 (0.66%)Corynebacterium - 28 (0.62%)Medicago - 26 (0.57%)Apis - 25 (0.55%)Beta - 24 (0.53%)Listeria - 24 (0.53%)Novosphingobium - 22 (0.49%)Weissella - 21 (0.46%)Oenococcus - 20 (0.44%)Clostridium - 20 (0.44%)Gluconacetobacter - 17 (0.38%)Malassezia - 17 (0.38%)Homo - 17 (0.38%)Pseudomonas - 16 (0.35%)Sphingomonas - 14 (0.31%)Erwinia - 14 (0.31%)Delftia - 13 (0.29%)Cynara - 13 (0.29%)Neisseria - 12 (0.27%)Brassica - 11 (0.24%)Cucumis - 11 (0.24%)Solanum - 11 (0.24%)Lactococcus - 10 (0.22%)Pisum - 10 (0.22%)Finegoldia - 9 (0.20%)Arsenophonus - 9 (0.20%)Anaerococcus - 9 (0.20%)Acetobacter - 8 (0.18%)Burkholderia - 8 (0.18%)Thermus - 7 (0.15%)Euphorbia - 7 (0.15%)unclassified (derived from Siphoviridae) - 7 (0.15%)Glycine - 7 (0.15%)Vicia - 6 (0.13%)Bifidobacterium - 6 (0.13%)Anoxybacillus - 6 (0.13%)Veillonella - 6 (0.13%)Peptoniphilus - 6 (0.13%)unclassified (derived from Viruses) - 6 (0.13%)Sorghum - 6 (0.13%)Antirrhinum - 6 (0.13%)Papaver - 5 (0.11%)Granulicatella - 5 (0.11%)Prevotella - 5 (0.11%)Serratia - 4 (0.09%)Vibrio - 4 (0.09%)Shewanella - 4 (0.09%)Eubacterium - 4 (0.09%)Physcomitrella - 4 (0.09%)Aerococcus - 4 (0.09%)Pantoea - 4 (0.09%)Fusobacterium - 4 (0.09%)Borrelia - 3 (0.07%)Thauera - 3 (0.07%)Metarhizium - 3 (0.07%)Acidobacterium - 3 (0.07%)Candidatus Solibacter - 3 (0.07%)Catonella - 3 (0.07%)Photorhabdus - 3 (0.07%)unclassified (derived from Myoviridae) - 3 (0.07%)Methylobacterium - 3 (0.07%)Paenibacillus - 3 (0.07%)Silene - 3 (0.07%)Lysinibacillus - 3 (0.07%)Cicer - 3 (0.07%)Pedobacter - 3 (0.07%)Enhydrobacter - 3 (0.07%)Klebsiella - 3 (0.07%)Prunus - 3 (0.07%)Mycobacterium - 3 (0.07%)Mitsuokella - 2 (0.04%)Verminephrobacter - 2 (0.04%)Geobacter - 2 (0.04%)Providencia - 2 (0.04%)Turicibacter - 2 (0.04%)Azoarcus - 2 (0.04%)Lemna - 2 (0.04%)unclassified (derived from Clostridiales) - 2 (0.04%)Desulfotomaculum - 2 (0.04%)Yersinia - 2 (0.04%)Haemophilus - 2 (0.04%)Pasteurella - 2 (0.04%)Carnobacterium - 2 (0.04%)Variovorax - 2 (0.04%)Sphingopyxis - 2 (0.04%)Psychrobacter - 2 (0.04%)Caulobacter - 2 (0.04%)Escherichia - 2 (0.04%)Deinococcus - 2 (0.04%)Cyanothece - 2 (0.04%)Chloroflexus - 2 (0.04%)Chloroherpeton - 2 (0.04%)Phelipanche - 2 (0.04%)unclassified (derived from Gammaproteobacteria) - 2 (0.04%)unclassified (derived from Flavobacteria) - 2 (0.04%)Dechloromonas - 2 (0.04%)Bacteroides - 2 (0.04%)Candidatus Phytoplasma - 2 (0.04%)Malus - 2 (0.04%)Spiroplasma - 2 (0.04%)Mycoplasma - 2 (0.04%)Rothia - 2 (0.04%)Methylacidiphilum - 2 (0.04%)Gordonia - 2 (0.04%)Lutiella - 1 (0.02%)Polaromonas - 1 (0.02%)Chryseobacterium - 1 (0.02%)Flavobacterium - 1 (0.02%)Gallionella - 1 (0.02%)Methylovorus - 1 (0.02%)Kordia - 1 (0.02%)Polaribacter - 1 (0.02%)Simonsiella - 1 (0.02%)Nitrosomonas - 1 (0.02%)Aromatoleum - 1 (0.02%)Zunongwangia - 1 (0.02%)Rathayibacter - 1 (0.02%)Rhodothermus - 1 (0.02%)Bacteriovorax - 1 (0.02%)Desulfatibacillum - 1 (0.02%)Arthrobacter - 1 (0.02%)Anaeromyxobacter - 1 (0.02%)Helicobacter - 1 (0.02%)Alteromonas - 1 (0.02%)Sphingobacterium - 1 (0.02%)Alkalilimnicola - 1 (0.02%)Thioalkalivibrio - 1 (0.02%)Chitinophaga - 1 (0.02%)Citrobacter - 1 (0.02%)Enterobacter - 1 (0.02%)Alkaliphilus - 1 (0.02%)Chlorobium - 1 (0.02%)Micrococcus - 1 (0.02%)Actinomyces - 1 (0.02%)Pectobacterium - 1 (0.02%)Desulfurispirillum - 1 (0.02%)Proteus - 1 (0.02%)Actinosynnema - 1 (0.02%)Rahnella - 1 (0.02%)Microcystis - 1 (0.02%)Sodalis - 1 (0.02%)Coprococcus - 1 (0.02%)Rickettsiella - 1 (0.02%)Legionella - 1 (0.02%)Neptuniibacter - 1 (0.02%)Rhodococcus - 1 (0.02%)Faecalibacterium - 1 (0.02%)Ruminococcus - 1 (0.02%)Brevibacterium - 1 (0.02%)Carboxydothermus - 1 (0.02%)Coprobacillus - 1 (0.02%)Methylophaga - 1 (0.02%)Photobacterium - 1 (0.02%)Solobacterium - 1 (0.02%)Xanthomonas - 1 (0.02%)Streptomyces - 1 (0.02%)unclassified (derived from Erysipelotrichaceae) - 1 (0.02%)Aminomonas - 1 (0.02%)Haloterrigena - 1 (0.02%)Selenomonas - 1 (0.02%)Hydrogenobacter - 1 (0.02%)Verrucomicrobium - 1 (0.02%)Thermocrinis - 1 (0.02%)unclassified (derived from Bacteria) - 1 (0.02%)Platynereis - 1 (0.02%)Plasmodium - 1 (0.02%)Rhyparus - 1 (0.02%)Aedes - 1 (0.02%)Anopheles - 1 (0.02%)Drosophila - 1 (0.02%)Ilyobacter - 1 (0.02%)Sclerotinia - 1 (0.02%)Clavispora - 1 (0.02%)Streptobacillus - 1 (0.02%)Phaeodactylum - 1 (0.02%)Gemmatimonas - 1 (0.02%)Tetraodon - 1 (0.02%)Rhodopirellula - 1 (0.02%)Pongo - 1 (0.02%)Hydra - 1 (0.02%)Moniezia - 1 (0.02%)Echinococcus - 1 (0.02%)Geodermatophilus - 1 (0.02%)Picea - 1 (0.02%)Pinus - 1 (0.02%)Phenylobacterium - 1 (0.02%)Parvularcula - 1 (0.02%)Phalaenopsis - 1 (0.02%)Beijerinckia - 1 (0.02%)Phyllostachys - 1 (0.02%)Bradyrhizobium - 1 (0.02%)Triticum - 1 (0.02%)Rhodopseudomonas - 1 (0.02%)Ochrobactrum - 1 (0.02%)Helianthus - 1 (0.02%)Lactuca - 1 (0.02%)Trachelium - 1 (0.02%)Oceanobacillus - 1 (0.02%)Methylosinus - 1 (0.02%)Raphanus - 1 (0.02%)Parvibaculum - 1 (0.02%)Agrobacterium - 1 (0.02%)Oceanicola - 1 (0.02%)Roseobacter - 1 (0.02%)Porphyromonas - 1 (0.02%)Brevibacillus - 1 (0.02%)Phaseolus - 1 (0.02%)Candidatus Koribacter - 1 (0.02%)Trifolium - 1 (0.02%)Granulibacter - 1 (0.02%)Coffea - 1 (0.02%)Erythrobacter - 1 (0.02%)Citromicrobium - 1 (0.02%)Macrococcus - 1 (0.02%)Sphingobium - 1 (0.02%)Microscilla - 1 (0.02%)Helianthemum - 1 (0.02%)Gossypium - 1 (0.02%)Gemella - 1 (0.02%)Coptis - 1 (0.02%)Zymomonas - 1 (0.02%)Capnocytophaga - 1 (0.02%)Boehmeria - 1 (0.02%)Citrus - 1 (0.02%)Convolvulus - 1 (0.02%)Ipomoea - 1 (0.02%)Capsicum - 1 (0.02%)Lautropia - 1 (0.02%)Ralstonia - 1 (0.02%)Nonionella - 1 (0.02%)Leishmania - 1 (0.02%)Phytophthora - 1 (0.02%)T4-like viruses - 1 (0.02%)Acidovorax - 1 (0.02%)T7-like viruses - 1 (0.02%)Curvibacter - 1 (0.02%)Eremococcus - 1 (0.02%)unclassified (derived from unclassified sequences) - 1 (0.02%)
Rank Abundance Plot
The plots below show the taxonomic abundances ordered from the most abundant to least abundant. Only the top 50 most abundant are shown. The y-axis plots the abundances of annotations on a log scale.
The rank abundance chart is a tool for visually representing taxonomic richness and evenness.
taxon abundanceBacteria: 6554Eukaryota: 3724Viruses: 18Archaea: 1unclassified sequences: 10101102103104BacteriaEukaryotaVirusesArchaeaunclassified sequences
taxon abundanceFirmicutes: 4593Streptophyta: 3442Proteobacteria: 1141Actinobacteria: 136Deinococcus-Thermus: 136Bacteroidetes: 42Arthropoda: 40Chordata: 25Basidiomycota: 20unclassified (derived from Viruses): 18Acidobacteria: 9Tenericutes: 8Ascomycota: 7Fusobacteria: 6Cyanobacteria: 4Spirochaetes: 4Chlorobi: 4Aquificae: 4Verrucomicrobia: 3unclassified (derived from Eukaryota): 3Platyhelminthes: 2Chloroflexi: 2Synergistetes: 1Gemmatimonadetes: 1Bacillariophyta: 1Planctomycetes: 1Euryarchaeota: 1Cnidaria: 1unclassified (derived from Bacteria): 1Annelida: 1Apicomplexa: 1Chrysiogenetes: 1unclassified (derived from unclassified sequences): 10101102103104FirmicutesStreptophytaProteobacteriaActinobacteriaDeinococcus-ThermusBacteroidetesArthropodaChordataBasidiomycotaunclassified (derived from Viruses)AcidobacteriaTenericutesAscomycotaFusobacteriaCyanobacteriaSpirochaetesChlorobiAquificaeVerrucomicrobiaunclassified (derived from Eukaryota)PlatyhelminthesChloroflexiSynergistetesGemmatimonadetesBacillariophytaPlanctomycetesEuryarchaeotaCnidariaunclassified (derived from Bacteria)AnnelidaApicomplexaChrysiogenetesunclassified (derived from unclassified sequences)
taxon abundanceBacilli: 4312unclassified (derived from Streptophyta): 1843Gammaproteobacteria: 507Alphaproteobacteria: 415Liliopsida: 342Actinobacteria (class): 136Deinococci: 136Betaproteobacteria: 100Clostridia: 71Insecta: 38Mammalia: 22unclassified (derived from Viruses): 18Exobasidiomycetes: 17Bacteroidia: 12Negativicutes: 11Deltaproteobacteria: 9Mollicutes: 8Flavobacteria: 8Fusobacteria (class): 6Sphingobacteria: 6Erysipelotrichi: 5unclassified (derived from Cyanobacteria): 4Chlorobia: 4Spirochaetes (class): 4Aquificae (class): 4Sordariomycetes: 4Acidobacteria (class): 4Bryopsida: 4Solibacteres: 3Cestoda: 2Cytophagia: 2Coniferopsida: 2Chloroflexi (class): 2unclassified (derived from Eukaryota): 2unclassified (derived from Verrucomicrobia): 2Halobacteria: 1Malacostraca: 1Leotiomycetes: 1Saccharomycetes: 1Epsilonproteobacteria: 1Bacillariophyceae: 1Gemmatimonadetes (class): 1Actinopterygii: 1Planctomycetacia: 1Hydrozoa: 1Synergistia: 1unclassified (derived from Acidobacteria): 1Verrucomicrobiae: 1Chrysiogenetes (class): 1unclassified (derived from Bacteria): 10101102103104Bacilliunclassified (derived from Streptophyta)GammaproteobacteriaAlphaproteobacteriaLiliopsidaActinobacteria (class)DeinococciBetaproteobacteriaClostridiaInsectaMammaliaunclassified (derived from Viruses)ExobasidiomycetesBacteroidiaNegativicutesDeltaproteobacteriaMollicutesFlavobacteriaFusobacteria (class)SphingobacteriaErysipelotrichiunclassified (derived from Cyanobacteria)ChlorobiaSpirochaetes (class)Aquificae (class)SordariomycetesAcidobacteria (class)BryopsidaSolibacteresCestodaCytophagiaConiferopsidaChloroflexi (class)unclassified (derived from Eukaryota)unclassified (derived from Verrucomicrobia)HalobacteriaMalacostracaLeotiomycetesSaccharomycetesEpsilonproteobacteriaBacillariophyceaeGemmatimonadetes (class)ActinopterygiiPlanctomycetaciaHydrozoaSynergistiaunclassified (derived from Acidobacteria)VerrucomicrobiaeChrysiogenetes (class)unclassified (derived from Bacteria)
taxon abundanceLactobacillales: 3734Vitales: 528Poales: 296Pseudomonadales: 254Rhodospirillales: 246Bacillales: 210Enterobacteriales: 176Brassicales: 152Thermales: 133Actinomycetales: 125Malpighiales: 102Sphingomonadales: 79Clostridiales: 68Fabales: 55Burkholderiales: 47Asparagales: 31Hymenoptera: 30Caryophyllales: 27Primates: 22Neisseriales: 20Lamiales: 19Asterales: 18Malasseziales: 17Solanales: 15Rhizobiales: 14Bacteroidales: 12Caudovirales: 12Rhodocyclales: 11Cucurbitales: 11Selenomonadales: 11Ranunculales: 10Pasteurellales: 8Fusobacteriales: 6Rosales: 6unclassified (derived from Viruses): 6Flavobacteriales: 6Bifidobacteriales: 6Sphingobacteriales: 6Vibrionales: 5Alteromonadales: 5Erysipelotrichales: 5Chlorobiales: 4Acidobacteriales: 4Hypocreales: 4Chroococcales: 4Spirochaetales: 4Aquificales: 4Rhodobacterales: 4Acholeplasmatales: 4Funariales: 40101102103104LactobacillalesVitalesPoalesPseudomonadalesRhodospirillalesBacillalesEnterobacterialesBrassicalesThermalesActinomycetalesMalpighialesSphingomonadalesClostridialesFabalesBurkholderialesAsparagalesHymenopteraCaryophyllalesPrimatesNeisserialesLamialesAsteralesMalassezialesSolanalesRhizobialesBacteroidalesCaudoviralesRhodocyclalesCucurbitalesSelenomonadalesRanunculalesPasteurellalesFusobacterialesRosalesunclassified (derived from Viruses)FlavobacterialesBifidobacterialesSphingobacterialesVibrionalesAlteromonadalesErysipelotrichalesChlorobialesAcidobacterialesHypocrealesChroococcalesSpirochaetalesAquificalesRhodobacteralesAcholeplasmatalesFunariales
taxon abundanceLactobacillaceae: 2495Vitaceae: 528Poaceae: 292Acetobacteraceae: 239Moraxellaceae: 233Enterobacteriaceae: 176Brassicaceae: 151Thermaceae: 133Leuconostocaceae: 113Bacillaceae: 111Streptococcaceae: 80Propionibacteriaceae: 66Sphingomonadaceae: 57Salicaceae: 57Fabaceae: 55Enterococcaceae: 50Staphylococcaceae: 45Euphorbiaceae: 40Comamonadaceae: 31Asparagaceae: 30Corynebacteriaceae: 28Clostridiales Family XI. Incertae Sedis: 26Apidae: 25Listeriaceae: 24Amaranthaceae: 24Clostridiaceae: 21Hominidae: 21Neisseriaceae: 20Malasseziaceae: 17Asteraceae: 17Pseudomonadaceae: 16Solanaceae: 13Cucurbitaceae: 11Veillonellaceae: 11Rhodocyclaceae: 11Burkholderiaceae: 10Pasteurellaceae: 8Orobanchaceae: 8Siphoviridae: 7Carnobacteriaceae: 7Papaveraceae: 6unclassified (derived from Viruses): 6Plantaginaceae: 6Flavobacteriaceae: 6Bifidobacteriaceae: 6Fusobacteriaceae: 6Rosaceae: 5Prevotellaceae: 5Erysipelotrichaceae: 5Vibrionaceae: 50101102103104LactobacillaceaeVitaceaePoaceaeAcetobacteraceaeMoraxellaceaeEnterobacteriaceaeBrassicaceaeThermaceaeLeuconostocaceaeBacillaceaeStreptococcaceaePropionibacteriaceaeSphingomonadaceaeSalicaceaeFabaceaeEnterococcaceaeStaphylococcaceaeEuphorbiaceaeComamonadaceaeAsparagaceaeCorynebacteriaceaeClostridiales Family XI. Incertae SedisApidaeListeriaceaeAmaranthaceaeClostridiaceaeHominidaeNeisseriaceaeMalasseziaceaeAsteraceaePseudomonadaceaeSolanaceaeCucurbitaceaeVeillonellaceaeRhodocyclaceaeBurkholderiaceaePasteurellaceaeOrobanchaceaeSiphoviridaeCarnobacteriaceaePapaveraceaeunclassified (derived from Viruses)PlantaginaceaeFlavobacteriaceaeBifidobacteriaceaeFusobacteriaceaeRosaceaePrevotellaceaeErysipelotrichaceaeVibrionaceae
taxon abundanceLactobacillus: 1879Vitis: 528Acinetobacter: 222Oryza: 160Arabidopsis: 127Meiothermus: 112Gluconobacter: 81Pediococcus: 70Streptococcus: 66Propionibacterium: 66Zea: 61Populus: 57Leuconostoc: 55Enterococcus: 50Staphylococcus: 44Bacillus: 36Ricinus: 33Geobacillus: 30Asparagus: 30Corynebacterium: 28Medicago: 26Apis: 25Beta: 24Listeria: 24Novosphingobium: 22Weissella: 21Oenococcus: 20Clostridium: 20Gluconacetobacter: 17Malassezia: 17Homo: 17Pseudomonas: 16Sphingomonas: 14Erwinia: 14Delftia: 13Cynara: 13Neisseria: 12Brassica: 11Cucumis: 11Solanum: 11Lactococcus: 10Pisum: 10Finegoldia: 9Arsenophonus: 9Anaerococcus: 9Acetobacter: 8Burkholderia: 8Thermus: 7Euphorbia: 7unclassified (derived from Siphoviridae): 70101102103104LactobacillusVitisAcinetobacterOryzaArabidopsisMeiothermusGluconobacterPediococcusStreptococcusPropionibacteriumZeaPopulusLeuconostocEnterococcusStaphylococcusBacillusRicinusGeobacillusAsparagusCorynebacteriumMedicagoApisBetaListeriaNovosphingobiumWeissellaOenococcusClostridiumGluconacetobacterMalasseziaHomoPseudomonasSphingomonasErwiniaDelftiaCynaraNeisseriaBrassicaCucumisSolanumLactococcusPisumFinegoldiaArsenophonusAnaerococcusAcetobacterBurkholderiaThermusEuphorbiaunclassified (derived from Siphoviridae)
Rarefaction Curve
The plot below shows the rarefaction curve of annotated species richness. This curve is a plot of the total number of distinct species annotations as a function of the number of sequences sampled. On the left, a steep slope indicates that a large fraction of the species diversity remains to be discovered. If the curve becomes flatter to the right, a reasonable number of individuals is sampled: more intensive sampling is likely to yield only few additional species.
Sampling curves generally rise very quickly at first and then level off toward an asymptote as fewer new species are found per unit of individuals collected. These rarefaction curves are calculated from the table of species abundance. The curves represent the average number of different species annotations for subsamples of the complete data set.
undefinednumber of readsspecies count01402804205607008409801,1201,2601,40007,00014,00021,00028,00035,00042,00049,00056,00063,00070,000
Alpha Diversity
The α-diversity of this data set is 135 species.
The above image shows the range of α-diversity values in the study 'SPlab-run431-SPD3-asm'. The min, max, and mean values are shown, with the standard deviation ranges (σ and 2σ) in different shades. The α-diversity of this metagenome is shown in red.
Alpha diversity summarizes the diversity of organisms in a sample with a single number. The α-diversity of annotated samples can be estimated from the distribution of the species-level annotations.
Annotated species richness is the number of distinct species annotations in the combined MG-RAST data set. Shannon diversity is an abundance-weighted average of the logarithm of the relative abundances of annotated species. The species-level annotations are from all the annotation source databases used by MG-RAST.
Sequence Length Histogram
The histograms below show the distribution of sequence lengths in basepairs for this metagenome. Each position represents the number of sequences within a length bp range.
The data used in these graphs is based on raw upload and post-QC sequences.
uploadedpost-QCsequence length (bp)number of reads01402804205607008409801,1201,2601,400523476429361,2311,5261,8212,1162,4102,7053,000uploadedpost-QC
Sequence GC Distribution
The histograms below show the distribution of the GC percentage for this metagenome. Each position represents the number of sequences within a GC percentage range. The data used in these graphs is based on raw upload and post-QC sequences.
uploadedpost-QCsequence GC %number of reads04509001,3501,8002,2502,7003,1503,6004,0504,5000102030405060708090100uploadedpost-QC
Sample Data
This data set has no sample data