assembled shotgun metagenome
This data is private and cannot be publicly linked. Click the share button for publication options. show sharing options
SPD4.k17.ForMGRAST
This assembled shotgun metagenome is part of the study 'SPlab-run431-SPD4-asm'
by undefined undefined, undefined.
VisibilityprivateNCBI Project ID-
ID49d7d454086d676d343836313731342e33GOLD ID-
ENA Project IDENA Library ID
Static Linkprivate metagenomes cannot be linkedPubMed ID-
Sample - Library -
The data set SPD4.k17.ForMGRAST was uploaded on 2019-09-29 at 10:44:45 and contains 661,413 sequences totaling 33,786,502 basepairs with an average length of 51 bps.
Of the sequences tested, 58,722 sequences (8.88%) failed to pass the QC pipeline. Of those, dereplication identified 28,524 sequences as artificial duplicate reads.
Of the sequences that passed QC, 771 sequences (1%) contain ribosomal RNA genes, 7,261 sequences (9.04%) contain predicted proteins with known functions, and 72,247 sequences (89.99%) contain predicted proteins with unknown function.
Data shown here is displayed as a quick way to assess the quality and contents of the data set. We note that the submitting authors may have performed their own analysis. The analysis page provides the best way to perform in-depth analyses of this data set.
Provenance
The data on this page represents the automated analysis generated by the MG-RAST automated processing pipeline. Details on the processing of this data set are in the processing receipt.
Sequence Breakdown
failed QC - 58,722 (8.88%)unknown - 522,412 (78.98%)predicted feature - 80,279 (12.14%)
Predicted Features
unknown protein - 72,247 (89.99%)annotated protein - 7,261 (9.04%)ribosomal RNA - 771 (0.96%)
Analysis Statistics
GSC MIxS Info
Upload: bp Count33,786,502 bp
Upload: Sequences Count661,413
Upload: Mean Sequence Length51 ± 35 bp
Upload: Mean GC percent42 ± 10 %
Artificial Duplicate Reads: Sequence Count28,524
Post QC: bp Count27,438,401 bp
Post QC: Sequences Count602,691
Post QC: Mean Sequence Length46 ± 24 bp
Post QC: Mean GC percent42 ± 10 %
Processed: Predicted Protein Features78,273
Processed: Predicted rRNA Features1,156
Alignment: Identified Protein Features6,873
Alignment: Identified rRNA Features798
Annotation: Identified Functional Categoriesundefined
Investigation TypeWGS
Study NameSPlab-run431-SPD4-asm
Latitude and Longitude
Country and/or Sea, Location
Collection Date
Environment (Biome)
Environment (Feature)
Environment (Material)
Environmental Package
Sequencing Methodother
More Metadataclick for full table
K-mer Profile
The kmer rank abundance graph plots the kmer coverage as a function of abundance rank, with the most abundant sequences at the left.
undefinedsequence sizek-mer coverage01011020101102103104105106107108
Source Hits Distribution
The graph below displays the number of hits in the different databases listed. These include protein databases, protein databases with functional hierarchy information, and ribosomal RNA databases. The bars representing annotated reads are colored by e-value range. Different databases have different numbers of hits but can also have different types of annotation data.
006093,0872,574005532,8012,271006243,2852,810003271,7501,376005272,8112,330006273,2002,72000166815652005222,6362,142003232,0711,742002671,2601,022003031,9501,6241830326099128255764387111825000643431825273007001,4002,1002,8003,5004,2004,9005,6006,3007,000RefSeqIMGTrEMBLSubsystemsKEGGGenBankSwissProtPATRICeggNOGKOGOCOGRDPLSUSSUNOGe^-30 & lesse^-20 to e^-30e^-10 to e^-20e^-5 to e^-10e^-3 to e^-5
Functional Category Hits Distribution
The pie charts below illustrate the distribution of functional categories for COGs, KOs, NOGs, and Subsystems at the highest level supported by these functional hierarchies. Each slice indicates the percentage of reads with predicted protein functions annotated to the category for the given source.
COG
METABOLISM - 1,782 (40.73%)INFORMATION STORAGE AND PROCESSING - 1,137 (25.99%)CELLULAR PROCESSES AND SIGNALING - 798 (18.24%)POORLY CHARACTERIZED - 658 (15.04%)
NOG
POORLY CHARACTERIZED - 55 (56.70%)CELLULAR PROCESSES AND SIGNALING - 18 (18.56%)METABOLISM - 12 (12.37%)INFORMATION STORAGE AND PROCESSING - 12 (12.37%)
KO
Metabolism - 1,972 (55.10%)Genetic Information Processing - 922 (25.76%)Environmental Information Processing - 468 (13.08%)Cellular Processes - 137 (3.83%)Human Diseases - 61 (1.70%)Organismal Systems - 19 (0.53%)
Subsystems
Clustering-based subsystems - 1,522 (15.43%)Carbohydrates - 1,152 (11.68%)Protein Metabolism - 933 (9.46%)Amino Acids and Derivatives - 916 (9.29%)Miscellaneous - 725 (7.35%)RNA Metabolism - 715 (7.25%)DNA Metabolism - 526 (5.33%)Cofactors, Vitamins, Prosthetic Groups, Pigments - 508 (5.15%)Cell Wall and Capsule - 405 (4.11%)Nucleosides and Nucleotides - 345 (3.50%)Fatty Acids, Lipids, and Isoprenoids - 259 (2.63%)Membrane Transport - 221 (2.24%)Virulence, Disease and Defense - 210 (2.13%)Respiration - 196 (1.99%)Stress Response - 190 (1.93%)Cell Division and Cell Cycle - 184 (1.87%)Regulation and Cell signaling - 154 (1.56%)Phages, Prophages, Transposable elements, Plasmids - 147 (1.49%)Phosphorus Metabolism - 105 (1.06%)Metabolism of Aromatic Compounds - 88 (0.89%)Iron acquisition and metabolism - 78 (0.79%)Motility and Chemotaxis - 61 (0.62%)Nitrogen Metabolism - 48 (0.49%)Sulfur Metabolism - 47 (0.48%)Dormancy and Sporulation - 40 (0.41%)Secondary Metabolism - 36 (0.37%)Photosynthesis - 31 (0.31%)Potassium metabolism - 21 (0.21%)
Taxonomic Hits Distribution
The charts below represent the distribution of taxa using a contigLCA algorithm finding a single consensus taxonomic entity for all features on each individual sequence.
Domain
Bacteria - 5,450 (73.50%)Eukaryota - 1,953 (26.34%)Viruses - 9 (0.12%)Archaea - 1 (0.01%)other sequences - 1 (0.01%)unclassified sequences - 1 (0.01%)
Phylum
Firmicutes - 2,265 (33.05%)Proteobacteria - 1,902 (27.75%)Streptophyta - 1,670 (24.37%)Deinococcus-Thermus - 475 (6.93%)Actinobacteria - 253 (3.69%)Arthropoda - 63 (0.92%)Chordata - 54 (0.79%)Bacteroidetes - 47 (0.69%)Basidiomycota - 40 (0.58%)Ascomycota - 15 (0.22%)Acidobacteria - 9 (0.13%)unclassified (derived from Viruses) - 9 (0.13%)Aquificae - 8 (0.12%)Spirochaetes - 7 (0.10%)Chloroflexi - 6 (0.09%)unclassified (derived from Bacteria) - 5 (0.07%)Fusobacteria - 4 (0.06%)Tenericutes - 3 (0.04%)unclassified (derived from Eukaryota) - 3 (0.04%)Chlorobi - 2 (0.03%)Planctomycetes - 2 (0.03%)Verrucomicrobia - 2 (0.03%)Cyanobacteria - 2 (0.03%)Apicomplexa - 1 (0.01%)Platyhelminthes - 1 (0.01%)Thermotogae - 1 (0.01%)Synergistetes - 1 (0.01%)Euryarchaeota - 1 (0.01%)unclassified (derived from other sequences) - 1 (0.01%)unclassified (derived from unclassified sequences) - 1 (0.01%)
Class
Bacilli - 2,032 (34.15%)unclassified (derived from Streptophyta) - 902 (15.16%)Gammaproteobacteria - 870 (14.62%)Alphaproteobacteria - 601 (10.10%)Deinococci - 475 (7.98%)Actinobacteria (class) - 253 (4.25%)Liliopsida - 233 (3.92%)Betaproteobacteria - 228 (3.83%)Clostridia - 66 (1.11%)Insecta - 58 (0.97%)Mammalia - 43 (0.72%)Exobasidiomycetes - 33 (0.55%)Negativicutes - 16 (0.27%)Bacteroidia - 13 (0.22%)unclassified (derived from Viruses) - 9 (0.15%)Aquificae (class) - 8 (0.13%)Spirochaetes (class) - 7 (0.12%)Coniferopsida - 6 (0.10%)Deltaproteobacteria - 6 (0.10%)Sphingobacteria - 6 (0.10%)Sordariomycetes - 5 (0.08%)Solibacteres - 5 (0.08%)Flavobacteria - 5 (0.08%)Erysipelotrichi - 5 (0.08%)unclassified (derived from Bacteria) - 5 (0.08%)Saccharomycetes - 4 (0.07%)Fusobacteria (class) - 4 (0.07%)Chloroflexi (class) - 4 (0.07%)Mollicutes - 3 (0.05%)Dothideomycetes - 3 (0.05%)Epsilonproteobacteria - 3 (0.05%)Bryopsida - 3 (0.05%)Cytophagia - 3 (0.05%)Eurotiomycetes - 2 (0.03%)unclassified (derived from Cyanobacteria) - 2 (0.03%)Chlorobia - 2 (0.03%)Planctomycetacia - 2 (0.03%)Arachnida - 2 (0.03%)unclassified (derived from Eukaryota) - 2 (0.03%)Actinopterygii - 2 (0.03%)Acidobacteria (class) - 1 (0.02%)Thermotogae (class) - 1 (0.02%)Agaricomycetes - 1 (0.02%)Spartobacteria - 1 (0.02%)Ustilaginomycetes - 1 (0.02%)Verrucomicrobiae - 1 (0.02%)Aves - 1 (0.02%)Dehalococcoidetes - 1 (0.02%)Trematoda - 1 (0.02%)Coccidia - 1 (0.02%)Thermomicrobia (class) - 1 (0.02%)Ginkgoopsida - 1 (0.02%)Isoetopsida - 1 (0.02%)Thermococci - 1 (0.02%)Malacostraca - 1 (0.02%)unclassified (derived from Acidobacteria) - 1 (0.02%)Synergistia - 1 (0.02%)unclassified (derived from other sequences) - 1 (0.02%)unclassified (derived from unclassified sequences) - 1 (0.02%)
Order
Lactobacillales - 1,577 (31.74%)Thermales - 470 (9.46%)Pseudomonadales - 381 (7.67%)Rhodospirillales - 327 (6.58%)Enterobacteriales - 293 (5.90%)Bacillales - 271 (5.45%)Actinomycetales - 229 (4.61%)Poales - 215 (4.33%)Vitales - 146 (2.94%)Burkholderiales - 141 (2.84%)Brassicales - 137 (2.76%)Sphingomonadales - 125 (2.52%)Malpighiales - 74 (1.49%)Clostridiales - 56 (1.13%)Hymenoptera - 43 (0.87%)Fabales - 42 (0.85%)Malasseziales - 33 (0.66%)Primates - 28 (0.56%)Neisseriales - 21 (0.42%)Asterales - 17 (0.34%)Selenomonadales - 16 (0.32%)Solanales - 14 (0.28%)Rhizobiales - 14 (0.28%)Bifidobacteriales - 14 (0.28%)Rhodocyclales - 13 (0.26%)Bacteroidales - 13 (0.26%)Ranunculales - 10 (0.20%)Pasteurellales - 10 (0.20%)Caudovirales - 8 (0.16%)Aquificales - 8 (0.16%)Caryophyllales - 8 (0.16%)Asparagales - 7 (0.14%)Spirochaetales - 7 (0.14%)Oceanospirillales - 7 (0.14%)Rhodobacterales - 6 (0.12%)Sphingobacteriales - 6 (0.12%)Coniferales - 6 (0.12%)unclassified (derived from Bacteria) - 5 (0.10%)Chromatiales - 5 (0.10%)Vibrionales - 5 (0.10%)Alismatales - 5 (0.10%)Solibacterales - 5 (0.10%)Erysipelotrichales - 5 (0.10%)Flavobacteriales - 5 (0.10%)unclassified (derived from Gammaproteobacteria) - 4 (0.08%)Fusobacteriales - 4 (0.08%)Saccharomycetales - 4 (0.08%)Thermoanaerobacterales - 4 (0.08%)unclassified (derived from Betaproteobacteria) - 4 (0.08%)Sapindales - 4 (0.08%)Caulobacterales - 4 (0.08%)Funariales - 3 (0.06%)Cucurbitales - 3 (0.06%)Gentianales - 3 (0.06%)Lamiales - 3 (0.06%)Pleosporales - 3 (0.06%)Rosales - 3 (0.06%)Campylobacterales - 3 (0.06%)Alteromonadales - 3 (0.06%)Cytophagales - 3 (0.06%)Apiales - 2 (0.04%)Xanthomonadales - 2 (0.04%)Hypocreales - 2 (0.04%)Sordariales - 2 (0.04%)Nitrosomonadales - 2 (0.04%)Chloroflexales - 2 (0.04%)Fagales - 2 (0.04%)Methylophilales - 2 (0.04%)unclassified (derived from Mammalia) - 2 (0.04%)Chlorobiales - 2 (0.04%)Malvales - 2 (0.04%)Myrtales - 2 (0.04%)Entomoplasmatales - 2 (0.04%)Hydrogenophilales - 2 (0.04%)Rickettsiales - 2 (0.04%)Legionellales - 2 (0.04%)Planctomycetales - 2 (0.04%)Peronosporales - 2 (0.04%)Acholeplasmatales - 1 (0.02%)Onygenales - 1 (0.02%)Aeromonadales - 1 (0.02%)unclassified (derived from Dehalococcoidetes) - 1 (0.02%)Cardiobacteriales - 1 (0.02%)Agaricales - 1 (0.02%)unclassified (derived from Alphaproteobacteria) - 1 (0.02%)Ustilaginales - 1 (0.02%)Tetraodontiformes - 1 (0.02%)Galliformes - 1 (0.02%)Thermomicrobiales - 1 (0.02%)Chroococcales - 1 (0.02%)Strigeidida - 1 (0.02%)Solirubrobacterales - 1 (0.02%)Thermococcales - 1 (0.02%)Ginkgoales - 1 (0.02%)Selaginellales - 1 (0.02%)unclassified (derived from Acidobacteria) - 1 (0.02%)Coriobacteriales - 1 (0.02%)Dioscoreales - 1 (0.02%)Halanaerobiales - 1 (0.02%)Bdellovibrionales - 1 (0.02%)Desulfovibrionales - 1 (0.02%)Synergistales - 1 (0.02%)Acidobacteriales - 1 (0.02%)Crossosomatales - 1 (0.02%)Desulfuromonadales - 1 (0.02%)Ericales - 1 (0.02%)Thermotogales - 1 (0.02%)unclassified (derived from Spartobacteria) - 1 (0.02%)Verrucomicrobiales - 1 (0.02%)Myxococcales - 1 (0.02%)Laurales - 1 (0.02%)Magnoliales - 1 (0.02%)Eucoccidiorida - 1 (0.02%)Ixodida - 1 (0.02%)Scorpiones - 1 (0.02%)Nymphaeales - 1 (0.02%)Proteales - 1 (0.02%)Coleoptera - 1 (0.02%)Diptera - 1 (0.02%)Herpetosiphonales - 1 (0.02%)Isoptera - 1 (0.02%)Mysida - 1 (0.02%)Acidithiobacillales - 1 (0.02%)Eurotiales - 1 (0.02%)unclassified (derived from Viruses) - 1 (0.02%)unclassified (derived from other sequences) - 1 (0.02%)unclassified (derived from unclassified sequences) - 1 (0.02%)
Family
Lactobacillaceae - 1,034 (23.59%)Thermaceae - 470 (10.72%)Moraxellaceae - 338 (7.71%)Acetobacteraceae - 320 (7.30%)Enterobacteriaceae - 293 (6.68%)Poaceae - 210 (4.79%)Bacillaceae - 181 (4.13%)Vitaceae - 146 (3.33%)Brassicaceae - 136 (3.10%)Propionibacteriaceae - 129 (2.94%)Comamonadaceae - 86 (1.96%)Sphingomonadaceae - 86 (1.96%)Streptococcaceae - 68 (1.55%)Leuconostocaceae - 60 (1.37%)Corynebacteriaceae - 58 (1.32%)Staphylococcaceae - 45 (1.03%)Fabaceae - 42 (0.96%)Pseudomonadaceae - 39 (0.89%)Euphorbiaceae - 38 (0.87%)Apidae - 35 (0.80%)Malasseziaceae - 33 (0.75%)Salicaceae - 29 (0.66%)Clostridiales Family XI. Incertae Sedis - 22 (0.50%)Neisseriaceae - 21 (0.48%)Burkholderiaceae - 19 (0.43%)Hominidae - 17 (0.39%)Asteraceae - 16 (0.37%)Veillonellaceae - 15 (0.34%)Bifidobacteriaceae - 14 (0.32%)Clostridiaceae - 13 (0.30%)Rhodocyclaceae - 13 (0.30%)Enterococcaceae - 12 (0.27%)Solanaceae - 11 (0.25%)Lachnospiraceae - 10 (0.23%)Paenibacillaceae - 10 (0.23%)Pasteurellaceae - 10 (0.23%)Myoviridae - 7 (0.16%)Erythrobacteraceae - 7 (0.16%)Papaveraceae - 7 (0.16%)Actinomycetaceae - 7 (0.16%)Listeriaceae - 6 (0.14%)Aquificaceae - 6 (0.14%)Asparagaceae - 6 (0.14%)Spirochaetaceae - 6 (0.14%)Erysipelotrichaceae - 5 (0.11%)unclassified (derived from Bacteria) - 5 (0.11%)Amaranthaceae - 5 (0.11%)Pinaceae - 5 (0.11%)Vibrionaceae - 5 (0.11%)Carnobacteriaceae - 5 (0.11%)Araceae - 5 (0.11%)Prevotellaceae - 5 (0.11%)Solibacteraceae - 5 (0.11%)unclassified (derived from Betaproteobacteria) - 4 (0.09%)Fusobacteriaceae - 4 (0.09%)unclassified (derived from Gammaproteobacteria) - 4 (0.09%)Flavobacteriaceae - 4 (0.09%)unclassified (derived from Sphingobacteriales) - 4 (0.09%)Caulobacteraceae - 4 (0.09%)Rhodobacteraceae - 4 (0.09%)unclassified (derived from Bacillales) - 4 (0.09%)Bromeliaceae - 3 (0.07%)Rubiaceae - 3 (0.07%)Micrococcaceae - 3 (0.07%)Cucurbitaceae - 3 (0.07%)Rosaceae - 3 (0.07%)Halomonadaceae - 3 (0.07%)Funariaceae - 3 (0.07%)Helicobacteraceae - 3 (0.07%)Ectothiorhodospiraceae - 3 (0.07%)Ruminococcaceae - 3 (0.07%)Thermoanaerobacteraceae - 3 (0.07%)Methylobacteriaceae - 2 (0.05%)Clavicipitaceae - 2 (0.05%)Formicidae - 2 (0.05%)Suidae - 2 (0.05%)Spiroplasmataceae - 2 (0.05%)Xanthomonadaceae - 2 (0.05%)Nitrosomonadaceae - 2 (0.05%)Methylophilaceae - 2 (0.05%)Hydrogenophilaceae - 2 (0.05%)Alcaligenaceae - 2 (0.05%)Rhodospirillaceae - 2 (0.05%)Hyphomonadaceae - 2 (0.05%)Rhizobiaceae - 2 (0.05%)Phyllobacteriaceae - 2 (0.05%)Cercopithecidae - 2 (0.05%)Fagaceae - 2 (0.05%)Bradyrhizobiaceae - 2 (0.05%)Planctomycetaceae - 2 (0.05%)Clostridiales Family XVII. Incertae Sedis - 2 (0.05%)Chloroflexaceae - 2 (0.05%)Chlorobiaceae - 2 (0.05%)Anacardiaceae - 2 (0.05%)Rutaceae - 2 (0.05%)Convolvulaceae - 2 (0.05%)Cytophagaceae - 2 (0.05%)Bacteroidaceae - 2 (0.05%)unclassified (derived from Peronosporales) - 2 (0.05%)Oxalobacteraceae - 2 (0.05%)unclassified (derived from Alphaproteobacteria) - 1 (0.02%)Leptospiraceae - 1 (0.02%)Thermoanaerobacterales Family III. Incertae Sedis - 1 (0.02%)Synergistaceae - 1 (0.02%)Acholeplasmataceae - 1 (0.02%)Dermacoccaceae - 1 (0.02%)Thermotogaceae - 1 (0.02%)unclassified (derived from Spartobacteria) - 1 (0.02%)Verrucomicrobia subdivision 3 - 1 (0.02%)unclassified (derived from Flavobacteriales) - 1 (0.02%)Cryptosporidiidae - 1 (0.02%)Ixodidae - 1 (0.02%)Buthidae - 1 (0.02%)Tenebrionidae - 1 (0.02%)Drosophilidae - 1 (0.02%)Rhodothermaceae - 1 (0.02%)Sphingobacteriaceae - 1 (0.02%)Rhinotermitidae - 1 (0.02%)Mysidae - 1 (0.02%)Trichocomaceae - 1 (0.02%)unclassified (derived from Onygenales) - 1 (0.02%)Metschnikowiaceae - 1 (0.02%)Saccharomycetaceae - 1 (0.02%)Coriobacteriaceae - 1 (0.02%)Chaetomiaceae - 1 (0.02%)Sordariaceae - 1 (0.02%)Marasmiaceae - 1 (0.02%)Conexibacteraceae - 1 (0.02%)Ustilaginaceae - 1 (0.02%)Tetraodontidae - 1 (0.02%)Phasianidae - 1 (0.02%)Thermococcaceae - 1 (0.02%)Herpetosiphonaceae - 1 (0.02%)unclassified (derived from Dehalococcoidetes) - 1 (0.02%)Schistosomatidae - 1 (0.02%)Bdellovibrionaceae - 1 (0.02%)Desulfovibrionaceae - 1 (0.02%)Ginkgoaceae - 1 (0.02%)Selaginellaceae - 1 (0.02%)Geobacteraceae - 1 (0.02%)Thermomicrobiaceae - 1 (0.02%)Orchidaceae - 1 (0.02%)Burmanniaceae - 1 (0.02%)Acidithiobacillaceae - 1 (0.02%)Cyperaceae - 1 (0.02%)Aeromonadaceae - 1 (0.02%)Apiaceae - 1 (0.02%)Alteromonadaceae - 1 (0.02%)Campanulaceae - 1 (0.02%)Shewanellaceae - 1 (0.02%)Cardiobacteriaceae - 1 (0.02%)Rhabdodendraceae - 1 (0.02%)Tamaricaceae - 1 (0.02%)Crossosomataceae - 1 (0.02%)Chromatiaceae - 1 (0.02%)Primulaceae - 1 (0.02%)unclassified (derived from Chroococcales) - 1 (0.02%)Halothiobacillaceae - 1 (0.02%)Eubacteriaceae - 1 (0.02%)Orobanchaceae - 1 (0.02%)Scrophulariaceae - 1 (0.02%)Lauraceae - 1 (0.02%)Magnoliaceae - 1 (0.02%)Coxiellaceae - 1 (0.02%)Legionellaceae - 1 (0.02%)Cistaceae - 1 (0.02%)Malvaceae - 1 (0.02%)Myrtaceae - 1 (0.02%)Onagraceae - 1 (0.02%)Nymphaeaceae - 1 (0.02%)Proteaceae - 1 (0.02%)Menispermaceae - 1 (0.02%)Alcanivoracaceae - 1 (0.02%)Ranunculaceae - 1 (0.02%)Hahellaceae - 1 (0.02%)Acidobacteriaceae - 1 (0.02%)Oceanospirillaceae - 1 (0.02%)Mycobacteriaceae - 1 (0.02%)Anaplasmataceae - 1 (0.02%)Rickettsiaceae - 1 (0.02%)unclassified (derived from Acidobacteria) - 1 (0.02%)Sanguibacteraceae - 1 (0.02%)Podoviridae - 1 (0.02%)unclassified (derived from Viruses) - 1 (0.02%)unclassified (derived from other sequences) - 1 (0.02%)unclassified (derived from unclassified sequences) - 1 (0.02%)
Genus
Lactobacillus - 799 (22.41%)Meiothermus - 422 (11.83%)Acinetobacter - 312 (8.75%)Vitis - 146 (4.09%)Propionibacterium - 129 (3.62%)Gluconobacter - 116 (3.25%)Zea - 105 (2.94%)Arabidopsis - 99 (2.78%)Geobacillus - 63 (1.77%)Delftia - 59 (1.65%)Streptococcus - 59 (1.65%)Corynebacterium - 58 (1.63%)Oryza - 55 (1.54%)Staphylococcus - 44 (1.23%)Novosphingobium - 41 (1.15%)Gluconacetobacter - 40 (1.12%)Pseudomonas - 39 (1.09%)Pediococcus - 38 (1.07%)Leuconostoc - 37 (1.04%)Ricinus - 34 (0.95%)Malassezia - 33 (0.93%)Apis - 31 (0.87%)Populus - 29 (0.81%)Bacillus - 26 (0.73%)Thermus - 22 (0.62%)Arsenophonus - 21 (0.59%)Acetobacter - 21 (0.59%)Sphingomonas - 19 (0.53%)Erwinia - 18 (0.50%)Brassica - 17 (0.48%)Medicago - 17 (0.48%)Anoxybacillus - 15 (0.42%)Veillonella - 14 (0.39%)Homo - 13 (0.36%)Neisseria - 13 (0.36%)Enterococcus - 12 (0.34%)Clostridium - 12 (0.34%)Burkholderia - 11 (0.31%)Anaerococcus - 11 (0.31%)Serratia - 10 (0.28%)Pantoea - 10 (0.28%)Solanum - 9 (0.25%)Paenibacillus - 7 (0.20%)Bifidobacterium - 7 (0.20%)Lactococcus - 7 (0.20%)Trifolium - 7 (0.20%)Escherichia - 7 (0.20%)Erythrobacter - 7 (0.20%)Sorghum - 7 (0.20%)Asparagus - 6 (0.17%)Yersinia - 6 (0.17%)Lotus - 6 (0.17%)Oenococcus - 6 (0.17%)Listeria - 6 (0.17%)Finegoldia - 6 (0.17%)Providencia - 5 (0.14%)Catonella - 5 (0.14%)Papaver - 5 (0.14%)Prevotella - 5 (0.14%)Candidatus Solibacter - 5 (0.14%)Vibrio - 5 (0.14%)Actinomyces - 5 (0.14%)Lemna - 5 (0.14%)Psychrobacter - 5 (0.14%)unclassified (derived from Bacteria) - 5 (0.14%)Sphingobium - 5 (0.14%)Peptoniphilus - 5 (0.14%)Weissella - 5 (0.14%)Klebsiella - 5 (0.14%)Pisum - 4 (0.11%)Candidatus Accumulibacter - 4 (0.11%)Salmonella - 4 (0.11%)Beta - 4 (0.11%)Cicer - 4 (0.11%)Haemophilus - 4 (0.11%)Ralstonia - 4 (0.11%)Euphorbia - 4 (0.11%)Gemella - 4 (0.11%)Chitinophaga - 4 (0.11%)T4-like viruses - 4 (0.11%)Cronobacter - 4 (0.11%)Carboxydothermus - 3 (0.08%)Pectobacterium - 3 (0.08%)Aromatoleum - 3 (0.08%)Carnobacterium - 3 (0.08%)Fructobacillus - 3 (0.08%)Pinus - 3 (0.08%)unclassified (derived from Enterobacteriaceae) - 3 (0.08%)Glycine - 3 (0.08%)Physcomitrella - 3 (0.08%)Gardnerella - 3 (0.08%)Enhydrobacter - 3 (0.08%)Capnocytophaga - 3 (0.08%)Ananas - 3 (0.08%)Sisymbrium - 3 (0.08%)unclassified (derived from Gammaproteobacteria) - 3 (0.08%)unclassified (derived from Myoviridae) - 3 (0.08%)Treponema - 3 (0.08%)Methylobacterium - 2 (0.06%)Spiroplasma - 2 (0.06%)Pasteurella - 2 (0.06%)Metarhizium - 2 (0.06%)Enterobacter - 2 (0.06%)Macaca - 2 (0.06%)Dickeya - 2 (0.06%)Sus - 2 (0.06%)Citrobacter - 2 (0.06%)Alkalilimnicola - 2 (0.06%)Helicobacter - 2 (0.06%)Nitrosomonas - 2 (0.06%)Hydrogenophilus - 2 (0.06%)Comamonas - 2 (0.06%)Citromicrobium - 2 (0.06%)Rhodospirillum - 2 (0.06%)Crepis - 2 (0.06%)Granulibacter - 2 (0.06%)Agrobacterium - 2 (0.06%)Borrelia - 2 (0.06%)Asticcacaulis - 2 (0.06%)Leptotrichia - 2 (0.06%)Fusobacterium - 2 (0.06%)Holdemania - 2 (0.06%)Ruminococcus - 2 (0.06%)Coprococcus - 2 (0.06%)Thermaerobacter - 2 (0.06%)Granulicatella - 2 (0.06%)Brevibacillus - 2 (0.06%)Oceanobacillus - 2 (0.06%)Chloroflexus - 2 (0.06%)Citrus - 2 (0.06%)Ipomoea - 2 (0.06%)Nicotiana - 2 (0.06%)Microscilla - 2 (0.06%)Bacteroides - 2 (0.06%)Phytophthora - 2 (0.06%)Hydrogenobacter - 2 (0.06%)Simonsiella - 2 (0.06%)Acidithiobacillus - 1 (0.03%)Marinobacter - 1 (0.03%)Shewanella - 1 (0.03%)Cardiobacterium - 1 (0.03%)Allochromatium - 1 (0.03%)Solobacterium - 1 (0.03%)Halothiobacillus - 1 (0.03%)unclassified (derived from Erysipelotrichaceae) - 1 (0.03%)Selenomonas - 1 (0.03%)Acidobacterium - 1 (0.03%)Rhodothermus - 1 (0.03%)Pedobacter - 1 (0.03%)Thermococcus - 1 (0.03%)Planctomyces - 1 (0.03%)Rhodopirellula - 1 (0.03%)Sanguibacter - 1 (0.03%)Brevundimonas - 1 (0.03%)Proteus - 1 (0.03%)Phenylobacterium - 1 (0.03%)Rahnella - 1 (0.03%)Bradyrhizobium - 1 (0.03%)Herpetosiphon - 1 (0.03%)Sodalis - 1 (0.03%)Xenorhabdus - 1 (0.03%)Chelativorans - 1 (0.03%)Mesorhizobium - 1 (0.03%)Coxiella - 1 (0.03%)Legionella - 1 (0.03%)Kangiella - 1 (0.03%)Hahella - 1 (0.03%)Halomonas - 1 (0.03%)Zymobacter - 1 (0.03%)Oceanospirillum - 1 (0.03%)Dehalococcoides - 1 (0.03%)Hyphomonas - 1 (0.03%)Maricaulis - 1 (0.03%)Phaeobacter - 1 (0.03%)Moraxella - 1 (0.03%)Ruegeria - 1 (0.03%)Thermomicrobium - 1 (0.03%)Acidiphilium - 1 (0.03%)Xanthomonas - 1 (0.03%)Congregibacter - 1 (0.03%)Synechococcus - 1 (0.03%)Leptospira - 1 (0.03%)Arcanobacterium - 1 (0.03%)Spirochaeta - 1 (0.03%)Candidatus Koribacter - 1 (0.03%)Anaerobaculum - 1 (0.03%)Alkaliphilus - 1 (0.03%)Petrotoga - 1 (0.03%)Chthoniobacter - 1 (0.03%)unclassified (derived from Verrucomicrobia subdivision 3) - 1 (0.03%)Wolbachia - 1 (0.03%)Cryptosporidium - 1 (0.03%)Ixodes - 1 (0.03%)Tribolium - 1 (0.03%)Drosophila - 1 (0.03%)Rickettsia - 1 (0.03%)Myrmecia - 1 (0.03%)Coptotermes - 1 (0.03%)Paramysis - 1 (0.03%)Aspergillus - 1 (0.03%)Paracoccidioides - 1 (0.03%)Clavispora - 1 (0.03%)Collinsella - 1 (0.03%)Chaetomium - 1 (0.03%)Sordaria - 1 (0.03%)Moniliophthora - 1 (0.03%)Conexibacter - 1 (0.03%)Ustilago - 1 (0.03%)Tetraodon - 1 (0.03%)Gallus - 1 (0.03%)Aquifex - 1 (0.03%)Kytococcus - 1 (0.03%)Pan - 1 (0.03%)Arthrobacter - 1 (0.03%)Schistosoma - 1 (0.03%)Sphingopyxis - 1 (0.03%)Picea - 1 (0.03%)Zymomonas - 1 (0.03%)Ginkgo - 1 (0.03%)Selaginella - 1 (0.03%)unclassified (derived from Alphaproteobacteria) - 1 (0.03%)Sutterella - 1 (0.03%)Phalaenopsis - 1 (0.03%)Gymnosiphon - 1 (0.03%)Taylorella - 1 (0.03%)Eleocharis - 1 (0.03%)Festuca - 1 (0.03%)Hordeum - 1 (0.03%)Eubacterium - 1 (0.03%)Cupriavidus - 1 (0.03%)Limnobacter - 1 (0.03%)Platysace - 1 (0.03%)Chrysanthemum - 1 (0.03%)Anaerostipes - 1 (0.03%)Cynara - 1 (0.03%)Helianthus - 1 (0.03%)Lactuca - 1 (0.03%)Parthenium - 1 (0.03%)Saussurea - 1 (0.03%)Trachelium - 1 (0.03%)Alicycliphilus - 1 (0.03%)Micrococcus - 1 (0.03%)Eruca - 1 (0.03%)Rothia - 1 (0.03%)Amaranthus - 1 (0.03%)Polaromonas - 1 (0.03%)Rhabdodendron - 1 (0.03%)Tamarix - 1 (0.03%)Crossosoma - 1 (0.03%)Cucumis - 1 (0.03%)Trichosanthes - 1 (0.03%)Primula - 1 (0.03%)Herbaspirillum - 1 (0.03%)Dorea - 1 (0.03%)Methylobacillus - 1 (0.03%)Methylovorus - 1 (0.03%)Laribacter - 1 (0.03%)Roseburia - 1 (0.03%)Vicia - 1 (0.03%)Chrysolepis - 1 (0.03%)Coffea - 1 (0.03%)Neohymenopogon - 1 (0.03%)Phelipanche - 1 (0.03%)Aptosimum - 1 (0.03%)Cinnamomum - 1 (0.03%)Liriodendron - 1 (0.03%)Faecalibacterium - 1 (0.03%)Mycobacterium - 1 (0.03%)Chryseobacterium - 1 (0.03%)Helianthemum - 1 (0.03%)Malva - 1 (0.03%)Eucalyptus - 1 (0.03%)Oenothera - 1 (0.03%)Nymphaea - 1 (0.03%)Protea - 1 (0.03%)Cissampelos - 1 (0.03%)Azoarcus - 1 (0.03%)Fragaria - 1 (0.03%)Malus - 1 (0.03%)Prunus - 1 (0.03%)Dechloromonas - 1 (0.03%)Thauera - 1 (0.03%)Caldicellulosiruptor - 1 (0.03%)Bdellovibrio - 1 (0.03%)Desulfovibrio - 1 (0.03%)Geobacter - 1 (0.03%)Candidatus Sulcia - 1 (0.03%)Sulfuricurvum - 1 (0.03%)N4-like viruses - 1 (0.03%)unclassified (derived from Viruses) - 1 (0.03%)unclassified (derived from other sequences) - 1 (0.03%)unclassified (derived from unclassified sequences) - 1 (0.03%)
Rank Abundance Plot
The plots below show the taxonomic abundances ordered from the most abundant to least abundant. Only the top 50 most abundant are shown. The y-axis plots the abundances of annotations on a log scale.
The rank abundance chart is a tool for visually representing taxonomic richness and evenness.
taxon abundanceBacteria: 5450Eukaryota: 1953Viruses: 9Archaea: 1other sequences: 1unclassified sequences: 10101102103104BacteriaEukaryotaVirusesArchaeaother sequencesunclassified sequences
taxon abundanceFirmicutes: 2265Proteobacteria: 1902Streptophyta: 1670Deinococcus-Thermus: 475Actinobacteria: 253Arthropoda: 63Chordata: 54Bacteroidetes: 47Basidiomycota: 40Ascomycota: 15Acidobacteria: 9unclassified (derived from Viruses): 9Aquificae: 8Spirochaetes: 7Chloroflexi: 6unclassified (derived from Bacteria): 5Fusobacteria: 4Tenericutes: 3unclassified (derived from Eukaryota): 3Chlorobi: 2Planctomycetes: 2Verrucomicrobia: 2Cyanobacteria: 2Apicomplexa: 1Platyhelminthes: 1Thermotogae: 1Synergistetes: 1Euryarchaeota: 1unclassified (derived from other sequences): 1unclassified (derived from unclassified sequences): 10101102103104FirmicutesProteobacteriaStreptophytaDeinococcus-ThermusActinobacteriaArthropodaChordataBacteroidetesBasidiomycotaAscomycotaAcidobacteriaunclassified (derived from Viruses)AquificaeSpirochaetesChloroflexiunclassified (derived from Bacteria)FusobacteriaTenericutesunclassified (derived from Eukaryota)ChlorobiPlanctomycetesVerrucomicrobiaCyanobacteriaApicomplexaPlatyhelminthesThermotogaeSynergistetesEuryarchaeotaunclassified (derived from other sequences)unclassified (derived from unclassified sequences)
taxon abundanceBacilli: 2032unclassified (derived from Streptophyta): 902Gammaproteobacteria: 870Alphaproteobacteria: 601Deinococci: 475Actinobacteria (class): 253Liliopsida: 233Betaproteobacteria: 228Clostridia: 66Insecta: 58Mammalia: 43Exobasidiomycetes: 33Negativicutes: 16Bacteroidia: 13unclassified (derived from Viruses): 9Aquificae (class): 8Spirochaetes (class): 7Coniferopsida: 6Deltaproteobacteria: 6Sphingobacteria: 6Sordariomycetes: 5Solibacteres: 5Flavobacteria: 5Erysipelotrichi: 5unclassified (derived from Bacteria): 5Saccharomycetes: 4Fusobacteria (class): 4Chloroflexi (class): 4Mollicutes: 3Dothideomycetes: 3Epsilonproteobacteria: 3Bryopsida: 3Cytophagia: 3Eurotiomycetes: 2unclassified (derived from Cyanobacteria): 2Chlorobia: 2Planctomycetacia: 2Arachnida: 2unclassified (derived from Eukaryota): 2Actinopterygii: 2Acidobacteria (class): 1Thermotogae (class): 1Agaricomycetes: 1Spartobacteria: 1Ustilaginomycetes: 1Verrucomicrobiae: 1Aves: 1Dehalococcoidetes: 1Trematoda: 1Coccidia: 10101102103104Bacilliunclassified (derived from Streptophyta)GammaproteobacteriaAlphaproteobacteriaDeinococciActinobacteria (class)LiliopsidaBetaproteobacteriaClostridiaInsectaMammaliaExobasidiomycetesNegativicutesBacteroidiaunclassified (derived from Viruses)Aquificae (class)Spirochaetes (class)ConiferopsidaDeltaproteobacteriaSphingobacteriaSordariomycetesSolibacteresFlavobacteriaErysipelotrichiunclassified (derived from Bacteria)SaccharomycetesFusobacteria (class)Chloroflexi (class)MollicutesDothideomycetesEpsilonproteobacteriaBryopsidaCytophagiaEurotiomycetesunclassified (derived from Cyanobacteria)ChlorobiaPlanctomycetaciaArachnidaunclassified (derived from Eukaryota)ActinopterygiiAcidobacteria (class)Thermotogae (class)AgaricomycetesSpartobacteriaUstilaginomycetesVerrucomicrobiaeAvesDehalococcoidetesTrematodaCoccidia
taxon abundanceLactobacillales: 1577Thermales: 470Pseudomonadales: 381Rhodospirillales: 327Enterobacteriales: 293Bacillales: 271Actinomycetales: 229Poales: 215Vitales: 146Burkholderiales: 141Brassicales: 137Sphingomonadales: 125Malpighiales: 74Clostridiales: 56Hymenoptera: 43Fabales: 42Malasseziales: 33Primates: 28Neisseriales: 21Asterales: 17Selenomonadales: 16Solanales: 14Rhizobiales: 14Bifidobacteriales: 14Rhodocyclales: 13Bacteroidales: 13Ranunculales: 10Pasteurellales: 10Caudovirales: 8Aquificales: 8Caryophyllales: 8Asparagales: 7Spirochaetales: 7Oceanospirillales: 7Rhodobacterales: 6Sphingobacteriales: 6Coniferales: 6unclassified (derived from Bacteria): 5Chromatiales: 5Vibrionales: 5Alismatales: 5Solibacterales: 5Erysipelotrichales: 5Flavobacteriales: 5unclassified (derived from Gammaproteobacteria): 4Fusobacteriales: 4Saccharomycetales: 4Thermoanaerobacterales: 4unclassified (derived from Betaproteobacteria): 4Sapindales: 40101102103104LactobacillalesThermalesPseudomonadalesRhodospirillalesEnterobacterialesBacillalesActinomycetalesPoalesVitalesBurkholderialesBrassicalesSphingomonadalesMalpighialesClostridialesHymenopteraFabalesMalassezialesPrimatesNeisserialesAsteralesSelenomonadalesSolanalesRhizobialesBifidobacterialesRhodocyclalesBacteroidalesRanunculalesPasteurellalesCaudoviralesAquificalesCaryophyllalesAsparagalesSpirochaetalesOceanospirillalesRhodobacteralesSphingobacterialesConiferalesunclassified (derived from Bacteria)ChromatialesVibrionalesAlismatalesSolibacteralesErysipelotrichalesFlavobacterialesunclassified (derived from Gammaproteobacteria)FusobacterialesSaccharomycetalesThermoanaerobacteralesunclassified (derived from Betaproteobacteria)Sapindales
taxon abundanceLactobacillaceae: 1034Thermaceae: 470Moraxellaceae: 338Acetobacteraceae: 320Enterobacteriaceae: 293Poaceae: 210Bacillaceae: 181Vitaceae: 146Brassicaceae: 136Propionibacteriaceae: 129Comamonadaceae: 86Sphingomonadaceae: 86Streptococcaceae: 68Leuconostocaceae: 60Corynebacteriaceae: 58Staphylococcaceae: 45Fabaceae: 42Pseudomonadaceae: 39Euphorbiaceae: 38Apidae: 35Malasseziaceae: 33Salicaceae: 29Clostridiales Family XI. Incertae Sedis: 22Neisseriaceae: 21Burkholderiaceae: 19Hominidae: 17Asteraceae: 16Veillonellaceae: 15Bifidobacteriaceae: 14Clostridiaceae: 13Rhodocyclaceae: 13Enterococcaceae: 12Solanaceae: 11Lachnospiraceae: 10Paenibacillaceae: 10Pasteurellaceae: 10Myoviridae: 7Erythrobacteraceae: 7Papaveraceae: 7Actinomycetaceae: 7Listeriaceae: 6Aquificaceae: 6Asparagaceae: 6Spirochaetaceae: 6Erysipelotrichaceae: 5unclassified (derived from Bacteria): 5Amaranthaceae: 5Pinaceae: 5Vibrionaceae: 5Carnobacteriaceae: 50101102103104LactobacillaceaeThermaceaeMoraxellaceaeAcetobacteraceaeEnterobacteriaceaePoaceaeBacillaceaeVitaceaeBrassicaceaePropionibacteriaceaeComamonadaceaeSphingomonadaceaeStreptococcaceaeLeuconostocaceaeCorynebacteriaceaeStaphylococcaceaeFabaceaePseudomonadaceaeEuphorbiaceaeApidaeMalasseziaceaeSalicaceaeClostridiales Family XI. Incertae SedisNeisseriaceaeBurkholderiaceaeHominidaeAsteraceaeVeillonellaceaeBifidobacteriaceaeClostridiaceaeRhodocyclaceaeEnterococcaceaeSolanaceaeLachnospiraceaePaenibacillaceaePasteurellaceaeMyoviridaeErythrobacteraceaePapaveraceaeActinomycetaceaeListeriaceaeAquificaceaeAsparagaceaeSpirochaetaceaeErysipelotrichaceaeunclassified (derived from Bacteria)AmaranthaceaePinaceaeVibrionaceaeCarnobacteriaceae
taxon abundanceLactobacillus: 799Meiothermus: 422Acinetobacter: 312Vitis: 146Propionibacterium: 129Gluconobacter: 116Zea: 105Arabidopsis: 99Geobacillus: 63Delftia: 59Streptococcus: 59Corynebacterium: 58Oryza: 55Staphylococcus: 44Novosphingobium: 41Gluconacetobacter: 40Pseudomonas: 39Pediococcus: 38Leuconostoc: 37Ricinus: 34Malassezia: 33Apis: 31Populus: 29Bacillus: 26Thermus: 22Arsenophonus: 21Acetobacter: 21Sphingomonas: 19Erwinia: 18Brassica: 17Medicago: 17Anoxybacillus: 15Veillonella: 14Homo: 13Neisseria: 13Enterococcus: 12Clostridium: 12Burkholderia: 11Anaerococcus: 11Serratia: 10Pantoea: 10Solanum: 9Paenibacillus: 7Bifidobacterium: 7Lactococcus: 7Trifolium: 7Escherichia: 7Erythrobacter: 7Sorghum: 7Asparagus: 60101102103LactobacillusMeiothermusAcinetobacterVitisPropionibacteriumGluconobacterZeaArabidopsisGeobacillusDelftiaStreptococcusCorynebacteriumOryzaStaphylococcusNovosphingobiumGluconacetobacterPseudomonasPediococcusLeuconostocRicinusMalasseziaApisPopulusBacillusThermusArsenophonusAcetobacterSphingomonasErwiniaBrassicaMedicagoAnoxybacillusVeillonellaHomoNeisseriaEnterococcusClostridiumBurkholderiaAnaerococcusSerratiaPantoeaSolanumPaenibacillusBifidobacteriumLactococcusTrifoliumEscherichiaErythrobacterSorghumAsparagus
Rarefaction Curve
The plot below shows the rarefaction curve of annotated species richness. This curve is a plot of the total number of distinct species annotations as a function of the number of sequences sampled. On the left, a steep slope indicates that a large fraction of the species diversity remains to be discovered. If the curve becomes flatter to the right, a reasonable number of individuals is sampled: more intensive sampling is likely to yield only few additional species.
Sampling curves generally rise very quickly at first and then level off toward an asymptote as fewer new species are found per unit of individuals collected. These rarefaction curves are calculated from the table of species abundance. The curves represent the average number of different species annotations for subsamples of the complete data set.
undefinednumber of readsspecies count01603204806408009601,1201,2801,4401,600070,000140,000210,000280,000350,000420,000490,000560,000630,000700,000
Alpha Diversity
The α-diversity of this data set is 210 species.
The above image shows the range of α-diversity values in the study 'SPlab-run431-SPD4-asm'. The min, max, and mean values are shown, with the standard deviation ranges (σ and 2σ) in different shades. The α-diversity of this metagenome is shown in red.
Alpha diversity summarizes the diversity of organisms in a sample with a single number. The α-diversity of annotated samples can be estimated from the distribution of the species-level annotations.
Annotated species richness is the number of distinct species annotations in the combined MG-RAST data set. Shannon diversity is an abundance-weighted average of the logarithm of the relative abundances of annotated species. The species-level annotations are from all the annotation source databases used by MG-RAST.
Sequence Length Histogram
The histograms below show the distribution of sequence lengths in basepairs for this metagenome. Each position represents the number of sequences within a length bp range.
The data used in these graphs is based on raw upload and post-QC sequences.
uploadedpost-QCsequence length (bp)number of reads03,5007,00010,50014,00017,50021,00024,50028,00031,50035,00011212413614816017208409601,0801,200uploadedpost-QC
Sequence GC Distribution
The histograms below show the distribution of the GC percentage for this metagenome. Each position represents the number of sequences within a GC percentage range. The data used in these graphs is based on raw upload and post-QC sequences.
uploadedpost-QCsequence GC %number of reads04,0008,00012,00016,00020,00024,00028,00032,00036,00040,0000102030405060708090100uploadedpost-QC
Sample Data
This data set has no sample data