Contents
SummaryFiguresResultsReferences
|
Summary
Analysis summary
Summary:
The raw bam files, one for each RNA-Seq sample, were summarized to a 3'UTR read counts table, using the Bioconductor package GenomicRanges. In the final read counts table, each row represented each column one RNA-Seq sample and each cell, the corresponding read counts associated with each row and column.The gene counts table was normalized for inherent systematic or experimental biases (e.g. sequencing depth, gene length, GC content bias etc.) using the Bioconductor package DESeq after removing gene s that had zero counts over all the RNA-Seq samples (28545 genes). The output of the normalization algorithm was a table with normalized counts, which can be used for differential expression analysis with statistical algorithms developed specifically for count data. Prior to the statistical testing procedure, the gene read counts were filtered for possible artifacts that could affect the subsequent statistical testing procedures. Genes/transcripts presenting any of the following were excluded from further analysis: i) genes with length less than 500 (4916 genes), ii) genes whose average reads per 100 bp was less than the 25th quantile of the total normalized distribution of average reads per 100bp (0 genes with cutoff value 0.03982 average reads per 100 bp), iii) genes with read counts below the median read counts of the total normalized count distribution (11674 genes with cutoff value 5 normalized read counts), iv) genes whose biotype matched the following: rRNA, TR_V_pseudogene, TR_J_pseudogene, IG_C_pseudogene, IG_J_pseudogene, IG_V_pseudogene (102 genes). The total number of genes excluded due to the application of gene filters was 6739. The total (unified) number of genes excluded due to the application of all filters was 41761. The resulting gene counts table was subjected to differential expression analysis for the contrasts Wt versus Tg using the Bioconductor package DESeq. The final numbers of differentially expressed gene s were (per contrast): for the contrast Wt versus Tg, 221 (4) statistically significant genes were found with a p-value (FDR or adjusted p-value) threshold of 0.05 and of these, 106 (4) were up-regulated, 99 (0) were down-regulated and 16 (0) were not differentially expressed according to an absolute fold change cutoff value of 1 in log2 scale. Literature references for all the algorithms used can be found at the end of this report.
Read counts file: imported sam/bam/bed files
Conditions: Tg, Wt
Samples included: MF3R23_Prox1-2, MF3R24_Prox1-new2, MF3R22_Prox1-1, MF3R19_GFP1, MF3R20_GFP2, MF3R21_GFPnew2
Samples excluded:
none
Requested contrasts: Wt_vs_Tg
Library sizes:
- MF3R23_Prox1-2: 3714593
- MF3R24_Prox1-new2: 5633486
- MF3R22_Prox1-1: 4527039
- MF3R19_GFP1: 4048341
- MF3R20_GFP2: 4321851
- MF3R21_GFPnew2: 3296533
Annotation: download
Organism: human ( Homo sapiens), genome version alias hg19
Annotation source: Ensembl genomes
Count type: utr
3' UTR flanking :: 500
Exon filters:
none applied
Gene filters:
length, avg.reads, expression, biotype - avg.reads
- average.per.bp: 100
- quantile: 0.25
- expression
- median: TRUE
- mean: FALSE
- quantile: NA
- known: NA
- custom: NA
- biotype
- pseudogene: FALSE
- lincRNA: FALSE
- protein_coding: FALSE
- antisense: FALSE
- processed_transcript: FALSE
- snRNA: FALSE
- sense_intronic: FALSE
- miRNA: FALSE
- misc_RNA: FALSE
- snoRNA: FALSE
- rRNA: TRUE
- polymorphic_pseudogene: FALSE
- sense_overlapping: FALSE
- three_prime_overlapping_ncrna: FALSE
- TR_V_gene: FALSE
- TR_V_pseudogene: TRUE
- TR_D_gene: FALSE
- TR_J_gene: FALSE
- TR_C_gene: FALSE
- TR_J_pseudogene: TRUE
- IG_C_gene: FALSE
- IG_C_pseudogene: TRUE
- IG_J_gene: FALSE
- IG_J_pseudogene: TRUE
- IG_D_gene: FALSE
- IG_V_gene: FALSE
- IG_V_pseudogene: TRUE
Filter application: after normalization
Normalization algorithm: DESeq
Normalization arguments:
locfunc - [[list(new("standardGeneric", .Data = function (x, na.rm = FALSE, ...)
standardGeneric("median"), generic = "median", package = "stats", group = list(), valueClass = character(0), signature = c("x", "na.rm"), default = new("derivedDefaultMethod", .Data = function (x, na.rm = FALSE, ...)
UseMethod("median"), target = new("signature", .Data = "ANY", names = "x", package = "methods"), defined = new("signature", .Data = "ANY", names = "x", package = "methods"), generic = "median"), skeleton = (new("derivedDefaultMethod", .Data = function (x, na.rm = FALSE, ...)
UseMethod("median"), target = new("signature", .Data = "ANY", names = "x", package = "methods"), defined = new("signature", .Data = "ANY", names = "x", package = "methods"), generic = "median"))(x, na.rm, ...)))locfunc
Statistical algorithm(s): DESeq
Statistical arguments for DESeq: method, sharingMode, fitType - method: blind
- sharingMode: fit-only
- fitType: local
Meta-analysis method: no meta-analysis, reported p-values from the first supplied statistical algorithm
Multiple testing correction: Benjamini-Hochberg FDR
p-value threshold: 0.05
Logarithmic tranformation offset: 1
Analysis preset: not available
Quality control plots: multidimensional scaling, biotype detection, biotype counts, sample and biotype saturation, filtered biotypes, correlation heatmap and correlogram, pairwise scatterplots between samples, boxplots, GC-content bias, transcript length bias, mean-difference plot, mean-variance plot, DEG biotype detection, volcano plot, DEG heatmap
Figure format: png, pdf
Output directory: /data/images/proton3/run451/metaseqr-mat/metaseqr_run451
Output data: Annotation, p-value, Adjusted p-value (FDR), Fold change, Read counts
Output scale(s): log2 scale, Reads per Gene Model
Output values: Normalized values
Output statistics: Mean, Median, Standard deviation, Median Absolute Deviation (MAD), Coefficient of Variation, Robust Coefficient of Variation
Total run time: 22 minutes 07 seconds
Filtered genes
Number of filtered gene s: 41761 which is the union of
- Filtered because of zero reads: 28545
-
Filtered because of exon filters: 0
-
Filtered because of gene filters: 13216
which is the union of
- length: 4916 genes with filter cutoff value 500
- avg.reads: 1720 genes with filter cutoff value 0.03981684
- expression: 11674 genes further decomposed to (filter name, filtered genes, filter cutoff):
- median: 11674 genes with filter cutoff value 5
- biotype: 102 genes with filter cutoff value rRNA, TR_V_pseudogene, TR_J_pseudogene, IG_C_pseudogene, IG_J_pseudogene, IG_V_pseudogene
Differentially expressed genes
Number of differentially expressed genes per contrast:
- Wt_vs_Tg: 221 (4) statistically significant genes of which 106 (4) up regulated, 99 (0) down regulated and 16 (0) not differentially expressed according to a p-value (FDR or adjusted p-value) threshold of 0.05 and an absolute fold change cutoff value of 1 in log2 scale.
Run logINFO [2020-10-15 15:48:06] 2020-10-15 15:48:06: Data processing started...INFO [2020-10-15 15:48:06] Read counts file: imported sam/bam/bed filesINFO [2020-10-15 15:48:06] Conditions: Tg, WtINFO [2020-10-15 15:48:06] Samples to include: MF3R23_Prox1-2, MF3R24_Prox1-new2, MF3R22_Prox1-1, MF3R19_GFP1, MF3R20_GFP2, MF3R21_GFPnew2INFO [2020-10-15 15:48:06] Samples to exclude: noneINFO [2020-10-15 15:48:06] Requested contrasts: Wt_vs_TgINFO [2020-10-15 15:48:06] Annotation: downloadINFO [2020-10-15 15:48:06] Organism: hg19INFO [2020-10-15 15:48:06] Reference source: ensemblINFO [2020-10-15 15:48:06] Count type: utrINFO [2020-10-15 15:48:06] 3' UTR flanking: 500INFO [2020-10-15 15:48:06] Transcriptional level: geneINFO [2020-10-15 15:48:06] Exon filters: none appliedINFO [2020-10-15 15:48:06] Gene filters: length, avg.reads, expression, biotypeINFO [2020-10-15 15:48:06] length: INFO [2020-10-15 15:48:06] length: 500INFO [2020-10-15 15:48:06] avg.reads: INFO [2020-10-15 15:48:06] average.per.bp: 100INFO [2020-10-15 15:48:06] quantile: 0.25INFO [2020-10-15 15:48:06] expression: INFO [2020-10-15 15:48:06] median: TRUEINFO [2020-10-15 15:48:06] mean: FALSEINFO [2020-10-15 15:48:06] quantile: NAINFO [2020-10-15 15:48:06] known: NAINFO [2020-10-15 15:48:06] custom: NAINFO [2020-10-15 15:48:06] biotype: INFO [2020-10-15 15:48:06] pseudogene: FALSEINFO [2020-10-15 15:48:06] lincRNA: FALSEINFO [2020-10-15 15:48:06] protein_coding: FALSEINFO [2020-10-15 15:48:06] antisense: FALSEINFO [2020-10-15 15:48:06] processed_transcript: FALSEINFO [2020-10-15 15:48:06] snRNA: FALSEINFO [2020-10-15 15:48:06] sense_intronic: FALSEINFO [2020-10-15 15:48:06] miRNA: FALSEINFO [2020-10-15 15:48:06] misc_RNA: FALSEINFO [2020-10-15 15:48:06] snoRNA: FALSEINFO [2020-10-15 15:48:06] rRNA: TRUEINFO [2020-10-15 15:48:06] polymorphic_pseudogene: FALSEINFO [2020-10-15 15:48:06] sense_overlapping: FALSEINFO [2020-10-15 15:48:06] three_prime_overlapping_ncrna: FALSEINFO [2020-10-15 15:48:06] TR_V_gene: FALSEINFO [2020-10-15 15:48:06] TR_V_pseudogene: TRUEINFO [2020-10-15 15:48:06] TR_D_gene: FALSEINFO [2020-10-15 15:48:06] TR_J_gene: FALSEINFO [2020-10-15 15:48:06] TR_C_gene: FALSEINFO [2020-10-15 15:48:06] TR_J_pseudogene: TRUEINFO [2020-10-15 15:48:06] IG_C_gene: FALSEINFO [2020-10-15 15:48:06] IG_C_pseudogene: TRUEINFO [2020-10-15 15:48:06] IG_J_gene: FALSEINFO [2020-10-15 15:48:06] IG_J_pseudogene: TRUEINFO [2020-10-15 15:48:06] IG_D_gene: FALSEINFO [2020-10-15 15:48:06] IG_V_gene: FALSEINFO [2020-10-15 15:48:06] IG_V_pseudogene: TRUEINFO [2020-10-15 15:48:06] Filter application: postnormINFO [2020-10-15 15:48:06] Normalization algorithm: deseqINFO [2020-10-15 15:48:06] Normalization arguments: INFO [2020-10-15 15:48:06] locfunc: INFO [2020-10-15 15:48:06] [[list(new("standardGeneric", .Data = function (x, na.rm = FALSE, ...) standardGeneric("median"), generic = "median", package = "stats", group = list(), valueClass = character(0), signature = c("x", "na.rm"), default = new("derivedDefaultMethod", .Data = function (x, na.rm = FALSE, ...) UseMethod("median"), target = new("signature", .Data = "ANY", names = "x", package = "methods"), defined = new("signature", .Data = "ANY", names = "x", package = "methods"), generic = "median"), skeleton = (new("derivedDefaultMethod", .Data = function (x, na.rm = FALSE, ...) UseMethod("median"), target = new("signature", .Data = "ANY", names = "x", package = "methods"), defined = new("signature", .Data = "ANY", names = "x", package = "methods"), generic = "median"))(x, na.rm, ...)))locfuncINFO [2020-10-15 15:48:06] Statistical algorithm: deseqINFO [2020-10-15 15:48:06] Statistical arguments: INFO [2020-10-15 15:48:06] deseq: blind, fit-only, localINFO [2020-10-15 15:48:06] Meta-analysis method: noneINFO [2020-10-15 15:48:06] Multiple testing correction: BHINFO [2020-10-15 15:48:06] p-value threshold: 0.05INFO [2020-10-15 15:48:06] Logarithmic transformation offset: 1INFO [2020-10-15 15:48:06] Quality control plots: mds, biodetection, countsbio, saturation, readnoise, filtered, correl, pairwise, boxplot, gcbias, lengthbias, meandiff, meanvar, biodist, volcano, deheatmapINFO [2020-10-15 15:48:06] Figure format: png, pdfINFO [2020-10-15 15:48:06] Output directory: /data/images/proton3/run451/metaseqr-mat/metaseqr_run451INFO [2020-10-15 15:48:06] Output data: annotation, p.value, adj.p.value, fold.change, counts, flagsINFO [2020-10-15 15:48:06] Output scale(s): log2, rpgmINFO [2020-10-15 15:48:06] Output values: normalizedINFO [2020-10-15 15:48:06] Downloading gene annotation for hg19...WARN [2020-10-15 16:25:53] "yes" and "no" for read strandedness have been deprecated. Please use "forward", "forward" or "no". Replacing "yes" with "forward"...INFO [2020-10-15 16:25:54] 2020-10-15 16:25:54: Data processing started...INFO [2020-10-15 16:25:54] Read counts file: imported sam/bam/bed filesINFO [2020-10-15 16:25:54] Conditions: Tg, WtINFO [2020-10-15 16:25:54] Samples to include: MF3R23_Prox1-2, MF3R24_Prox1-new2, MF3R22_Prox1-1, MF3R19_GFP1, MF3R20_GFP2, MF3R21_GFPnew2INFO [2020-10-15 16:25:54] Samples to exclude: noneINFO [2020-10-15 16:25:54] Requested contrasts: Wt_vs_TgINFO [2020-10-15 16:25:54] Annotation: downloadINFO [2020-10-15 16:25:54] Organism: hg19INFO [2020-10-15 16:25:54] Reference source: ensemblINFO [2020-10-15 16:25:54] Count type: utrINFO [2020-10-15 16:25:54] 3' UTR flanking: 500INFO [2020-10-15 16:25:54] Transcriptional level: geneINFO [2020-10-15 16:25:54] Exon filters: none appliedINFO [2020-10-15 16:25:54] Gene filters: length, avg.reads, expression, biotypeINFO [2020-10-15 16:25:54] length: INFO [2020-10-15 16:25:54] length: 500INFO [2020-10-15 16:25:54] avg.reads: INFO [2020-10-15 16:25:54] average.per.bp: 100INFO [2020-10-15 16:25:54] quantile: 0.25INFO [2020-10-15 16:25:54] expression: INFO [2020-10-15 16:25:54] median: TRUEINFO [2020-10-15 16:25:54] mean: FALSEINFO [2020-10-15 16:25:54] quantile: NAINFO [2020-10-15 16:25:54] known: NAINFO [2020-10-15 16:25:54] custom: NAINFO [2020-10-15 16:25:54] biotype: INFO [2020-10-15 16:25:54] pseudogene: FALSEINFO [2020-10-15 16:25:54] lincRNA: FALSEINFO [2020-10-15 16:25:54] protein_coding: FALSEINFO [2020-10-15 16:25:54] antisense: FALSEINFO [2020-10-15 16:25:54] processed_transcript: FALSEINFO [2020-10-15 16:25:54] snRNA: FALSEINFO [2020-10-15 16:25:54] sense_intronic: FALSEINFO [2020-10-15 16:25:54] miRNA: FALSEINFO [2020-10-15 16:25:54] misc_RNA: FALSEINFO [2020-10-15 16:25:54] snoRNA: FALSEINFO [2020-10-15 16:25:54] rRNA: TRUEINFO [2020-10-15 16:25:54] polymorphic_pseudogene: FALSEINFO [2020-10-15 16:25:54] sense_overlapping: FALSEINFO [2020-10-15 16:25:54] three_prime_overlapping_ncrna: FALSEINFO [2020-10-15 16:25:54] TR_V_gene: FALSEINFO [2020-10-15 16:25:54] TR_V_pseudogene: TRUEINFO [2020-10-15 16:25:54] TR_D_gene: FALSEINFO [2020-10-15 16:25:54] TR_J_gene: FALSEINFO [2020-10-15 16:25:54] TR_C_gene: FALSEINFO [2020-10-15 16:25:54] TR_J_pseudogene: TRUEINFO [2020-10-15 16:25:54] IG_C_gene: FALSEINFO [2020-10-15 16:25:54] IG_C_pseudogene: TRUEINFO [2020-10-15 16:25:54] IG_J_gene: FALSEINFO [2020-10-15 16:25:54] IG_J_pseudogene: TRUEINFO [2020-10-15 16:25:54] IG_D_gene: FALSEINFO [2020-10-15 16:25:54] IG_V_gene: FALSEINFO [2020-10-15 16:25:54] IG_V_pseudogene: TRUEINFO [2020-10-15 16:25:54] Filter application: postnormINFO [2020-10-15 16:25:54] Normalization algorithm: deseqINFO [2020-10-15 16:25:54] Normalization arguments: INFO [2020-10-15 16:25:54] locfunc: INFO [2020-10-15 16:25:54] [[list(new("standardGeneric", .Data = function (x, na.rm = FALSE, ...) standardGeneric("median"), generic = "median", package = "stats", group = list(), valueClass = character(0), signature = c("x", "na.rm"), default = new("derivedDefaultMethod", .Data = function (x, na.rm = FALSE, ...) UseMethod("median"), target = new("signature", .Data = "ANY", names = "x", package = "methods"), defined = new("signature", .Data = "ANY", names = "x", package = "methods"), generic = "median"), skeleton = (new("derivedDefaultMethod", .Data = function (x, na.rm = FALSE, ...) UseMethod("median"), target = new("signature", .Data = "ANY", names = "x", package = "methods"), defined = new("signature", .Data = "ANY", names = "x", package = "methods"), generic = "median"))(x, na.rm, ...)))locfuncINFO [2020-10-15 16:25:54] Statistical algorithm: deseqINFO [2020-10-15 16:25:54] Statistical arguments: INFO [2020-10-15 16:25:54] deseq: blind, fit-only, localINFO [2020-10-15 16:25:54] Meta-analysis method: noneINFO [2020-10-15 16:25:54] Multiple testing correction: BHINFO [2020-10-15 16:25:54] p-value threshold: 0.05INFO [2020-10-15 16:25:54] Logarithmic transformation offset: 1INFO [2020-10-15 16:25:54] Quality control plots: mds, biodetection, countsbio, saturation, readnoise, filtered, correl, pairwise, boxplot, gcbias, lengthbias, meandiff, meanvar, biodist, volcano, deheatmapINFO [2020-10-15 16:25:54] Figure format: png, pdfINFO [2020-10-15 16:25:54] Output directory: /data/images/proton3/run451/metaseqr-mat/metaseqr_run451INFO [2020-10-15 16:25:54] Output data: annotation, p.value, adj.p.value, fold.change, counts, flagsINFO [2020-10-15 16:25:54] Output scale(s): log2, rpgmINFO [2020-10-15 16:25:54] Output values: normalizedINFO [2020-10-15 16:25:54] Downloading gene annotation for hg19...WARN [2020-10-15 17:42:31] "yes" and "no" for read strandedness have been deprecated. Please use "forward", "forward" or "no". Replacing "yes" with "forward"...INFO [2020-10-15 17:42:31] 2020-10-15 17:42:31: Data processing started...INFO [2020-10-15 17:42:31] Read counts file: imported sam/bam/bed filesINFO [2020-10-15 17:42:31] Conditions: Tg, WtINFO [2020-10-15 17:42:31] Samples to include: MF3R23_Prox1-2, MF3R24_Prox1-new2, MF3R22_Prox1-1, MF3R19_GFP1, MF3R20_GFP2, MF3R21_GFPnew2INFO [2020-10-15 17:42:31] Samples to exclude: noneINFO [2020-10-15 17:42:31] Requested contrasts: Wt_vs_TgINFO [2020-10-15 17:42:31] Annotation: downloadINFO [2020-10-15 17:42:31] Organism: hg19INFO [2020-10-15 17:42:31] Reference source: ensemblINFO [2020-10-15 17:42:31] Count type: utrINFO [2020-10-15 17:42:31] 3' UTR flanking: 500INFO [2020-10-15 17:42:31] Transcriptional level: geneINFO [2020-10-15 17:42:31] Exon filters: none appliedINFO [2020-10-15 17:42:31] Gene filters: length, avg.reads, expression, biotypeINFO [2020-10-15 17:42:31] length: INFO [2020-10-15 17:42:31] length: 500INFO [2020-10-15 17:42:31] avg.reads: INFO [2020-10-15 17:42:31] average.per.bp: 100INFO [2020-10-15 17:42:31] quantile: 0.25INFO [2020-10-15 17:42:31] expression: INFO [2020-10-15 17:42:31] median: TRUEINFO [2020-10-15 17:42:31] mean: FALSEINFO [2020-10-15 17:42:31] quantile: NAINFO [2020-10-15 17:42:31] known: NAINFO [2020-10-15 17:42:31] custom: NAINFO [2020-10-15 17:42:31] biotype: INFO [2020-10-15 17:42:31] pseudogene: FALSEINFO [2020-10-15 17:42:31] lincRNA: FALSEINFO [2020-10-15 17:42:31] protein_coding: FALSEINFO [2020-10-15 17:42:31] antisense: FALSEINFO [2020-10-15 17:42:31] processed_transcript: FALSEINFO [2020-10-15 17:42:31] snRNA: FALSEINFO [2020-10-15 17:42:31] sense_intronic: FALSEINFO [2020-10-15 17:42:31] miRNA: FALSEINFO [2020-10-15 17:42:31] misc_RNA: FALSEINFO [2020-10-15 17:42:31] snoRNA: FALSEINFO [2020-10-15 17:42:31] rRNA: TRUEINFO [2020-10-15 17:42:31] polymorphic_pseudogene: FALSEINFO [2020-10-15 17:42:31] sense_overlapping: FALSEINFO [2020-10-15 17:42:31] three_prime_overlapping_ncrna: FALSEINFO [2020-10-15 17:42:31] TR_V_gene: FALSEINFO [2020-10-15 17:42:31] TR_V_pseudogene: TRUEINFO [2020-10-15 17:42:31] TR_D_gene: FALSEINFO [2020-10-15 17:42:31] TR_J_gene: FALSEINFO [2020-10-15 17:42:31] TR_C_gene: FALSEINFO [2020-10-15 17:42:31] TR_J_pseudogene: TRUEINFO [2020-10-15 17:42:31] IG_C_gene: FALSEINFO [2020-10-15 17:42:31] IG_C_pseudogene: TRUEINFO [2020-10-15 17:42:31] IG_J_gene: FALSEINFO [2020-10-15 17:42:31] IG_J_pseudogene: TRUEINFO [2020-10-15 17:42:31] IG_D_gene: FALSEINFO [2020-10-15 17:42:31] IG_V_gene: FALSEINFO [2020-10-15 17:42:31] IG_V_pseudogene: TRUEINFO [2020-10-15 17:42:31] Filter application: postnormINFO [2020-10-15 17:42:31] Normalization algorithm: deseqINFO [2020-10-15 17:42:31] Normalization arguments: INFO [2020-10-15 17:42:31] locfunc: INFO [2020-10-15 17:42:31] [[list(new("standardGeneric", .Data = function (x, na.rm = FALSE, ...) standardGeneric("median"), generic = "median", package = "stats", group = list(), valueClass = character(0), signature = c("x", "na.rm"), default = new("derivedDefaultMethod", .Data = function (x, na.rm = FALSE, ...) UseMethod("median"), target = new("signature", .Data = "ANY", names = "x", package = "methods"), defined = new("signature", .Data = "ANY", names = "x", package = "methods"), generic = "median"), skeleton = (new("derivedDefaultMethod", .Data = function (x, na.rm = FALSE, ...) UseMethod("median"), target = new("signature", .Data = "ANY", names = "x", package = "methods"), defined = new("signature", .Data = "ANY", names = "x", package = "methods"), generic = "median"))(x, na.rm, ...)))locfuncINFO [2020-10-15 17:42:31] Statistical algorithm: deseqINFO [2020-10-15 17:42:31] Statistical arguments: INFO [2020-10-15 17:42:31] deseq: blind, fit-only, localINFO [2020-10-15 17:42:31] Meta-analysis method: noneINFO [2020-10-15 17:42:31] Multiple testing correction: BHINFO [2020-10-15 17:42:31] p-value threshold: 0.05INFO [2020-10-15 17:42:31] Logarithmic transformation offset: 1INFO [2020-10-15 17:42:31] Quality control plots: mds, biodetection, countsbio, saturation, readnoise, filtered, correl, pairwise, boxplot, gcbias, lengthbias, meandiff, meanvar, biodist, volcano, deheatmapINFO [2020-10-15 17:42:31] Figure format: png, pdfINFO [2020-10-15 17:42:31] Output directory: /data/images/proton3/run451/metaseqr-mat/metaseqr_run451INFO [2020-10-15 17:42:31] Output data: annotation, p.value, adj.p.value, fold.change, counts, flagsINFO [2020-10-15 17:42:31] Output scale(s): log2, rpgmINFO [2020-10-15 17:42:31] Output values: normalizedINFO [2020-10-15 17:42:31] Downloading gene annotation for hg19...WARN [2020-10-16 11:39:55] "yes" and "no" for read strandedness have been deprecated. Please use "forward", "forward" or "no". Replacing "yes" with "forward"...INFO [2020-10-16 11:39:55] 2020-10-16 11:39:55: Data processing started...INFO [2020-10-16 11:39:55] Read counts file: imported sam/bam/bed filesINFO [2020-10-16 11:39:55] Conditions: Tg, WtINFO [2020-10-16 11:39:55] Samples to include: MF3R23_Prox1-2, MF3R24_Prox1-new2, MF3R22_Prox1-1, MF3R19_GFP1, MF3R20_GFP2, MF3R21_GFPnew2INFO [2020-10-16 11:39:55] Samples to exclude: noneINFO [2020-10-16 11:39:55] Requested contrasts: Wt_vs_TgINFO [2020-10-16 11:39:55] Annotation: downloadINFO [2020-10-16 11:39:55] Organism: hg19INFO [2020-10-16 11:39:55] Reference source: ensemblINFO [2020-10-16 11:39:55] Count type: utrINFO [2020-10-16 11:39:55] 3' UTR flanking: 500INFO [2020-10-16 11:39:55] Transcriptional level: geneINFO [2020-10-16 11:39:55] Exon filters: none appliedINFO [2020-10-16 11:39:55] Gene filters: length, avg.reads, expression, biotypeINFO [2020-10-16 11:39:55] length: INFO [2020-10-16 11:39:55] length: 500INFO [2020-10-16 11:39:55] avg.reads: INFO [2020-10-16 11:39:55] average.per.bp: 100INFO [2020-10-16 11:39:55] quantile: 0.25INFO [2020-10-16 11:39:55] expression: INFO [2020-10-16 11:39:55] median: TRUEINFO [2020-10-16 11:39:55] mean: FALSEINFO [2020-10-16 11:39:55] quantile: NAINFO [2020-10-16 11:39:55] known: NAINFO [2020-10-16 11:39:55] custom: NAINFO [2020-10-16 11:39:55] biotype: INFO [2020-10-16 11:39:55] pseudogene: FALSEINFO [2020-10-16 11:39:55] lincRNA: FALSEINFO [2020-10-16 11:39:55] protein_coding: FALSEINFO [2020-10-16 11:39:55] antisense: FALSEINFO [2020-10-16 11:39:55] processed_transcript: FALSEINFO [2020-10-16 11:39:55] snRNA: FALSEINFO [2020-10-16 11:39:55] sense_intronic: FALSEINFO [2020-10-16 11:39:55] miRNA: FALSEINFO [2020-10-16 11:39:55] misc_RNA: FALSEINFO [2020-10-16 11:39:55] snoRNA: FALSEINFO [2020-10-16 11:39:55] rRNA: TRUEINFO [2020-10-16 11:39:55] polymorphic_pseudogene: FALSEINFO [2020-10-16 11:39:55] sense_overlapping: FALSEINFO [2020-10-16 11:39:55] three_prime_overlapping_ncrna: FALSEINFO [2020-10-16 11:39:55] TR_V_gene: FALSEINFO [2020-10-16 11:39:55] TR_V_pseudogene: TRUEINFO [2020-10-16 11:39:55] TR_D_gene: FALSEINFO [2020-10-16 11:39:55] TR_J_gene: FALSEINFO [2020-10-16 11:39:55] TR_C_gene: FALSEINFO [2020-10-16 11:39:55] TR_J_pseudogene: TRUEINFO [2020-10-16 11:39:55] IG_C_gene: FALSEINFO [2020-10-16 11:39:55] IG_C_pseudogene: TRUEINFO [2020-10-16 11:39:55] IG_J_gene: FALSEINFO [2020-10-16 11:39:55] IG_J_pseudogene: TRUEINFO [2020-10-16 11:39:55] IG_D_gene: FALSEINFO [2020-10-16 11:39:55] IG_V_gene: FALSEINFO [2020-10-16 11:39:55] IG_V_pseudogene: TRUEINFO [2020-10-16 11:39:55] Filter application: postnormINFO [2020-10-16 11:39:55] Normalization algorithm: deseqINFO [2020-10-16 11:39:55] Normalization arguments: INFO [2020-10-16 11:39:55] locfunc: INFO [2020-10-16 11:39:55] [[list(new("standardGeneric", .Data = function (x, na.rm = FALSE, ...) standardGeneric("median"), generic = "median", package = "stats", group = list(), valueClass = character(0), signature = c("x", "na.rm"), default = new("derivedDefaultMethod", .Data = function (x, na.rm = FALSE, ...) UseMethod("median"), target = new("signature", .Data = "ANY", names = "x", package = "methods"), defined = new("signature", .Data = "ANY", names = "x", package = "methods"), generic = "median"), skeleton = (new("derivedDefaultMethod", .Data = function (x, na.rm = FALSE, ...) UseMethod("median"), target = new("signature", .Data = "ANY", names = "x", package = "methods"), defined = new("signature", .Data = "ANY", names = "x", package = "methods"), generic = "median"))(x, na.rm, ...)))locfuncINFO [2020-10-16 11:39:55] Statistical algorithm: deseqINFO [2020-10-16 11:39:55] Statistical arguments: INFO [2020-10-16 11:39:55] deseq: blind, fit-only, localINFO [2020-10-16 11:39:55] Meta-analysis method: noneINFO [2020-10-16 11:39:55] Multiple testing correction: BHINFO [2020-10-16 11:39:55] p-value threshold: 0.05INFO [2020-10-16 11:39:55] Logarithmic transformation offset: 1INFO [2020-10-16 11:39:55] Quality control plots: mds, biodetection, countsbio, saturation, readnoise, filtered, correl, pairwise, boxplot, gcbias, lengthbias, meandiff, meanvar, biodist, volcano, deheatmapINFO [2020-10-16 11:39:55] Figure format: png, pdfINFO [2020-10-16 11:39:55] Output directory: /data/images/proton3/run451/metaseqr-mat/metaseqr_run451INFO [2020-10-16 11:39:55] Output data: annotation, p.value, adj.p.value, fold.change, counts, flagsINFO [2020-10-16 11:39:55] Output scale(s): log2, rpgmINFO [2020-10-16 11:39:55] Output values: normalizedINFO [2020-10-16 11:39:55] Downloading gene annotation for hg19...INFO [2020-10-16 11:40:02] Downloading transcript annotation for hg19...INFO [2020-10-16 11:43:02] Converting annotation to GenomicRanges object...INFO [2020-10-16 11:43:02] Merging transcript 3' UTRs to create unique gene models...INFO [2020-10-16 11:52:32] Flanking merged transcript 3' UTRs per 500bp...INFO [2020-10-16 11:52:35] Reading bam file MF3R23_Prox1-2.bam for sample with name MF3R23_Prox1-2. This might take some time...INFO [2020-10-16 11:52:35] Reading bam file MF3R24_Prox1-new2.bam for sample with name MF3R24_Prox1-new2. This might take some time...INFO [2020-10-16 11:52:35] Reading bam file MF3R22_Prox1-1.bam for sample with name MF3R22_Prox1-1. This might take some time...INFO [2020-10-16 11:52:35] Reading bam file MF3R19_GFP1.bam for sample with name MF3R19_GFP1. This might take some time...INFO [2020-10-16 11:52:35] Reading bam file MF3R20_GFP2.bam for sample with name MF3R20_GFP2. This might take some time...INFO [2020-10-16 11:52:35] Reading bam file MF3R21_GFPnew2.bam for sample with name MF3R21_GFPnew2. This might take some time...INFO [2020-10-16 11:52:44] Counting reads overlapping with given annotation...INFO [2020-10-16 11:52:44] ...for single-end reads...INFO [2020-10-16 11:52:44] ...assuming forward sequenced reads...INFO [2020-10-16 11:52:46] Counting reads overlapping with given annotation...INFO [2020-10-16 11:52:46] ...for single-end reads...INFO [2020-10-16 11:52:46] ...assuming forward sequenced reads...INFO [2020-10-16 11:52:46] Counting reads overlapping with given annotation...INFO [2020-10-16 11:52:46] ...for single-end reads...INFO [2020-10-16 11:52:46] ...assuming forward sequenced reads...INFO [2020-10-16 11:52:46] Counting reads overlapping with given annotation...INFO [2020-10-16 11:52:46] ...for single-end reads...INFO [2020-10-16 11:52:46] ...assuming forward sequenced reads...INFO [2020-10-16 11:52:48] Counting reads overlapping with given annotation...INFO [2020-10-16 11:52:48] ...for single-end reads...INFO [2020-10-16 11:52:48] ...assuming forward sequenced reads...INFO [2020-10-16 11:52:51] Counting reads overlapping with given annotation...INFO [2020-10-16 11:52:51] ...for single-end reads...INFO [2020-10-16 11:52:51] ...assuming forward sequenced reads...INFO [2020-10-16 11:53:58] Exporting raw read counts table to /data/images/proton3/run451/metaseqr-mat/metaseqr_run451/lists/raw_counts_table.txt.gzINFO [2020-10-16 11:54:02] Checking chromosomes in transcript counts and gene annotation...INFO [2020-10-16 11:54:02] Processing transcripts...INFO [2020-10-16 11:54:02] Separating transcripts (UTR regions) per for MF3R23_Prox1-2...INFO [2020-10-16 11:54:44] Separating transcripts (UTR regions) per for MF3R24_Prox1-new2...INFO [2020-10-16 11:55:24] Separating transcripts (UTR regions) per for MF3R22_Prox1-1...INFO [2020-10-16 11:56:04] Separating transcripts (UTR regions) per for MF3R19_GFP1...INFO [2020-10-16 11:56:44] Separating transcripts (UTR regions) per for MF3R20_GFP2...INFO [2020-10-16 11:57:24] Separating transcripts (UTR regions) per for MF3R21_GFPnew2...INFO [2020-10-16 11:58:07] Saving gene model to /data/images/proton3/run451/metaseqr-mat/metaseqr_run451/data/gene_model.RDataINFO [2020-10-16 11:58:12] Summarizing count data...INFO [2020-10-16 11:58:26] Removing genes with zero counts in all samples...INFO [2020-10-16 11:58:26] Normalizing with: deseqINFO [2020-10-16 11:58:29] Applying gene filter length...INFO [2020-10-16 11:58:30] Threshold below which ignored: 500INFO [2020-10-16 11:58:30] Applying gene filter avg.reads...INFO [2020-10-16 11:58:30] Threshold below which ignored: 0.0398168441025095INFO [2020-10-16 11:58:30] Applying gene filter expression...INFO [2020-10-16 11:58:30] Threshold below which ignored: 5INFO [2020-10-16 11:58:30] Applying gene filter biotype...INFO [2020-10-16 11:58:30] Biotypes ignored: rRNA, TR_V_pseudogene, TR_J_pseudogene, IG_C_pseudogene, IG_J_pseudogene, IG_V_pseudogeneINFO [2020-10-16 11:58:30] 41761 genes filtered outINFO [2020-10-16 11:58:30] 15975 genes remain after filteringINFO [2020-10-16 11:58:30] Running statistical tests with: deseqINFO [2020-10-16 11:58:31] Contrast: Wt_vs_TgINFO [2020-10-16 11:58:45] Contrast Wt_vs_Tg: found 221 genesINFO [2020-10-16 11:58:45] Exporting and compressing normalized read counts table to /data/images/proton3/run451/metaseqr-mat/metaseqr_run451/lists/normalized_counts_table.txtINFO [2020-10-16 11:58:46] Building output files...INFO [2020-10-16 11:58:50] Contrast: Wt_vs_TgINFO [2020-10-16 11:58:50] Adding non-filtered data...INFO [2020-10-16 11:58:50] binding annotation...INFO [2020-10-16 11:58:51] binding p-values...INFO [2020-10-16 11:58:51] binding FDRs...INFO [2020-10-16 11:58:52] binding log2 normalized fold changes...INFO [2020-10-16 11:58:52] binding all normalized counts for Wt...INFO [2020-10-16 11:58:52] binding all normalized counts for Tg...INFO [2020-10-16 11:58:53] binding filtering flags...INFO [2020-10-16 11:58:53] Writing output...INFO [2020-10-16 11:58:53] Adding filtered data...INFO [2020-10-16 11:58:53] binding annotation...INFO [2020-10-16 11:58:53] binding p-values...INFO [2020-10-16 11:58:53] binding FDRs...INFO [2020-10-16 11:58:54] binding log2 normalized fold changes...INFO [2020-10-16 11:58:54] binding all normalized counts for Wt...INFO [2020-10-16 11:58:54] binding all normalized counts for Tg...INFO [2020-10-16 11:58:54] binding filtering flags...INFO [2020-10-16 11:58:55] Writing output...INFO [2020-10-16 11:58:57] Creating quality control graphs...INFO [2020-10-16 11:58:57] Plotting in png format...INFO [2020-10-16 11:58:57] Plotting mds...INFO [2020-10-16 11:58:57] Plotting biodetection...INFO [2020-10-16 11:58:58] Plotting countsbio...INFO [2020-10-16 11:59:00] Plotting saturation...INFO [2020-10-16 11:59:08] Plotting readnoise...INFO [2020-10-16 11:59:08] Plotting correl...INFO [2020-10-16 11:59:09] Plotting pairwise...INFO [2020-10-16 11:59:21] Plotting boxplot...INFO [2020-10-16 11:59:21] Plotting gcbias...INFO [2020-10-16 11:59:22] Plotting lengthbias...INFO [2020-10-16 11:59:23] Plotting meandiff...INFO [2020-10-16 11:59:24] Plotting meanvar...INFO [2020-10-16 11:59:38] Plotting boxplot...INFO [2020-10-16 11:59:38] Plotting gcbias...INFO [2020-10-16 11:59:38] Plotting lengthbias...INFO [2020-10-16 11:59:39] Plotting meandiff...INFO [2020-10-16 11:59:40] Plotting meanvar...INFO [2020-10-16 11:59:54] Plotting biodist...INFO [2020-10-16 11:59:54] Contrast: Wt_vs_TgINFO [2020-10-16 11:59:54] Plotting volcano...INFO [2020-10-16 11:59:54] Contrast: Wt_vs_TgINFO [2020-10-16 11:59:55] Plotting deheatmap...INFO [2020-10-16 11:59:55] Contrast: Wt_vs_TgINFO [2020-10-16 12:00:36] Plotting filtered...INFO [2020-10-16 12:00:36] Plotting in pdf format...INFO [2020-10-16 12:00:36] Plotting mds...INFO [2020-10-16 12:00:36] Plotting biodetection...INFO [2020-10-16 12:00:37] Plotting countsbio...INFO [2020-10-16 12:00:38] Plotting saturation...INFO [2020-10-16 12:00:42] Plotting readnoise...INFO [2020-10-16 12:00:43] Plotting correl...INFO [2020-10-16 12:00:43] Plotting pairwise...INFO [2020-10-16 12:00:50] Plotting boxplot...INFO [2020-10-16 12:00:50] Plotting gcbias...INFO [2020-10-16 12:00:51] Plotting lengthbias...INFO [2020-10-16 12:00:52] Plotting meandiff...INFO [2020-10-16 12:00:53] Plotting meanvar...INFO [2020-10-16 12:01:07] Plotting boxplot...INFO [2020-10-16 12:01:07] Plotting gcbias...INFO [2020-10-16 12:01:07] Plotting lengthbias...INFO [2020-10-16 12:01:08] Plotting meandiff...INFO [2020-10-16 12:01:10] Plotting meanvar...INFO [2020-10-16 12:01:23] Plotting biodist...INFO [2020-10-16 12:01:23] Contrast: Wt_vs_TgINFO [2020-10-16 12:01:24] Plotting volcano...INFO [2020-10-16 12:01:24] Contrast: Wt_vs_TgINFO [2020-10-16 12:01:24] Plotting deheatmap...INFO [2020-10-16 12:01:24] Contrast: Wt_vs_TgINFO [2020-10-16 12:02:01] Plotting filtered...INFO [2020-10-16 12:02:01] Creating HTML report...INFO [2020-10-16 12:02:01] Compressing figures...
Multidimensional scaling
Biotype detection
Biotype detection counts
Read and biotype saturation
RNA-Seq reads noise
Correlation plots
Pairwise scatterplots
Boxplots
GC content bias
Gene/transcript length bias
Mean-difference
Mean-variance
Chromosome and biotype distribution of filtered gene s
Volcano plots
DEG heatmaps
Chromosome and biotype distributions of DEGs
Results
DEG table for the contrast Wt_vs_TgThe following table presents the top 5% statistically significant genes (use the download links below the table to retrieve the whole list) for the contrast Wt_vs_Tg. The fields of the table correspond to the requested features to be exported. The table can be searched using the search field on the top right. | chromosome | start | end | gene_id | gc_content | strand | gene_name | biotype | p-value_deseq | FDR_deseq | log2_normalized_fold_change_Wt_vs_Tg | log2_normalized_counts_MF3R19_GFP1 | log2_normalized_counts_MF3R20_GFP2 | log2_normalized_counts_MF3R21_GFPnew2 | rpgm_normalized_counts_MF3R19_GFP1 | rpgm_normalized_counts_MF3R20_GFP2 | rpgm_normalized_counts_MF3R21_GFPnew2 | log2_normalized_counts_MF3R23_Prox1-2 | log2_normalized_counts_MF3R24_Prox1-new2 | log2_normalized_counts_MF3R22_Prox1-1 | rpgm_normalized_counts_MF3R23_Prox1-2 | rpgm_normalized_counts_MF3R24_Prox1-new2 | rpgm_normalized_counts_MF3R22_Prox1-1 | LN | AR | MD | MN | QN | KN | CM | BT | PR |
|---|
| chr14 | 29241910 | 29282493 | ENSG00000186960 | 37.43 | + | C14orf23 | protein_coding | 1.94e-07 | 0.00247 | 3.595610 | 5.00e+00 | 3.58e+00 | 0.00e+00 | 5.98e-03 | 2.12e-03 | 0.00e+00 | 7.75e+00 | 5.78e+00 | 8.10e+00 | 4.13e-02 | 1.04e-02 | 5.27e-02 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | | chr19 | 29097595 | 29106136 | ENSG00000266976 | 48.27 | + | AC079466.1 | lincRNA | 3.09e-07 | 0.00247 | 4.523562 | 0.00e+00 | 0.00e+00 | 1.00e+00 | 0.00e+00 | 0.00e+00 | 8.36e-04 | 3.00e+00 | 6.36e+00 | 1.00e+00 | 5.85e-03 | 6.77e-02 | 8.36e-04 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | | chr1 | 214156524 | 214214595 | ENSG00000117707 | 41.19 | + | PROX1 | protein_coding | 1.51e-06 | 0.00806 | 4.292782 | 1.00e+00 | 1.00e+00 | 0.00e+00 | 9.80e-05 | 9.80e-05 | 0.00e+00 | 5.21e+00 | 5.88e+00 | 1.00e+00 | 3.53e-03 | 5.69e-03 | 9.80e-05 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | | chr13 | 21950263 | 22033509 | ENSG00000180776 | 39.79 | - | ZDHHC20 | protein_coding | 4.12e-06 | 0.01644 | 4.058894 | 1.00e+00 | 1.00e+00 | 1.00e+00 | 1.55e-04 | 1.55e-04 | 1.55e-04 | 5.46e+00 | 5.75e+00 | 1.00e+00 | 6.66e-03 | 8.21e-03 | 1.55e-04 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | | chr15 | 41245160 | 41248710 | ENSG00000128965 | 57.90 | + | CHAC1 | protein_coding | 3.79e-05 | 0.12113 | 2.123199 | 4.32e+00 | 4.91e+00 | 6.30e+00 | 9.82e-03 | 1.50e-02 | 4.03e-02 | 6.17e+00 | 8.83e+00 | 5.13e+00 | 3.67e-02 | 2.35e-01 | 1.76e-02 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | | chr14 | 24099324 | 24114848 | ENSG00000100867 | 49.50 | + | DHRS2 | protein_coding | 6.48e-05 | 0.17244 | 2.283172 | 6.58e+00 | 4.91e+00 | 5.32e+00 | 2.21e-02 | 6.76e-03 | 9.09e-03 | 7.80e+00 | 8.92e+00 | 6.67e+00 | 5.17e-02 | 1.12e-01 | 2.35e-02 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | | chr1 | 161192082 | 161193421 | ENSG00000158874 | 53.51 | - | APOA2 | protein_coding | 1.50e-04 | 0.34248 | -2.606024 | 6.19e+00 | 5.52e+00 | 6.46e+00 | 2.76e-02 | 1.73e-02 | 3.34e-02 | 0.00e+00 | 0.00e+00 | 5.00e+00 | 0.00e+00 | 0.00e+00 | 1.19e-02 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | | chr19 | 45449243 | 45452822 | ENSG00000234906 | 53.32 | + | APOC2 | protein_coding | 3.23e-04 | 0.64045 | -3.058894 | 4.91e+00 | 5.36e+00 | 2.00e+00 | 9.98e-03 | 1.38e-02 | 1.03e-03 | 1.00e+00 | 0.00e+00 | 2.58e+00 | 3.44e-04 | 0.00e+00 | 1.72e-03 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | | chr19 | 45445495 | 45452822 | ENSG00000224916 | 54.07 | + | APOC4-APOC2 | protein_coding | 4.17e-04 | 0.64045 | -2.944858 | 4.95e+00 | 5.36e+00 | 2.32e+00 | 5.36e-03 | 7.15e-03 | 7.15e-04 | 1.00e+00 | 0.00e+00 | 2.81e+00 | 1.79e-04 | 0.00e+00 | 1.07e-03 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | | chrX | 132669773 | 133119922 | ENSG00000147257 | 39.66 | - | GPC3 | protein_coding | 4.42e-04 | 0.64045 | -3.115477 | 4.81e+00 | 4.17e+00 | 2.58e+00 | 9.52e-03 | 5.99e-03 | 1.76e-03 | 0.00e+00 | 0.00e+00 | 2.00e+00 | 0.00e+00 | 0.00e+00 | 1.06e-03 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | | chr19 | 45445495 | 45452820 | ENSG00000267467 | 54.08 | + | APOC4 | protein_coding | 4.70e-04 | 0.64045 | -2.862496 | 5.04e+00 | 5.36e+00 | 2.58e+00 | 4.63e-03 | 5.79e-03 | 7.23e-04 | 1.00e+00 | 0.00e+00 | 3.00e+00 | 1.45e-04 | 0.00e+00 | 1.01e-03 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | | chr12 | 6647541 | 6665239 | ENSG00000010295 | 54.31 | - | IFFO1 | protein_coding | 4.81e-04 | 0.64045 | -3.536053 | 1.00e+00 | 5.09e+00 | 4.46e+00 | 7.61e-05 | 2.51e-03 | 1.60e-03 | 1.00e+00 | 1.00e+00 | 0.00e+00 | 7.61e-05 | 7.61e-05 | 0.00e+00 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | Download the DEG result list for Wt_vs_Tg.Download the whole result list for Wt_vs_Tg.Download the raw read counts table for the experiment.Download the normalized read counts table for the experiment.
References
|
|