Cufflinks report, contrast wt_vs_SET9ko

The samples have the following names, conditions, replicate number and number of reads:

sample_name       condition_replicateID    n_reads
__________________________________________________
IT3R9-wt2         wt_0                     6178908
IT3R10-wt3        wt_1                     9087907
IT3R11-wt4        wt_2                     5052072
IT3R12-wt5        wt_3                     6024589
IT3R13-SET9ko3    SET9ko_0                 6968696
IT3R14-SET9ko4    SET9ko_1                 5116923
IT3R15-SET9ko5    SET9ko_2                 4335051
IT3R16-SET9ko7    SET9ko_3                 8983036





Normalized UCSC genome browser tracks

Tracks normalized to the smallest library
IT3R15-SET9ko5 with 4335051 reads are at:

http://genomics-lab.fleming.gr/cgi-bin/hgTracks?db=mm10&hubUrl=http://genomics-lab.fleming.gr/fleming/ITlab/run297/sofia/hub.txt


Differentially expressed genes (DEGs)

A speadsheet with differential expression information for the contrast wt_vs_SET9ko for all known gense and novel genes identified by cufflinks are at:

http://genomics-lab.fleming.gr/fleming/ITlab/run297/sofia/cuffdiff_wt_vs_SET9ko/gene_exp.diff.xlsx


The number of known and novel DEG up to q-value 0.05 is 256
The number of known DEG up to q-value 0.05 is 177
The number of known and novel DEG up to q-value 0.01 is 152
The number of known DEG up to q-value 0.01 is 106



Differentially expressed isoforms

A speadsheet with differential expression information for the contrast wt_vs_SET9ko for all known isoforms and novel isoforms found by cufflinks including biotype information is at:

http://genomics-lab.fleming.gr/fleming/ITlab/run297/sofia/cuffdiff_wt_vs_SET9ko/isoform_exp.diff.xlsx




Multi-Dimensional Scaling plot

Multi-Dimensional Scaling (MDS) plots comprise a means of visualizing the level of similarity of individual cases of a dataset. MDS uses absolute distance metrics such as the classical Euclidean distance as a similarity measure. MDS serves as a quality control and it can be interpreted as follows: when the distance among samples of the same biological condition in the MDS space is small, this is an indication of high correlation and reproducibility among them. When this distance is larger, this constitutes an indication of low correlation and reproducibility among samples. It can help to identify mislabled samples or to exclude poor samples from further analysis.



MDS plot based on gene expression




Heatmaps

Differentially Expressed Genes (DEGs) heatmaps depict how well samples from different conditions cluster together according to their expression values after normalization and statistical testing, for each requested statistical contrast. If samples from the same biological condition do not cluster together, this would constitute a warning sign regarding the quality of the samples. In addition, DEG heatmaps provide an initial view of possible clusters of co-expressed genes.



DEG heatmap with q-value cutoff = 0.01




Volcano plot

A volcano plot is a scatterplot to give an overview of interesting genes. The log2 fold change is plotted on the x-axis and the negative log10 p-value is plotted on the y-axis. A volcano plot combines the results of a statistical test (aka, p-values) with the magnitude of the change enabling quick visual identification of those genes that display large-magnitude changes that are also statistically significant.




Additional files

All files of the cufflinks/cuffdiff analysis can be found at:

http://genomics-lab.fleming.gr/fleming/ITlab/run297/sofia/cuffdiff_wt_vs_SET9ko

The format of all files is explained at http://cole-trapnell-lab.github.io/cufflinks/cuffdiff/index.html#cuffdiff-output-files.
References and more information can be found at http://cole-trapnell-lab.github.io/cufflinks/papers.




This report was made on Tue Jul 19 14:51:10 2016 with the Cufflinks report generator by Martin Reczko.