Cufflinks report, contrast wt_vs_ko

The samples have the following names, conditions, replicate number and number of reads:

sample_name    condition_replicateID    n_reads
_______________________________________________
IT3R23-wt6     wt_0                     6790633
IT3R24-wt7     wt_1                     4263775
IT3R25-wt8     wt_2                     7113821
IT3R26-wt9     wt_3                     5519916
IT3R27-ko4     ko_0                     5635739
IT3R28-ko5     ko_1                     6095915
IT3R29-ko6     ko_2                     4001402
IT3R30-ko7     ko_3                     4687169





Normalized UCSC genome browser tracks

Tracks normalized to the smallest library
IT3R33-ko4TCP36 with 3670897 reads are at:

http://genomics-lab.fleming.gr/cgi-bin/hgTracks?db=mm10&hubUrl=http://genomics-lab.fleming.gr/fleming/ITlab/run299evangelia/hub.txt


Differentially expressed genes (DEGs)

A speadsheet with differential expression information for the contrast wt_vs_ko for all known gense and novel genes identified by cufflinks are at:

http://genomics-lab.fleming.gr/fleming/ITlab/run299evangelia/cuffdiff_wt_vs_ko/gene_exp.diff.xlsx


The number of known and novel DEG up to q-value 0.05 is 1017
The number of known DEG up to q-value 0.05 is 672
The number of known and novel DEG up to q-value 0.01 is 500
The number of known DEG up to q-value 0.01 is 342



Differentially expressed isoforms

A speadsheet with differential expression information for the contrast wt_vs_ko for all known isoforms and novel isoforms found by cufflinks including biotype information is at:

http://genomics-lab.fleming.gr/fleming/ITlab/run299evangelia/cuffdiff_wt_vs_ko/isoform_exp.diff.xlsx




Multi-Dimensional Scaling plot

Multi-Dimensional Scaling (MDS) plots comprise a means of visualizing the level of similarity of individual cases of a dataset. MDS uses absolute distance metrics such as the classical Euclidean distance as a similarity measure. MDS serves as a quality control and it can be interpreted as follows: when the distance among samples of the same biological condition in the MDS space is small, this is an indication of high correlation and reproducibility among them. When this distance is larger, this constitutes an indication of low correlation and reproducibility among samples. It can help to identify mislabled samples or to exclude poor samples from further analysis.



MDS plot based on gene expression




Heatmaps

Differentially Expressed Genes (DEGs) heatmaps depict how well samples from different conditions cluster together according to their expression values after normalization and statistical testing, for each requested statistical contrast. If samples from the same biological condition do not cluster together, this would constitute a warning sign regarding the quality of the samples. In addition, DEG heatmaps provide an initial view of possible clusters of co-expressed genes.



DEG heatmap with q-value cutoff = 0.01




Volcano plot

A volcano plot is a scatterplot to give an overview of interesting genes. The log2 fold change is plotted on the x-axis and the negative log10 p-value is plotted on the y-axis. A volcano plot combines the results of a statistical test (aka, p-values) with the magnitude of the change enabling quick visual identification of those genes that display large-magnitude changes that are also statistically significant.




Additional files

All files of the cufflinks/cuffdiff analysis can be found at:

http://genomics-lab.fleming.gr/fleming/ITlab/run299evangelia/cuffdiff_wt_vs_ko

The format of all files is explained at http://cole-trapnell-lab.github.io/cufflinks/cuffdiff/index.html#cuffdiff-output-files.
References and more information can be founf at http://cole-trapnell-lab.github.io/cufflinks/papers.




This report was made on Tue Jul 19 14:18:29 2016 with the Cufflinks report generator by Martin Reczko.