Setup

Before you begin, you should already have the R packages installed, but if not please follow the install guide for the relevant packages here.

The R script (DESeq_STAR.R) for this lab can be found at this link, and you should already have it downloaded from our previous exercises.

If you don't already have the github data downloaded:

git clone https://github.com/ryandkuster/EPP_575_RNA.git

You'll need your count file from the featureCounts step. Use Open OnDemand or scp to copy it to your local device:

/lustre/isaac24/proj/UTK0386/analysis/<your_folder>/05_count/combined.counts.txt

/lustre/isaac24/proj/UTK0386/completed/05_counts/combined.counts.txt

scp <your_netid>@dtn2.isaac.utk.edu:/lustre/isaac24/proj/UTK0386/analysis/<your_folder>/05_count/combined.counts.txt .

if you don't have the file, you can grab it from the completed directory:

scp <your_netid>@dtn2.isaac.utk.edu:/lustre/isaac24/proj/UTK0386/completed/05_count/combined.counts.txt .

Wherever you end up copying the counts file, you'll need to modify the path to this folder in the R script in the following line to reflect where on your computer the files are located:

setwd("~/Downloads/05_counts/")

The R script DESeq_STAR.R which can be found in EPP_575_RNA/data/R_materials/DESeq_STAR.R or you can just copy the text from the text below and open a new Rscript in RStudio with this information.

DESeq_STAR.R file contents

library(DESeq2)
library(pheatmap)


setwd("~/Downloads/05_counts/")

# open the featureCounts matrix as a matrix
counts <- read.table("combined.counts.txt", sep="\t", header=TRUE, row.names="Geneid")

# keep only the sample count columns
counts <- counts[, (ncol(counts)-5):ncol(counts)]

sample_id <- colnames(counts)
sample_id <- sapply(strsplit(sample_id, "[.]"), "[", 1)
colnames(counts) <- sample_id
counts <- as.matrix(counts)

# create a sample dataframe where sample ids are in the SAME ORDER as counts matrix above
hours_cold <- sapply(strsplit(sample_id, "_"), "[", 2)
rep <- sapply(strsplit(sample_id, "_"), "[", 3)
samples <- data.frame(sample_id, hours_cold, rep)
rownames(samples) <- samples$sample_id
samples <- samples[, -1]

# create a DESeq object from our matrix
dds <- DESeqDataSetFromMatrix(countData = counts,
                              colData = samples,
                              design = ~ hours_cold)

# run the DESeq!
dds <- DESeq(dds)
plotDispEsts(dds)

rld <- rlog(dds, blind = FALSE)
plotPCA(rld, intgroup = c("hours_cold"))

res <- results(dds, alpha = 0.05, contrast = c("hours_cold", "0h", "3h"))
res_ord <- res[order(res$padj, -abs(res$log2FoldChange)),]
write.csv(res_ord, file="star_featurecounts_results.csv")
summary(res)
plotMA(res, ylim=c(-10, 10))

res_sig <- as.data.frame(res[ which(res$padj < 0.05),])
res_sig <- res_sig[order(res_sig$padj, -abs(res_sig$log2FoldChange)),]

write.csv(res_sig, file="star_featurecounts_sig_results.csv")
summary(res_sig)

# these steps can be used to find individual points on the graph
# after running "identify", click on the plot, then hit "finish" button in top right of plot
plotMA(res, ylim=c(-10, 10))
idx <- identify(res$baseMean, res$log2FoldChange)
rownames(res)[idx]


# optional volcano plot
library(apeglm)
library(EnhancedVolcano)
library(ggpubr)

EnhancedVolcano(res, x = 'log2FoldChange',
                lab = row.names(res),
                pCutoff = 1e-100,
                FCcutoff = 2,
                y = 'padj',)

# create a plot for a single gene
plotCounts(dds, gene="gene-AT4G25480", intgroup="hours_cold")

# create a heatmap
# first run vst (quickly estimate dispersion trend and apply a variance stabilizing transformation)
vsd <- vst(dds)

mat <- assay(vsd)[ head(order(-abs(res$log2FoldChange)), 20), ]
mat <- mat - rowMeans(mat) 
mat <- data.frame(mat)

pheatmap(mat, cluster_rows=TRUE, show_rownames=TRUE,
         cluster_cols=TRUE)

Go futher!

Check out more information for any interesting gene patterns you find on the TAIR database.

https://www.arabidopsis.org/locus?name=AT4G25480

Lab 06: DESeq2 - ryandkuster/EPP_575_RNA_25 GitHub Wiki

Setup

DESeq_STAR.R file contents

Go futher!

⚠️ GitHub.com Fallback ⚠️

Lab 06: DESeq2 - ryandkuster/EPP_575_RNA_25 GitHub Wiki

Setup

DESeq_STAR.R file contents

Go futher!

⚠️ **GitHub.com Fallback** ⚠️

⚠️ GitHub.com Fallback ⚠️