Assignment #2 ‐ Differential Gene expression and Preliminary ORA - bcb420-2024/Anna_Lai GitHub Wiki
Differential Gene expression and Preliminary ORA
Date March 05, 2024
Notes that are not included in the final submission. For a coherent story of the data, please refer to the HTML file generated for this assignment.
Links to Assignment 1
The R file: https://github.com/bcb420-2024/Anna_Lai/blob/main/A1_AnnaLai.Rmd
The HTML filed: https://github.com/bcb420-2024/Anna_Lai/blob/main/A1_AnnaLai.html
Data annotation update
There are 9 conditions with 3 biological replicates per condition. In total 27 samples.
Differential Gene expression
The second model I was trying to use, with both the experimental condition and cell line. Yet the number of combinations is too big. There are already 9 different conditions. The error code:
Coefficients not estimable: exp_groups$sample_groupR168H.RG.Un Warning: Partial NA coefficients for 15865 probe(s)Warning: number of items to replace is not a multiple of replacement lengthWarning: Estimation of var.prior failed - set to default value
Hence, I just used the condition group as a grouping for the data for further investigation.
Threshold over-representation analysis
G:profiler used. There is a R package for it but didn't utilize it. Completed the ORA on the web platform instead.
Interpretation
Please refer to the RNotebook.
Links to Assignment 2
The R Markdown file: https://github.com/bcb420-2024/Anna_Lai/blob/main/A2_AnnaLai.Rmd
The HTML filed: https://github.com/bcb420-2024/Anna_Lai/blob/main/A2_AnnaLai.html
Citations
For the RNotebook, I used a research aid application Zotero to generate the bib file, which I highly recommend.
- Dorison A, Ghobrial I, Graham A, Peiris T et al. Kidney Organoids Generated Using an Allelic Series of NPHS2 Point Variants Reveal Distinct Intracellular Podocin Mistrafficking. J Am Soc Nephrol 2023 Jan 1;34(1):88-109. PMID: 36167728