Factor_Level_Summary.R Tutorial - sciencesharon/MicrobialSeq GitHub Wiki

Factor_Level_Summary.R Tutorial

Ever wanted to make a really nice demographic summary table for publication?

One that perhaps looks like this:

This is the script for you!

It takes the input of metadata with samples in rows and character data with factor levels in columns.
This takes character factor levels and returns counts (number of samples) of each factor level in each input column per group.
It can be used to compare control and experimental groups and their factor level characteristics.
The output also determines significant differences between input groups by Fischer's Exact Test and returns p_values.
It outputs directly to a .csv file

metadata <- read.csv("/path/to/your/file/Example_Metadata.csv")

character_cols <- colnames(metadata)[1:5]

metadata <- metadata %>% mutate(across(all_of(character_cols), as.factor))

Cherry <- metadata[metadata$Fruit %in% c("Cherry"),]

Banana <- metadata[metadata$Fruit %in% c("Banana"),]

Apple <- metadata[metadata$Fruit %in% c("Apple"),]

Date <- metadata[metadata$Fruit %in% c("Date"),]

Elderberry <- metadata[metadata$Fruit %in% c("Elderberry"),]

datasets <- list(Cherry, Banana, Apple, Date, Elderberry)

dataset_names <- c("Cherry", "Banana", "Apple", "Date", "Elderberry")

columns <- colnames(metadata)[1:5]

file_path <- ("/path/to/summary.csv")

summary <- factor_levels_summaries(datasets = datasets, dataset_names = dataset_names, columns = columns, file_path = file_path)

You can easily format the output table to look pretty like the image above in Excel or LibreOffice: