Statistical Recipes in Python - mauriceling/mauriceling.github.io GitHub Wiki

Descriptive Statistics

Methods to summarize a given set of data.

Methods to examine whether a set of values are normally distributed.

Methods to examine whether two or more sets of values have the same variance (or standard deviation).

Methods to examine the trend of 2 sets of values.

Kendall's tau: correlation measure for ordinal data
Pearson's correlation coefficient: correlation measure for normally distributed data
Point biserial correlation coefficient: correlation measure between a binary variable and a continuous variable
Somer's D: asymmetric measure of ordinal association
Spearman's rank correlation coefficient: correlation measure for ranked data

Statistical tests assuming normality in data set(s).

Statistical tests without assuming normality in data set(s), also known as distribution-free tests.

Brunner-Munzel test: non-parametric version of 2-samples (independent samples) t-test without assuming equal variances
Chi-Square test: test whether 2 distributions are equal
Cramér-von Mises test: test whether 2 distributions are equal
Cressie-Read power divergence test: test whether 2 distributions are equal
Epps-Singleton test: test whether 2 distributions are equal
Freeman-Tukey test: test whether 2 distributions are equal
G-test: test whether 2 distributions are equal
Kolmogorov-Smirnov test: test whether 2 distributions are equal
Kruskal-Wallis H-test: non-parametric version of ANOVA - One-way
Mann-Whitney U test: non-parametric version of 2-samples (independent samples) t-test assuming equal variances
Neyman's test of goodness of fit: test whether 2 distributions are equal
Page's L test: measure of trend in observations between treatments
Wilcoxon rank-sum test: non-parametric version of 2-samples (independent samples) t-test assuming equal variances
Wilcoxon signed-rank test: non-parametric version of t-test - paired (dependent samples)

Methods to analyze 2x2 and MxN contingency tables.

Barnard exact test: test whether variable for columns is independent to the variable for rows on 2x2 contingency table
Boschloo’s exact test: test whether variable for columns is independent to the variable for rows on 2x2 contingency table
Chi-square test of independence: test whether variable for columns is independent to the variable for rows on MxN contingency table
Cramer’s V: measure the degree of association between two nominal variables on MxN contingency table
Fisher exact test: test whether variable for columns is independent to the variable for rows on 2x2 contingency table
G-test of independence: test whether variable for columns is independent to the variable for rows on MxN contingency table
Pearson’s contingency coefficient: measure the degree of association between two nominal variables on MxN contingency table
Relative risk: test whether exposure increases the risk of an outcome using 2x2 contingency table
Odds ratio: test whether exposure increases the odds of an outcome using 2x2 contingency table
Tschuprow’s T: measure the degree of association between two nominal variables on MxN contingency table

Methods to combine p-values from independent tests on the same hypothesis.