Basic Statistics in R

This section covers some basic aspects of statistical analysis which are very often encountered in the course of any student's normal progression. Most of the time, it is about comparing two or more groups to define whether there exists a difference or an association while considering a specific parameter or variable. But it isn't only about that. It is important to understand a few concepts about the data to analyse. What is your dataset made of? What do you expect? What do you want to know? How much or what do you already know about the data?

Two-sample analysis

It's now time for things to get serious: we have more than one sample! Actually two! But what do we want to do with these two samples? Compare their variances? Their means? And what about correlating two variables? Here is a series of useful tests and functions in R to take care of these two groups of data.

Learn about: Fisher's F test, Student's t-test, Wilcoxon's rank test, Pearson's correlation coefficient, Spearman rank correlation, χ-square test (Chi-square test) ...

ANalysis Of VAriance

Analysis of variance (a.k.a. ANOVA) is used to compare the means of two or more groups. Unsurprisingly, the way ANOVA works is by comparing variances (hence the name Analysis of Variance…). Variables must be categorical, and will often be called factors. There are several designs for ANOVA, depending on the number of variables to be compared and on whether samples are measured several times during the course of an experiment.

Learn about: one-way ANOVA, factorial design, repeated measures, ...

Post Hoc Tests

After having performed an F-test (one-way ANOVA, two-way ANOVA,...) which has indicated that there exists a difference between the group means, post hoc tests help defining which group means are significantly different from each other. Such tests are never to be used when the F-test does not show the existence of significant differences. Note that some parameters in post hoc tests must be carefully chosen based on the experimental design to avoid "false positive".

Learn about: pairwise t-tests, Tukey HSD, ...

When biology adds up, at last…

When biology adds up, at last…

Basic Statistics in R