Handling matched data

Matched data

In contrast to independent measurements, matched data consist of measurements that should be considered together. For example, matching can be used in clinical studies. Here, patients that exhibit similar characteristics are paired in order to remove confounding effects. Matched data can also arise naturally when multiple measurements are performed on the same entity. For example, matched data can arise when a clinical marker is measured once before and once after a treatment intervention. Irrespective of how the matched data were generated, their structure should be taken into account through the use of appropriate statistical tests.

Paired data

The most common type of matched data are paired measurements, which consist of two data points. For this type of data, the following significance tests are available:

Type of dependent variable	Tests
Quantitative	Paired t-test, Wilcoxon signed rank test
Categorical	McNemar’s test

Click on a variable type in the table to obtain more information on how to use the corresponding significance tests in R.

Repeated-measures data

If you have more than two matched measurements, then you are dealing with repeated-measures data. An example of a significance test that handles such data is repeated-measures one-way ANOVA.

Posts that deal with matched data

In the following posts, you can find more specific information on how you can handle matched data.

McNemar’s test is a non-parametric test for contingency tables that arise from paired measurements. In contrast to the chi-squared test, which is a test for independence, McNemar’s test is a test for symmetry (also called marginal homogeneity). Still, McNemar’s test is related to the chi-squared test because its test static also follows a chi-squared distribution.

Testing Significance on Paired Measurements: What Can Go Wrong?

In a previous post, I’ve contrasted two tests for pairs of measurements. Here, I’d like to show why it is important to choose a test that appropriately accounts for such dependent measurements.

Wilcoxon Signed Rank Test vs Paired Student's t-test

In this post, we will explore tests for comparing two groups of dependent (i.e. paired) quantitative data: the Wilcoxon signed rank test and the paired Student’s t-test. The critical difference between these tests is that the test from Wilcoxon is a non-parametric test, while the t-test is a parametric test. In the following, we will explore the ramifications of this difference.