Summary of "Correlation | Karl Pearson’s coefficient of correlation | Multiple correlation | Biostatistics"

Topic

Correlation — definition and types — and methods to calculate Karl Pearson’s coefficient of correlation; brief note on multiple correlation.

Key lessons

Definitions and types

Methods to compute Karl Pearson’s coefficient

1) Actual‑Mean Method

When to use:

Formula:

r = Σ[(xi − x̄)(yi − ȳ)] / sqrt[ Σ(xi − x̄)² × Σ(yi − ȳ)² ]

Steps:

  1. List paired data (xi, yi).
  2. Compute x̄ = (Σxi)/n and ȳ = (Σyi)/n.
  3. For each pair compute deviations: (xi − x̄) and (yi − ȳ).
  4. Compute the product for each pair: (xi − x̄)(yi − ȳ).
  5. Sum those products → numerator = Σ[(xi − x̄)(yi − ȳ)].
  6. Compute squared deviations for x and y separately: Σ(xi − x̄)² and Σ(yi − ȳ)².
  7. Denominator = sqrt[Σ(xi − x̄)² × Σ(yi − ȳ)²].
  8. Divide numerator by denominator to get r.

Worked example (from lecture):

2) Assumed‑Mean Method

When to use:

Basic idea:

Formula (using coded deviations):

r = [ n·Σ(dx·dy) − (Σdx)(Σdy) ] / sqrt{ [ n·Σ(dx²) − (Σdx)² ] × [ n·Σ(dy²) − (Σdy)² ] }

Steps:

  1. Choose assumed means a and b (central values) so dx and dy are small integers.
  2. For each observation compute dx = xi − a and dy = yi − b.
  3. Compute Σdx, Σdy, Σ(dx·dy), Σ(dx²), Σ(dy²), and note n.
  4. Substitute into the coded‑deviation form of Pearson’s formula.
  5. Compute numerator and denominator, then r.

Tips:

3) Multiple correlation (brief)

Additional practical tips and pedagogical points

Resources mentioned

Speakers / sources

Category ?

Educational


Share this summary


Is the summary off?

If you think the summary is inaccurate, you can reprocess it with the latest model.

Video