Tiber Tutor

definitions

IB Maths AA Topic 4 Definitions

This page contains our IB Maths AA definitions for topic 4. By learning each one of these definitions, you will fully cover the content for IB Maths AA 'Stats & Probability'.

0

bias

A tendency in the sampling or data collection process that favours certain outcomes or groups, so the results are not representative.

bivariate

Involving two numerical variables measured together as paired values, usually written as (x,y)(x,y).

class

A range of values used to group continuous data, usually written as an interval such as '40x<5040\leq x\lt 50'.

complement

The event that AA does not happen, written as AA', with probability P(A)=1P(A)P\left(A'\right)=1-P\left(A\right).

conditional probability

The probability that one event happens given that another event has already happened, written as
P(AB)=P(AB)P(B)P(A \mid B) = \frac{P(A \cap B)}{P(B)} provided that P(B)0P(B) \neq 0.

continuous

Able to take any value within an interval.

correlation

A description of the direction and strength of the relationship between two variables, as seen in a scatter diagram.

cumulative

A running total found by adding successive values as you move through a table or list.

discrete

Taking separate, countable values, usually whole numbers.

dispersion

The extent to which data values vary about a central value, quantified using measures such as interquartile range, standard deviation, and variance.

event

A set of outcomes from the sample space.

exclusive

Describes events that cannot happen at the same time, so P(AB)=0P\left(A\cap B\right)=0 and P(AB)=P(A)+P(B)P\left(A\cup B\right)=P\left(A\right)+P\left(B\right).

extrapolation

Using a regression line to predict a value that lies outside the range of the observed data, which is generally less reliable.

frequency

A table that shows how often each value (discrete) or each class interval (continuous) occurs.

gradient

The coefficient aa in y=ax+by=ax+b, giving the predicted change in yy when xx increases by 11.

histogram

A diagram for grouped continuous data in which each class interval is shown by a touching bar, allowing the shape and spread of the distribution to be seen.

independent probability

Describes events or random variables where knowing one outcome gives no information about the other; for Poisson variables this condition allows their totals to remain Poisson.

interpolation

Estimating a value at an unknown point using known values at nearby points; in nearest neighbour interpolation, the estimate is taken from the closest site.

interquartile

Referring to the middle 50%50\% of the data, between the lower quartile Q1Q_1 and upper quartile Q3Q_3.

interquartile range

Measures the spread of the middle 50%50\% of an ordered data set, calculated as the difference between the upper and lower quartiles: IQR=Q3Q1IQR=Q_3-Q_1.
HL

joint probability

Describing the probability that two events occur together, written as P(AB)P(A\cap B).

mean

The expected value of a random variable; if XPoisson(λ)X \sim \mathrm{Poisson}(\lambda), then E(X)=λE(X) = \lambda

median

The value mm such that mf(x)dx=12\int_{-\infty}^{m} f(x)\,dx=\frac{1}{2}, so half the total area under the density curve lies on each side.

modal

The class interval with the highest frequency in grouped data (used when class intervals are equal).
HL

mode

The value of xx where the probability density function has its greatest value.

negative correlation

Describing correlation where points tend to fall from left to right as xx increases.

outcome

One possible result of a trial.

outlier

A value unusually far from the rest of the data; in this course, one that is below Q11.5×IQRQ_1-1.5\times IQR or above Q3+1.5×IQRQ_3+1.5\times IQR, where IQR=Q3Q1IQR=Q_3-Q_1.
HL

partition

A collection of events (such as A1A_1, A2A_2, A3A_3) that are mutually exclusive and whose union is the whole sample space, so exactly one must occur.

Pearson's

Referring to the product-moment correlation coefficient rr, a statistic that measures the strength and direction of a linear relationship and lies between 1-1 and 11.

percentiles

Values that split ordered data into 100100 equal parts; the ppth percentile has about pp% of the data below it and can be estimated from a cumulative frequency graph.

population

The full set of all possible values of interest, described by parameters such as the mean 'μ\mu' and variance 'σ2\sigma^2'.

positive correlation

Describing a correlation where the points in a scatter diagram tend to rise from left to right.
HL

prior probability

The probability assigned to an event before any new information is taken into account, often written as P(A)P(A).

probability

Measures how likely an event is to happen, taking a value between 00 and 11, where 00 means impossible and 11 means certain.

HL

probability density function

A function f(x)f(x) used to describe a continuous random variable, where f(x)0f(x)\geq0 and f(x)dx=1\int_{-\infty}^{\infty} f(x)\,dx=1, so probabilities correspond to areas under its graph.

quartiles

Values that split ordered data into four equal parts; on a cumulative frequency graph with nn values, they are estimated at cumulative frequencies n4\frac{n}{4} and 3n4\frac{3n}{4}.

random

Describing an outcome that cannot be predicted with certainty in advance, even though long-run patterns may be modelled using probabilities.

range

The set of possible outputs a function can produce.

regression

A method for modelling a linear relationship with a line used to make predictions from bivariate data.

sample

A set of 'nn' observed values taken from a population and used to compute statistics such as 'xˉ\bar{x}' and 'sn12s_{n-1}^2'.

spread

How dispersed the data values are, described using measures of dispersion such as interquartile range, standard deviation, and variance.

standard deviation

The square root of the variance, σ=Var(X)\sigma=\sqrt{\mathrm{Var}(X)}, giving spread in the same units as the random variable.
HL

theorem

A mathematical result that can be proved logically from definitions and earlier results, then used to solve problems.
HL

tree

A connected graph with no cycles; if it has nn vertices then it has exactly n1n-1 edges.
HL

updated probability

Describing a probability that has been revised after learning that another event has occurred, often written as P(AB)P(A|B).

variance

Measures spread using the mean of squared distances from the mean, so values far from the mean have greater influence; measured in squared units and equal to the square of the standard deviation.

Next Up

You have completed the topic 4 definitions for IB Maths AA - continue with related resources below or explore the full IB Maths AA course from the IBO.

Other topic 4 resources