Probability and Statistics

Probability and Statistics

⌊x⌋= the greatest integer smaller than x

$n \to \infty lim (1 + \frac{r}{B})^{A} = e^{r} i f n \to \infty lim \frac{A ^{'}}{B ^{'}} = 1$

sequence

index variable

explicit formulas

recurrence relation

arithmetic sequences

geometric sequences

bounded above/ below

(un)bounded

convergent ⇒ bounded

bounded, monotone ⇒ convergent

series

convergent vs divergent series

harmonic sum $\sum_{n = 1}^{\infty} \frac{1}{n} = \infty$

p series: $\sum_{n = 1}^{\infty} \frac{1}{n ^{p}} ⎩ ⎨ ⎧ co n v er g e n t, p > 1 d i v er g e n t, p \leq 1$

$\sum_{n = 1}^{\infty} \frac{1}{n ^{2}} = \frac{π ^{2}}{6}$

$\sum_{n = k}^{\infty} a_{n}$ convergent ⇒ lim_{n → ∞}a_n = 0

lim_{n → ∞}a_n ≠ 0⇒ $\sum_{n = k}^{\infty} a_{n}$ divergent

tests

divergence test

partial sum

geometric: |q| < 1

integral: f, f(n) = a_n, continuous, ultimately decreasing 🡪 same

comparison:

$lim_{n \to \infty} \frac{a _{n}}{b _{n}} = c \Rightarrow ⎩ ⎨ ⎧ s am e, c \neq = 0 a_{n} co n v er g e n t, c = 0, b_{n} co n v er g e n t a_{n} d i v er g e n t, c = \infty, b_{n} d i v er g e n t$

scaling with $\sum_{n = 1}^{\infty} \frac{1}{n ^{p}} (p > 1)$ or $\sum_{n = 1}^{\infty} \frac{1}{n}$

alternating: |a_n + 1| − |a_n| < 0, ∀ n > n₀, lim_{n → ∞}a_n = 0⇒ convergent

absolute convergence: $\sum ∣ a_{n} ∣$ convergent

conditional convergence: $\sum a_{n}$ convergent, but $\sum ∣ a_{n} ∣$ divergent

$\sum ∣ a_{n} ∣$ convergent $\Rightarrow \sum a_{n}$ convergent

ratio: $lim_{n \to \infty} \frac{a _{n + 1}}{a _{n}} ⎩ ⎨ ⎧ < 1 \Rightarrow \sum a_{n} ab so l u t e l y co n v er g e n t > 1 \Rightarrow \sum a_{n} d i v er g e n t$

root: $lim_{n \to \infty} n ∣ a_{n} ∣ {> 1 \Rightarrow d i v er g e n t < 1 \Rightarrow co n v er g e n t$

uniform convergence:

f_n(x) converge to f(x): f_n(x) → f(x) as n → ∞

∃N(ε) independent to x, ∀n > N, |f_n(x) − f(x)| < ε⇔ uniform convergence: f_n ⇉ f

dominated convergence: f_i(x) → f(x) as x → ∞, ∃g, ∀x, i, f_i(x) < g(x) ⇒ lim_{i → ∞}∫_a^bf_i(x)d**x = ∫_a^bf(x)d**x

power series: $\sum_{n = 0}^{\infty} c_{n} (x - a)^{n}$

center a, coefficient c_n

⇒ x = a⇔ convergent

or ∀x ∈ ℝ, convergent

or ∃R > 0, |x − a| < R⇒ convergent, |x − a| > R⇒ divergent

method: $ρ = lim_{n \to \infty} \frac{c _{n + 1}}{c _{n}}$ , radius of convergence $R = \frac{1}{ρ}$

|x − a| < R⇒ absolutely convergent, |x − a| > R⇒ divergent from ratio test

Bessel function of the first kind: $J_{0} = \sum_{n = 0}^{\infty} \frac{( - 1 ) ^{n} x ^{2 n}}{2 ^{2 n} ( n ! ) ^{2}}$

basic formula: $\frac{1}{1 - x} = \sum_{n = 0}^{\infty} x^{n} f or ∣ x ∣ < 1$

plug out terms, create (x − a), see something as 1 − x, do it, range

Taylor and Maclaurin Series: $f (x) = \sum_{n = 0}^{\infty} \frac{f ^{(n)} ( a )}{n !} (x - a)^{n}$

$\frac{1}{1 - x} = \sum_{n = 0}^{\infty} x^{n} f or ∣ x ∣ < 1$

$ln (1 + x) = \sum_{n = 0}^{\infty} (- 1)^{n} \frac{x ^{n + 1}}{n + 1}$ for |x| < 1

$- ln (1 - x) = \sum_{n = 1}^{\infty} \frac{x ^{n}}{n}$

$e^{x} = \sum_{n = 0}^{\infty} \frac{x ^{n}}{n !}$

$sin x = \sum_{n = 0}^{\infty} (- 1)^{n} \frac{x ^{2 n + 1}}{( 2 n + 1 )!}$

$cos x = \sum_{n = 0}^{\infty} (- 1)^{n} \frac{x ^{2 n}}{( 2 n )!}$

$tan^{- 1} x = \sum_{n = 0}^{\infty} (- 1)^{n} \frac{x ^{2 n + 1}}{2 n + 1}$ for |x| < 1

$(1 + x)^{k} = \sum_{n = 0}^{\infty} (k n) x^{n}$

$(x + y)^{k} = \sum_{n = 0}^{\infty} (k n) x^{n} y^{k - n} = \sum_{n = 0}^{k} (k n) x^{n} y^{k - n}$

differentiation and integration: $f^{^{'}} (x) = \sum_{n = 1}^{\infty} \frac{f ^{(n)} ( a )}{n !} ((x - a)^{n})^{^{'}} = \sum_{n = 1}^{\infty} \frac{f ^{(n)} ( a )}{n !} n (x - a)^{n - 1}$ , $\int f (x) d x = \sum_{n = 0}^{\infty} \frac{f ^{(n)} ( a )}{n !} \int (x - a)^{n} d x = \sum_{n = 0}^{\infty} \frac{f ^{(n)} ( a )}{n !} \frac{( x - a ) ^{n + 1}}{n + 1} + C$

may lose boundary of convergence

Cauchy product: $f (x) g (x) = \sum_{n = 0}^{\infty} a_{n} x^{n} \sum_{n = 0}^{\infty} b_{n} x^{n} = \sum_{n = 0}^{\infty} c_{n} x^{n}$ , $c_{n} = \sum_{k = 0}^{\infty} a_{k} b_{n - k}$

$(k n) = \frac{k !}{n ! ( k - n )!} = \frac{k ( k - 1 ) \dots ( k - n + 1 )}{n !}$

probability

sample space S: nonempty set

collection of events: subset of S: P(S)

probability measure P(A) ∈ [0, 1], P(⌀) = 0, P(S) = 1

$P (⋃_{i = 1}^{n} A_{i}) = \sum_{i = 1}^{n} P (A_{i})$ if A_i… are mutually disjoint

symmetric difference: AΔB = (A ∖ B) ∪ (B ∖ A)

complement: A^c : = S ∖ A

disjoint: A ∩ B = ⌀ ⇔ A ⊆ B^c ⇒ A ∖ B = A ∩ B^c

De Morgan Laws: (A ∪ B)^c = A^c ∩ B^c, (A ∩ B)^c = A^c ∪ B^c

A = B ⇔ A ⊆ B, B ⊆ A

cardinality: |A|= # of elements in A

inclusion-exclusion principle: |A ∪ B| = |A| + |B| − |A ∩ B|, |A ∪ B ∪ C| = |A| + |B| + |C| − |A ∩ B| − |A ∩ C| − |B ∩ C| + |A ∩ B ∩ C|

partition of the sample: B_i, disjoint, S = B₁ ∪ B₂ ∪ …

P(A) = P(A ∩ B₁) + P(A ∩ B₂) + …

P(A ∖ B) = P(A ∩ B^c) = P(A) − P(B)

not necessarily disjoint: P(A₁ ∩ A₂ ∩ …) ≤ P(A₁) + P(A₂) + …

independent: P(A ∩ B) = P(A)P(B) for two events, and P(A₁ ∩ … ∩ A_i) = P(A₁)…P(A_i) for more events

uniform probability: $P (A) = \frac{∣ A ∣}{∣ S ∣}$

continuous uniform distribution: $P ([x, y]) = \frac{y - x}{b - a}$ , [x, y] ⊂ [a, b]

combinatorics: length k, n symbols ⇒ n^k

ordered subset $\Rightarrow \frac{n !}{( n - k )!} = n^{\underline{k}}$

binomial: $(n k) = \frac{n !}{k ! ( n - k )!}, or 0 i f k < 0 or k > n$

$(n k) = (n - 1 k) + (n - 1 k - 1)$

$(2 n n) = \sum_{k = 0}^{n} (n k)^{2}$

generating function: $f (x) = \sum_{n = 0}^{\infty} a_{n} x^{n}$

exponential generating function: $f (x) = \sum_{n = 0}^{\infty} a_{n} \frac{x ^{n}}{n}$

Bayes’ theorem: $P (B ∣ A) = \frac{P ( B \cap A )}{P ( A )} = \frac{P ( B )}{P ( A )} P (A ∣ B)$

random variable X(bla) = 2

discrete: $\sum_{x \in R} P (X = x) = 1$

PDF probability mass/ density function: p_X(x) = P(X = x)

$P (X = x) = \sum_{i} P (A_{i}) P (X = x ∣ A_{i})$

law of total probability: $P (A) = \sum_{x \in R} P (X = x) P (A ∣ X = x)$

Bernoulli: p_X(0) = q, p_X(1) = p

Binomial: $p_{X} (k) = (n k) p^{k} q^{n - k}$

r_X(t) = (t**p + q)ⁿ

geometric: p_X(k) = p**q^k

$E (X) = \frac{q}{p}, v a r (X) = \frac{q}{p ^{2}}$

negative-binomial: $p_{X} (k) = (r + k - 1 k) p^{r} q^{k} = (- 1)^{k} (- r k) p^{r} q^{k}$

Poisson: $p_{X} (k) = e^{- λ} \frac{λ ^{k}}{k !}$

E(X) = var(X) = λ

hypergeometric: $p_{X} (k) = \frac{( M k ) ( N - M n - k )}{( n N )}$

continuous distribution: p_X(k) = 0 ∀ x ∈ ℝ,∫_−∞^∞f(x)d**x = 1

uniform: $f (x) = {\frac{1}{b - a}, a \leq x \leq b 0$

$v a r (X) = \frac{( b - a ) ^{2}}{12}$

exponential: $f (x) = {λ e^{- λ x}, 0 \leq x 0, x < 0$

P(x ≥ t) = e^−λ**t

$E (X) = \frac{1}{λ}, v a r (X) = \frac{1}{λ ^{2}}$

$r_{X} (t) = \frac{λ}{λ - l n t} i f λ - ln t > 0$

normal: $f (x) = n (x; μ, σ^{2}) := \frac{1}{2 π σ} e^{- \frac{( x - μ ) ^{2}}{2 σ ^{2}}}$

$Φ (x) = \frac{1}{2 π} \int_{- \infty}^{x} e^{- \frac{t ^{2}}{2}} d t$

gamma: $f (x) = {\frac{λ ^{α} x ^{α - 1}}{Γ ( α )} e^{- λ x}, x \geq 0 0, x < 0$

$Γ (x) = \int_{0}^{\infty} t^{x - 1} e^{- t} d t = {(x - 1)!, x \in N * π, x = \frac{1}{2}$

Γ(x + 1) = x**Γ(x)

chi-squared: Χ²(n) is the distribution of Z = X₁² + … + X_n² with X_i ∼ N(0, 1)

$Z \sim Γ (\frac{n}{2}, \frac{1}{2})$

E(Z) = n

$f_{Z} (z) = \frac{1}{2 π z} e^{- \frac{z}{2}}$ for z ≥ 0 if n = 1

t (or student): t(n) is the distribution of $Z = \frac{X}{\frac{X _{1}^{2} + \dots + X _{n}^{2}}{n}}$ with X_i ∼ N(0, 1)

$T = \frac{X - μ}{\frac{S _{n}}{n}} \sim t (n - 1)$

Table Description automatically generated

CDF cumulative distribution fun $F_{X} (x) = P (X \leq x) = {\sum_{y_{i} \leq x} f_{X} (y_{i}), d i scre t e \int_{- \infty}^{x} f_{X} (t) d t, co n t in u o u s$

joint distributions Y = h(X)

if X discrete, then $P_{Y} (y) = \sum_{x \in h^{- 1} y} P_{X} (x)$

if X continuous, h(x) monotone where f_X(x) > 0, then $f_{Y} (y) = \frac{f _{X} ( h ^{- 1} ( y ) )}{∣ h ^{^{'}} ( h ^{- 1} ( y ) ) ∣}$

marginal distribution: F_X(x) = lim_{y → ∞}F_X, Y(x, y)

joint probability fun

marginal PDF: $p_{X} (x) = \sum_{y} p_{X, Y} (x, y)$ table

joint density fun: f(x, y), ∫_−∞^∞∫_−∞^∞f(x, y)dxd**y = 1

marginal PDF f_X(x) = ∫_−∞^∞f_X, Y(x, y)d**y

independent $P (A ∣ B) = \frac{P ( A \cap B )}{P ( B )}$

$p_{Y ∣ X} (y ∣ x) = \frac{p _{X, Y} ( x , y )}{\sum _{z} p _{X, Y} ( x , z )} = \frac{p _{X, Y} ( x , y )}{p _{X} ( x )}$ or f

continuous: $P (a \leq Y \leq b, X = x) = \int_{a}^{b} \frac{f _{X, Y} ( x , y ) d y}{f _{X} ( x )}$

law of total prob: P((X, Y) ∈ B) = ∬_Bf_X(x)f_Y|X(y|x)dxd**y

independent random variables: ∀B₁, B₂, P(X ∈ B₁, Y ∈ B₂) = P(X ∈ B₁)P(Y ∈ B₂)

expectation: E(X**Y) = E(X)E(Y) if X, Y independent

variance: var(X) = E((X − μ_x)²) = E(X²) − E²(X), var(X + Y) = var(X) + var(Y) + 2cov(X, Y)

covariance: cov(X, Y) = E((X − μ_X)(Y − μ_Y)) = E(X**Y) − E(X)E(Y)

cov(a**X + b**Y, Z) = a cov(X, Z) + b cov(Y, Z)

correlation: $corr (X, Y) = \frac{co v ( X , Y )}{S d ( X ) S d ( Y )}$

i.i.d. mutually independent and identically distributed

X₁, …, X_n ∼ Bernoulli(p) i. i. d. ⇒ X₁ + … + X_n ∼ B(n, p)

X₁ ∼ B(n₁, p), X₂ ∼ B(n₂, p) ⇒ X₁ + X₂ ∼ B(n₁ + n₂, p)

X₁ ∼ Poisson(λ₁), X₂ ∼ Poisson(λ₂) ⇒ X₁ + X₂ ∼ Poisson(λ₁ + λ₂)

generating fun: r_X(t) = E(t^X)

r_X + Y(t) = r_X(t)r_Y(t)

kth moment: E(X^k)

kth central moment: E((X − μ_x)^k)

moment-generating fun: $M_{X} (s) = E (e^{X s}) = r_{X} (e^{s}) = \sum_{n = 0}^{\infty} \frac{E ( X ^{n} )}{n !} s^{n}$

M_X^(k)(0) = E(X^k)

M_X + Y(s) = M_X(s)M_Y(s) X, Y independent, work for r_X(t)

uniqueness: if ∃s₀ > 0, M_X(s) < ∞ ∀s ∈ (−s₀, s₀), M_X(s) = M_Y(s), then X, Y have the same distribution also work for r

characteristic fun: c_X(s)= E(e^iXs)

inequalities

Markov’s: $P (X \geq a) \leq \frac{E ( X )}{a}$ X ≥ 0, a > 0

Chebychev’s: $P (∣ X - μ_{X} ∣ \geq a) \leq \frac{v a r ( X )}{a ^{2}}$ a > 0

Cauchy-Schwarz: $∣ co v (X, Y) ∣ \leq v a r (X) v a r (Y)$

=, iff $X - μ_{X} = \frac{co v ( X , Y )}{v a r ( Y )} (Y - μ_{Y})$ if var(Y) > 0

law of large number

sample sum: S_n = X₁ + … + X_n for {X_i} i. i. d.

sample average: $M_{n} = \overline{X} = \frac{1}{n} (X_{1} + \dots + X_{n}) = \frac{S _{n}}{n}$

WLLN weak: lim_{n → ∞}P(|M_n − μ| ≥ ε) = 0 for {X_i} same μ, var ≤ v, v < ∞, ε > 0 not necessarily i. i. d.

$M_{n} \to P μ$

SLLN strong: P{lim_{n → ∞}M_n = μ} = 1 for {X_i} i. i. d.

$M_{n} \to a . s . μ$ x

central limit theorem: lim_{n → ∞}P(Z_n ≤ x) = P(Z ≤ x) for {X_i} i. i. d, Z ∼ N(0, 1), $Z_{n} = \frac{S _{n} - n μ}{n σ} = n (\frac{M _{n} - μ}{σ})$

$Φ (x) = lim_{n \to \infty} P (Z_{n} \leq x) = lim_{n \to \infty} P (S_{n} \leq n μ + x n σ) = lim_{n \to \infty} P (M_{n} \leq μ + \frac{x σ}{n})$

$Z_{n} \to D Z \sim N (0, 1)$

convolution: for X, Y independent, Z = X + Y

discrete: $p_{Z} (z) = \sum_{w} p_{X} (z - w) p_{Y} (w)$

continuous: f_Z(z) = ∫_−∞^∞f_X(z − w)f_Y(w)d**w

different convergence

$X_{n} \to P Y$ : {X_n} converges in probability to Y, if lim_{n → ∞}P(|X_n − Y| ≥ ε) = 0

$X_{n} \to a . s . Y$ : {X_n} converges with probability 1 (almost surely) to Y, if lim_{n → ∞}P(X_n = Y) = 1

$X_{n} \to D Y$ : {X_n} converges in distribution to Y, if lim_{n → ∞}P(X_n ≤ x) = P(Y ≤ x) ∀ x ∈ {x|P(Y = x) = 0}

$X_{n} \to a . s . Y \Rightarrow X_{n} \to D Y \Rightarrow X_{n} \to P Y$

sample

sample variance: $\overline{S_{n}^{2}} = \frac{1}{n - 1} \sum_{j = 1}^{n} (X_{j} - \overline{X})^{2}$

kth sample moment: $\overline{x^{k}}$

point estimation

the method of moments: $μ_{n} = \overline{X}, σ^{2} = \overline{X^{2}} - \overline{X}^{2}$

MLE maximum likelihood estimation: make L big as possible

maximum likelihood fun: L(θ; x₁, …, x_n) = p(X₁ = x₁, …X_n = x_n|θ) or f

x₁, …, x_n independent: $L (θ; x_{1}, \dots, x_{n}) = \prod_{j = 1}^{n} p (X_{j} = x_{j} ∣ θ)$

good point estimator

unbiased: E(θ̂_n) = θ

consistent: $θ_{n} \to P θ$

mean squared error E((θ̂_n − θ)²) is small

confidence interval: P(θ̂_n⁻ ≤ θ ≤ θ̂_n⁺) ≥ 1 − α

⇐ P(a ≤ g(θ) ≤ b) ≥ 1 − α

N, t: let −b = a,

Χ²: let $a = X_{\frac{α}{2}}^{2} (n), b = X_{1 - \frac{α}{2}}^{2} (n)$

deal with X_i ∼ N(μ, σ²)

$\overline{S}^{2} = \frac{1}{n - 1} \sum_{i = 1}^{n} (X_{i} - \overline{X})^{2}$

$σ = σ_{0} \Rightarrow \frac{X - μ}{σ _{0} / n} \sim N (0, 1) \Rightarrow μ$

$\frac{X - μ}{S _{n} / n} \sim t (n - 1) \Rightarrow μ$

μ = μ₀: $\frac{1}{σ ^{2}} \sum_{i = 1}^{n} (X_{i} - μ_{0})^{2} \sim X^{2} (n) \Rightarrow σ$

$\frac{( n - 1 ) S ^{2}}{σ ^{2}} \sim X^{2} (n - 1) \Rightarrow σ$

hypothesis testing

null hypothesis H₀

type I/ II error: H₀ true/ false

level of significance α: probability for type I error

power of the test 1 − β, β: probability for type II error

p-value: probability that H₀ is true

reject if p-value < α

for {X_i ∼ N(μ, σ₀²)} i.i.d., use $\frac{X - μ _{0}}{σ _{0} / n} \sim N (0, 1)$

for two samples, $T = \frac{( X _{1} - μ _{1} ) - ( X _{2} - μ _{2} )}{\frac{S _{1}^{2}}{n _{1}} + \frac{S _{2}^{2}}{n _{2}}} \sim t (df), df = \frac{( \frac{S _{1}^{2}}{n _{1}} + \frac{S _{2}^{2}}{n _{2}} ) ^{2}}{\frac{1}{n _{1} - 1} ( \frac{S _{1}^{2}}{n _{1}} ) ^{2} + \frac{1}{n _{2} - 1} ( \frac{S _{2}^{2}}{n _{2}} ) ^{2}}$

linear regression

linear least squares ${m = \frac{\sum _{j = 1}^{n} ( x _{j} - x ) ( y _{j} - y )}{\sum _{j = 1}^{n} ( x _{j} - x )} b = \overline{y} - m \overline{x}$

correlation coefficient: $r = corr (X, Y) = \frac{n \sum _{j = 1}^{n} ( x _{j} y _{j} ) - \sum _{j = 1}^{n} x _{j} \sum _{j = 1}^{n} y _{j}}{[ \sum _{j = 1}^{n} x _{j}^{2} - ( \sum _{j = 1}^{n} x _{j} ) ^{2} ] [ \sum _{j = 1}^{n} y _{j}^{2} - ( \sum _{j = 1}^{n} y _{j} ) ^{2} ]}$

standard statistical model: Y_j = β₀ + β₁x_j + e_j, j = 1, 2, …, n

E(e_j) = 0, var(e_j) = σ²

intercept β₀, slope β₁, residual e_j

estimators $⎩ ⎨ ⎧ β_{1} = \frac{1}{l _{XX}} \sum_{j = 1}^{n} (x_{j} - \overline{x}) (y_{j} - \overline{y}) = \frac{1}{l _{XX}} (\sum_{j = 1}^{n} x_{j} y_{j} - n \overline{x} \overline{y}) β_{0} = \frac{\sum _{j = 1}^{n} x _{j}^{2} \sum _{j = 1}^{n} y _{j} - \sum _{j = 1}^{n} x _{j} \sum _{j = 1}^{n} x _{j} y _{j}}{n \sum _{j = 1}^{n} x _{j}^{2} - ( \sum _{j = 1}^{n} x _{j} ) ^{2}}$

$l_{XX} = j = 1 \sum n (x_{j} - \overline{x})^{2} = j = 1 \sum n x_{j}^{2} - n \overline{x}^{2}$

E(β̂₀) = β₀, E(β̂₁) = β₁

$v a r (β_{0}) = (\frac{1}{n} + \frac{x ^{2}}{l _{XX}}) σ^{2}, v a r (β_{1}) = \frac{σ ^{2}}{l _{XX}}, co v (β_{0}, β_{1}) = - \frac{x}{l _{XX}} σ^{2}$

if e_j ∼ N(0, σ²) i.i.d., then β̂₀, β̂₁ normal dist

sum of squared errors SSE: $S_{e}^{2} = \sum_{j = 1}^{n} (Y_{j} - β_{0} - β_{1} x_{j})^{2}$

$σ^{2} = \frac{S _{e}^{2}}{n - 2}$

$E (σ^{2}) = σ^{2}, \frac{S _{e}^{2}}{σ ^{2}} \sim X^{2} (n - 2), σ^{2}$ independent of β̂₀, β̂₁

$σ kn o w n \frac{β _{1} - β _{1}}{σ / l _{XX}} \sim N (0, 1) \Rightarrow β_{1}$

$\frac{β _{1} - β _{1}}{σ / l _{XX}} \sim t (n - 2) \Rightarrow β_{1}$

$σ kn o w n \frac{β _{0} - β _{0}}{σ / \frac{1}{n} + \frac{x ^{2}}{l _{XX}}} \sim N (0, 1) \Rightarrow β_{0}$

$\frac{β _{0} - β _{0}}{σ / \frac{1}{n} + \frac{x ^{2}}{l _{XX}}} \sim t (n - 2) \Rightarrow β_{0}$

1 − α confidence interval for β̂₀, β̂₁

Text, letter Description automatically generated

1 − α confidence interval for σ²

$[\frac{( n - 2 ) σ ^{2}}{X _{\frac{α}{2}}^{2} ( n - 2 )}, \frac{( n - 2 ) σ ^{2}}{X _{1 - \frac{α}{2}}^{2} ( n - 2 )}]$

$y_{0} \sim N (β_{0} + β_{1} x_{0}, (\frac{1}{n} + \frac{( x _{0} - x ) ^{2}}{l _{XX}}) σ^{2})$

1 − α confidence interval for y₀ = E(ŷ₀): $[y_{0} - t_{1 - \frac{α}{2}} (n - 2) σ \frac{1}{n} + \frac{( x _{0} - x ) ^{2}}{l _{XX}}, y_{0} + t_{1 - \frac{α}{2}} (n - 2) σ \frac{1}{n} + \frac{( x _{0} - x ) ^{2}}{l _{XX}}]$

Steven Hé (Sīchàng)

Probability and Statistics