Householder QR Factorization

Linear Algebra

A complete reference for the Householder QR factorization algorithm (Golub & Van Loan, Matrix Computations, Algorithm 5.2.1). Five parts covering the same algorithm from different angles — definitions, invariants, a question-and-answer walkthrough, a traditional narrative, and a concept map. They are not meant to be read in sequence; they are meant to be used as a reference you return to.

How to use this document

Part 2 (Definitions) and Part 3 (Invariants) are the core: every claim in the document is grounded there. Part 4 (Dialogue) is the recommended first read for a new reader — each question is an independent entry point. Part 5 (Narrative) is the synthesis, with a worked example and code. Part 6 (Concept Map) shows the dependency graph of ideas — which concepts are load-bearing, and which connections are non-obvious.

Part 1 — Context

The Householder QR factorization is Algorithm 5.2.1 in Golub and Van Loan, Matrix Computations (4th ed.). It is the standard method for computing the QR factorization of a dense matrix. The storage scheme it uses — packing Householder vectors into the zeros they create — is the format returned by the LAPACK routine DGEQRF. Anyone reading that output without knowing the storage scheme will be confused by the lower triangle.

Part 2 — Definitions

Every term used in Parts 3–5 is defined here. No term in the invariants requires knowledge from outside this section.

Matrix.: A rectangular array of numbers with $m$ rows and $n$ columns. Written $A$ , size $m \times n$ . Entry in row $i$ , column $j$ is written $A_{ij}$ .
Vector.: A column of numbers. Written $x$ . The $i$ -th entry is $x_{i}$ . A vector of length $m$ is an $m \times 1$ matrix.
Length of a vector.: $∥ x ∥ = x_{1}^{2} + x_{2}^{2} + \dots + x_{m}^{2}$ . Also called the Euclidean norm.
Dot product.: $v^{T} x = v_{1} x_{1} + v_{2} x_{2} + \dots + v_{m} x_{m}$ . Always a single number.
Outer product.: $v v^{T}$ is an $m \times m$ matrix whose $(i, j)$ entry is $v_{i} v_{j}$ .
Identity matrix.: $I$ is the square matrix with 1s on the diagonal and 0s elsewhere. $I x = x$ for any vector $x$ .
Transpose.: $A^{T}$ has rows and columns of $A$ swapped: $(A^{T})_{ij} = A_{ji}$ .
Orthogonal matrix.: A square matrix $Q$ satisfying $Q^{T} Q = I$ , equivalently $Q^{- 1} = Q^{T}$ . Applying $Q$ to any vector preserves its length: $∥ Q x ∥ = ∥ x ∥$ for all $x$ .
Upper triangular matrix.: A matrix $R$ where $R_{ij} = 0$ whenever $i > j$ . Everything below the main diagonal is zero.
$e_{1}$ .: The vector $[1, 0, 0, \dots, 0]^{T}$ . Points along the first coordinate axis.
$sign (x_{1})$ .: Returns $+ 1$ if $x_{1} \geq 0$ , returns $- 1$ if $x_{1} < 0$ .
Submatrix $A (j : m, j : n)$ .: The block of $A$ formed by rows $j$ through $m$ and columns $j$ through $n$ .
Rank-1 update.: An operation of the form $B \leftarrow B - τv (v^{T} B)$ , where $v$ is a vector and $τ$ is a scalar. It modifies every column of $B$ by subtracting a scalar multiple of $v$ . Cost: $O (mn)$ .

Part 3 — Glossary of Invariants

Every statement below is necessarily true. Each uses only terms from Part 2 or from earlier invariants. Statements are ordered by dependency — no invariant uses a term introduced later.

The object

Every $m \times n$ matrix $A$ with $m \geq n$ can be written exactly as $A = QR$ , where $Q$ is an $m \times m$ orthogonal matrix and $R$ is an $m \times n$ upper triangular matrix. This is not an approximation.
$R$ being upper triangular means every entry below the main diagonal is zero. The diagonal and above are generally nonzero.
$Q$ being orthogonal means $Q^{T} Q = I$ : applying $Q$ changes the direction of vectors without changing their length.
Because $Q^{- 1} = Q^{T}$ , the operation $Q^{T} b$ — reorienting a vector $b$ — costs $O (mn)$ and destroys no information.
The factorization $A = QR$ always exists for any $m \times n$ matrix with $m \geq n$ .
The factorization is not unique: negating the $j$ -th column of $Q$ and the $j$ -th row of $R$ simultaneously gives a different but equally valid factorization. The signs of the diagonal of $R$ are free.

Why it is useful

The system $A x = b$ is equivalent to $R x = Q^{T} b$ , obtained by substituting $A = QR$ and multiplying both sides by $Q^{T}$ .
Because $R$ is upper triangular, $R x = Q^{T} b$ is solved from the bottom row upward, one unknown at a time. This is back-substitution; it costs $O (n^{2})$ operations.
The least squares problem — find $x$ minimizing $∥ A x - b ∥$ — has the solution obtained by: computing $Q^{T} b$ , taking its first $n$ entries, then solving $R x = (Q^{T} b)_{1 : n}$ by back-substitution. No other steps are required.
Applying $Q^{T}$ to $b$ does not change $∥ b ∥$ , because $Q^{T}$ is orthogonal. It reorients $b$ without distorting it.

The key constraint

Any transformation applied to $A$ during the factorization must be orthogonal — it must preserve lengths. Setting entries to zero directly changes the matrix itself, not just its coordinate representation, and is not permitted.
Zeroing all entries below position $j$ in a column $x$ is equivalent to finding an orthogonal transformation $H$ such that $H x = [\pm ∥ x ∥, 0, 0, \dots, 0]^{T}$ . The first entry must be $\pm ∥ x ∥$ because $H$ preserves length and the right-hand side must have the same norm as $x$ .

The Householder reflector

For any nonzero vector $v$ , the scalar $τ = 2/ (v^{T} v)$ satisfies $0 < τ \leq 2$ .
The matrix $H = I - τv v^{T}$ , where $τ = 2/ (v^{T} v)$ , is called a Householder reflector.
$H$ is orthogonal: $H^{T} H = I$ . Applying $H$ to any vector preserves its length.
$H$ is symmetric: $H^{T} = H$ .
$H$ is its own inverse: $H^{2} = I$ . Applying $H$ twice returns any vector exactly to where it started.
$H$ reflects space across the hyperplane ${w : v^{T} w = 0}$ — the set of all vectors perpendicular to $v$ . The component of any vector along $v$ is negated; components perpendicular to $v$ are unchanged.
$H$ is never stored or formed as an explicit $m \times m$ matrix. Applying $H$ to a vector $b$ uses: $H b = b - τv (v^{T} b)$ . This requires one dot product, one scaling, and one vector subtraction. Cost: $O (m)$ .
Applying $H$ to an $m \times n$ matrix $B$ uses: $H B = B - τv (v^{T} B)$ , where $v^{T} B$ is a row vector of $n$ numbers. Cost: $O (mn)$ .

Constructing $v$ to zero a column

Given a vector $x$ of length $m$ , define $v = x + sign (x_{1}) ∥ x ∥ e_{1}$ . The reflector $H = I - τv v^{T}$ with $τ = 2/ (v^{T} v)$ then satisfies $H x = - sign (x_{1}) ∥ x ∥ e_{1}$ : the first entry is $- sign (x_{1}) ∥ x ∥$ and all other entries are zero.
The vector $v$ from invariant 21 points from the target $- sign (x_{1}) ∥ x ∥ e_{1}$ to the current position $x$ . The hyperplane perpendicular to $v$ bisects $x$ and its target. Reflecting $x$ across this bisector maps it exactly onto the target.
The zeros in $H x$ are not set directly. They appear because $x$ has been rotated to point along $e_{1}$ . A vector pointing along $e_{1}$ has zero in every position except the first by definition.
The sign $sign (x_{1})$ in invariant 21 is chosen so that $v_{1} = x_{1} + sign (x_{1}) ∥ x ∥$ is always large in magnitude. The alternative sign gives $v_{1} = x_{1} - ∥ x ∥$ , which is nearly zero when $x_{1} \approx ∥ x ∥$ . Near-zero $v_{1}$ causes catastrophic cancellation when $τ = 2/ (v^{T} v)$ is computed, amplifying floating point errors by factors of millions. Example: $x = [0.9999, 0.01, 0.01]^{T}$ , $∥ x ∥ \approx 1.0001$ . Wrong sign: $v_{1} = 0.9999 - 1.0001 = - 0.0002$ (dangerous). Correct sign: $v_{1} = 0.9999 + 1.0001 = 2.0000$ (safe).

The algorithm

The algorithm performs $n$ steps. At step $j$ (for $j = 1, 2, \dots, n$ ), it operates on the submatrix $A (j : m, j : n)$ .
At step $j$ : extract $x = A (j : m, j)$ . Construct $v_{j}$ from $x$ using invariant 21. The reflector $H_{j} = I - τ_{j} v_{j} v_{j}^{T}$ maps $x$ to $- sign (x_{1}) ∥ x ∥ e_{1}$ .
Apply $H_{j}$ to the entire submatrix $A (j : m, j : n)$ — not just column $j$ . Applying it to one column but not the others would leave the matrix inconsistent.
$H_{j}$ acts only on rows $j$ through $m$ . It therefore cannot disturb columns $1$ through $j - 1$ , which are already in their final form. This is what makes the column-by-column approach correct, not merely plausible.
After applying $H_{j}$ : entry $A (j, j)$ becomes $r_{jj} = - sign (x_{1}) ∥ x ∥$ . Entries $A (j + 1 : m, j)$ become exactly zero.
After all $n$ steps: the upper triangle of the working matrix (including diagonal) contains $R$ . The product $Q = H_{1} H_{2} \dots H_{n}$ is the orthogonal factor. $Q$ is never computed explicitly.
The total cost of the factorization is $2 m n^{2} - \frac{2}{3} n^{3}$ floating point operations. For a square $m = n$ matrix this is $\frac{4}{3} n^{3}$ — twice the cost of LU factorization ( $\frac{2}{3} n^{3}$ ). QR is preferred not for square systems but for least squares and rank-deficient problems, where its orthogonality guarantees are essential.

The storage scheme

After step $j$ , entries $A (j + 1 : m, j)$ are zero. These slots held no information about the original $A$ and will not be needed going forward. They are free to reuse.
By normalizing $v_{j}$ so that $v_{j} (1) = 1$ (dividing $v_{j}$ by its first component after construction), the first entry is always exactly 1. Only $v_{j} (2), \dots, v_{j} (m - j + 1)$ — a total of $m - j$ entries — need to be stored.
The number of free slots created at step $j$ (from invariant 32) is $m - j$ . The number of entries of $v_{j}$ that need storing (from invariant 33) is also $m - j$ . These counts are equal. This exact coincidence is what enables the storage trick.
The free slots in the lower triangle of column $j$ store $v_{j} (2)$ through $v_{j} (m - j + 1)$ . The diagonal entry stores $r_{jj}$ . The upper triangle stores $R$ .
One scalar $τ_{j} = 2/ (v_{j}^{T} v_{j})$ is stored per step in a separate length- $n$ vector $τ$ . It cannot be recovered from the stored $v_{j}$ entries alone without recomputation.
The output is: (a) the overwritten array, upper triangle = $R$ , strictly lower triangle = essential parts of $v_{1}, \dots, v_{n}$ ; and (b) the vector $τ$ of length $n$ . Together these are a complete lossless encoding of both $Q$ and $R$ .
The lower triangle of the output is not part of $R$ , not zero, and not padding. It is the compressed representation of $Q$ . This is the LAPACK DGEQRF format.

Recovering and applying $Q$

To apply $Q^{T}$ to a vector $b$ : for $j = 1, 2, \dots, n$ (forward), reconstruct $v_{j}$ by prepending the implied 1 to the stored subdiagonal of column $j$ , then compute $b (j : m) \leftarrow b (j : m) - τ_{j} \cdot v_{j} \cdot (v_{j}^{T} b (j : m))$ .
To apply $Q$ to a vector $b$ : run the same loop in reverse, $j = n, n - 1, \dots, 1$ . This is correct because $(H_{1} H_{2} \dots H_{n})^{T} = H_{n}^{T} \dots H_{1}^{T} = H_{n} \dots H_{1}$ , since each $H_{j}$ is symmetric by invariant 16.
Forming $Q$ as an explicit $m \times m$ matrix costs $O (m^{2} n)$ . Applying $Q$ or $Q^{T}$ to a single vector using the stored representation costs $O (mn)$ . Explicit $Q$ is almost never needed.

What the algorithm is not doing

It is not setting entries to zero directly. Zeros appear because columns are rotated onto coordinate axes.
It is not building $Q$ first and then computing $Q^{T} A$ . It applies each reflector directly to the working matrix as it goes.
It is not storing Householder vectors in a separate data structure. They live in the zeros they created.
The lower triangle of the output is not a mistake. It is the compressed recipe for $Q$ .

Part 4 — Dialogue

Each question is an independent entry point. Answers are self-contained.

What are we trying to do?

We have a matrix $A$ . We want to write it as $A = QR$ , where $R$ is upper triangular and $Q$ is orthogonal.

Why upper triangular?

Because triangular systems are trivial to solve. If $A = QR$ , then $A x = b$ becomes $R x = Q^{T} b$ — and that is just back-substitution.

Why orthogonal $Q$ ?

Because orthogonal matrices don't distort lengths. Applying $Q^{T}$ to both sides of an equation changes coordinates without changing geometry. It is the safest possible transformation.

How do you make a matrix triangular?

Column by column. Take column 1. Make everything below the diagonal zero. Then column 2, and so on.

But you said we can't distort lengths. Doesn't zeroing entries distort things?

Yes — setting entries to zero directly changes the matrix, not just its representation. The trick is to rotate the column so it naturally has zeros below the diagonal.

What does it mean to rotate a column so it has zeros below the diagonal?

It means finding an orthogonal $H$ such that:

H x_{1} x_{2} x_{3} = \pm ∥ x ∥ 00

The result must be $\pm ∥ x ∥$ in the first position because $H$ preserves length.

So we're not zeroing entries. We're rotating the whole column onto an axis.

Precisely. The zeros appear as a consequence of alignment, not by being set directly.

What kind of transformation achieves this?

A Householder reflector: $H = I - τv v^{T}$ , $τ = 2/ (v^{T} v)$ .

Why a reflection and not a rotation?

Both work. Rotations (Givens) zero one entry at a time. Reflections (Householder) zero an entire subvector at once. For a dense column, one reflection beats $m - 1$ rotations.

How does $H$ work geometrically?

It flips space across the hyperplane perpendicular to $v$ . The component of any vector along $v$ is negated; everything perpendicular to $v$ is unchanged.

How do you choose $v$ ?

v = x + sign (x_{1}) ∥ x ∥ e_{1}

This vector points from the target ( $- sign (x_{1}) ∥ x ∥ e_{1}$ ) to the current position ( $x$ ). The hyperplane perpendicular to $v$ bisects $x$ and its target. Reflecting across this bisector maps $x$ exactly onto the target.

Why that particular sign?

Numerical stability. The wrong sign gives $v_{1} = x_{1} - ∥ x ∥$ , nearly zero when $x$ already points near $e_{1}$ . Near-zero $v_{1}$ means $v^{T} v$ is near zero, and $τ = 2/ (v^{T} v)$ explodes. The correct sign makes $v_{1} = x_{1} + ∥ x ∥$ , always large.

You apply $H$ to one column. What about the rest of the matrix?

You apply $H$ to all remaining columns simultaneously: $A \leftarrow A - τv (v^{T} A)$ .

Why?

Because you're changing the coordinate system for the whole matrix. Transforming one column but not the others would leave them inconsistent.

Is that expensive?

No. $H$ is never formed explicitly. The rank-1 update $A \leftarrow A - τv (v^{T} A)$ costs $O (mn)$ per step. Total: $O (m n^{2})$ .

After applying $H$ , where do the zeros go?

The entries below the diagonal of column $j$ are now exactly zero — created by the reflection.

So those memory slots are free?

Exactly. And we need to store $v_{j}$ for later use in applying $Q$ . So we store it there.

But $v_{j}$ has one more entry than the number of zeros.

The first entry of $v_{j}$ is always 1 by convention. It is never stored. The remaining entries fit exactly into the zeros.

What is stored separately?

Just $τ_{j}$ — one scalar per column, a vector of length $n$ .

What is $Q$ in the end?

Q = H_{1} H_{2} \dots H_{n}

We never formed any $H_{j}$ . So how do we use $Q$ ?

for j = 1 to n:
    reconstruct v_j: prepend 1 to stored subdiagonal of column j
    b[j:] ← b[j:] − τⱼ · vⱼ · (vⱼᵀ b[j:])

That applies $Q^{T}$ . Reverse the loop for $Q$ .

Why does reversing give $Q$ and not $Q^{T}$ ?

Because each $H_{j}$ is symmetric: $H_{j}^{T} = H_{j}$ . So $(H_{1} H_{2} \dots H_{n})^{T} = H_{n} \dots H_{2} H_{1}$ . The forward loop applies them left to right ( $Q^{T}$ ); the reverse loop applies them right to left ( $Q$ ).

Part 5 — Traditional Narrative

What QR factorization is

Given an $m \times n$ matrix $A$ with $m \geq n$ , QR factorization produces $Q$ and $R$ such that $A = QR$ . $Q$ is $m \times m$ orthogonal ( $Q^{T} Q = I$ ); $R$ is $m \times n$ upper triangular. The factorization always exists and is unique up to sign choices on the diagonal of $R$ .

The system $A x = b$ becomes $R x = Q^{T} b$ , solved by back-substitution. The least squares minimizer of $∥ A x - b ∥$ satisfies the same triangular system. In both cases, $Q^{T}$ is applied without forming $Q$ explicitly.

The Householder reflector

For any nonzero vector $v$ , the matrix

H = I - τv v^{T}, τ = \frac{2}{v ^{T} v}

is symmetric and orthogonal. It reflects space across the hyperplane ${w : v^{T} w = 0}$ . It is its own inverse: $H^{2} = I$ .

The key property: given any vector $x$ , choose

v = x + sign (x_{1}) ∥ x ∥ e_{1}

Then $H x = - sign (x_{1}) ∥ x ∥ e_{1}$ : first entry $- sign (x_{1}) ∥ x ∥$ , all other entries zero.

The sign convention ensures $v_{1} = x_{1} + sign (x_{1}) ∥ x ∥$ is large, avoiding catastrophic cancellation. The wrong sign gives $v_{1} \approx 0$ when $x$ nearly points along $e_{1}$ , causing $τ = 2/ (v^{T} v)$ to explode.

$H$ is never formed explicitly. Applied to matrix $B$ :

H B = B - τv (v^{T} B)

Cost: $O (mn)$ rather than $O (m^{2} n)$ .

The algorithm

At step $j$ , extract $x = A (j : m, j)$ . Construct $v_{j}$ and $τ_{j}$ . Apply:

A (j : m, j : n) \leftarrow H_{j} \cdot A (j : m, j : n)

After this update: $A (j, j) = r_{jj}$ , entries $A (j + 1 : m, j) = 0$ . Earlier columns are untouched because $H_{j}$ acts only on rows $j$ through $m$ .

After all $n$ steps: $A$ has been overwritten with $R$ , and $Q = H_{1} H_{2} \dots H_{n}$ .

The compact storage scheme

After step $j$ , the entries $A (j + 1 : m, j)$ are zero and free. Normalizing so $v_{j} (1) = 1$ , only $v_{j} (2 : e n d)$ needs storing — exactly $m - j$ entries, matching the $m - j$ free slots. The diagonal stores $r_{jj}$ .

The output contains two things simultaneously: upper triangle = $R$ , strictly lower triangle = essential Householder vectors. A separate length- $n$ vector $τ$ completes the encoding. This is the LAPACK DGEQRF format.

Applying $Q$ without forming it

To apply $Q^{T}$ to $b$ :

for j = 1 to n : b (j : m) \leftarrow b (j : m) - τ_{j} v_{j} (v_{j}^{T} b (j : m))

To apply $Q$ : same loop, reversed. Cost: $O (mn)$ . Forming $Q$ explicitly costs $O (m^{2} n)$ and is almost never needed.

Complete numeric example

A = 110101011

Step 1. $x = [1, 1, 0]^{T}$ , $∥ x ∥ = 2$ .

v_{1} = 1 + 2 10, τ_{1} = \frac{2}{( 1 + 2 ) ^{2} + 1} = \frac{2}{4 + 2 2} = \frac{1}{2 + 2}

Apply $H_{1}$ to all three columns via $A \leftarrow A - τ_{1} v_{1} (v_{1}^{T} A)$ .

$v_{1}^{T} A$ : compute row by row.

Column 1: $v_{1}^{T} x = (1 + 2) (1) + (1) (1) + 0 = 2 + 2$
Column 2: $(1 + 2) (1) + (1) (0) + 0 = 1 + 2$
Column 3: $(1 + 2) (0) + (1) (1) + 0 = 1$

So $τ_{1} (v_{1}^{T} A) = \frac{1}{2 + 2} [2 + 2, 1 + 2, 1]$ .

After update:

A^{(1)} = - 2 v_{1} (2) v_{1} (3) \frac{1 - 2}{2} \frac{- 1}{2} 1 \frac{- 1}{2 + 2} \frac{2}{2 + 2} 1

where $v_{1} (2) = 1/ (1 + 2)$ and $v_{1} (3) = 0$ are packed into the lower triangle, and $r_{11} = - 2$ .

Step 2. Subvector: $x = [- \frac{1}{2}, 1]^{T}$ , $∥ x ∥ = \frac{5}{2}$ .

Since $x_{1} = - \frac{1}{2} < 0$ , $sign (x_{1}) = - 1$ :

v_{2} = [- \frac{1}{2} + \frac{5}{2} 1] = [\frac{5 - 1}{2} 1]

τ_{2} = \frac{2}{v _{2}^{T} v _{2}} = \frac{2}{\frac{( 5 - 1 ) ^{2}}{4} + 1} = \frac{2}{\frac{6 - 2 5}{4} + 1} = \frac{2}{\frac{10 - 2 5}{4}} = \frac{4}{5 - 5}

Apply $H_{2}$ to the $2 \times 2$ subblock. After this step: $A (3, 2) = 0$ , $r_{22} = \frac{5}{2}$ . $v_{2} (2) = 1$ packed into position $(3, 2)$ .

Step 3. Single entry: $A (3, 3)$ is already in upper triangular position. $r_{33}$ is read directly. No reflection needed.

Final packed array:

r_{11} v_{1} (2) v_{1} (3) r_{12} r_{22} v_{2} (2) r_{13} r_{23} r_{33}, τ = τ_{1} τ_{2} τ_{3}

Upper triangle: the answer ( $R$ ). Lower triangle: the recipe for $Q$ . These two objects share one array.

Implementation

import numpy as np

def householder_qr(A):
    A = A.astype(float).copy()
    m, n = A.shape
    tau = np.zeros(n)
    for j in range(n):
        x = A[j:, j]
        v = x.copy()
        # sign convention: avoid cancellation when x[0] ≈ ||x||
        v[0] += (1.0 if v[0] >= 0 else -1.0) * np.linalg.norm(x)
        v /= v[0]                                      # normalize so v[0] = 1
        tau[j] = 2.0 / (v @ v)
        A[j:, j:] -= tau[j] * np.outer(v, v @ A[j:, j:])  # rank-1 update
        A[j+1:, j] = v[1:]                            # pack v into lower triangle
    return A, tau                                      # upper = R, lower = v's, tau separate

Line 7 is the sign convention fix (np.sign(0) = 0 in NumPy; this version defaults to $+ 1$ when $v [0] = 0$ ). Line 10 is the entire reflector application. Lines 11–12 are the storage trick.

To apply $Q^{T}$ to a vector $b$ given the packed output:

def apply_Qt(QR, tau, b):
    b = b.copy().astype(float)
    m, n = QR.shape
    for j in range(n):
        v = np.concatenate([[1.0], QR[j+1:, j]])      # reconstruct v with implied 1
        b[j:] -= tau[j] * v * (v @ b[j:])
    return b

Part 6 — Concept Map

A diagram of the dependency graph of ideas across three clusters — geometry, mechanism, and storage & use — with labelled cross-cluster connections (sign rule → numerical stability; rank-1 update → packed array; column-by-column → zeros are free; the exact coincidence zero count = v entry count). SVG is pending.

Known remaining issues

The following items are identified for completion:

Geometric diagram. A 2D SVG illustration showing: vector $x$ as an arrow from origin; target $\pm ∥ x ∥ e_{1}$ on the horizontal axis; $v$ as the vector from target to $x$ ; the hyperplane as the perpendicular bisector; $H x$ landing on the axis. Should make visually obvious that zeros appear from alignment, not force.
Numeric example step 2. The $2 \times 2$ subblock update after constructing $v_{2}$ should be carried through to produce exact values of $r_{22}$ , $r_{23}$ , $r_{33}$ , and $v_{2} (2)$ . Currently only $r_{22} = 5 /2$ is stated.
Concept map SVG. The SVG described in Part 6 needs to be built and embedded.
Invariant ordering audit. Confirm that no invariant uses a term introduced in a later invariant. Current known issue: check that the ordering of 13–24 is strict (definitions before use).

Householder QR Factorization

How to use this document

Part 1 — Context

Part 2 — Definitions

Part 3 — Glossary of Invariants

The object

Why it is useful

The key constraint

The Householder reflector

Constructing v to zero a column

The algorithm

The storage scheme

Recovering and applying Q

What the algorithm is not doing

Part 4 — Dialogue

Part 5 — Traditional Narrative

What QR factorization is

The Householder reflector

The algorithm

The compact storage scheme

Applying Q without forming it

Complete numeric example

Implementation

Part 6 — Concept Map

Known remaining issues

Constructing $v$ to zero a column

Recovering and applying $Q$

Applying $Q$ without forming it