Fast Fourier Transform

Explore the most important algorithm in signal processing: how roots of unity enable O(n log n) polynomial multiplication

O(n log n) Polynomial Multiplication

The Fast Fourier Transform is one of the most important algorithms ever discovered. It transforms signals between time and frequency domains in O(n log n) instead of O(n²), revolutionizing signal processing, image compression, and polynomial arithmetic.

The key insight: evaluating a polynomial at the nth roots of unityhas special structure. Because ω^k+n/2 = -ω^k, we can split the problem in half at each step — the classic divide-and-conquer pattern.

In this section, you'll explore roots of unity on the complex plane, watch the butterfly diagram unfold, and see how FFT makes convolution efficient.

Demo 1: Roots of Unity

The nth roots of unity are complex numbers ω where ωⁿ = 1. They sit evenly spaced on the unit circle and satisfy remarkable properties: the cancellation, halving, and summation lemmas that make FFT possible.

n (roots):

k (power):

Selected: ω₈⁰

Value: 1.00

Angle: 0.0°

Formula: e^2πi·0/8 = cos(0°) + i·sin(0°)

All 8th Roots of Unity

ω⁰ = 1.00

ω¹ = 0.71 + 0.71i

ω² = 1.00i

ω³ = -0.71 + 0.71i

ω⁴ = -1.00

ω⁵ = -0.71 - 0.71i

ω⁶ = -1.00i

ω⁷ = 0.71 - 0.71i

Cancellation Lemma

ω_dn^dk = ω_n^k for any positive integer d

With d = 2:

ω₁₆⁰ = 1.00

ω₈⁰ = 1.00

✓ Equal!

This lemma lets us relate roots of different orders, crucial for FFT recursion.

Why Roots of Unity Matter

Evaluating a degree-(n-1) polynomial at all n roots of unity takes O(n²) naively. But the roots have special structure: ω^k+n/2 = -ω^k. This means half the evaluations share work with the other half, giving O(n log n)!

Demo 2: Coefficient vs Point-Value

A polynomial can be represented as coefficients [a₀, a₁, ...] or as point-values [(x₀, y₀), (x₁, y₁), ...]. Multiplication is O(n²) in coefficient form but O(n) in point-value form. FFT converts between them in O(n log n)!

Polynomial A (coefficients, lowest degree first):

A(x) = 1 +2x +3x^2

Polynomial B (coefficients):

B(x) = 1 +x

Naive O(n²) Multiplication

Multiply each coefficient of A with each coefficient of B, then collect like terms.

A × B:

1·x⁰ × 1·x⁰ = 1·x⁰

1·x⁰ × 1·x¹ = 1·x¹

2·x¹ × 1·x⁰ = 2·x¹

2·x¹ × 1·x¹ = 2·x²

3·x² × 1·x⁰ = 3·x²

3·x² × 1·x¹ = 3·x³

Operations: 6

FFT O(n log n) Multiplication

Transform to point-value form, multiply pointwise, transform back.

1. FFT(A) → point-values

2. FFT(B) → point-values

3. Multiply pointwise

4. Inverse FFT → coefficients

Operations: ~72

Result: A(x) × B(x)

1 +3x +5x^2 +3x^3

Coefficients: [1, 3, 5, 3]

✓ Naive and FFT results match

Complexity Comparison

Operation

Coefficient

Point-Value

Addition

O(n)

Multiplication

O(n²)

O(n)

Evaluation

O(n) per point

O(1) per point

Conversion (FFT)

O(n log n)

Demo 3: The Butterfly Diagram

The Cooley-Tukey FFT algorithm uses butterfly operations that combine pairs of values using twiddle factors (powers of ω). Step through the iterative FFT and watch how data flows through log₂(n) stages.

Input sequence:

n = 4, log₂(n) = 2

Stage 0 / 2

Bit-Reverse Permutation

First, reorder input by reversing the binary representation of each index.

Index	Binary	Reversed	New Index	Value
0	00	00	0	1.00
1	01	10	2	2.00
2	10	01	1	3.00
3	11	11	3	4.00

Butterfly Diagram

Add path

Subtract path

Multiply by twiddle factor

Why O(n log n)?

There are log₂(n) = 2 stages. Each stage performs n/2 = 2 butterfly operations. Total: 2 × 2 = 4 operations = O(n log n).

Demo 4: Convolution via FFT

Convolution in time = multiplication in frequency. Instead of O(n²) direct convolution, we FFT both signals, multiply pointwise, then inverse FFT — all in O(n log n). This powers audio processing, image filters, and neural network layers.

Signal:

Kernel:

Kernel size:5

Input Signal (sine)

Kernel (gaussian)

Convolution Result

Convolution Formula

(f * g)[n] = Σ_k f[k] · g[n - k]

Gaussian blur: Smooths the signal by weighted averaging with a bell-shaped kernel.

The Convolution Theorem

Convolution in time = multiplication in frequency.
Instead of O(n²) direct convolution, we can: FFT both signals, multiply pointwise, then inverse FFT — all in O(n log n). This is the foundation of modern signal processing.

Demo 5: Number Theoretic Transform

The NTT is FFT over finite fields — using roots of unity mod a prime instead of complex numbers. No floating-point errors! NTT powers big integer multiplication, cryptography (lattice-based schemes), and competitive programming.

Number Theoretic Transform (NTT)

FFT over finite fields instead of complex numbers. NTT avoids floating-point errors entirely, making it perfect for exact integer arithmetic, cryptography, and competitive programming.

Prime modulus:

Why These Primes?

p = 998,244,353 has a special form: p = k·2^m + 1

This means (p-1) is divisible by large powers of 2, allowing NTT of size up to 2²³.

g = 3 is a primitive root mod p, meaning g generates the multiplicative group Z_p*.

8th root of unity ω = g^(p-1)/8 mod p:

ω = 372528824

Powers of ω (should cycle back to 1):

ω⁰=1ω¹=372528824ω²=911660635ω³=488723995ω⁴=998244352ω⁵=625715529ω⁶=86583718ω⁷=509520358

Polynomial A (integer coefficients):

Polynomial B (integer coefficients):

NTT Multiplication Result

[1, 3, 5, 3]

Exact integer multiplication with no floating-point errors!

FFT vs NTT Comparison

Property	FFT	NTT
Domain	Complex numbers	Integers mod p
Roots of unity	e^2πi/n	g^(p-1)/n mod p
Precision	Floating-point errors	Exact (modular)
Size constraint	Power of 2	n \| (p-1)
Use cases	Signal processing	Cryptography, big integers

NTT Algorithm

1. Choose prime p = k·2^m + 1 where n ≤ 2^m

2. Find primitive root g mod p

3. Compute ω = g^(p-1)/n mod p (nth root of unity mod p)

4. Apply same Cooley-Tukey algorithm, but all arithmetic is mod p

5. Inverse NTT uses ω^-1 = g^p-1-(p-1)/n mod p

Applications

Big integer multiplication: Multiply million-digit numbers exactly
Polynomial arithmetic: GCD, interpolation, division
Lattice cryptography: NTRU, Ring-LWE use NTT for efficiency
Error-correcting codes: Reed-Solomon encoding

FFT Mastered!

You now understand one of the most transformative algorithms:

Roots of unity: ωⁿ = 1, evenly spaced on unit circle
Halving lemma: Squares of n roots = n/2 roots
Butterfly operations: Combine pairs with twiddle factors
Convolution theorem: Multiply in frequency domain
NTT: Exact arithmetic over finite fields

Next: We'll explore advanced tree structures — B-trees for disk optimization, persistent trees for version control, and van Emde Boas trees for O(log log U) integer operations.