Beta

Curating trusted science content, from across the internet.

Explore Deep Dive

But what is a convolution?

18/11/2022·

Watch: 23 minRead: 4 min

# 3Blue1Brown # Grant-Sanderson # Probability # polynomials # Mathematics

Below is a short summary and detailed review of this video written by FutureFactual:

Convolution Demystified: Intuition, Visuals, and a Fast FFT-Based Algorithm

Convolution explained in plain terms

This video unpacks convolution as a fundamental operation for combining two lists or functions, showing how flipping, sliding, and summing products creates a new sequence. It starts with probability visuals using dice, moves through moving averages and image blurring, and ends with a fast FFT based method that scales to large inputs. Along the way, the presenter connects discrete convolutions to polynomials and to practical computing tricks, offering intuition behind a powerful mathematical tool.

Convolution extends beyond simple addition and multiplication to mix two lists or functions.
Moving averages and Gaussian kernels illustrate smoothing and weighting effects in data and images.
Convolution relates to polynomials, enabling a backdoor view of polynomial multiplication via coefficient convolution.
FFT based convolution reduces complexity to O(n log n), enabling efficient large scale computation.

Overview

The video introduces convolution as a truly fundamental operation for combining two sequences or functions. Rather than simply adding or multiplying term by term, convolution blends the two sources by reversing one and overlaying it across the other, summing the pairwise products at each alignment. This perspective is motivated through probability before turning to signal processing and polynomials. The central idea is that convolution creates a new sequence whose nth term is a sum of products where the indices sum to n. This simple rule explains a wide range of phenomena from probability distributions to image processing kernels.

Visualizing Convolution

One vivid visualization uses rolling dice probabilities. By flipping and sliding the probability arrays relative to one another, different sums line up in a diagonal fashion in a grid of pairwise products. Offsetting the second array reveals all combinations that yield a given sum. This diagonal summation of products is exactly a convolution. The same operation can be imagined as a sliding window over data, where the window taps into the data and the kernel weights the window values before summing. This creates a new data sequence that reflects how the two inputs interact across positions.

Discrete Case and Notation

The convolution of two sequences A and B is denoted with an asterisk. The nth term is a sum over all index pairs whose sum equals n. The video walks through a concrete example with small lists to illustrate how the partial sums are formed and how the output grows in length relative to the inputs. The core takeaway is that convolution is a structured, sum of products that depends on index sums and flipping the second input, which is a natural artefact when deriving the operation from pure mathematics.

Moving Averages and Kernels

Convolution can implement moving averages: if the second list sums to one, the convolution at each position is a weighted average of nearby data points. Different kernels produce different smoothing effects; a uniform kernel yields a basic moving average, while a Gaussian kernel yields a weighted blur that emphasizes the center. The video demonstrates 2D convolution by applying kernels to images, yielding blurring and other effects such as edge detection or sharpening depending on the kernel design. Kernels are the heart of many image processing pipelines and even connect to the idea of convolutional neural networks where kernels are learned from data.

Convolution and Polynomials

A second viewpoint connects convolution to polynomial multiplication. If you treat input sequences as polynomial coefficients, convolving them corresponds to multiplying the polynomials and then collecting like terms. This perspective foreshadows the FFT approach by linking discrete convolution to polynomial arithmetic, hinting at why fast algorithms for convolution exist beyond naive O(n^2) methods.

The Fast Way: FFT Convolution

The presenter then explains how a fast Fourier transform algorithm can compute convolutions in O(n log n) time. By evaluating the polynomials (treating the inputs as coefficients) at special points on the unit circle, multiplying pointwise, and then applying an inverse transform, convolution becomes a cheap operation. This trick is the essence of the FFT based approach and is widely used in large scale signal processing, image processing, and scientific computing. The same method applies backward to recover the coefficients from the outputs, enabling a fast path to both forward and inverse operations.

Practical Contexts and Extensions

Beyond probability distributions, convolution appears in many algorithms, including image blurring, edge detection, and even the inner workings of neural networks. The video also touches on the mathematical relationship between convolution and polynomial multiplication, clarifying why convolution is more than a simple product in the finite sense. The outline emphasizes the practical advantage of FFT based convolution: a dramatic speedup that makes large scale data analysis feasible. It also hints at the continuous case and other applications to be explored in a subsequent part.

Takeaways and Homework

The takeaway is that convolution is a flexible and ubiquitous operation that arises naturally from both discrete and continuous perspectives. Its connections to probability, statistics, image processing, and polynomial algebra illuminate why the operation is so central in mathematics and computer science. As a practical note, the FFT enables fast computation by exploiting redundancy in the transform domain, turning a potentially quadratic task into a near linear one in the size of the input.

Impact and Related Concepts

Understanding convolution not only clarifies how data is blended over space and time but also uncovers why Gaussian blur often produces natural looking results in photography and imaging. It also provides a foundation for more advanced topics such as convolutional neural networks, where learning kernels from data becomes a powerful tool for feature extraction. The interconnections across probability, numerical methods, and signal processing showcase the unifying power of convolution as a core mathematical construct.

To find out more about the video and 3Blue1Brown go to: But what is a convolution?.