Intuitive overview of Kolmogorov Complexity

In the mid-nineties, Ray Solomonoff and Andrey Kolmogorov independently researched and published work that ultimately lead to the foundation of Algorithmic Information Theory. While closely related, the purpose of their research was different, creating two branches: Algorithmic Probability and Algorithmic Complexity. Kolmogorov was concerned primarily with Complexity.

Kolmogorov's paper titled "Three approaches to defining Quantity of Information" proposed an alternative approach to the existing Combinatorial and Probabilistic ones: Algorithmic. Kolmogorov theorized that the complexity (quantity of information) of a data object is relative to the smallest program that can generate it (in a fixed language):

K(x) of a string x is the smallest k, such that there exists a representation (L,y) of x, such that:

|(L,y)|≤ k, where L is a fixed language and y is a representation of x in language L.

It is possible to fix the language to be a Turing-complete, general purpose language and perform an analysis in that context.

Given two strings of length 100:

  1. A finite sequence of repeating numbers: 01460146...0146,

  2. A statistically random finite sequence of numbers: 82095163...2472,

a 43 byte program can be written in Python to generate the first string:

''.join([str(2*(i%4)) for i in range(100)])

However, there is no program that can generate the second string whose length is shorter than the string literal representation. That is, it cannot be compressed. An incompressible string is one whose K(x) equals the length of the string itself.

Most strings are either highly complex, or incompressible. The probability that a string of length n is incompressible by c is at least 1 − 2 − c + 1 + 2 − n.

For any given string length there exist incompressible strings, which means that universal lossless compression is impossible.

It is natural to consider what influence the choice of language has on the result of K(x). The Invariance Theorem proves that it bears no meaningful impact for theoretical analysis. The difference is constant between the optimal language description of the data object and that of a description in another language. Unfortunately, K(x) is uncomputable, therefore it is impossible to know what the optimal compression is for a data object.

The theory of Kolmogorov's complexity has interesting philosophical implications for the concept of "randomness". It is commonly understood that programs (without an entropy source exhibiting quantum randomness, e.g., radioactive decay) can generate only pseudorandom data. This can be intuitively derived if to consider the Kolmogorov complexity of the output of such a program. A program is known to exist that can generate that output, therefore it is not Kolmogorov-random. It might be statistically random, though, depending on the implementation.

Postscript

The incompressibility insight can be applied to simplify a variety of proofs, using a method named The Incompressibility Method. The book "An Introduction to Kolmogorov Complexity and Its Applications" by Ming Li and Paul Vitaniy is a seminal text which delves into applying this proof method.

Previous
Previous

In praise of Sony Digital Paper