Skip to content

Information Theory

WARNING

Most of these notes are from Claude. Review them carefully after you're done.

NOTATION

  • All logarithms (log) and exponentials (exp) will be in base-2 (unless explicitly specified otherwise) since a "natural" unit of information is bits.
  • Sets will be denoted with calligraphic letters, e.g. X.
  • Vectors will be denoted with underlines, e.g. x.
  • Random variables will be denoted by capital letters, e.g. X. So, combined with the previous convention, random vectors will be denoted by underlined capital letters, e.g. X.
  • Notation such as pX() references the probability mass function (p.m.f.) of random variable X. So pX(i) equals the probability that the random variable X takes the value i.
  • For any vector vRn, the 1 norm v1=i=1n|vi|.
  • We denote the length of a vector v as len(v)
  • is for definitions. So ab means a is defined to be b.
  • is approximate. So ab is the same as ab.
  • When given a vector, say (X1,...,Xn), if we want to reference only a (consecutive) subset of its components, say Xi,Xi+1,...,Xj1,Xj, we denote this as Xij. By convention, Xn should be taken as meaning X1n.
  • "iff" means "if and only if"

Table of Contents: