We want to formalize the notion of uncertainty. Consider two experiments. In the first, you flip a fair coin. In the second, you roll a fair dice. It is reasonable to say that the outcome of the second experiment is more uncertain than the outcome of the first, simply because there are more possibilities to choose from.
Consider the uncertainty (whatever that is) of a random variable
which takes on only the distinct values
, ...,
with non-zero probabilities
, ...,
, resp., where
.
The uncertainty of
should not depend on the values
of
but only on the probabilites
,
.
Likely events should be less uncertain than rare events.
With this motivation, we present Claude Shannon's
definition.
It is intuitively obvious that when the channel creates a lot of errors then there is a limitation to the information which can be sent. The next definition makes this idea more precise.
In the case of the binary symmetric channel, the capacity is
. The minimum capacity is when
.
To justify the formula which defines the capacity, we need Shannon's fundamental theorem of information theory (also called the noisy channel theorem). It will be stated in section §3.3.2.
(a) What is the probability of exactly
one error? (Ans:
. Why?)
(b) What is the probability of exactly
two errors? (Ans:
. Why?)
(c) What is the probability of exactly
errors?