Probablity Theory
A key concept in the field of pattern recognition is that of uncertainity. It arrises through:
- noise on measurements
- finite size of datasets.
Probability theory provides a consistent framework for the quantification and manipulation of uncertainity and forms one of the central foundations of Pattern Recognition.
When combined with Decision Theory, it allows us to make optimal predictions given all the informtion available to us, even though that information may be incomplete or ambiguous.
Example: Red/Blue boxes
The setup
Imagine we have two boxes red_box
and blue_box
.
Each box contains different number of fruits oranges
and apples
.
1 2 3 4 5 6 |
|
The Experiment
Now, we do the following steps:
- Select a box: Randomly pick one of the boxes (either
red_box
orblue_box
) - Pick a fruit: Then, randomly select an item of fruit for the box.
- Replace the fruit: Having observed the type of the picked fruit (
apple
ororange
), now, replace it in the box from which it came. - Repeat steps 1-to-3 multiple times.
The Pre-conditions / Assumptions
Now, let's suppose in doing the above experiment,
- we pick the
red_box
40% of the times and theblue_box
60% of the times. - Also, assume that that when we remove an item of fruit from a box, we are equally likely to select any of the pices of fruits in the box.
The Theory
In this experiemnt:
- the identity of the box to be chosen is a random variable \(B\) which can take
two possible values
r
andb
(corresponding to red or blue boxes). - Similarly, the identity of fruit is also a random variable \(F\) and it can
take either of the values
a
ando
(corresponding to apple an dorange respectively).
Definition:
The Probability of an event is defined as the fraction of times, that event occurs out of the total number of trials (in the limit that the total number of trials goes to infinity). All probabilities must lie in the interval [0, 1]
Thus , the probability of selecting the red_box
is \(4/10\) and that of the blue_box
is
\(6/10\).