A simple averaging technique to supplement the Bayes equation

Background

As we saw previously, the Bayes equation can easily be misapplied to situations that are not based on rigorous probabilistic studies. The example given was along the lines of 100 people who are not sure whether it will be sunny or cloudy tomorrow because either they individually don’t know (P=50%) or their probabilities cancel out to become 50%. But the 101st person is certain that it will be sunny. According to Bayes, this will give you 100% certainty that tomorrow will be sunny because the 50% opinion, no matter how it is arrived at, simply has no influence.

Another simpler example is if 10 people give you a 60% chance of rain tomorrow and thereby, via the Bayes eqn, cause you to believe with near certainty that it will rain tomorrow. The problem here is that the 10 people are not conducting independent tests (or simulations) to judge the probability of rain. They are, more likely, simply reflecting the single weather report they all watched on TV.

It is clear that Bayes cannot be used in cases where the source offers no better than a handwaving estimate of probability resulting from a casual opinion. Since this is going to happen in a large number of cases, we will need a more realistic way to combine these opinions.

One way is to simply average the probabilities being given by each source. For the 10 people who watched the same weather report, the average will then be 60%, reflecting only the single source they obtained their information from.

So now we would have two methods for combining probabilities, a simple averaging technique and Bayes. It would be left to the user to choose which of these to use.

A weighting factor between simple averaging and Bayes

But these two choices seem like two opposite endpoints on a continuum. On the one hand we have Bayes for rigorous probabilistic tests and on the other we have simple averaging for the most unrigorous opinions. Unlike Bayes, the averaging technique provides no reinforcement (ie the ${\textstyle P_{comb}}$ can’t be higher than the highest ${\textstyle P}$ ). But it seems like a large crowd should provide some reinforcement. If 10 people say 60%, isn’t that sometimes better than a single person saying 60%? What if there were two independent weather predictions that give the chance of rain at 60% and the 10 people are divided into these two groups? Then you could apply Bayes for two sources at 60% and get a reinforced result of 69%.

A straightforward way to do this is to simply have a user-chosen weighting factor between simple averaging and Bayes:

P_{comb}=(1-w_{bay})P_{ave}+w_{bay}P_{bay}

where:

${\textstyle P_{comb}}$ is combined probability

${\textstyle P_{ave}}$ is the simple-averaged probability

${\textstyle P_{bay}}$ is the Bayesian combined probability

${\textstyle w_{bay}}$ is the weighting toward Bayes. If ${\textstyle w_{bay}}$ = 0 then only simple averaging is used. If ${\textstyle w_{bay}}$ = 1 then the algorithm uses Bayes only.

The input probabilities for ${\textstyle P_{ave}}$ and ${\textstyle P_{bay}}$ are the same. That is, they are modified using trust in exactly the same way (using Sapienza’s equation, or using the modified forms of this equation described here).

Combining input probabilities in simple averaging

However, the input probabilities are not rolled up in exactly the same way to create ${\textstyle P_{comb}}$ . For Bayes, nodes are combined with their parent to create ${\textstyle P_{comb}}$ . The ${\textstyle P_{comb}}$ of the parents are then combined with their parent to create a new ${\textstyle P_{comb}}$ and so on all the way to the topmost node. For simple averaging, doing this will result in double-counting nodes, so the technique is to just append probabilities as we work up and then take the average once the top-most node is reached by dividing the sum by the number of nodes.