Straight average algorithm with continuous input distributions, complex trust, and intermediate results: Difference between revisions

Content deleted Content added

Inline

Revision as of 17:56, 26 September 2024

Main article: [[Aggregation techniques]]

In this previous post we discussed an algorithm for modifying probabilities using a more complex trust factor, one involving random lying, bias, and biased lying. We also discussed a simple case of combining continuous distributions using Bayes, straight averaging and trust-weighted averaging. However, the continuous distributions in this case did not use the complex trust factor.

In this post from last week we discussed a way to modify probabilities using the complex trust factor with continuous distributions. Doing so requires that we break up the continuous distribution into the set of choices being offered in the question (eg rainy, cloudy, sunny would be 3 choices) and apply the lying and biased lying portion of the trust modification to each choice accordingly.

None of this previous work, however, allowed us to calculate a full multi-level tree of nodes where we use intermediate_results to transfer calculations up to the parent node so the calculation can continue. This post is about an algorithm that rectifies this shortcoming and yields the ability to do straight averaging with complex trust, continuous input distributions, and multi-level calculations.

The algorithm, at its essence, keeps track of each node’s a) personal opinion and b) the sum of the opinions of all its children (after they have been modified by trust). These two pieces of information become the intermediate_results which are used to calculate the average for that node and are passed up to the parent node. Each parent, in turn, sums all its children’s intermediate_results together to form, along with its personal opinion, its own intermediate_results.

First, let’s start by reminding ourselves of how intermediate_results is constructed in this context:

[personal opinion, sum of child opinions, population of children]

Since we are dealing with continuous distributions, we include the x-values of the points in this list:

[[x points of personal opinion, pdf points of personal opinion], [xpoints of child opinions, sum of pdf points of child opinions], population of children]]

The following is an example of this:

inter_results012 =  [[[0.0, 0.1, 0.2, 0.3, 0.4, 0.5, 0.6, 0.7, 0.8, 0.9, 1.0], [2.00051038563241, 2.00051038563241, 2.00051038563241, 2.00051038563241, 2.00051038563241, 2.00051038563241, 1.5350918071000005e-06, 1.5350918071000005e-06, 1.5350918071000005e-06, 1.5350918071000005e-06, 1.5350918071000005e-06]], [[0.0, 0.1, 0.2, 0.3, 0.4, 0.5, 0.6, 0.7, 0.8, 0.9, 1.0], [5.821628442333012, 5.821628442333012, 5.821628442333012, 5.821628442333012, 5.821628442333012, 5.821628442333012, 6.144719133017774, 6.144719133017774, 6.144719133017774, 6.144719133017774, 6.144719133017774]], 6]

Let’s also remind ourselves of the complex trust_factor which is broken up into a judgement trust and a communication trust:

trust_factor = [ [0.8, 0.1, 0.0, 0.02, 0.03, 0.02, 0.03],    #Tj, judgement trust
                 [0.9, 0.1, 0.0, 0.0, 0.0, 0.0, 0.0] ]  #Tc, communication trust

Here there are two choices (say rainy and sunny) so each Tj and Tc could be described as:

[Ttruth, Trandom, Tlierandom, Tliebias_rainy, Tliebias_sunny, Tbias_rainy, Tbias_sunny]

This is discussed at some length in Exercising the algorithms.py interface with more complex data types

The new algorithm is encapsulated in the function straight_ave_points_continuous_complex_trust_intermediate which looks like the following and is available in the following snippet:

@register_algorithm()
def straight_ave_points_continuous_complex_trust_intermediate(data: AlgorithmInput):
    convert_functions_to_points_continuous(data)
    set_initial_intermediate_results_sapccti(data)
    modify_prob_data_for_continuous_complex_trust_intermediate(data)
    sum_kids_intermediate_results_and_putinto_parent(data)
    x_list, Pdave_list = ave_points_continuous_complex_trust_intermediate4(data)
    opinion_output = OpinionData([x_list, Pdave_list], 1)
    intermediate_results = data.components[0].intermediate_results #not really necessary
    algout = AlgorithmOutput(opinion_output, intermediate_results)
    return algout

At this outermost level the algorithm is composed of 5 basic steps, the first five function calls in the script above:

Converting input continuous distributions (provided by a Weibull distribution) to discrete points. This is governed by the misc_input parameter, particularly the number of points, n, desired. The function convert_functions_to_points_continuous handles this.
Setting initial intermediate_results to zero for leaf nodes. These nodes only have personal opinions. However, by forcing them to have intermediate_results of the same form as all other nodes, the calculations can be applied consistently to all nodes (with no need for special casing the leaf nodes). The function set_initial_intermediate_results_sapccti handles this.
Modifying the probability data using complex trust as discussed in this previous post. modify_prob_data_for_continuous_complex_trust_intermediate handles this.
Take the sum of all the children intermediate_results and place this result as part of the parent intermediate_results (along with the parent personal opinion). This is done in sum_kids_intermediate_results_and_putinto_parent.
Use the result of 4 to take the straight average for the node, which consists of its personal opinion summed with the intermediate_results of all the children. The function ave_points_continuous_complex_trust_intermediate4 handles this. Incidentally, the 1-3 variants of this are earlier versions of the function which are not important to this discussion. They are kept for reference since they have properties of interest for future calculations.

The function convert_functions_to_points_continuous takes continuous functions given by a Python function name, eg:

def weibull(k, lamb, x):
    w = (k/lamb)*(x/lamb)**(k-1.0)*math.exp(-(x/lamb)**k)
    return w

def Pd0(x):
    k = 3.0
    lamb = 0.2
    return weibull(k, lamb, x)

and converts them to a discrete set of (x, p) points depending on the misc_input, eg:

misc_input = {'xmin': 0.0, 'xmax': 1.0, 'n': 10}

The continuous function is sampled at each of the n intervals between xmin and xmax to create the corresponding set of discrete points. If the input is already a set of discrete points, convert_functions_to_points_continuous does nothing.

The function set_initial_intermediate_results_sapccti provides an empty set of intermediate_results to nodes (leaf nodes) that do not already have them:

def set_initial_intermediate_results_sapccti(data):
    for comp in data.components:
        if(comp.intermediate_results == []):
            x_points_zero = comp.opinion.pdf_points[0]
            pdf_points_zero = [0.0]*len(x_points_zero)
            comp.intermediate_results = [[comp.opinion.pdf_points[0], comp.opinion.pdf_points[1]], [x_points_zero, pdf_points_zero], 0]

The last line in the above provides the intermediate_results for a particular node, that is the (x, p) points for its personal opinion, the (x, p) points of the nonexistent children where p is a list of zeros, and the population of the children which, in this case, is zero.

modify_prob_data_for_continuous_complex_trust_intermediate is an outer function for doing the probability modification using the complex trust. It breaks up each node’s intermediate_results into its corresponding personal opinion and the summed opinion of its children. The personal opinion is then modified using the judgement trust portion of the trust_factor and the summed children opinions are modified using the communication portion of the trust_factor. Since in most cases the population of the children will be greater than 1, the summed probability of the children is divided by this population to create an average probability for the trust-modification. After modification, the result is multiplied by the population to recreate the sum for continued calculation:

def modify_prob_data_for_continuous_complex_trust_intermediate(data):
    for comp in data.components:
        # personal opinion
        xp_points_personal = comp.intermediate_results[0]
        x_points_personal = xp_points_personal[0]
        pdf_points_personal = xp_points_personal[1]
        T_personal = comp.trust_factor[0] #use Tj
        comp.intermediate_results[0][1] = calc_pdfmod_from_complex_trust_intermediate(x_points_personal, pdf_points_personal, T_personal)
        # computed opinion of children
        xp_points_children = comp.intermediate_results[1]
        x_points_children = xp_points_children[0]
        pdf_points_children = xp_points_children[1] #this is really a sum
        T_children = comp.trust_factor[1] #use Tc
        Npop = comp.intermediate_results[2]
        #Need to modify the function to handle Npop > 1 PM_090623
        if(Npop == 1):
            comp.intermediate_results[1][1] = calc_pdfmod_from_complex_trust_intermediate(x_points_children, pdf_points_children, T_children, Npop)
        elif(Npop > 1):
            #account for the fact that we have a sum here, not an individual probability
            pdf_points_children = (np.array(pdf_points_children) / Npop).tolist()
            inter_results = calc_pdfmod_from_complex_trust_intermediate(x_points_children, pdf_points_children, T_children, Npop)
            inter_results = (np.array(inter_results) * Npop).tolist()
            comp.intermediate_results[1][1] = inter_results
        else: #(Npop == 0)
            continue

Within this function another function, calc_pdfmod_from_complex_trust_intermediate is called to perform the actual modification calculation. This follows exactly the description set out in the last post starting at the section titled “Trust modification for continuous algorithms using complex trust factors”.

The function sum_kids_intermediate_results_and_putinto_parent goes through each child node and adds up all their intermediate_results. This sum then becomes this parent’s child portion of its intermediate_results, alongside its personal opinion:

def sum_kids_intermediate_results_and_putinto_parent(data):
    #parent is the first one (index 0) and kids are the rest
    x_points = data.components[0].opinion.pdf_points[0]
    kids = data.components[1:]
    tot_kids = [0.0]*len(x_points)
    tot_npop_kids = 0
    for kid in kids:
        sum_kid = addlists(kid.intermediate_results[0][1], kid.intermediate_results[1][1])
        npop_kid = 1 + kid.intermediate_results[2]
        tot_kids = addlists(tot_kids, sum_kid)
        tot_npop_kids = tot_npop_kids + npop_kid
    #put this into the parent intermediate result
    data.components[0].intermediate_results[1][1] = tot_kids
    data.components[0].intermediate_results[2] = tot_npop_kids

Given a finalized set of intermediate_results, the only remaining task is to take the average for the node which is done in ave_points_continuous_complex_trust_intermediate4. This is fairly straightforward since it only involves adding up the node’s personal opinion with the already calculated sum of its children and dividing by the total population:

def ave_points_continuous_complex_trust_intermediate4(data):
    inter_results_parent = data.components[0].intermediate_results
    x_data = inter_results_parent[0][0]
    p_parent = inter_results_parent[0][1]
    p_sum_kids = inter_results_parent[1][1]
    npop_kids = inter_results_parent[2]
    p_sum = addlists(p_parent, p_sum_kids)
    npop = npop_kids + 1
    p_ave = (np.array(p_sum) / npop).tolist()
    return [x_data, p_ave]

Revision as of 14:32, 25 September 2024 view source Lembot (talk \| contribs) associate, Bots 44 edits m Pywikibot 9.3.1 ← Older edit		Revision as of 17:56, 26 September 2024 view source Pete (talk \| contribs) associate 2,150 edits No edit summary Newer edit →
Line 1: {{Main\|[[Aggregation techniques\|Aggregation techniques]]}} In [[~~Internal:FromGitlab/~~Exercising_the_algorithms_py_interface_with_more_complex_data_types\|this previous post]] we discussed an algorithm for modifying probabilities using a more complex [[trust]] factor, one involving random [[Deception\|lying]], [[Bias\|bias]], and biased lying. We also discussed a simple case of combining [[Continuous distribution\|continuous distribution]]s using [[Bayes' theorem\|Bayes]], straight averaging and [[Trust-weighted averaging\|trust-weighted averaging]]. However, the continuous distributions in this case did not use the complex [[Trust factor\|trust factor]]. In [[Internal:FromGitlab/Notes_on_setting_up_&_using_the_sandbox_and_algorithmic_improvements\|this post from last week]] we discussed a way to modify probabilities using the complex trust factor with continuous distributions. Doing so requires that we break up the continuous distribution into the set of choices being offered in the question (eg rainy, cloudy, sunny would be 3 choices) and apply the lying and biased lying portion of the [[Trust\|trust]] modification to each choice accordingly. Line 7: None of this previous work, however, allowed us to calculate a full multi-level tree of nodes where we use <code>intermediate_results</code> to transfer calculations up to the parent node so the calculation can continue. This post is about an algorithm that rectifies this shortcoming and yields the ability to do straight averaging with complex trust, continuous input distributions, and multi-level calculations. The algorithm, at its essence, keeps track of each node’s a) [[Personal opinion\|personal opinion]] and b) the sum of the [[Opinion\|opinion]]s of all its children (after they have been modified by trust). These two pieces of information become the <code>intermediate_results</code> which are used to calculate the average for that node and are passed up to the parent node. Each parent, in turn, sums all its children’s <code>intermediate_results</code> together to form, along with its personal [[opinion]], its own <code>intermediate_results</code>. First, let’s start by reminding ourselves of how <code>intermediate_results</code> is constructed in this context: