Exemplos de DecisionTrees SetHelper.missingDataFraction em C# (CSharp)

Linguagem de programação: C# (CSharp)

Espaço para nome / nome do pacote: DecisionTrees

Classe / Tipo: SetHelper

Método / Função: missingDataFraction

Exemplos em hotexamples.com: 2

DecisionTrees SetHelper.missingDataFraction em C# (CSharp) - 2 exemplos encontrados. Esses são os exemplos do mundo real mais bem avaliados de DecisionTrees.SetHelper.missingDataFraction em C# (CSharp) extraídos de projetos de código aberto. Você pode avaliar os exemplos para nos ajudar a melhorar a qualidade deles.

Métodos Frequentes

Exibir Ocultar

mostCommonClassifier(4)

hasUniformClassifier(2)

missingDataFraction(2)

subset_errors(2)

all_leaf_children(1)

attributePossibilities(1)

subsetOnAttributeContinuous(1)

subsetOnAttributeNominal(1)

total_subset_of_data_locations(1)

Métodos Frequentes

mostCommonClassifier (4)

hasUniformClassifier (2)

missingDataFraction (2)

subset_errors (2)

all_leaf_children (1)

attributePossibilities (1)

subsetOnAttributeContinuous (1)

subsetOnAttributeNominal (1)

total_subset_of_data_locations (1)

Exemplo n.º 1

0

Exibir arquivo

Arquivo: Calculator.cs Projeto: MaxGroot/AlgorithmVerbalization

public static double gainRatio(List <DataInstance> S, string wanted_attribute, string targetAttribute, List <string> possible_values) { // Adjust for missing data double missingFraction = SetHelper.missingDataFraction(S, wanted_attribute); double gain_result = gain(S, wanted_attribute, targetAttribute, possible_values); double splitinfo = splitInfo(S, wanted_attribute, possible_values); if (splitinfo == 0) { return(0); } double ret = gain_result / splitinfo; return(ret); // return (1 - missingFraction) * gain(S, wanted_attribute, targetAttribute, possible_values) / splitInfo(S, wanted_attribute, possible_values); }

Exemplo n.º 2

0

Exibir arquivo

Arquivo: Calculator.cs Projeto: MaxGroot/AlgorithmVerbalization

public static double[] best_split_and_ratio_for_continuous(List <DataInstance> S, string wanted_attribute, string target_attribute, List <double> supplied_values) { double total_set_entropy = entropy(S, target_attribute); // Sort by the wanted attribute List <DataInstance> s_sorted = S.OrderBy(o => o.getProperty(wanted_attribute)).ToList(); List <double> possible_values = null; if (supplied_values.Count == 0) { possible_values = new List <double>(); // If the supplied list of possible values is empty, we have to fill it ourselves. // The list becomes empty if the user opts to keep considering all values of an attribute, not just the possible values of subsets. // Add posisble attribute values based on instances supplied. foreach (DataInstance instance in s_sorted) { if (instance.getProperty(wanted_attribute) != null) { double my_value = instance.getPropertyAsDouble(wanted_attribute); if (!possible_values.Contains(my_value)) { possible_values.Add(my_value); } } } } else { // If we a list of possible values supplied to us, we'll use that one. // It's safe to just reference to that list since we won't make changes to it. possible_values = supplied_values; } // Loop through possible splits and calculate their gain ratios double best_split = 0; double best_split_gain = -1; double best_split_gain_ratio = -1; bool found_better_than_nothing = false; foreach (double binary_split in possible_values) { // Create subsets below or equal, and above the wanted attribute's current binary split. List <DataInstance> s_below_or_equal = S.Where(o => (o.getProperty(wanted_attribute) != null) ? o.getPropertyAsDouble(wanted_attribute) <= binary_split : false).ToList(); List <DataInstance> s_above = S.Where(o => (o.getProperty(wanted_attribute) != null) ? o.getPropertyAsDouble(wanted_attribute) > binary_split : false).ToList(); double entropy_below_or_equal = entropy(s_below_or_equal, target_attribute); double entropy_above = entropy(s_above, target_attribute); double proportion_below_or_equal = ((double)s_below_or_equal.Count()) / ((double)S.Count()); double proportion_above = ((double)s_above.Count()) / ((double)S.Count()); // Calculare gain of splitting on this binary split double gain_on_this_split = total_set_entropy - (proportion_below_or_equal * entropy_below_or_equal) - (proportion_above * entropy_above); double splitinfo_on_this_split = -(proportion_below_or_equal * Math.Log(proportion_below_or_equal, 2)) - (proportion_above * Math.Log(proportion_above, 2)); double gain_ratio_on_this_split = gain_on_this_split / splitinfo_on_this_split; // Finally all calculations are done! Lets find out if this one is the best one yet. if (gain_on_this_split > best_split_gain) { found_better_than_nothing = true; best_split_gain = gain_on_this_split; best_split_gain_ratio = gain_ratio_on_this_split; best_split = binary_split; } } if (!found_better_than_nothing) { Console.WriteLine($"No gain ratio could be found for this attribute {wanted_attribute}"); } // Adjust for missing data double missingFraction = SetHelper.missingDataFraction(S, wanted_attribute); best_split_gain_ratio = (1 - missingFraction) * best_split_gain_ratio; // We want to select by the best gain, not by the best gain ratio, just like J48 does it. return(new double[] { best_split, best_split_gain }); }