C# (CSharp) ReinforcementLearning Qmatrix示例

编程语言: C# (CSharp)

命名空间/包名称: ReinforcementLearning

类/类型: Qmatrix

hotexamples.com的示例: 3

C# (CSharp) ReinforcementLearning Qmatrix - 已找到3个示例。这些是从开源项目中提取的最受好评的ReinforcementLearning.Qmatrix现实C# (CSharp)示例。您可以评价示例，以帮助我们提高示例质量。

常用方法

显示隐藏

GenerateStep(1)

ProcessNewEpisode(1)

UpdateState(1)

示例#1

显示文件

        //Copied from another algorithm state
        //We reset some data, so we dont reflect values that aren't true for the new state
        //This constructor is called when a new step is being generated, so we transfer some values appropriately.
        public AlgorithmState(AlgorithmState set_from)
        {
            cans_collected = set_from.cans_collected;

            episode_rewards = set_from.episode_rewards; //Reward data
            total_rewards   = set_from.total_rewards;

            board_data = new GameBoard(set_from.board_data); //Copy the board

            //Increase steps in here
            live_qmatrix = new Qmatrix(set_from.live_qmatrix); //Copy the q matrix

            //The initial location will be the resulting location of the last step
            location_initial = new int[2] {
                set_from.location_result[0], set_from.location_result[1]
            };

            bender_perception_starting = set_from.bender_perception_ending;

            //Detect if we reached the limit for this episode

            if (live_qmatrix.step_number == Qmatrix.step_limit)
            {
                StartNewEpisode();
            }
            else
            {
                live_qmatrix.step_number++;
            }
        }

示例#2

显示文件

        //Called from create_empty_board (after reset), and the constructor
        //Just a useful container for resetting some values when we want to start over, but making a new state would have us lose bender's position.
        private void InitializeValues()
        {
            board_data.ClearCans(); //Clear the board for our initial launch(this doesn't remove bender, just cans)
            live_qmatrix = new Qmatrix();

            location_initial = new int[2] {
                0, 0
            };
            location_result = new int[2] {
                0, 0
            };
        }

示例#3

显示文件

 public Qmatrix(Qmatrix copy_from)
 {
     //Copy the q-matrix.
     matrix_data = new Dictionary <PerceptionState, ValueSet>();
     foreach (var i in copy_from.matrix_data.Keys)
     {   //For each list of strings in copy_from.matrix data
         //Get a copy of the dictionary at this list of strings
         //Should be a deep copy
         matrix_data.Add(i, new ValueSet(copy_from.matrix_data[i]));
     }
     did_we_update  = copy_from.did_we_update;
     step_number    = copy_from.step_number;
     episode_number = copy_from.episode_number;
     n_current      = copy_from.n_current;
     y_current      = copy_from.y_current;
     e_current      = copy_from.e_current;
 }