C# (CSharp) Assets.Scripts.QLearning.Models ActionReward示例

编程语言: C# (CSharp)

命名空间/包名称: Assets.Scripts.QLearning.Models

类/类型: ActionReward

hotexamples.com的示例: 3

C# (CSharp) Assets.Scripts.QLearning.Models ActionReward - 已找到3个示例。这些是从开源项目中提取的最受好评的Assets.Scripts.QLearning.Models.ActionReward现实C# (CSharp)示例。您可以评价示例，以帮助我们提高示例质量。

示例#1

显示文件

文件： State.cs 项目： yschuurmans/Machine-Learning-Tests

        public void IncreaseReward_Q(string actionName, double alpha, double inc)
        {
            ActionReward action = FindAction(actionName);

            action.Reward *= 1 - alpha;
            action.Reward += alpha * inc;
        }

示例#2

显示文件

文件： State.cs 项目： yschuurmans/Machine-Learning-Tests

        public ActionReward BestAction()
        {
            ActionReward bestAction = null;

            foreach (var actionReward in ActionRewards)
            {
                if (bestAction == null || actionReward.Reward > bestAction.Reward)
                {
                    bestAction = actionReward;
                }
            }

            return(bestAction);
        }

示例#3

显示文件

文件： Learner.cs 项目： yschuurmans/Machine-Learning-Tests

        public bool Learn(string chosenAction, string newState, double reward)
        {
            current_iteration += 1;
            if (current_iteration > maxIterations)
            {
                EndEpisode();
                return(true);
            }


            State        prevState           = QTable.GetState(currentState);
            ActionReward bestActionNextState = QTable.GetState(newState).BestAction();

            prevState.IncreaseReward_Q(chosenAction, alpha, reward + discount * bestActionNextState.Reward);


            time        += 1;
            alpha        = Math.Pow(time, -0.1);
            currentState = newState;
            return(false);
        }