C# (CSharp) social_learning StateActionReward 예제들

프로그래밍 언어: C# (CSharp)

네임스페이스/패키지 이름: social_learning

클래스/타입: StateActionReward

hotexamples.com에서의 예제들: 1

C# (CSharp) social_learning StateActionReward - 1개의 예제가 발견되었습니다. 이것들은 오픈소스 프로젝트에서 추출된 C# (CSharp)의 social_learning.StateActionReward에 대한 실세계 최고 등급의 예제들입니다. 예제들을 평가하여 예제의 품질 향상에 도움을 줄 수 있습니다.

StateActionReward 1 문서

예제 #1

파일 보기

        void pollAlternativeAction(IAgent agent)
        {
            // This will be a negative (bad) reward.
            // We need to handle this by polling other agents to see if they
            // could have avoided it.
            if (TeachParadigm != TeachingParadigm.EveryonePolling &&
                TeachParadigm != TeachingParadigm.EveryoneRewardsAndPolling &&
                TeachParadigm != TeachingParadigm.SubculturePolling &&
                TeachParadigm != TeachingParadigm.SubcultureRewardsAndPolling)
            {
                return;
            }

            // If we are using EveryonePolling, the agent can ask the entire population.
            // If we are using SubculturePolling, the agent can only ask agents in its subculture.
            var available = TeachParadigm == TeachingParadigm.EveryonePolling ? _agents.Where((a, i) => agent.Id != i)
                                : _agents.Where((a, i) => _agentGroups[agent.Id] == _agentGroups[i] && i != agent.Id);

            // The teacher is the highest fitness agent available to the eaten agent.
            var teacher = (SocialAgent)available.OrderByDescending(a => a.Fitness).FirstOrDefault();

            // Get the corrected moves for every action in the eaten agent's memory
            LinkedList <StateActionReward> badTrajectory     = ((SocialAgent)agent).Memory;
            LinkedList <StateActionReward> correctTrajectory = new LinkedList <StateActionReward>();

            foreach (var bad in badTrajectory)
            {
                double[] state  = bad.State;
                double[] action = new double[bad.Action.Length];
                teacher.activateNetworkWithoutMemory(bad.State).CopyTo(action, 0, action.Length);
                StateActionReward good = new StateActionReward(state, action, 0);
                correctTrajectory.AddLast(good);
            }

            // Train the agent with the correct trajectory according to the teacher.
            TeachAgent(agent, correctTrajectory);
        }