C# (CSharp) IPolicy.RequestDecision 예제들

프로그래밍 언어: C# (CSharp)

클래스/타입: IPolicy

메소드/함수: RequestDecision

hotexamples.com에서의 예제들: 12

C# (CSharp) IPolicy.RequestDecision - 12개의 예제가 발견되었습니다. 이것들은 오픈소스 프로젝트에서 추출된 C# (CSharp)의 IPolicy.RequestDecision에 대한 실세계 최고 등급의 예제들입니다. 예제들을 평가하여 예제의 품질 향상에 도움을 줄 수 있습니다.

자주 사용되는 메소드들

보기 숨기기

RequestDecision(12)

IsValid(6)

Sub(5)

DecideAction(4)

IsBelowZero(3)

One(3)

Evaluate(3)

Equals(3)

Div(3)

Dispose(3)

Abs(3)

GetWithHttpMessagesAsync(2)

GetMove(2)

Mul(2)

ResetWithHttpMessagesAsync(2)

Apply(2)

FromTransport(2)

IsZero(2)

ExecuteAsync(2)

Add(2)

AddState(2)

ChooseAction(2)

DeleteWithHttpMessagesAsync(2)

GetOuId(2)

IsOne(1)

IsDependency(1)

Lcd(1)

IsBusinessDay(1)

SelectPreferredPackages(1)

ResourceIdentity(1)

MapContext(1)

MapContextAsync(1)

PAction(1)

RemoveDependency(1)

IsAvailable(1)

Require(1)

ResetAsync(1)

ResourceType(1)

ResourceMeta(1)

ResourceLinks(1)

Negate(1)

GetPolicyWrap(1)

InsertPolicy(1)

FindUpdatePackages(1)

AddDependency(1)

AddPolicy(1)

Compare(1)

Comply(1)

ContainsState(1)

DeleteAsync(1)

예제 #1

파일 보기

파일: Agent.cs 프로젝트: z976686281/ml-agents

        void NotifyAgentDone(DoneReason doneReason)
        {
            m_Info.episodeId      = m_EpisodeId;
            m_Info.reward         = m_Reward;
            m_Info.done           = true;
            m_Info.maxStepReached = doneReason == DoneReason.MaxStepReached;
            // Request the last decision with no callbacks
            // We request a decision so Python knows the Agent is done immediately
            m_Brain?.RequestDecision(m_Info, sensors);

            // We also have to write any to any DemonstationStores so that they get the "done" flag.
            foreach (var demoWriter in DemonstrationWriters)
            {
                demoWriter.Record(m_Info, sensors);
            }

            if (doneReason != DoneReason.Disabled)
            {
                // We don't want to udpate the reward stats when the Agent is disabled, because this will make
                // the rewards look lower than they actually are during shutdown.
                UpdateRewardStats();
            }

            // The Agent is done, so we give it a new episode Id
            m_EpisodeId        = EpisodeIdCounter.GetEpisodeId();
            m_Reward           = 0f;
            m_CumulativeReward = 0f;
            m_RequestAction    = false;
            m_RequestDecision  = false;
        }

예제 #2

파일 보기

파일: Agent.cs 프로젝트: R-Carver/MArbeit

        /// <summary>
        /// Sends the Agent info to the linked Brain.
        /// </summary>
        void SendInfoToBrain()
        {
            if (m_Brain == null)
            {
                return;
            }

            m_Info.storedVectorActions = m_Action.vectorActions;
            m_ActionMasker.ResetMask();
            UpdateSensors();
            using (TimerStack.Instance.Scoped("CollectObservations"))
            {
                CollectObservations();
            }
            m_Info.actionMasks = m_ActionMasker.GetMask();

            m_Info.reward         = m_Reward;
            m_Info.done           = m_Done;
            m_Info.maxStepReached = m_MaxStepReached;
            m_Info.id             = m_Id;

            m_Brain.RequestDecision(m_Info, sensors, UpdateAgentAction);

            if (m_Recorder != null && m_Recorder.record && Application.isEditor)
            {
                m_Recorder.WriteExperience(m_Info, sensors);
            }
        }

예제 #3

파일 보기

        /// <summary>
        /// Sends the Agent info to the linked Brain.
        /// </summary>
        void SendInfoToBrain()
        {
            if (m_Brain == null)
            {
                return;
            }

            m_Info.storedVectorActions = m_Action.vectorActions;
            m_ActionMasker.ResetMask();
            UpdateSensors();
            using (TimerStack.Instance.Scoped("CollectObservations"))
            {
                CollectObservations(collectObservationsSensor, m_ActionMasker);
            }
            m_Info.actionMasks = m_ActionMasker.GetMask();

            m_Info.reward         = m_Reward;
            m_Info.done           = false;
            m_Info.maxStepReached = false;
            m_Info.episodeId      = m_EpisodeId;

            m_Brain.RequestDecision(m_Info, sensors);

            // If we have any DemonstrationWriters, write the AgentInfo and sensors to them.
            foreach (var demoWriter in DemonstrationWriters)
            {
                demoWriter.Record(m_Info, sensors);
            }
        }

예제 #4

파일 보기

파일: Agent.cs 프로젝트: R-Carver/MArbeit

 void NotifyAgentDone()
 {
     m_Info.done = true;
     // Request the last decision with no callbacks
     // We request a decision so Python knows the Agent is disabled
     m_Brain?.RequestDecision(m_Info, sensors, (a) => { });
 }

예제 #5

파일 보기

파일: Agent.cs 프로젝트: varunsingh3000/Airplane-Game-in-Unity3d-using-Reinforcement-Learning

        /// <summary>
        /// Sends the Agent info to the linked Brain.
        /// </summary>
        void SendInfoToBrain()
        {
            if (m_Brain == null)
            {
                return;
            }

            m_Info.memories            = m_Action.memories;
            m_Info.storedVectorActions = m_Action.vectorActions;
            m_Info.storedTextActions   = m_Action.textActions;
            m_Info.vectorObservation.Clear();
            m_Info.compressedObservations.Clear();
            m_ActionMasker.ResetMask();
            using (TimerStack.Instance.Scoped("CollectObservations"))
            {
                CollectObservations();
            }
            m_Info.actionMasks = m_ActionMasker.GetMask();

            var param = m_PolicyFactory.brainParameters;

            if (m_Info.vectorObservation.Count != param.vectorObservationSize)
            {
                throw new UnityAgentsException(string.Format(
                                                   "Vector Observation size mismatch in continuous " +
                                                   "agent {0}. " +
                                                   "Was Expecting {1} but received {2}. ",
                                                   gameObject.name,
                                                   param.vectorObservationSize,
                                                   m_Info.vectorObservation.Count));
            }

            Utilities.ShiftLeft(m_Info.stackedVectorObservation, param.vectorObservationSize);
            Utilities.ReplaceRange(m_Info.stackedVectorObservation, m_Info.vectorObservation,
                                   m_Info.stackedVectorObservation.Count - m_Info.vectorObservation.Count);

            m_Info.reward         = m_Reward;
            m_Info.done           = m_Done;
            m_Info.maxStepReached = m_MaxStepReached;
            m_Info.id             = m_Id;

            m_Brain.RequestDecision(this);

            if (m_Recorder != null && m_Recorder.record && Application.isEditor)
            {
                // This is a bit of a hack - if we're in inference mode, compressed observations won't be generated
                // But we need these to be generated for the recorder. So generate them here.
                if (m_Info.compressedObservations.Count == 0)
                {
                    GenerateSensorData();
                }

                m_Recorder.WriteExperience(m_Info);
            }

            m_Info.textObservation = "";
        }

예제 #6

파일 보기

파일: Agent.cs 프로젝트: jaklw/ml-agents

        /// <summary>
        /// Sends the Agent info to the linked Brain.
        /// </summary>
        void SendInfoToBrain()
        {
            if (!m_Initialized)
            {
                throw new UnityAgentsException("Call to SendInfoToBrain when Agent hasn't been initialized." +
                                               "Please ensure that you are calling 'base.OnEnable()' if you have overridden OnEnable.");
            }

            if (m_Brain == null)
            {
                return;
            }

            if (m_Info.done)
            {
                Array.Clear(m_Info.storedVectorActions, 0, m_Info.storedVectorActions.Length);
            }
            else
            {
                Array.Copy(m_Action.vectorActions, m_Info.storedVectorActions, m_Action.vectorActions.Length);
            }
            m_ActionMasker.ResetMask();
            UpdateSensors();
            using (TimerStack.Instance.Scoped("CollectObservations"))
            {
                CollectObservations(collectObservationsSensor);
            }
            using (TimerStack.Instance.Scoped("CollectDiscreteActionMasks"))
            {
                if (m_PolicyFactory.brainParameters.vectorActionSpaceType == SpaceType.Discrete)
                {
                    CollectDiscreteActionMasks(m_ActionMasker);
                }
            }
            m_Info.discreteActionMasks = m_ActionMasker.GetMask();

            m_Info.reward         = m_Reward;
            m_Info.done           = false;
            m_Info.maxStepReached = false;
            m_Info.episodeId      = m_EpisodeId;

            m_Brain.RequestDecision(m_Info, sensors);

            // If we have any DemonstrationWriters, write the AgentInfo and sensors to them.
            foreach (var demoWriter in DemonstrationWriters)
            {
                demoWriter.Record(m_Info, sensors);
            }
        }

예제 #7

파일 보기

파일: Agent.cs 프로젝트: pkunjam/ml-agents

 void NotifyAgentDone(bool maxStepReached = false)
 {
     m_Info.reward         = m_Reward;
     m_Info.done           = true;
     m_Info.maxStepReached = maxStepReached;
     // Request the last decision with no callbacks
     // We request a decision so Python knows the Agent is done immediately
     m_Brain?.RequestDecision(m_Info, sensors, (a) => {});
     // The Agent is done, so we give it a new episode Id
     m_EpisodeId        = EpisodeIdCounter.GetEpisodeId();
     m_Reward           = 0f;
     m_CumulativeReward = 0f;
     m_RequestAction    = false;
     m_RequestDecision  = false;
 }

예제 #8

파일 보기

파일: Agent.cs 프로젝트: jaklw/ml-agents

        void NotifyAgentDone(DoneReason doneReason)
        {
            if (m_Info.done)
            {
                // The Agent was already marked as Done and should not be notified again
                return;
            }
            m_Info.episodeId      = m_EpisodeId;
            m_Info.reward         = m_Reward;
            m_Info.done           = true;
            m_Info.maxStepReached = doneReason == DoneReason.MaxStepReached;
            if (collectObservationsSensor != null)
            {
                // Make sure the latest observations are being passed to training.
                collectObservationsSensor.Reset();
                CollectObservations(collectObservationsSensor);
            }
            // Request the last decision with no callbacks
            // We request a decision so Python knows the Agent is done immediately
            m_Brain?.RequestDecision(m_Info, sensors);
            ResetSensors();

            // We also have to write any to any DemonstationStores so that they get the "done" flag.
            foreach (var demoWriter in DemonstrationWriters)
            {
                demoWriter.Record(m_Info, sensors);
            }

            if (doneReason != DoneReason.Disabled)
            {
                // We don't want to update the reward stats when the Agent is disabled, because this will make
                // the rewards look lower than they actually are during shutdown.
                m_CompletedEpisodes++;
                UpdateRewardStats();
            }

            m_Reward           = 0f;
            m_CumulativeReward = 0f;
            m_RequestAction    = false;
            m_RequestDecision  = false;
            Array.Clear(m_Info.storedVectorActions, 0, m_Info.storedVectorActions.Length);
        }

예제 #9

파일 보기

파일: Agent.cs 프로젝트: KJ-Waller/ml-agents

        /// <summary>
        /// Sends the Agent info to the linked Brain.
        /// </summary>
        void SendInfoToBrain()
        {
            if (m_Brain == null)
            {
                return;
            }

            m_Info.storedVectorActions = m_Action.vectorActions;
            m_ActionMasker.ResetMask();
            UpdateSensors();
            using (TimerStack.Instance.Scoped("CollectObservations"))
            {
                CollectObservations();
            }
            m_Info.actionMasks = m_ActionMasker.GetMask();

            // var param = m_PolicyFactory.brainParameters; // look, no brain params!

            m_Info.reward         = m_Reward;
            m_Info.done           = m_Done;
            m_Info.maxStepReached = m_MaxStepReached;
            m_Info.id             = m_Id;

            m_Brain.RequestDecision(m_Info, sensors, UpdateAgentAction);

            if (m_Recorder != null && m_Recorder.record && Application.isEditor)
            {
                if (m_VectorSensorBuffer == null)
                {
                    // Create a buffer for writing uncompressed (i.e. float) sensor data to
                    m_VectorSensorBuffer = new float[sensors.GetSensorFloatObservationSize()];
                }

                // This is a bit of a hack - if we're in inference mode, observations won't be generated
                // But we need these to be generated for the recorder. So generate them here.
                var observations = new List <Observation>();
                GenerateSensorData(sensors, m_VectorSensorBuffer, m_WriteAdapter, observations);

                m_Recorder.WriteExperience(m_Info, observations);
            }
        }

예제 #10

파일 보기

파일: Agent.cs 프로젝트: ankur25140/DirtRacing

        /// <summary>
        /// Sends the Agent info to the linked Brain.
        /// </summary>
        void SendInfoToBrain()
        {
            if (m_Brain == null)
            {
                return;
            }

            m_Info.storedVectorActions = m_Action.vectorActions;
            m_Info.observations.Clear();
            m_ActionMasker.ResetMask();
            UpdateSensors();
            using (TimerStack.Instance.Scoped("CollectObservations"))
            {
                CollectObservations();
            }
            m_Info.actionMasks = m_ActionMasker.GetMask();

            // var param = m_PolicyFactory.brainParameters; // look, no brain params!

            m_Info.reward         = m_Reward;
            m_Info.done           = m_Done;
            m_Info.maxStepReached = m_MaxStepReached;
            m_Info.id             = m_Id;

            m_Brain.RequestDecision(this);

            if (m_Recorder != null && m_Recorder.record && Application.isEditor)
            {
                // This is a bit of a hack - if we're in inference mode, observations won't be generated
                // But we need these to be generated for the recorder. So generate them here.
                if (m_Info.observations.Count == 0)
                {
                    GenerateSensorData();
                }

                m_Recorder.WriteExperience(m_Info);
            }
        }

예제 #11

파일 보기

        void NotifyAgentDone(bool maxStepReached = false)
        {
            m_Info.reward         = m_Reward;
            m_Info.done           = true;
            m_Info.maxStepReached = maxStepReached;
            // Request the last decision with no callbacks
            // We request a decision so Python knows the Agent is done immediately
            m_Brain?.RequestDecision(m_Info, sensors);

            if (m_Recorder != null && m_Recorder.record && Application.isEditor)
            {
                m_Recorder.WriteExperience(m_Info, sensors);
            }

            UpdateRewardStats();

            // The Agent is done, so we give it a new episode Id
            m_EpisodeId        = EpisodeIdCounter.GetEpisodeId();
            m_Reward           = 0f;
            m_CumulativeReward = 0f;
            m_RequestAction    = false;
            m_RequestDecision  = false;
        }

예제 #12

파일 보기

        void NotifyAgentDone(bool maxStepReached = false)
        {
            m_Info.reward         = m_Reward;
            m_Info.done           = true;
            m_Info.maxStepReached = maxStepReached;
            // Request the last decision with no callbacks
            // We request a decision so Python knows the Agent is done immediately
            m_Brain?.RequestDecision(m_Info, sensors);

            // We also have to write any to any DemonstationStores so that they get the "done" flag.
            foreach (var demoWriter in DemonstrationWriters)
            {
                demoWriter.Record(m_Info, sensors);
            }

            UpdateRewardStats();

            // The Agent is done, so we give it a new episode Id
            m_EpisodeId        = EpisodeIdCounter.GetEpisodeId();
            m_Reward           = 0f;
            m_CumulativeReward = 0f;
            m_RequestAction    = false;
            m_RequestDecision  = false;
        }