C# (CSharp) ActIdx 예제들

프로그래밍 언어: C# (CSharp)

클래스/타입: ActIdx

hotexamples.com에서의 예제들: 4

C# (CSharp) ActIdx - 4개의 예제가 발견되었습니다. 이것들은 오픈소스 프로젝트에서 추출된 C# (CSharp)의 ActIdx에 대한 실세계 최고 등급의 예제들입니다. 예제들을 평가하여 예제의 품질 향상에 도움을 줄 수 있습니다.

자주 사용되는 메소드들

보기 숨기기

Actions(1)

Idx(1)

예제 #1

파일 보기

파일: AgentPlayer1.cs 프로젝트: watabe951/RLFighter

    void Learn(Stat s, ActIdx a, Stat next_s, float reward)
    {
        QActIdx qa = MaxQActIdx(next_s);

        q_table[s.Idx(), a.Idx()] +=
            alpha * (reward + gamma * qa.q - q_table[s.Idx(), a.Idx()]);
    }

예제 #2

파일 보기

파일: AgentPlayer1.cs 프로젝트: watabe951/RLFighter

    public Actions RunStep(States states)
    {
        next_stat = new Stat(states, isRed);

        if (is_started)
        {
            float reward = Reward(stat, next_stat);
            Learn(stat, act_idx, next_stat, reward);
        }
        is_started = true;
        act_idx    = Policy(next_stat);
        stat       = next_stat;
        return(act_idx.Actions(next_stat));
    }

예제 #3

파일 보기

파일: AgentPlayer1.cs 프로젝트: watabe951/RLFighter

    QActIdx MaxQActIdx(Stat s)
    {
        int s_idx = s.Idx();

        float  max_q = float.NegativeInfinity;
        ActIdx max_a = new ActIdx(0);

        for (int i = 0; i < ActIdx.max_idx; i++)
        {
            if (q_table[s_idx, i] > max_q)
            {
                max_q = q_table[s_idx, i];
                max_a = new ActIdx(i);
            }
        }
        return(new QActIdx(max_q, max_a));
    }

예제 #4

파일 보기

파일: AgentPlayer1.cs 프로젝트: watabe951/RLFighter

 public QActIdx(float q, ActIdx a_idx)
 {
     this.q     = q;
     this.a_idx = a_idx;
 }