C# (CSharp) Epimetheus.Markov MarkovState 예제들

프로그래밍 언어: C# (CSharp)

네임스페이스/패키지 이름: Epimetheus.Markov

클래스/타입: MarkovState

hotexamples.com에서의 예제들: 6

C# (CSharp) Epimetheus.Markov MarkovState - 6개의 예제가 발견되었습니다. 이것들은 오픈소스 프로젝트에서 추출된 C# (CSharp)의 Epimetheus.Markov.MarkovState에 대한 실세계 최고 등급의 예제들입니다. 예제들을 평가하여 예제의 품질 향상에 도움을 줄 수 있습니다.

자주 사용되는 메소드들

보기 숨기기

calculatePolicy(2)

addAction(1)

예제 #1

파일 보기

파일: GridProblem.cs 프로젝트: wyliemcevoy/Epimetheus

        public String getPolicyAsString()
        {
            String build = "";

            for (int y = 0; y < height; y++)
            {
                for (int x = 0; x < width; x++)
                {
                    String num = "" + ((int)get(x, y).estimatedValue);

                    String buf  = " ";
                    int    size = 4 - num.Count();

                    for (int i = 0; i < size; i++)
                    {
                        buf += " ";
                    }

                    build += buf + num + " ";
                    MarkovState state = get(x, y);
                    state.calculatePolicy();
                    if (state.policy != null)
                    {
                        build += state.policy.getName() + " ";
                    }
                }
                build += "\n";
            }
            return(build);
        }

예제 #2

파일 보기

파일: GridProblem.cs 프로젝트: wyliemcevoy/Epimetheus

        private void createActions(int x, int y)
        {
            MarkovState state = get(x, y);

            state.addAction(createAction(Direction.up, x, y));
            state.addAction(createAction(Direction.down, x, y));
            state.addAction(createAction(Direction.left, x, y));
            state.addAction(createAction(Direction.right, x, y));
        }

예제 #3

파일 보기

파일: QLearningAgent.cs 프로젝트: wyliemcevoy/Epimetheus

        private bool runEpoch()
        {
            bool        policyHasChanged = false;
            int         usedActions      = 0;
            MarkovState current          = problem.getStartState();

            while (!current.isTerminal && usedActions < maxActionsInEpoch)
            {
                double oldEstimatedValue = current.estimatedValue;

                current.calculatePolicy();
                double bestActionValue = 0;
                foreach (ActionResult result in current.policy.getPossibleResults())
                {
                    bestActionValue += result.state.estimatedValue * result.probability;
                }

                double newEstimatedValue = oldEstimatedValue + learningRate * (current.value + gamma * bestActionValue - oldEstimatedValue);
                current.nextEstimatedValue = newEstimatedValue;

                if (rand.NextDouble() > epsilon)
                {
                    // Explore over Exploit

                    int actionIndex = rand.Next(current.actions.Count());

                    current = executer.getResult(current.actions[actionIndex]);
                }
                else
                {
                    // Exploit over Explore
                    current = executer.getResult(current.policy);
                }

                if (oldEstimatedValue != newEstimatedValue)
                {
                    policyHasChanged = true;
                }

                usedActions++;
            }

            problem.update();

            Console.WriteLine(problem.ToString());

            return(policyHasChanged);
        }

예제 #4

파일 보기

파일: GridProblem.cs 프로젝트: wyliemcevoy/Epimetheus

        public String getOptimalPath()
        {
            Boolean[,] path = new Boolean[height, width];
            for (int y = 0; y < height; y++)
            {
                for (int x = 0; x < width; x++)
                {
                    path[y, x] = false;
                }
            }

            MarkovState currentState = getStartState();

            while (!currentState.isTerminal)
            {
            }

            String build = "";

            for (int y = 0; y < height; y++)
            {
                for (int x = 0; x < width; x++)
                {
                    String buf = " ";
                    if (path[y, x])
                    {
                        build += "  O";
                    }
                    else
                    {
                    }

                    build += buf + ((int)get(x, y).estimatedValue) + " ";
                }
                build += "\n";
            }
            return(build);
        }

예제 #5

파일 보기

파일: GridProblem.cs 프로젝트: wyliemcevoy/Epimetheus

        private void intializeGrid()
        {
            this.states = new List <MarkovState>();
            this.grid   = new MarkovState[height, width];
            int index = 0;

            for (int y = 0; y < height; y++)
            {
                for (int x = 0; x < width; x++)
                {
                    grid[y, x]       = new MarkovState(index, 0);
                    grid[y, x].value = -1;
                    states.Add(grid[y, x]);
                    index++;
                }
            }

            grid[height / 2 - 1, width / 2 - 1].isTerminal = true;
            grid[height / 2 - 1, width / 2 - 1].value      = -100;
            grid[height - 1, width - 1].isTerminal         = true;
            grid[height - 1, width - 1].value = 100;

            intializeActions();
        }

예제 #6

파일 보기

 public ActionResult(MarkovState markovState)
 {
     this.state = markovState;
 }