C# (CSharp) Util.ActionFromWeights Examples

Programming Language: C# (CSharp)

Class/Type: Util

Method/Function: ActionFromWeights

Examples at hotexamples.com: 2

C# (CSharp) Util.ActionFromWeights - 2 examples found. These are the top rated real world C# (CSharp) examples of Util.ActionFromWeights from package HoloToolkit extracted from open source projects. You can rate examples to help us improve the quality of examples.

Frequently Used Methods

Show Hide

CheckSessionTimeout(30)

AESDecrypt(23)

AddChild(19)

AddChildToTarget(13)

ActivatorCreateInstance(11)

AdaptDomPath(10)

AddEmailToButton(9)

AESEncrypt(9)

AddAssembliesToStringCollection(8)

AddCBWithOffset(7)

AddAll(5)

CreateFieldNode(5)

Accumulate(5)

ABBZones(4)

ABBSpeeds(4)

DeleteApplication(4)

AddAuthInfoToClient(4)

DoesAppExist(4)

DecodeMySql(4)

Add(3)

AddArrayString(3)

AbsolutePath(3)

AddBackslashInPath(3)

Activate(2)

CopyStagingDesignToProduction(2)

ActorName(2)

DeleteOldTempFiles(2)

AddAngleClamp(2)

AbreForm(2)

ActionFromWeights(2)

AddChildStayLocal(1)

CopyTemplateApp(1)

AddToStyleAttribute(1)

CopyApp(1)

CopyAppToAccount(1)

AddBookDataBase(1)

CreateApp(1)

CreateMobiFlexAccount(1)

DeleteAppPageImage(1)

DeleteLargeIcon(1)

DeleteOldErrorsInLog(1)

DeleteSplashImage(1)

DoesAppExistInAccount(1)

DoesDatabaseInfoExist(1)

AddBorder(1)

AddApplicationBarButton(1)

AddBetsDecisionInfoAsSingleNodes(1)

ActivateMonoRecurrsive(1)

AUToMeters(1)

AbreLogin(1)

Util Class Documentation

Example #1

Show file

File: DQNAgent.cs Project: play3577/RainForce

        public int Act(int[] stateArray)
        {
            var r = new Random();
            // convert to a Mat column vector
            var state = new Matrix(NumberOfStates, 1);

            state.Set(stateArray);
            var a = 0;
            var y = r.NextDouble();

            // epsilon greedy policy
            if (y < Options.Epsilon)
            {
                a = Util.Random(0, NumberOfActions);
            }
            else
            {
                // greedy wrt Q function
                var amat = Forward(Net, state, false);
                a = Util.ActionFromWeights(amat.Weights); // returns index of argmax action
            }
            // shift state memory
            previousStateCache = nextStateCache;
            previousAction     = nextAction;
            nextStateCache     = state;
            nextAction         = a;
            return(a);
        }

Example #2

Show file

File: DQNAgent.cs Project: play3577/RainForce

        private double LearnFromTuple(Matrix prevState, int prevAction, double reward, Matrix nextState)
        {
            // want: Q(s,a) = r + gamma * max_a' Q(s',a')

            // compute the target Q value
            var tmat = Forward(Net, nextState, false);
            var qmax = reward + Options.Gamma * tmat.Weights[Util.ActionFromWeights(tmat.Weights)];

            // now predict
            var pred = Forward(Net, prevState, true);

            var tderror = pred.Weights[prevAction] - qmax;
            var clamp   = Options.ErrorClamp;

            if (Math.Abs(tderror) > clamp)
            {  // huber loss to robustify
                if (tderror > clamp)
                {
                    tderror = clamp;
                }
                if (tderror < -clamp)
                {
                    tderror = -clamp;
                }
            }
            pred.BackPropWeights[prevAction] = tderror;
            LastGraph.Backward();

            // update net
            Util.UpdateNetwork(Net, Options.Alpha);
            return(tderror);
        }