Search in sources :

Example 6 with CellWorldAction

use of aima.core.environment.cellworld.CellWorldAction in project aima-java by aimacode.

the class LearningDemo method passiveADPAgentDemo.

public static void passiveADPAgentDemo() {
    System.out.println("=======================");
    System.out.println("DEMO: Passive-ADP-Agent");
    System.out.println("=======================");
    System.out.println("Figure 21.3");
    System.out.println("-----------");
    CellWorld<Double> cw = CellWorldFactory.createCellWorldForFig17_1();
    CellWorldEnvironment cwe = new CellWorldEnvironment(cw.getCellAt(1, 1), cw.getCells(), MDPFactory.createTransitionProbabilityFunctionForFigure17_1(cw), new JavaRandomizer());
    Map<Cell<Double>, CellWorldAction> fixedPolicy = new HashMap<Cell<Double>, CellWorldAction>();
    fixedPolicy.put(cw.getCellAt(1, 1), CellWorldAction.Up);
    fixedPolicy.put(cw.getCellAt(1, 2), CellWorldAction.Up);
    fixedPolicy.put(cw.getCellAt(1, 3), CellWorldAction.Right);
    fixedPolicy.put(cw.getCellAt(2, 1), CellWorldAction.Left);
    fixedPolicy.put(cw.getCellAt(2, 3), CellWorldAction.Right);
    fixedPolicy.put(cw.getCellAt(3, 1), CellWorldAction.Left);
    fixedPolicy.put(cw.getCellAt(3, 2), CellWorldAction.Up);
    fixedPolicy.put(cw.getCellAt(3, 3), CellWorldAction.Right);
    fixedPolicy.put(cw.getCellAt(4, 1), CellWorldAction.Left);
    PassiveADPAgent<Cell<Double>, CellWorldAction> padpa = new PassiveADPAgent<Cell<Double>, CellWorldAction>(fixedPolicy, cw.getCells(), cw.getCellAt(1, 1), MDPFactory.createActionsFunctionForFigure17_1(cw), new ModifiedPolicyEvaluation<Cell<Double>, CellWorldAction>(10, 1.0));
    cwe.addAgent(padpa);
    output_utility_learning_rates(padpa, 20, 100, 100, 1);
    System.out.println("=========================");
}
Also used : CellWorldAction(aima.core.environment.cellworld.CellWorldAction) HashMap(java.util.HashMap) JavaRandomizer(aima.core.util.JavaRandomizer) PassiveADPAgent(aima.core.learning.reinforcement.agent.PassiveADPAgent) CellWorldEnvironment(aima.core.learning.reinforcement.example.CellWorldEnvironment) Cell(aima.core.environment.cellworld.Cell)

Aggregations

Cell (aima.core.environment.cellworld.Cell)6 CellWorldAction (aima.core.environment.cellworld.CellWorldAction)6 CellWorldEnvironment (aima.core.learning.reinforcement.example.CellWorldEnvironment)6 JavaRandomizer (aima.core.util.JavaRandomizer)6 HashMap (java.util.HashMap)4 Before (org.junit.Before)3 PassiveADPAgent (aima.core.learning.reinforcement.agent.PassiveADPAgent)1 PassiveTDAgent (aima.core.learning.reinforcement.agent.PassiveTDAgent)1 QLearningAgent (aima.core.learning.reinforcement.agent.QLearningAgent)1 ModifiedPolicyEvaluation (aima.core.probability.mdp.impl.ModifiedPolicyEvaluation)1