Search in sources :

Example 1 with PassiveTDAgent

use of aima.core.learning.reinforcement.agent.PassiveTDAgent in project aima-java by aimacode.

the class LearningDemo method passiveTDAgentDemo.

public static void passiveTDAgentDemo() {
    System.out.println("======================");
    System.out.println("DEMO: Passive-TD-Agent");
    System.out.println("======================");
    System.out.println("Figure 21.5");
    System.out.println("-----------");
    CellWorld<Double> cw = CellWorldFactory.createCellWorldForFig17_1();
    CellWorldEnvironment cwe = new CellWorldEnvironment(cw.getCellAt(1, 1), cw.getCells(), MDPFactory.createTransitionProbabilityFunctionForFigure17_1(cw), new JavaRandomizer());
    Map<Cell<Double>, CellWorldAction> fixedPolicy = new HashMap<Cell<Double>, CellWorldAction>();
    fixedPolicy.put(cw.getCellAt(1, 1), CellWorldAction.Up);
    fixedPolicy.put(cw.getCellAt(1, 2), CellWorldAction.Up);
    fixedPolicy.put(cw.getCellAt(1, 3), CellWorldAction.Right);
    fixedPolicy.put(cw.getCellAt(2, 1), CellWorldAction.Left);
    fixedPolicy.put(cw.getCellAt(2, 3), CellWorldAction.Right);
    fixedPolicy.put(cw.getCellAt(3, 1), CellWorldAction.Left);
    fixedPolicy.put(cw.getCellAt(3, 2), CellWorldAction.Up);
    fixedPolicy.put(cw.getCellAt(3, 3), CellWorldAction.Right);
    fixedPolicy.put(cw.getCellAt(4, 1), CellWorldAction.Left);
    PassiveTDAgent<Cell<Double>, CellWorldAction> ptda = new PassiveTDAgent<Cell<Double>, CellWorldAction>(fixedPolicy, 0.2, 1.0);
    cwe.addAgent(ptda);
    output_utility_learning_rates(ptda, 20, 500, 100, 1);
    System.out.println("=========================");
}
Also used : CellWorldAction(aima.core.environment.cellworld.CellWorldAction) HashMap(java.util.HashMap) PassiveTDAgent(aima.core.learning.reinforcement.agent.PassiveTDAgent) JavaRandomizer(aima.core.util.JavaRandomizer) CellWorldEnvironment(aima.core.learning.reinforcement.example.CellWorldEnvironment) Cell(aima.core.environment.cellworld.Cell)

Aggregations

Cell (aima.core.environment.cellworld.Cell)1 CellWorldAction (aima.core.environment.cellworld.CellWorldAction)1 PassiveTDAgent (aima.core.learning.reinforcement.agent.PassiveTDAgent)1 CellWorldEnvironment (aima.core.learning.reinforcement.example.CellWorldEnvironment)1 JavaRandomizer (aima.core.util.JavaRandomizer)1 HashMap (java.util.HashMap)1