Class SPUCBPolicy<N,A>
- java.lang.Object
-
- ai.libs.jaicore.search.algorithms.mdp.mcts.uct.AUpdatingPolicy<T,A>
-
- ai.libs.jaicore.search.algorithms.mdp.mcts.uct.UCBPolicy<N,A>
-
- ai.libs.jaicore.search.algorithms.mdp.mcts.spuct.SPUCBPolicy<N,A>
-
- All Implemented Interfaces:
IPathUpdatablePolicy<N,A,java.lang.Double>,IPolicy<N,A>,org.api4.java.common.control.ILoggingCustomizable
public class SPUCBPolicy<N,A> extends UCBPolicy<N,A> implements org.api4.java.common.control.ILoggingCustomizable
-
-
Constructor Summary
Constructors Constructor Description SPUCBPolicy(double gamma, boolean maximize, double bigD)SPUCBPolicy(double gamma, double bigD)
-
Method Summary
All Methods Instance Methods Concrete Methods Modifier and Type Method Description java.lang.StringgetLoggerName()doublegetScore(N node, A action)voidsetLoggerName(java.lang.String name)voidupdatePath(org.api4.java.datastructure.graph.ILabeledPath<N,A> path, java.util.List<java.lang.Double> scores)Note that this is a transposition-based and hence, only partially path-dependent, update.-
Methods inherited from class ai.libs.jaicore.search.algorithms.mdp.mcts.uct.UCBPolicy
getActionBasedOnScores, getEmpiricalMean, getExplorationConstant, getExplorationTerm, setExplorationConstant
-
Methods inherited from class ai.libs.jaicore.search.algorithms.mdp.mcts.uct.AUpdatingPolicy
getAction, getBehaviorWhenActionForNotFullyExploredStateIsRequested, getGamma, getLabelOfNode, isMaximize, setBehaviorWhenActionForNotFullyExploredStateIsRequested
-
-
-
-
Method Detail
-
getLoggerName
public java.lang.String getLoggerName()
- Specified by:
getLoggerNamein interfaceorg.api4.java.common.control.ILoggingCustomizable- Overrides:
getLoggerNamein classUCBPolicy<N,A>
-
setLoggerName
public void setLoggerName(java.lang.String name)
- Specified by:
setLoggerNamein interfaceorg.api4.java.common.control.ILoggingCustomizable- Overrides:
setLoggerNamein classUCBPolicy<N,A>
-
updatePath
public void updatePath(org.api4.java.datastructure.graph.ILabeledPath<N,A> path, java.util.List<java.lang.Double> scores)
Description copied from class:AUpdatingPolicyNote that this is a transposition-based and hence, only partially path-dependent, update. The labels are associated to nodes of the original MDP (states) and not to nodes in the MCTS search tree (paths)! This means that, in fact, several paths are (partially) updated simultanously. However, on all other paths crossing the nodes on the updated paths, only those situations are updated and not the situations in higher nodes of the search tree.- Specified by:
updatePathin interfaceIPathUpdatablePolicy<N,A,java.lang.Double>- Overrides:
updatePathin classAUpdatingPolicy<N,A>
-
-