HEXQ

From Wikipedia, the free encyclopedia
Jump to: navigation, search

HEXQ is a reinforcement learning algorithm created by Bernhard Hengst, which attempts to solve a Markov Decision Process by decomposing it hierarchically.

Bernhard Hengst (2002). "Discovering Hierarchy in Reinforcement Learning with HEXQ".