Abstract—A reinforcement learning (RL) agent mostly assumes environments are stationary which is not feasible on most real world problems. Most RL approaches adapt slow changes by forgetting the previous dynamics of the environment. Reinforcement learning-context detection (RL-CD) is a technique that helps determine changes of the environment’s nature which the agent with the capability to learn different dynamics of the non-stationary environment. In this study we propose an autonomous agent that learns a dynamic environment by taking advantage of hierarchical reinforcement learning (HRL) and present how the hierarchical structure can be integrated into RL-CD to speed up the convergence of a policy.
Index Terms—Reinforcement learning, autonomous agent, hierarchical reinforcement learning, non-stationary environment, betweenness centrality, prioritized sweeping.
Yiğit E. Yücesoy is with the Halic University, Istanbul, Turkey (e-mail: email@example.com).
M. Borahan Tümer is with the Marmara University, Istanbul, Turkey (e-mail: firstname.lastname@example.org).
Cite: Yiğit E. Yücesoy and M. Borahan Tümer, "Hierarchical Reinforcement Learning with Context Detection (HRL-CD)," International Journal of Machine Learning and Computing vol.5, no. 5, pp. 353-358, 2015.