Instance-based state identification for reinforcement learning
Instance-Based State Identi cation for Reinforcement Learning Department of Computer Science University of Rochester Rochester, NY 14627-0226 firstname.lastname@example.org
R. Andrew McCallum
Abstract This paper presents instance-based state identi cation, an approach to reinforcement learning and hidden state that builds disambiguating amounts of short-term memory on-line, and also learns with an order of magnitude fewer training steps than several previous approaches. Inspired by a key similaritybetween learning with hidden state and learning in continuous geometrical spaces, this approach uses instance-based (or\memory-based") learning, a method that has worked well in continuous spaces.
1 BACKGROUND AND RELATED WORK When a robot's next course of action depends on information that is hidden from the sensors because of problems such as occlusion, restricted range, bounded eld of view and limited attention, the robot su ers from hidden state. More formally, we say a reinforcement learning agent su ers from the hidden state problem if the agent's state representation is non-Markovian with respect to actions and utility. The hidden state problem arises as a case of perceptual aliasing: the mapping between states of the world and sensations of the agent is not one-to-one Whitehead, 1992]. If the agent's perceptual system produces the same outputs for two world states in which di erent actions are required, and if the agent's state representation consists only of its percepts, then the agent will fail to choose correct actions. Note that even if an agent's state representation includes some internal state beyond its
...FOR ATM NETWORKS USING REINFORCEMENT LEARNING AL...
QOS-BASED ROUTING SCHEME FOR ATM NETWORKS USING REINFORCEMENT LEARNING ..., whose objective is the identification and selection of a suitable ...
...Near-optimal Policy Identification_免费下载
Regression (KRR), for Optimal Policy Identification in Reinforcement Learning....Support Vector Machines And Other Kernelbased Learning Methods. Cambridge ...
...Barcelona, Spain REINFORCEMENT LEARNING OF FUZZY...
Barcelona, Spain REINFORCEMENT LEARNING OF FUZZY ...The NN and RL control schemes are based on the...The structure identification of a FLC includes the...
Transfer learning for reinforcement learning. classi?ca7on, and regression ...Instance-based Transfer Learning Approaches Case I: Unlabeled Target Problem ...
Transfer learning with applications
Transfer learning for ? Transfer learning for reinforcement learning. ... Data Shift in Machine Learning, MIT Press 2009] 23 Instance-based ...
Principles of Learning in Humans and Machines
(1997). Instance-based learning. Ch. 8 in Machine learning (pp. 230248...Temporal difference learning ? Reinforcement learning for neural networks ? ...
homicide rate in the state Identification of EBD ...settings (for instance classroom and lunchroom) ?...? ? Positive Reinforcement Response Cost Proximity ...
learning 社会强化 social reinforcement 社会赞许 ...learning 识别学习 identification learning 直觉学习 ...learning 肯定例证 positive instance 否定例证 ...
The fundamental neural networks Neural network based system identification ...? ? ? Fuzzy control Learning human by demonstration Reinforcement learning ...
- 《小兔运南瓜》PPT (1)