Using combination of actions in reinforcement learning

Karanik, Marcelo J.; Gramajo, Sergio D.

Buscar material

Busque entre los 171529 recursos disponibles en el repositorio

Subir material

Suba sus trabajos a SEDICI, para mejorar notoriamente su visibilidad e impacto

Revistas
→
Journal of Computer Science & Technology
→
Volumen 10 | Número 01

Using combination of actions in reinforcement learning

Autores: Karanik, Marcelo J. | Gramajo, Sergio D.

2010

Tipo de documento: Articulo

Resumen

Software agents are programs that can observe their environment and act in an attempt to reach their design goals. In most cases the selection of particular agent architecture determines the behaviour in response to the different problem states However, there are some problem domains in which it is desirable that the agent learns a good action execution policy by interacting with its environment. This kind of learning is called Reinforcement Learning and it is useful in the process control area. Given a problem state, the agent selects the adequate action to do and receives an immediate reward, then estimations about every action are updated and, after a certain period of time, the agent learns which the best action to be executed is. Most reinforcement learning algorithms perform simple actions while two or more are capable of being used. This work involves the use of RL algorithms to find an optimal policy in a gridworld problem and proposes a mechanism to combine actions of different types.

Información general

Fecha de publicación: abril 2010

Idioma del documento: Inglés

Revista: Journal of Computer Science & Technology; vol. 10, no. 1

Institución de origen: Facultad de Informática

ISSN: 1666-6038

Páginas: 19-23

Palabras claves: Learning ; SARSA ; optimal policy ; action combination

Materias: Ciencias Informáticas

Descargar archivos

Documento completo
Descargar archivo (695.4Kb) - PDF

Enlace externo

journal.info.unlp.edu.ar/...

BASE

GoogleScholar

Creado el: 22 de marzo de 2010

Disponible en SEDICI desde: 22 de marzo de 2010

Por favor, utilice uno de estos identificadores(URI) para citar o enlazar este ítem:

http://sedici.unlp.edu.ar/handle/10915/9663

Mostrar el registro completo del ítem

Este ítem aparece en la(s) siguiente(s) colección(ones)

Journal of Computer Science & Technology → Volumen 10 | Número 01

Creative Commons Attribution-NonCommercial 3.0 Unported (CC BY-NC 3.0)

Excepto donde se diga explícitamente, este item se publica bajo la siguiente licencia Creative Commons Attribution-NonCommercial 3.0 Unported (CC BY-NC 3.0)

Iniciar sesión