Using combination of actions in reinforcement learning

Karanik, Marcelo J.; Gramajo, Sergio D.

Buscar material

Busque entre los 171219 recursos disponibles en el repositorio

Subir material

Suba sus trabajos a SEDICI, para mejorar notoriamente su visibilidad e impacto

Revistas
→
Journal of Computer Science & Technology
→
Volumen 10 | Número 01

Mostrar el registro sencillo del ítem

dc.date.accessioned	2010-03-22T12:56:34Z
dc.date.available	2010-03-22T03:00:00Z
dc.date.issued	2010-04
dc.identifier.uri	http://sedici.unlp.edu.ar/handle/10915/9663
dc.description.abstract	Software agents are programs that can observe their environment and act in an attempt to reach their design goals. In most cases the selection of particular agent architecture determines the behaviour in response to the different problem states However, there are some problem domains in which it is desirable that the agent learns a good action execution policy by interacting with its environment. This kind of learning is called Reinforcement Learning and it is useful in the process control area. Given a problem state, the agent selects the adequate action to do and receives an immediate reward, then estimations about every action are updated and, after a certain period of time, the agent learns which the best action to be executed is. Most reinforcement learning algorithms perform simple actions while two or more are capable of being used. This work involves the use of RL algorithms to find an optimal policy in a gridworld problem and proposes a mechanism to combine actions of different types.	en
dc.format.extent	19-23	es
dc.language	en	es
dc.subject	Learning	es
dc.subject	SARSA	en
dc.subject	optimal policy	en
dc.subject	action combination	en
dc.title	Using combination of actions in reinforcement learning	en
dc.type	Articulo	es
sedici.identifier.uri	http://journal.info.unlp.edu.ar/wp-content/uploads/JCST-Apr10-4.pdf	es
sedici.identifier.issn	1666-6038	es
sedici.creator.person	Karanik, Marcelo J.	es
sedici.creator.person	Gramajo, Sergio D.	es
sedici.subject.materias	Ciencias Informáticas	es
sedici.description.fulltext	true	es
mods.originInfo.place	Facultad de Informática	es
sedici.subtype	Articulo	es
sedici.rights.license	Creative Commons Attribution-NonCommercial 3.0 Unported (CC BY-NC 3.0)
sedici.rights.uri	http://creativecommons.org/licenses/by-nc/3.0/
sedici.description.peerReview	peer-review	es
sedici2003.identifier	ARG-UNLP-ART-0000005733	es
sedici.relation.journalTitle	Journal of Computer Science & Technology	es
sedici.relation.journalVolumeAndIssue	vol. 10, no. 1	es

Descargar archivos

Documento completo
Descargar archivo (695.4Kb) - PDF

Enlace externo

journal.info.unlp.edu.ar/...

Este ítem aparece en la(s) siguiente(s) colección(ones)

Journal of Computer Science & Technology → Volumen 10 | Número 01

Creative Commons Attribution-NonCommercial 3.0 Unported (CC BY-NC 3.0)

Excepto donde se diga explícitamente, este item se publica bajo la siguiente licencia Creative Commons Attribution-NonCommercial 3.0 Unported (CC BY-NC 3.0)

Iniciar sesión