A parallel implementation of Q-learning based on communication with cache

Printista, Alicia Marcela; Errecalde, Marcelo Luis; Montoya, Cecilia Inés

Buscar material

Busque entre los 171223 recursos disponibles en el repositorio

Subir material

Suba sus trabajos a SEDICI, para mejorar notoriamente su visibilidad e impacto

Revistas
→
Journal of Computer Science & Technology
→
Volumen 01 | Número 06

Mostrar el registro sencillo del ítem

dc.date.accessioned	2004-02-09T20:33:58Z
dc.date.available	2004-02-09T03:00:00Z
dc.date.issued	2002
dc.identifier.uri	http://sedici.unlp.edu.ar/handle/10915/9432
dc.description.abstract	Q-Learning is a Reinforcement Learning method for solving sequential decision problems, where the utility of actions depends on a sequence of decisions and there exists uncertainty about the dynamics of the environment the agent is situated on. This general framework has allowed that Q-Learning and other Reinforcement Learning methods to be applied to a broad spectrum of complex real world problems such as robotics, industrial manufacturing, games and others. Despite its interesting properties, Q-learning is a very slow method that requires a long period of training for learning an acceptable policy. In order to solve or at least reduce this problem, we propose a parallel implementation model of Q-learning using a tabular representation and via a communication scheme based on cache. This model is applied to a particular problem and the results obtained with different processor configurations are reported. A brief discussion about the properties and current limitations of our approach is finally presented.	en
dc.language	en	es
dc.subject	Parallel programming	es
dc.subject	Redes de Comunicación de Computadores	es
dc.subject	communication based on cache	en
dc.subject	reinforcement learning	en
dc.subject	Informática	es
dc.subject	Aprendizaje	es
dc.subject	asynchronous dynamic programming	en
dc.title	A parallel implementation of Q-learning based on communication with cache	en
dc.type	Articulo	es
sedici.identifier.uri	http://journal.info.unlp.edu.ar/wp-content/uploads/p41.pdf	es
sedici.creator.person	Printista, Alicia Marcela	es
sedici.creator.person	Errecalde, Marcelo Luis	es
sedici.creator.person	Montoya, Cecilia Inés	es
sedici.subject.materias	Ciencias Informáticas	es
sedici.description.fulltext	true	es
mods.originInfo.place	Facultad de Informática	es
sedici.subtype	Articulo	es
sedici.rights.license	Creative Commons Attribution-NonCommercial 3.0 Unported (CC BY-NC 3.0)
sedici.rights.uri	http://creativecommons.org/licenses/by-nc/3.0/
sedici.description.peerReview	peer-review	es
sedici2003.identifier	ARG-UNLP-ART-0000000090	es
sedici.relation.journalTitle	vol. 1, no. 6	es

Descargar archivos

Documento completo
Descargar archivo (58.41Kb) - PDF

Enlace externo

journal.info.unlp.edu.ar/...

Este ítem aparece en la(s) siguiente(s) colección(ones)

Journal of Computer Science & Technology → Volumen 01 | Número 06

Creative Commons Attribution-NonCommercial 3.0 Unported (CC BY-NC 3.0)

Excepto donde se diga explícitamente, este item se publica bajo la siguiente licencia Creative Commons Attribution-NonCommercial 3.0 Unported (CC BY-NC 3.0)

Iniciar sesión