Subir material

Suba sus trabajos a SEDICI, para mejorar notoriamente su visibilidad e impacto

 

Mostrar el registro sencillo del ítem

dc.date.accessioned 2004-02-09T20:33:58Z
dc.date.available 2004-02-09T03:00:00Z
dc.date.issued 2002
dc.identifier.uri http://sedici.unlp.edu.ar/handle/10915/9432
dc.description.abstract Q-Learning is a Reinforcement Learning method for solving sequential decision problems, where the utility of actions depends on a sequence of decisions and there exists uncertainty about the dynamics of the environment the agent is situated on. This general framework has allowed that Q-Learning and other Reinforcement Learning methods to be applied to a broad spectrum of complex real world problems such as robotics, industrial manufacturing, games and others. Despite its interesting properties, Q-learning is a very slow method that requires a long period of training for learning an acceptable policy. In order to solve or at least reduce this problem, we propose a parallel implementation model of Q-learning using a tabular representation and via a communication scheme based on cache. This model is applied to a particular problem and the results obtained with different processor configurations are reported. A brief discussion about the properties and current limitations of our approach is finally presented. en
dc.language en es
dc.subject Parallel programming es
dc.subject Redes de Comunicación de Computadores es
dc.subject communication based on cache en
dc.subject reinforcement learning en
dc.subject Informática es
dc.subject Aprendizaje es
dc.subject asynchronous dynamic programming en
dc.title A parallel implementation of Q-learning based on communication with cache en
dc.type Articulo es
sedici.identifier.uri http://journal.info.unlp.edu.ar/wp-content/uploads/p41.pdf es
sedici.creator.person Printista, Alicia Marcela es
sedici.creator.person Errecalde, Marcelo Luis es
sedici.creator.person Montoya, Cecilia Inés es
sedici.subject.materias Ciencias Informáticas es
sedici.description.fulltext true es
mods.originInfo.place Facultad de Informática es
sedici.subtype Articulo es
sedici.rights.license Creative Commons Attribution-NonCommercial 3.0 Unported (CC BY-NC 3.0)
sedici.rights.uri http://creativecommons.org/licenses/by-nc/3.0/
sedici.description.peerReview peer-review es
sedici2003.identifier ARG-UNLP-ART-0000000090 es
sedici.relation.journalTitle vol. 1, no. 6 es


Descargar archivos

Este ítem aparece en la(s) siguiente(s) colección(ones)

Creative Commons Attribution-NonCommercial 3.0 Unported (CC BY-NC 3.0) Excepto donde se diga explícitamente, este item se publica bajo la siguiente licencia Creative Commons Attribution-NonCommercial 3.0 Unported (CC BY-NC 3.0)