Busque entre los 167390 recursos disponibles en el repositorio
Mostrar el registro sencillo del ítem
dc.date.accessioned | 2004-02-09T20:33:58Z | |
dc.date.available | 2004-02-09T03:00:00Z | |
dc.date.issued | 2002 | |
dc.identifier.uri | http://sedici.unlp.edu.ar/handle/10915/9432 | |
dc.description.abstract | Q-Learning is a Reinforcement Learning method for solving sequential decision problems, where the utility of actions depends on a sequence of decisions and there exists uncertainty about the dynamics of the environment the agent is situated on. This general framework has allowed that Q-Learning and other Reinforcement Learning methods to be applied to a broad spectrum of complex real world problems such as robotics, industrial manufacturing, games and others. Despite its interesting properties, Q-learning is a very slow method that requires a long period of training for learning an acceptable policy. In order to solve or at least reduce this problem, we propose a parallel implementation model of Q-learning using a tabular representation and via a communication scheme based on cache. This model is applied to a particular problem and the results obtained with different processor configurations are reported. A brief discussion about the properties and current limitations of our approach is finally presented. | en |
dc.language | en | es |
dc.subject | Parallel programming | es |
dc.subject | Redes de Comunicación de Computadores | es |
dc.subject | communication based on cache | en |
dc.subject | reinforcement learning | en |
dc.subject | Informática | es |
dc.subject | Aprendizaje | es |
dc.subject | asynchronous dynamic programming | en |
dc.title | A parallel implementation of Q-learning based on communication with cache | en |
dc.type | Articulo | es |
sedici.identifier.uri | http://journal.info.unlp.edu.ar/wp-content/uploads/p41.pdf | es |
sedici.creator.person | Printista, Alicia Marcela | es |
sedici.creator.person | Errecalde, Marcelo Luis | es |
sedici.creator.person | Montoya, Cecilia Inés | es |
sedici.subject.materias | Ciencias Informáticas | es |
sedici.description.fulltext | true | es |
mods.originInfo.place | Facultad de Informática | es |
sedici.subtype | Articulo | es |
sedici.rights.license | Creative Commons Attribution-NonCommercial 3.0 Unported (CC BY-NC 3.0) | |
sedici.rights.uri | http://creativecommons.org/licenses/by-nc/3.0/ | |
sedici.description.peerReview | peer-review | es |
sedici2003.identifier | ARG-UNLP-ART-0000000090 | es |
sedici.relation.journalTitle | vol. 1, no. 6 | es |