Subir material

Suba sus trabajos a SEDICI, para mejorar notoriamente su visibilidad e impacto

 

Mostrar el registro sencillo del ítem

dc.date.accessioned 2019-03-12T13:54:51Z
dc.date.available 2019-03-12T13:54:51Z
dc.date.issued 2018
dc.identifier.uri http://sedici.unlp.edu.ar/handle/10915/73032
dc.description.abstract The capacity to understand others, or to reason about others’ ways of reasoning about others (including us), is fundamental for an agent to survive in a multi-agent uncertain environment. This reasoning ability, commonly known as Theory of Mind, is instrumental for making effective predictions over others’ future actions and learning from both real and simulated experience. In this work, a novel architecture for model-based reinforcement learning in a multi-agent setting is proposed. The proposed architecture, called ToM-Dyna-Q, integrates ToM simulation alongside with the well-known Dyna-Q architecture to account for artificial cognition in a shared environment inhabited by multiple agents interacting with each other. Results obtained for the two-player competitive game of Tic-Tac-Toe demonstrate the importance for a given agent of learning, reasoning and planning based on mental simulation modeling of other agents’ goals, beliefs and intentions. en
dc.format.extent 32-41 es
dc.language en es
dc.subject intelligent agents en
dc.subject prediction machines en
dc.subject reinforcement learning en
dc.subject theory of mind en
dc.title ToM-Dyna-Q: on the integration of reinforcement learning and machine Theory of Mind en
dc.type Objeto de conferencia es
sedici.identifier.isbn 978-950-658-472-6 es
sedici.creator.person Kröhling, Dan es
sedici.creator.person Martínez, Ernesto es
sedici.description.note XIX Workshop Agentes y Sistemas Inteligentes (WASI) es
sedici.subject.materias Ciencias Informáticas es
sedici.description.fulltext true es
mods.originInfo.place Red de Universidades con Carreras en Informática (RedUNCI) es
sedici.subtype Objeto de conferencia es
sedici.rights.license Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0)
sedici.rights.uri http://creativecommons.org/licenses/by-nc-sa/4.0/
sedici.date.exposure 2018-10
sedici.relation.event XXIV Congreso Argentino de Ciencias de la Computación (La Plata, 2018). es
sedici.description.peerReview peer-review es


Descargar archivos

Este ítem aparece en la(s) siguiente(s) colección(ones)

Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0) Excepto donde se diga explícitamente, este item se publica bajo la siguiente licencia Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0)