ToM-Dyna-Q: on the integration of reinforcement learning and machine Theory of Mind

Kröhling, Dan; Martínez, Ernesto

Buscar material

Busque entre los 169416 recursos disponibles en el repositorio

Subir material

Suba sus trabajos a SEDICI, para mejorar notoriamente su visibilidad e impacto

Red de Universidades con Carreras en Informática (RedUNCI)
→
Eventos
→
CACIC
→
CACIC 2018

Mostrar el registro sencillo del ítem

dc.date.accessioned	2019-03-12T13:54:51Z
dc.date.available	2019-03-12T13:54:51Z
dc.date.issued	2018
dc.identifier.uri	http://sedici.unlp.edu.ar/handle/10915/73032
dc.description.abstract	The capacity to understand others, or to reason about others’ ways of reasoning about others (including us), is fundamental for an agent to survive in a multi-agent uncertain environment. This reasoning ability, commonly known as Theory of Mind, is instrumental for making effective predictions over others’ future actions and learning from both real and simulated experience. In this work, a novel architecture for model-based reinforcement learning in a multi-agent setting is proposed. The proposed architecture, called ToM-Dyna-Q, integrates ToM simulation alongside with the well-known Dyna-Q architecture to account for artificial cognition in a shared environment inhabited by multiple agents interacting with each other. Results obtained for the two-player competitive game of Tic-Tac-Toe demonstrate the importance for a given agent of learning, reasoning and planning based on mental simulation modeling of other agents’ goals, beliefs and intentions.	en
dc.format.extent	32-41	es
dc.language	en	es
dc.subject	intelligent agents	en
dc.subject	prediction machines	en
dc.subject	reinforcement learning	en
dc.subject	theory of mind	en
dc.title	ToM-Dyna-Q: on the integration of reinforcement learning and machine Theory of Mind	en
dc.type	Objeto de conferencia	es
sedici.identifier.isbn	978-950-658-472-6	es
sedici.creator.person	Kröhling, Dan	es
sedici.creator.person	Martínez, Ernesto	es
sedici.description.note	XIX Workshop Agentes y Sistemas Inteligentes (WASI)	es
sedici.subject.materias	Ciencias Informáticas	es
sedici.description.fulltext	true	es
mods.originInfo.place	Red de Universidades con Carreras en Informática (RedUNCI)	es
sedici.subtype	Objeto de conferencia	es
sedici.rights.license	Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0)
sedici.rights.uri	http://creativecommons.org/licenses/by-nc-sa/4.0/
sedici.date.exposure	2018-10
sedici.relation.event	XXIV Congreso Argentino de Ciencias de la Computación (La Plata, 2018).	es
sedici.description.peerReview	peer-review	es

Descargar archivos

Documento completo
Descargar archivo (1.092Mb) - PDF

Este ítem aparece en la(s) siguiente(s) colección(ones)

CACIC → CACIC 2018

Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0)

Excepto donde se diga explícitamente, este item se publica bajo la siguiente licencia Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0)

Iniciar sesión