Integrating Reinforcement Learning and Declarative Programming to Learn Causal Laws in Dynamic Domains

Mohan Sridharan, Sarah Rainge

Research output: Chapter in Book/Report/Conference proceedingConference contribution

1 Citation (Scopus)

Abstract

Robots deployed to assist and collaborate with humans in complex
domains need the ability to represent and reason with incomplete domain knowledge, and to learn from minimal feedback obtained from non-expert human participants. This paper presents an architecture that combines the complementary strengths of Reinforcement Learning (RL) and declarative programming to support such commonsense reasoning and incremental learning of the rules governing the domain dynamics. Answer Set Prolog (ASP), a declarative language, is used to represent domain knowledge. The robot’s current beliefs, obtained by inference in the ASP program, are used to formulate the task of learning previously unknown domain rules as an RL problem. The learned rules are, in turn, encoded in the ASP program and used to plan action sequences for subsequent tasks. The architecture is illustrated and evaluated in the context of a simulated robot that plans action sequences to arrange tabletop objects in desired configurations.
Original languageEnglish
Title of host publicationSocial Robotics
Subtitle of host publication6th International Conference, ICSR 2014 Sydney, NSW, Australia, October 27-29, 2014 Proceedings
EditorsMichael Beetz, Benjamin Johnston, Mary-Anne Williams
PublisherSpringer
Chapter33
Pages320-329
ISBN (Electronic)978-3-319-11973-1
ISBN (Print)978-3-319-11972-4
DOIs
Publication statusPublished - 27 Oct 2014
Event6th International Conference on Social Robotics, ICSR 2014 - Sydney, Australia
Duration: 27 Oct 201429 Oct 2014

Publication series

NameLecture Notes in Artificial Intelligence
Volume8755
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference6th International Conference on Social Robotics, ICSR 2014
Country/TerritoryAustralia
CitySydney
Period27/10/1429/10/14

Fingerprint

Dive into the research topics of 'Integrating Reinforcement Learning and Declarative Programming to Learn Causal Laws in Dynamic Domains'. Together they form a unique fingerprint.

Cite this