Should I do that? Using relational reinforcement learning and declarative programming to discover domain axioms

Mohan Sridharan, Ben Meadows

Research output: Chapter in Book/Report/Conference proceedingConference contribution

2 Citations (Scopus)
236 Downloads (Pure)

Abstract

Robots assisting humans in complex domains need the ability to represent, reason with, and learn from, different descriptions of incomplete domain knowledge and uncertainty. This paper focuses on the challenge of incrementally and interactively discovering previously unknown axioms governing domain dynamics, and describes an architecture that integrates declarative programming and relational reinforcement learning to address this challenge. Answer Set Prolog (ASP), a declarative programming paradigm, is used to represent and reason with incomplete domain knowledge for planning and diagnostics. For any given goal, unexplained failure of plans created by ASP-based inference is taken to indicate the existence of unknown domain axioms. The task of discovering these axioms is formulated as a reinforcement learning problem, and a relational representation is used to incrementally generalize from specific axioms identified over time. These generic axioms are then added to the ASP-based representation for subsequent inference. The architecture's capabilities are demonstrated and evaluated in two domains, Blocks World and Robot Butler.
Original languageEnglish
Title of host publication2016 Joint IEEE International Conference on Development and Learning and Epigenetic Robotics (ICDL-EpiRob)
PublisherIEEE Computer Society Press
Pages252-259
ISBN (Electronic)9781509050697
ISBN (Print)9781509050703
DOIs
Publication statusPublished - 19 Sept 2016
Event2016 Joint IEEE International Conference on Development and Learning and Epigenetic Robotics (ICDL-EpiRob) - Cergy-Pontoise, France
Duration: 19 Sept 201622 Sept 2016

Conference

Conference2016 Joint IEEE International Conference on Development and Learning and Epigenetic Robotics (ICDL-EpiRob)
Period19/09/1622/09/16

Keywords

  • Probabilistic logic
  • Cognition
  • Planning
  • Learning (artificial intelligence)
  • Programming
  • Robot sensing systems

Fingerprint

Dive into the research topics of 'Should I do that? Using relational reinforcement learning and declarative programming to discover domain axioms'. Together they form a unique fingerprint.

Cite this