Should I do that? Using relational reinforcement learning and declarative programming to discover domain axioms

Mohan Sridharan; Ben  Meadows

doi:10.1109/DEVLRN.2016.7846827

Should I do that? Using relational reinforcement learning and declarative programming to discover domain axioms

Mohan Sridharan, Ben Meadows

Computer Science

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

2 Citations (Scopus)

236 Downloads (Pure)

Abstract

Robots assisting humans in complex domains need the ability to represent, reason with, and learn from, different descriptions of incomplete domain knowledge and uncertainty. This paper focuses on the challenge of incrementally and interactively discovering previously unknown axioms governing domain dynamics, and describes an architecture that integrates declarative programming and relational reinforcement learning to address this challenge. Answer Set Prolog (ASP), a declarative programming paradigm, is used to represent and reason with incomplete domain knowledge for planning and diagnostics. For any given goal, unexplained failure of plans created by ASP-based inference is taken to indicate the existence of unknown domain axioms. The task of discovering these axioms is formulated as a reinforcement learning problem, and a relational representation is used to incrementally generalize from specific axioms identified over time. These generic axioms are then added to the ASP-based representation for subsequent inference. The architecture's capabilities are demonstrated and evaluated in two domains, Blocks World and Robot Butler.

Original language	English
Title of host publication	2016 Joint IEEE International Conference on Development and Learning and Epigenetic Robotics (ICDL-EpiRob)
Publisher	IEEE Computer Society Press
Pages	252-259
ISBN (Electronic)	9781509050697
ISBN (Print)	9781509050703
DOIs	https://doi.org/10.1109/DEVLRN.2016.7846827
Publication status	Published - 19 Sept 2016
Event	2016 Joint IEEE International Conference on Development and Learning and Epigenetic Robotics (ICDL-EpiRob) - Cergy-Pontoise, France Duration: 19 Sept 2016 → 22 Sept 2016

Conference

Conference	2016 Joint IEEE International Conference on Development and Learning and Epigenetic Robotics (ICDL-EpiRob)
Period	19/09/16 → 22/09/16

Keywords

Probabilistic logic
Cognition
Planning
Learning (artificial intelligence)
Programming
Robot sensing systems

Access to Document

10.1109/DEVLRN.2016.7846827Licence: None: All rights reserved

Sridharan_Meadows_Should_I_do_that_Development_and_Learning_and_Epigenetic_Robotics_2016
M. Sridharan and B. Meadows, "Should I do that? using relational reinforcement learning and declarative programming to discover domain axioms," 2016 Joint IEEE International Conference on Development and Learning and Epigenetic Robotics (ICDL-EpiRob), Cergy-Pontoise, 2016, pp. 252-259. (© 2016 IEEE) doi: 10.1109/DEVLRN.2016.7846827
Accepted author manuscript, 410 KBLicence: None: All rights reserved

http://ieeexplore.ieee.org/document/7846827/Licence: None: All rights reserved

Cite this

@inproceedings{de2f16b9e2c14e869d8cb8486ef8cfb2,

title = "Should I do that? Using relational reinforcement learning and declarative programming to discover domain axioms",

abstract = "Robots assisting humans in complex domains need the ability to represent, reason with, and learn from, different descriptions of incomplete domain knowledge and uncertainty. This paper focuses on the challenge of incrementally and interactively discovering previously unknown axioms governing domain dynamics, and describes an architecture that integrates declarative programming and relational reinforcement learning to address this challenge. Answer Set Prolog (ASP), a declarative programming paradigm, is used to represent and reason with incomplete domain knowledge for planning and diagnostics. For any given goal, unexplained failure of plans created by ASP-based inference is taken to indicate the existence of unknown domain axioms. The task of discovering these axioms is formulated as a reinforcement learning problem, and a relational representation is used to incrementally generalize from specific axioms identified over time. These generic axioms are then added to the ASP-based representation for subsequent inference. The architecture's capabilities are demonstrated and evaluated in two domains, Blocks World and Robot Butler.",

keywords = "Probabilistic logic, Cognition, Planning, Learning (artificial intelligence), Programming, Robot sensing systems",

author = "Mohan Sridharan and Ben Meadows",

year = "2016",

month = sep,

day = "19",

doi = "10.1109/DEVLRN.2016.7846827",

language = "English",

isbn = "9781509050703 ",

pages = "252--259",

booktitle = "2016 Joint IEEE International Conference on Development and Learning and Epigenetic Robotics (ICDL-EpiRob)",

publisher = "IEEE Computer Society Press",

note = "2016 Joint IEEE International Conference on Development and Learning and Epigenetic Robotics (ICDL-EpiRob) ; Conference date: 19-09-2016 Through 22-09-2016",

}

Sridharan, M & Meadows, B 2016, Should I do that? Using relational reinforcement learning and declarative programming to discover domain axioms. in 2016 Joint IEEE International Conference on Development and Learning and Epigenetic Robotics (ICDL-EpiRob) . IEEE Computer Society Press, pp. 252-259, 2016 Joint IEEE International Conference on Development and Learning and Epigenetic Robotics (ICDL-EpiRob), 19/09/16. https://doi.org/10.1109/DEVLRN.2016.7846827

Should I do that? Using relational reinforcement learning and declarative programming to discover domain axioms. / Sridharan, Mohan; Meadows, Ben .
2016 Joint IEEE International Conference on Development and Learning and Epigenetic Robotics (ICDL-EpiRob) . IEEE Computer Society Press, 2016. p. 252-259.

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

TY - GEN

T1 - Should I do that? Using relational reinforcement learning and declarative programming to discover domain axioms

AU - Sridharan, Mohan

AU - Meadows, Ben

PY - 2016/9/19

Y1 - 2016/9/19

N2 - Robots assisting humans in complex domains need the ability to represent, reason with, and learn from, different descriptions of incomplete domain knowledge and uncertainty. This paper focuses on the challenge of incrementally and interactively discovering previously unknown axioms governing domain dynamics, and describes an architecture that integrates declarative programming and relational reinforcement learning to address this challenge. Answer Set Prolog (ASP), a declarative programming paradigm, is used to represent and reason with incomplete domain knowledge for planning and diagnostics. For any given goal, unexplained failure of plans created by ASP-based inference is taken to indicate the existence of unknown domain axioms. The task of discovering these axioms is formulated as a reinforcement learning problem, and a relational representation is used to incrementally generalize from specific axioms identified over time. These generic axioms are then added to the ASP-based representation for subsequent inference. The architecture's capabilities are demonstrated and evaluated in two domains, Blocks World and Robot Butler.

AB - Robots assisting humans in complex domains need the ability to represent, reason with, and learn from, different descriptions of incomplete domain knowledge and uncertainty. This paper focuses on the challenge of incrementally and interactively discovering previously unknown axioms governing domain dynamics, and describes an architecture that integrates declarative programming and relational reinforcement learning to address this challenge. Answer Set Prolog (ASP), a declarative programming paradigm, is used to represent and reason with incomplete domain knowledge for planning and diagnostics. For any given goal, unexplained failure of plans created by ASP-based inference is taken to indicate the existence of unknown domain axioms. The task of discovering these axioms is formulated as a reinforcement learning problem, and a relational representation is used to incrementally generalize from specific axioms identified over time. These generic axioms are then added to the ASP-based representation for subsequent inference. The architecture's capabilities are demonstrated and evaluated in two domains, Blocks World and Robot Butler.

KW - Probabilistic logic

KW - Cognition

KW - Planning

KW - Learning (artificial intelligence)

KW - Programming

KW - Robot sensing systems

U2 - 10.1109/DEVLRN.2016.7846827

DO - 10.1109/DEVLRN.2016.7846827

M3 - Conference contribution

SN - 9781509050703

SP - 252

EP - 259

BT - 2016 Joint IEEE International Conference on Development and Learning and Epigenetic Robotics (ICDL-EpiRob)

PB - IEEE Computer Society Press

T2 - 2016 Joint IEEE International Conference on Development and Learning and Epigenetic Robotics (ICDL-EpiRob)

Y2 - 19 September 2016 through 22 September 2016

ER -

Should I do that? Using relational reinforcement learning and declarative programming to discover domain axioms

Abstract

Conference

Keywords

Access to Document

Fingerprint

Cite this