Discovering Domain Axioms Using Relational Reinforcement Learning and Declarative Programming

Mohan Sridharan; Prashanth Devarakonda; Rashmica Gupta

Discovering Domain Axioms Using Relational Reinforcement Learning and Declarative Programming

Mohan Sridharan, Prashanth Devarakonda, Rashmica Gupta

Computer Science

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Abstract

This paper presents an architecture that integrates declarative programming and relational reinforcement learning to support incremental and interactive discovery of previously unknown axioms governing domain dynamics. Specifically, Answer Set Prolog (ASP), a declarative programming paradigm, is used to represent and reason with incomplete commonsense domain knowledge. For any given goal, any unexplained failure of plans created by inference in the ASP program is taken to indicate the existence of unknown domain axioms. The task of discovering these axioms is formulated as a Reinforcement Learning problem, and decisiontree regression with a relational representation is used to incrementally generalize from specific axioms identified over time. These new axioms are added to the ASP program for subsequent inference. We demonstrate and evaluate the capabilities of our architecture in two simulated domains: Blocks World and Simple Mario.

Original language	English
Title of host publication	Proceedings of the 4th Workshop on Planning and Robotics (PlanRob)
Subtitle of host publication	at the 26th International Conference on Automated Planning and Scheduling (ICAPS 2016)
Editors	Alberto Finzi, Erez Karpas
Pages	204-212
Publication status	Published - 13 Jun 2016
Event	4th Workshop on Planning and Robotics (PlanRob) at the 26th International Conference on Automated Planning and Scheduling (ICAPS 2016) - London, United Kingdom Duration: 13 Jun 2016 → 14 Jun 2016

Conference

Conference	4th Workshop on Planning and Robotics (PlanRob) at the 26th International Conference on Automated Planning and Scheduling (ICAPS 2016)
Country/Territory	United Kingdom
City	London
Period	13/06/16 → 14/06/16

Access to Document

http://icaps16.icaps-conference.org/proceedings/planrob16.pdfLicence: None: All rights reserved

Cite this

Sridharan, M., Devarakonda, P., & Gupta, R. (2016). Discovering Domain Axioms Using Relational Reinforcement Learning and Declarative Programming. In A. Finzi, & E. Karpas (Eds.), Proceedings of the 4th Workshop on Planning and Robotics (PlanRob) : at the 26th International Conference on Automated Planning and Scheduling (ICAPS 2016) (pp. 204-212) http://icaps16.icaps-conference.org/proceedings/planrob16.pdf

Sridharan, Mohan ; Devarakonda, Prashanth ; Gupta, Rashmica. / Discovering Domain Axioms Using Relational Reinforcement Learning and Declarative Programming. Proceedings of the 4th Workshop on Planning and Robotics (PlanRob) : at the 26th International Conference on Automated Planning and Scheduling (ICAPS 2016). editor / Alberto Finzi ; Erez Karpas. 2016. pp. 204-212

@inproceedings{ebdd9c5f4b094be7874511b8b03cf6f7,

title = "Discovering Domain Axioms Using Relational Reinforcement Learning and Declarative Programming",

abstract = "This paper presents an architecture that integrates declarative programming and relational reinforcement learning to support incremental and interactive discovery of previously unknown axioms governing domain dynamics. Specifically, Answer Set Prolog (ASP), a declarative programming paradigm, is used to represent and reason with incomplete commonsense domain knowledge. For any given goal, any unexplained failure of plans created by inference in the ASP program is taken to indicate the existence of unknown domain axioms. The task of discovering these axioms is formulated as a Reinforcement Learning problem, and decisiontree regression with a relational representation is used to incrementally generalize from specific axioms identified over time. These new axioms are added to the ASP program for subsequent inference. We demonstrate and evaluate the capabilities of our architecture in two simulated domains: Blocks World and Simple Mario.",

author = "Mohan Sridharan and Prashanth Devarakonda and Rashmica Gupta",

year = "2016",

month = jun,

day = "13",

language = "English",

pages = "204--212",

editor = "Alberto Finzi and Erez Karpas",

booktitle = "Proceedings of the 4th Workshop on Planning and Robotics (PlanRob)",

note = "4th Workshop on Planning and Robotics (PlanRob) at the 26th International Conference on Automated Planning and Scheduling (ICAPS 2016) ; Conference date: 13-06-2016 Through 14-06-2016",

}

Sridharan, M, Devarakonda, P & Gupta, R 2016, Discovering Domain Axioms Using Relational Reinforcement Learning and Declarative Programming. in A Finzi & E Karpas (eds), Proceedings of the 4th Workshop on Planning and Robotics (PlanRob) : at the 26th International Conference on Automated Planning and Scheduling (ICAPS 2016). pp. 204-212, 4th Workshop on Planning and Robotics (PlanRob) at the 26th International Conference on Automated Planning and Scheduling (ICAPS 2016), London, United Kingdom, 13/06/16. <http://icaps16.icaps-conference.org/proceedings/planrob16.pdf>

Discovering Domain Axioms Using Relational Reinforcement Learning and Declarative Programming. / Sridharan, Mohan; Devarakonda, Prashanth; Gupta, Rashmica.
Proceedings of the 4th Workshop on Planning and Robotics (PlanRob) : at the 26th International Conference on Automated Planning and Scheduling (ICAPS 2016). ed. / Alberto Finzi; Erez Karpas. 2016. p. 204-212.

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

TY - GEN

T1 - Discovering Domain Axioms Using Relational Reinforcement Learning and Declarative Programming

AU - Sridharan, Mohan

AU - Devarakonda, Prashanth

AU - Gupta, Rashmica

PY - 2016/6/13

Y1 - 2016/6/13

N2 - This paper presents an architecture that integrates declarative programming and relational reinforcement learning to support incremental and interactive discovery of previously unknown axioms governing domain dynamics. Specifically, Answer Set Prolog (ASP), a declarative programming paradigm, is used to represent and reason with incomplete commonsense domain knowledge. For any given goal, any unexplained failure of plans created by inference in the ASP program is taken to indicate the existence of unknown domain axioms. The task of discovering these axioms is formulated as a Reinforcement Learning problem, and decisiontree regression with a relational representation is used to incrementally generalize from specific axioms identified over time. These new axioms are added to the ASP program for subsequent inference. We demonstrate and evaluate the capabilities of our architecture in two simulated domains: Blocks World and Simple Mario.

AB - This paper presents an architecture that integrates declarative programming and relational reinforcement learning to support incremental and interactive discovery of previously unknown axioms governing domain dynamics. Specifically, Answer Set Prolog (ASP), a declarative programming paradigm, is used to represent and reason with incomplete commonsense domain knowledge. For any given goal, any unexplained failure of plans created by inference in the ASP program is taken to indicate the existence of unknown domain axioms. The task of discovering these axioms is formulated as a Reinforcement Learning problem, and decisiontree regression with a relational representation is used to incrementally generalize from specific axioms identified over time. These new axioms are added to the ASP program for subsequent inference. We demonstrate and evaluate the capabilities of our architecture in two simulated domains: Blocks World and Simple Mario.

M3 - Conference contribution

SP - 204

EP - 212

BT - Proceedings of the 4th Workshop on Planning and Robotics (PlanRob)

A2 - Finzi, Alberto

A2 - Karpas, Erez

T2 - 4th Workshop on Planning and Robotics (PlanRob) at the 26th International Conference on Automated Planning and Scheduling (ICAPS 2016)

Y2 - 13 June 2016 through 14 June 2016

ER -

Discovering Domain Axioms Using Relational Reinforcement Learning and Declarative Programming

Abstract

Conference

Access to Document

Fingerprint

Cite this