Music Generation using Human-In-The-Loop Reinforcement Learning

Research output: Chapter in Book/Report/Conference proceedingConference contribution

1 Downloads (Pure)

Abstract

This paper presents an approach that combines Human-In-The-Loop Reinforcement Learning (HITL RL) with principles derived from music theory to facilitate real-time generation of musical compositions. HITL RL, previously employed in diverse applications such as modelling humanoid robot mechanics and enhancing language models, harnesses human feedback to refine the training process. In this study, we develop a HILT RL framework that can leverage the constraints and principles in music theory. In particular, we propose an episodic tabular Q-learning algorithm with an epsilon-greedy exploration policy. The system generates musical tracks (compositions), continuously enhancing its quality through iterative human-in-the-loop feedback. The reward function for this process is the subjective musical taste of the user.
Original languageEnglish
Title of host publication2023 IEEE International Conference on Big Data (BigData)
PublisherIEEE
Pages4479-4484
Number of pages6
ISBN (Electronic)9798350324457
ISBN (Print)9798350324464 (PoD)
DOIs
Publication statusPublished - 22 Jan 2024
Event2023 IEEE International Conference on Big Data - Sorrento, Italy
Duration: 15 Dec 202318 Dec 2023

Publication series

NameIEEE International Conference on Big Data
PublisherIEEE
ISSN (Print)2639-1589
ISSN (Electronic)2573-2978

Conference

Conference2023 IEEE International Conference on Big Data
Abbreviated titleBigData 2023
Country/TerritoryItaly
CitySorrento
Period15/12/2318/12/23

Keywords

  • Reinforcement Learning
  • Q-learning
  • Machine learning algorithms
  • Shape
  • Heuristic algorithms
  • Music
  • Human in the loop

Fingerprint

Dive into the research topics of 'Music Generation using Human-In-The-Loop Reinforcement Learning'. Together they form a unique fingerprint.

Cite this