A Q-learning-based memetic algorithm for multi-objective dynamic software project scheduling

Xiao-Ning Shen; Leandro L. Minku; Naresh Marturi; Yi-Nan Guo; Ying Han

doi:10.1016/j.ins.2017.10.041

A Q-learning-based memetic algorithm for multi-objective dynamic software project scheduling

Xiao-Ning Shen, Leandro L. Minku, Naresh Marturi, Yi-Nan Guo, Ying Han

Research output: Contribution to journal › Article › peer-review

30 Citations (Scopus)

381 Downloads (Pure)

Abstract

Software project scheduling is the problem of allocating employees to tasks in a software project. Due to the large scale of current software projects, many studies have investigated the use of optimization algorithms to find good software project schedules. However, despite the importance of human factors to the success of software projects, existing work has considered only a limited number of human properties when formulating software project scheduling as an optimization problem. Moreover, the changing environments of software companies mean that software project scheduling is a dynamic optimization problem. However, there is a lack of effective dynamic scheduling approaches to solve this problem. This work proposes a more realistic mathematical model for the dynamic software project scheduling problem. This model considers that skill proficiency can improve over time and, different from previous work, it considers that such improvement is affected by the employees’ properties of motivation and learning ability, and by the skill difficulty. It also defines the objective of employees’ satisfaction with the allocation. It is considered together with the objectives of project duration, cost, robustness and stability under a variety of practical constraints. To adapt schedules to the dynamically changing software project environments, a multi-objective two-archive memetic algorithm based on Q-learning (MOTAMAQ) is proposed to solve the problem in a proactive-rescheduling way. Different from previous work, MOTAMAQ learns the most appropriate global and local search methods to be used for different software project environment states by using Q-learning. Extensive experiments on 18 dynamic benchmark instances and 3 instances derived from real-world software projects were performed. A comparison with seven other meta-heuristic algorithms shows that the strategies used by our novel approach are very effective in improving its convergence performance in dynamic environments, while maintaining a good distribution and spread of solutions. The Q-learning-based learning mechanism can choose appropriate search operators for the different scheduling environments. We also show how different trade-offs among the five objectives can provide software managers with a deeper insight into various compromises among many objectives, and enabling them to make informed decisions.

Original language	English
Pages (from-to)	1-29
Journal	Information Sciences
Volume	428
Early online date	24 Oct 2017
DOIs	https://doi.org/10.1016/j.ins.2017.10.041
Publication status	Published - 1 Feb 2018

Keywords

metaheuristics
dynamic software project scheduling
multi-objective memetic algorithms
mathematical modeling
Q-learning

Access to Document

10.1016/j.ins.2017.10.041Licence: None: All rights reserved

Shen_et_al_A_Q-learning-based_memetic_algorithm_Information_Sciences_2017
https://doi.org/10.1016/j.ins.2017.10.041
Accepted author manuscript, 1.34 MBLicence: Creative Commons: Attribution-NonCommercial-NoDerivs (CC BY-NC-ND)

http://www.sciencedirect.com/science/article/pii/S0020025517310472Licence: None: All rights reserved

Cite this

@article{7e814d43bf294f55a82756da3b351591,

title = "A Q-learning-based memetic algorithm for multi-objective dynamic software project scheduling",

abstract = "Software project scheduling is the problem of allocating employees to tasks in a software project. Due to the large scale of current software projects, many studies have investigated the use of optimization algorithms to find good software project schedules. However, despite the importance of human factors to the success of software projects, existing work has considered only a limited number of human properties when formulating software project scheduling as an optimization problem. Moreover, the changing environments of software companies mean that software project scheduling is a dynamic optimization problem. However, there is a lack of effective dynamic scheduling approaches to solve this problem. This work proposes a more realistic mathematical model for the dynamic software project scheduling problem. This model considers that skill proficiency can improve over time and, different from previous work, it considers that such improvement is affected by the employees{\textquoteright} properties of motivation and learning ability, and by the skill difficulty. It also defines the objective of employees{\textquoteright} satisfaction with the allocation. It is considered together with the objectives of project duration, cost, robustness and stability under a variety of practical constraints. To adapt schedules to the dynamically changing software project environments, a multi-objective two-archive memetic algorithm based on Q-learning (MOTAMAQ) is proposed to solve the problem in a proactive-rescheduling way. Different from previous work, MOTAMAQ learns the most appropriate global and local search methods to be used for different software project environment states by using Q-learning. Extensive experiments on 18 dynamic benchmark instances and 3 instances derived from real-world software projects were performed. A comparison with seven other meta-heuristic algorithms shows that the strategies used by our novel approach are very effective in improving its convergence performance in dynamic environments, while maintaining a good distribution and spread of solutions. The Q-learning-based learning mechanism can choose appropriate search operators for the different scheduling environments. We also show how different trade-offs among the five objectives can provide software managers with a deeper insight into various compromises among many objectives, and enabling them to make informed decisions.",

keywords = "metaheuristics , dynamic software project scheduling , multi-objective memetic algorithms , mathematical modeling , Q-learning",

author = "Xiao-Ning Shen and Minku, {Leandro L.} and Naresh Marturi and Yi-Nan Guo and Ying Han",

year = "2018",

month = feb,

day = "1",

doi = "10.1016/j.ins.2017.10.041",

language = "English",

volume = "428",

pages = "1--29",

journal = "Information Sciences",

issn = "0020-0255",

publisher = "Elsevier",

}

TY - JOUR

T1 - A Q-learning-based memetic algorithm for multi-objective dynamic software project scheduling

AU - Shen, Xiao-Ning

AU - Minku, Leandro L.

AU - Marturi, Naresh

AU - Guo, Yi-Nan

AU - Han, Ying

PY - 2018/2/1

Y1 - 2018/2/1

N2 - Software project scheduling is the problem of allocating employees to tasks in a software project. Due to the large scale of current software projects, many studies have investigated the use of optimization algorithms to find good software project schedules. However, despite the importance of human factors to the success of software projects, existing work has considered only a limited number of human properties when formulating software project scheduling as an optimization problem. Moreover, the changing environments of software companies mean that software project scheduling is a dynamic optimization problem. However, there is a lack of effective dynamic scheduling approaches to solve this problem. This work proposes a more realistic mathematical model for the dynamic software project scheduling problem. This model considers that skill proficiency can improve over time and, different from previous work, it considers that such improvement is affected by the employees’ properties of motivation and learning ability, and by the skill difficulty. It also defines the objective of employees’ satisfaction with the allocation. It is considered together with the objectives of project duration, cost, robustness and stability under a variety of practical constraints. To adapt schedules to the dynamically changing software project environments, a multi-objective two-archive memetic algorithm based on Q-learning (MOTAMAQ) is proposed to solve the problem in a proactive-rescheduling way. Different from previous work, MOTAMAQ learns the most appropriate global and local search methods to be used for different software project environment states by using Q-learning. Extensive experiments on 18 dynamic benchmark instances and 3 instances derived from real-world software projects were performed. A comparison with seven other meta-heuristic algorithms shows that the strategies used by our novel approach are very effective in improving its convergence performance in dynamic environments, while maintaining a good distribution and spread of solutions. The Q-learning-based learning mechanism can choose appropriate search operators for the different scheduling environments. We also show how different trade-offs among the five objectives can provide software managers with a deeper insight into various compromises among many objectives, and enabling them to make informed decisions.

AB - Software project scheduling is the problem of allocating employees to tasks in a software project. Due to the large scale of current software projects, many studies have investigated the use of optimization algorithms to find good software project schedules. However, despite the importance of human factors to the success of software projects, existing work has considered only a limited number of human properties when formulating software project scheduling as an optimization problem. Moreover, the changing environments of software companies mean that software project scheduling is a dynamic optimization problem. However, there is a lack of effective dynamic scheduling approaches to solve this problem. This work proposes a more realistic mathematical model for the dynamic software project scheduling problem. This model considers that skill proficiency can improve over time and, different from previous work, it considers that such improvement is affected by the employees’ properties of motivation and learning ability, and by the skill difficulty. It also defines the objective of employees’ satisfaction with the allocation. It is considered together with the objectives of project duration, cost, robustness and stability under a variety of practical constraints. To adapt schedules to the dynamically changing software project environments, a multi-objective two-archive memetic algorithm based on Q-learning (MOTAMAQ) is proposed to solve the problem in a proactive-rescheduling way. Different from previous work, MOTAMAQ learns the most appropriate global and local search methods to be used for different software project environment states by using Q-learning. Extensive experiments on 18 dynamic benchmark instances and 3 instances derived from real-world software projects were performed. A comparison with seven other meta-heuristic algorithms shows that the strategies used by our novel approach are very effective in improving its convergence performance in dynamic environments, while maintaining a good distribution and spread of solutions. The Q-learning-based learning mechanism can choose appropriate search operators for the different scheduling environments. We also show how different trade-offs among the five objectives can provide software managers with a deeper insight into various compromises among many objectives, and enabling them to make informed decisions.

KW - metaheuristics

KW - dynamic software project scheduling

KW - multi-objective memetic algorithms

KW - mathematical modeling

KW - Q-learning

U2 - 10.1016/j.ins.2017.10.041

DO - 10.1016/j.ins.2017.10.041

M3 - Article

SN - 0020-0255

VL - 428

SP - 1

EP - 29

JO - Information Sciences

JF - Information Sciences

ER -

A Q-learning-based memetic algorithm for multi-objective dynamic software project scheduling

Abstract

Keywords

Access to Document

Fingerprint

Cite this