Less is more: Temporal fault predictive performance over multiple hadoop releases

Research output: Chapter in Book/Report/Conference proceedingConference contribution


  • Mark Harman
  • Syed Islam
  • Yue Jia
  • Federica Sarro
  • Komsan Srivisut

Colleges, School and Institutes

External organisations

  • UCL
  • University of York


We investigate search based fault prediction over time based on 8 consecutive Hadoop versions, aiming to analyse the impact of chronology on fault prediction performance. Our results confound the assumption, implicit in previous work, that additional information from historical versions improves prediction; though G-mean tends to improve, Recall can be reduced.


Original languageEnglish
Title of host publicationProceedings of the 6th Symposium on Search-Based Software Engineering (SSBSE), Lecture Notes in Computer Science
Publication statusPublished - 2014
Event6th International Symposium on Search-Based Software Engineering, SSBSE 2014 - Fortaleza, Brazil
Duration: 26 Aug 201429 Aug 2014

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume8636 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349


Conference6th International Symposium on Search-Based Software Engineering, SSBSE 2014