How to make best use of cross-company data in software effort estimation?

Leandro Minku; Xin Yao

doi:10.1145/2568225.2568228

How to make best use of cross-company data in software effort estimation?

Leandro Minku, Xin Yao

Computer Science

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

40 Citations (Scopus)

245 Downloads (Pure)

Abstract

Previous works using Cross-Company (CC) data for making Within-Company (WC) Software Eort Estimation (SEE) try to use CC data or models directly to provide directions in the WC context. So, these data or models are only helpful when they match the WC context well. When they do not, a fair amount of WC training data, which are usually expensive to acquire, are still necessary to achieve good performance. We investigate how to make best use of CC data, so that we can reduce the amount of WC data while maintaining or improving performance in comparison to WC SEE models. This is done by proposing a new framework to learn the relationship between CC and WC projects explicitly, allowing CC models to be mapped to the WC context. Such mapped models can be useful even when the CC models themselves do not match the WC context directly. Our study shows that a new approach instantiating this framework is able not only to use substantially less WC data than a corresponding WC model, but also to achieve similar/better performance. This approach can also be used to provide insight into the behaviour of a company in comparison to others.

Original language	English
Title of host publication	ICSE '14 : 36th International Conference on Software Engineering Proceedings
Publisher	Association for Computing Machinery
Pages	446-456
ISBN (Print)	9781450327565
DOIs	https://doi.org/10.1145/2568225.2568228
Publication status	Published - May 2014
Event	ICSE 2014 : 36th International Conference on Software Engineering - Hyderabad, India Duration: 31 May 2014 → 7 Jun 2014

Conference

Conference	ICSE 2014 : 36th International Conference on Software Engineering
Country/Territory	India
City	Hyderabad
Period	31/05/14 → 7/06/14

Keywords

Software effort estimation
cross-company learning
transfer learning
online learning
ensembles of learning machines

Access to Document

10.1145/2568225.2568228

Minku_Cross_company_data_ICSE_2014
Copyright is held by the owner/author(s). Eligibility for repository : checked 10/03/2014
Final published version, 374 KBLicence: Other (please specify with Rights Statement)

FP7_COLLAB - Isense - Making Sense of Nonsense
Yao, X.
European Commission - Management Costs, European Commission
1/01/11 → 31/12/13
Project: Research

Cite this

@inproceedings{290f0f44b63744d3999d1c0f9f66c5c5,

title = "How to make best use of cross-company data in software effort estimation?",

abstract = "Previous works using Cross-Company (CC) data for making Within-Company (WC) Software Eort Estimation (SEE) try to use CC data or models directly to provide directions in the WC context. So, these data or models are only helpful when they match the WC context well. When they do not, a fair amount of WC training data, which are usually expensive to acquire, are still necessary to achieve good performance. We investigate how to make best use of CC data, so that we can reduce the amount of WC data while maintaining or improving performance in comparison to WC SEE models. This is done by proposing a new framework to learn the relationship between CC and WC projects explicitly, allowing CC models to be mapped to the WC context. Such mapped models can be useful even when the CC models themselves do not match the WC context directly. Our study shows that a new approach instantiating this framework is able not only to use substantially less WC data than a corresponding WC model, but also to achieve similar/better performance. This approach can also be used to provide insight into the behaviour of a company in comparison to others.",

keywords = "Software effort estimation, cross-company learning, transfer learning, online learning, ensembles of learning machines",

author = "Leandro Minku and Xin Yao",

year = "2014",

month = may,

doi = "10.1145/2568225.2568228",

language = "English",

isbn = "9781450327565",

pages = "446--456",

booktitle = "ICSE '14 : 36th International Conference on Software Engineering Proceedings",

publisher = "Association for Computing Machinery ",

note = "ICSE 2014 : 36th International Conference on Software Engineering ; Conference date: 31-05-2014 Through 07-06-2014",