このエントリーをはてなブックマークに追加


ID 60821
フルテキストURL
著者
Gan, Maohua Okayama University
Yücel, Zeynep Okayama University ORCID Kaken ID publons researchmap
Monden, Akito Okayama University ORCID Kaken ID researchmap
Sasaki, Kentaro Okayama University
抄録
To conduct empirical research on industry software development, it is necessary to obtain data of real software projects from industry. However, only few such industry data sets are publicly available; and unfortunately, most of them are very old. In addition, most of today's software companies cannot make their data open, because software development involves many stakeholders, and thus, its data confidentiality must be strongly preserved. To that end, this study proposes a method for artificially generating a “mimic” software project data set, whose characteristics (such as average, standard deviation and correlation coefficients) are very similar to a given confidential data set. Instead of using the original (confidential) data set, researchers are expected to use the mimic data set to produce similar results as the original data set. The proposed method uses the Box-Muller transform for generating normally distributed random numbers; and exponential transformation and number reordering for data mimicry. To evaluate the efficacy of the proposed method, effort estimation is considered as potential application domain for employing mimic data. Estimation models are built from 8 reference data sets and their concerning mimic data. Our experiments confirmed that models built from mimic data sets show similar effort estimation performance as the models built from original data sets, which indicate the capability of the proposed method in generating representative samples.
キーワード
empirical software engineering
data confidentiality
data mining
発行日
2020-10-01
出版物タイトル
IEICE Transactions on Information and Systems
E103.D巻
10号
出版者
Institute of Electronics, Information and Communication Engineers
開始ページ
2094
終了ページ
2103
ISSN
0916-8532
NCID
AA10826272
資料タイプ
学術雑誌論文
言語
英語
OAI-PMH Set
岡山大学
著作権者
© 2020 The Institute of Electronics, Information and Communication Engineers
論文のバージョン
publisher
DOI
Web of Science KeyUT
関連URL
isVersionOf https://doi.org/10.1587/transinf.2019EDP7150
助成機関名
日本学術振興会
助成番号
17K00102