Please use this identifier to cite or link to this item:
http://dspace.dtu.ac.in:8080/jspui/handle/repository/15968
Full metadata record
DC Field | Value | Language |
---|---|---|
dc.contributor.author | YADAV, AAKANSHA | - |
dc.date.accessioned | 2017-09-14T12:00:43Z | - |
dc.date.available | 2017-09-14T12:00:43Z | - |
dc.date.issued | 2017-07 | - |
dc.identifier.uri | http://dspace.dtu.ac.in:8080/jspui/handle/repository/15968 | - |
dc.description.abstract | Software defect prediction helps in identifying fault prone classes of a software in the early phases of software development life cycle. This helps in efficient resource allocation, since more resources should be allocated to such fault prone classes. Defect prediction(DP) models are developed from different machine learning methodologies in which the models are trained using data from previous releases of a software. However, while developing DP models, researchers have to deal with certain issues. Two such critical issues are addressed in this study a) unavailability of historical data of a software project, and b) imbalanced nature of training data set, where the distribution of classes is highly skewed. In order to deal with scarcity of historical data of a project, literature studies use cross project defect prediction(CPDP). Though, a number of literature studies have suggested methods for improving the accuracy of DP models with imbalanced datasets, but the imbalanced issue has not been properly addressed in the scenario of CPDP.Thus, this study analyzes the performance of a CPDP model HYDRA(Hybridized moDel Reconstruction Approach), developed by Xia et al. using imbalanced data. The results of the study are empirically validated using twenty open source software projects, where ten software projects are of imbalanced nature. Furthermore, we also suggest variation in the fitness function of HYDRA for improving its performance. The results of the study are statistically assessed using Wilcoxon test. | en_US |
dc.language.iso | en | en_US |
dc.relation.ispartofseries | TD-2947; | - |
dc.subject | HYBRID MODEL RECONSTRUCTION | en_US |
dc.subject | BALANCED DATASET | en_US |
dc.subject | UNBALANCED DATASET | en_US |
dc.subject | CPDP | en_US |
dc.title | ANALYZING THE PERFORMANCE OF HYBRID MODEL RECONSTRUCTION APPROACH FOR BALANCED AND UNBALANCED DATASET | en_US |
dc.type | Thesis | en_US |
Appears in Collections: | M.E./M.Tech. Computer Engineering |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
thesis_aakansha.pdf | 1.08 MB | Adobe PDF | View/Open |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.