Paper Abstract.
The Software Engineering (SE) research area must provide results of a certain quality for the sake of value. High quality research results may ensure experience and knowledge, which are essential for the technology to be transferred to the industry. One of the means to obtain such quality results is experimentation. Experimentation is a scientific method that aims to provide evidence of a theory over real-world observations establishing a cause-effect relation. Well conducted, auditable, and repeatable experiments are vital for scientific evolution and novelty. Quality evaluation of controlled experiments and quasi-experiments in SE has been recently discussed in the literature as researchers desire to assess whether such experiments have improved by reporting information that enables the experiments to be replicated and the reader can understand the experiment and validate results. Thus, this work empirically compares four approaches for quality evaluation of SE experiments, such as Kitchenham and Charters, Kampenes, Kitchenham et al., and Dieste et al. in the context of Software Product Lines (SPL). In addition, we are interested on verifying the quality of reporting experiments in a well-discussed reuse technique as SPL. The Pearson technique supported the correlation between pairs of evaluation approaches. In addition, the T-Test and Mann-Whitney-Wilcoxon U test were applied to the samples to verify whether there was a difference in the quality of experiments when using an experimental template. Preliminary results show a strong positive correlation between them, the hypothesis tests confirmed there is such a difference in quality when using experimental template, and the SPL experiments report more the planning phase than the analysis and interpretation phase. Based on our results, we provide initial evidence the Kampenes et al. and Kitchenham et al. approaches are the best to reporting SPL experiments.
Acknowledgments: This study was financed in part by the Coordenação de Aperfeiçoamento de Pessoal de Nível Superior - Brasil (CAPES) - Finance Code 001.
Acknowledgments: This study was financed in part by the Coordenação de Aperfeiçoamento de Pessoal de Nível Superior - Brasil (CAPES) - Finance Code 001.
Empirical Study Instrumentation
We create an study instrumentation package including the quality evaluation of SE experiments, the list of papers selected for the pilot project, the list of papers selected for the experiment, the spreadsheet for evaluation approaches and spreadsheets with the results from: the evaluations of the approaches by each participant, the Kappa agreement analysis, the quality of experiments, the usage of template for experimental reporting, and the granularity of questions from the approaches. Click on the links bellow to access the documents.
Instrumentation Pack for our Empirical Study
Approaches for Quality Evaluation of Experiments in SE
- Download Kitchenham and Charters (2007) Approach
- Download Kampenes (2007) Approach
- Download Kitchenham et al. (2010) Approach
- Download Dieste et al. (2011) Approach
Selected Papers for the Pilot Project
Selected Papers for the Experiment
Spreadsheet for Evaluation Approaches
Evaluations of Approaches by each Participant
- Download Evaluation of Researcher Henrique Vignando
- Download Evaluation of Researcher Victor França
- Download Evaluation of Researcher Viviane R. Furtado
Kappa Agreement Analysis
Quality of Experiments
Usage of Template for Experimental Reporting
- Download Templates Used in Papers
- Download Quality of Papers Using or Not Using Experimental Template in each Approach