TY - JOUR
T1 - Deep variability modeling to enhance reproducibility of database performance testing
AU - Ouared, Abdelkader
AU - Amrani, Moussa
AU - Chadli, Abdelhafid
AU - Schobbens, Pierre Yves
N1 - Publisher Copyright:
© The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2024.
PY - 2024/11
Y1 - 2024/11
N2 - Reproducibility is a major principle underpinning the scientific method. Especially, deep database environment systems variability represents an opportunity for reproducibility of database performance testing where metrics defining performance are intricately woven from the entirety of the testing environment-including data sets, schema design, queries, hardware configurations, DBMS parameters, versions, code, etc. Nevertheless, despite the availability of data and code, numerous studies have highlighted a disconcerting trend: analyzing the same dataset using different software can yield divergent results. This variance stems from the nuanced interplay between explicit and implicit parameters within multiple layers, spanning the hardware platform, database model, deployment intricacies, and physical structures, showcasing the complexity inherent in performance model generation and its sensitivity to intricate interactions. Moreover, many authors of database papers do not provide ways to easily reproduce their experiments. In this paper, we propose a framework called ReFleT (REproductibility-aware Framework for LEveraging database Testing) that aims at modeling the manifold variations within the database testing environment, thereby fortifying the reproducibility of database performance models. This comprehensive framework, ReFleT, stands on three different components: (i) a reproduction design specific language, (ii) a model repository for persisting the database environment testing, and (iii) ReFleT capabilities—an interface custom-crafted for seamless navigation, provision, exploration, and reproduction of database performance testing. To substantiate the efficacy of our framework, we present a compelling proof-of-concept through its application in a meticulously designed case study.
AB - Reproducibility is a major principle underpinning the scientific method. Especially, deep database environment systems variability represents an opportunity for reproducibility of database performance testing where metrics defining performance are intricately woven from the entirety of the testing environment-including data sets, schema design, queries, hardware configurations, DBMS parameters, versions, code, etc. Nevertheless, despite the availability of data and code, numerous studies have highlighted a disconcerting trend: analyzing the same dataset using different software can yield divergent results. This variance stems from the nuanced interplay between explicit and implicit parameters within multiple layers, spanning the hardware platform, database model, deployment intricacies, and physical structures, showcasing the complexity inherent in performance model generation and its sensitivity to intricate interactions. Moreover, many authors of database papers do not provide ways to easily reproduce their experiments. In this paper, we propose a framework called ReFleT (REproductibility-aware Framework for LEveraging database Testing) that aims at modeling the manifold variations within the database testing environment, thereby fortifying the reproducibility of database performance models. This comprehensive framework, ReFleT, stands on three different components: (i) a reproduction design specific language, (ii) a model repository for persisting the database environment testing, and (iii) ReFleT capabilities—an interface custom-crafted for seamless navigation, provision, exploration, and reproduction of database performance testing. To substantiate the efficacy of our framework, we present a compelling proof-of-concept through its application in a meticulously designed case study.
KW - Database performance testing
KW - Database physical design
KW - Database testing environment
KW - Deep variability
KW - Reproduction
UR - http://www.scopus.com/inward/record.url?scp=85194938772&partnerID=8YFLogxK
U2 - 10.1007/s10586-024-04533-0
DO - 10.1007/s10586-024-04533-0
M3 - Article
AN - SCOPUS:85194938772
SN - 1386-7857
VL - 27
SP - 11683
EP - 11708
JO - Cluster Computing
JF - Cluster Computing
IS - 8
ER -