VaryMinions: Leveraging RNNs to Identify Variants in Variability-intensive Systems’ Logs

Research output: Contribution to journalArticlepeer-review

70 Downloads (Pure)

Abstract

From business processes to course management, variability-intensive software systems (VIS) are now ubiquitous. One can configure these systems’behaviour by activating options, e.g., to derive variants handling building permits across municipalities or implementing different functionalities (quizzes, forums) for a given course. These customisation facilities allow VIS to support distinct relevant customer requirements while taking advantage of reuse for common parts. Customisation thus allows realising both scope and scale economies. Behavioural differences amongst variants manifest themselves in event logs. To re-engineer this kind of system, one must know which variant(s) have produced which behaviour. Since variant information is barely present in logs, this paper supports this task by employing machine learning techniques to classify behaviours (event sequences) among variants. Specifically, we train Long Short Term Memory (LSTMs) and Gated Recurrent Units (GRUs) recurrent neural networks to relate event sequences with the variants they belong to on six different datasets issued from the configurable process and VIS domains. After having evaluated 20 different architectures of LSTM/GRU, our results demonstrate that it is possible to effectively learn the trace-to-variant mapping with high accuracy (at least 80% and up to 99%) and at scale, i.e., identifying 50 variants using 5000+ traces for each variant.
Original languageEnglish
Article number99
JournalEmpirical Software Engineering
Volume29
Issue number4
DOIs
Publication statusPublished - 15 Jun 2024

Funding

Computational resources have been provided by the Consortium des \u00C9quipements de Calcul Intensif (C\u00C9CI), funded by the Fonds de la Recherche Scientifique de Belgique (F.R.S.-FNRS) under Grant No. 2.5020.11 and by the Walloon Region. This research is partly supported by the EOS VeriLearn project, under FRS-FNRS Grant No. O05518F-RG03. Sophie Fortz has been partially supported by the EPSRC project on Verified Simulation for Large Quantum Systems (VSL-Q), grant reference EP/Y005244/1, the EPSRC project on Robust and Reliable Quantum Computing (RoaRQ), Investigation 009 Model-based monitoring and calibration of quantum computations (ModeMCQ), grant reference EP/W032635/1, by the Fonds de la recherche scientifique (FRS-FNRS) via a FRIA grant and by the EOS VeriLearn project, under grant No. O05518F-RG03. Sophie Fortz has been partially supported by the EPSRC project on Verified Simulation for Large Quantum Systems (VSL-Q), grant reference EP/Y005244/1, the EPSRC project on Robust and Reliable Quantum Computing (RoaRQ), Investigation 009 Model-based monitoring and calibration of quantum computations (ModeMCQ), grant reference EP/W032635/1, by the Fonds de la recherche scientifique (FRS-FNRS) via a FRIA grant, and by the EOS VeriLearn project, under grant No. O05518F-RG03. Gilles Perrouin is an FRSFNRS Research Associate. The authors have no competing interests to declare that are relevant to the content of this article.

FundersFunder number
Région Wallonne
Consortium des Équipements de Calcul Intensif
Fonds de la Recherche Scientifique F.R.S.-FNRS2.5020.11
Fonds de la Recherche Scientifique F.R.S.-FNRS
Fonds pour la Formation à la Recherche dans l’Industrie et dans l’AgricultureO05518F-RG03
Fonds pour la Formation à la Recherche dans l’Industrie et dans l’Agriculture
Engineering and Physical Science Research Council (EPSRC)EP/Y005244/1, EP/W032635/1
Engineering and Physical Science Research Council (EPSRC)

    Keywords

    • configurable processes
    • recurrent neutral networks
    • Variability Intensive Systems
    • variability mining
    • Software product lines
    • Variability-intensive systems
    • Recurrent neural networks
    • Variability mining
    • Configurable processes

    Fingerprint

    Dive into the research topics of 'VaryMinions: Leveraging RNNs to Identify Variants in Variability-intensive Systems’ Logs'. Together they form a unique fingerprint.
    • Variability-aware Behavioural Learning

      Fortz, S., 28 Aug 2023, 27th ACM International Systems and Software Product Line Conference, SPLC 2023 - Proceedings: Doctoral Symposium. Arcaini, P., ter Beek, M. H., Perrouin, G., Reinhartz-Berger, I., Machado, I., Vergilio, S. R., Rabiser, R., Yue, T., Devroey, X., Pinto, M. & Washizaki, H. (eds.). ACM Press, Vol. B. p. 11 - 15 5 p. (Proceedings of the 27th ACM International Systems and Software Product Line Conference - Volume B).

      Research output: Contribution in Book/Catalog/Report/Conference proceedingConference contribution

    • VaryMinions: Leveraging RNNs to Identify Variants in Event Logs

      Fortz, S., Temple, P., DEVROEY, X., HEYMANS, P. & PERROUIN, G., 2021, 5th International Workshop on Machine Learning Techniques for Software Quality Evolution. ACM Press

      Research output: Contribution in Book/Catalog/Report/Conference proceedingConference contribution

      Open Access
      File

    Cite this