Relational symbolic execution of SQL code for unit testing of database programs

Michaël Marcozzi, Wim Vanhoof, Jean Luc Hainaut

Research output: Contribution to journalArticle

162 Downloads (Pure)

Abstract

Symbolic execution is a technique enabling the automatic generation of test inputs that exercise a set of execution paths within a code unit to be tested. If the paths cover a sufficient part of the code under test, the test data offer a representative view of the actual behaviour of this code. This notably enables detecting errors and correcting faults. Relational databases are ubiquitous in software, but symbolic execution of code units that manipulate them remains a non-trivial problem, particularly because of the complex structure of such databases and the complex behaviour of SQL statements. Finding errors in such code units is yet critical, as it can avoid corrupting important data. In this work, we define a symbolic execution translating database manipulation code directly into constraints and integrate it with a more traditional symbolic execution of normal program code. The database tables are represented by relational symbols and the SQL statements by relational constraints over these symbols. An algorithm based on these principles is presented for the symbolic execution of simple Java methods that implement transactional use cases by reading and writing in a relational database, the latter subject to data integrity constraints. The algorithm is integrated in a test generation tool and experimented over sample code. The target language for the constraints produced by the tool is the SMT-Lib standard and the used solver is Microsoft Z3. The results show that the proposed approach enables generating meaningful test data, including valid database content, in reasonable time. In particular, the Z3 solver is shown to be more scalable than the Alloy solver, used in our previous work, for solving relational constraints.

Original languageEnglish
Pages (from-to)44-72
Number of pages29
JournalScience of Computer Programming
Volume105
Early online date10 Apr 2015
DOIs
Publication statusPublished - 1 Jul 2015

Keywords

  • Software Testing
  • Symbolic Execution
  • Constraints
  • Satisfiability Modulo Theories (SMT)
  • Alloy
  • Quantifiers
  • First-Order Logic
  • Relational Algebra
  • Structured Query Language (SQL)
  • Database
  • Database Applications

Fingerprint Dive into the research topics of 'Relational symbolic execution of SQL code for unit testing of database programs'. Together they form a unique fingerprint.

  • Projects

    Student Theses

    Automatic Test Generation for Data-driven Applications

    Author: Marcozzi, M., 24 Oct 2014

    Supervisor: Vanhoof, W. (Supervisor), Habra, N. (Jury), Hainaut, J. (Jury), Englebert, V. (Jury), WIJSEN, J. (External person) (Jury) & Serebrenik, A. (External person) (Jury)

    Student thesis: Doc typesDoctor of Sciences

    Cite this