Proving a Compiler:
Mechanized verification of program transformations and static analyses
(The 2011 edition of this lecture is also available, with different contents.)
Abstract
Formal semantics of programming languages supports not only reasoning over individual programs (program correctness), but also reasoning over program transformations and static analyses, as typically found in compilers (tool correctness). With the help of a proof assistant, we can prove semantic preservation properties of program transformations and semantic soundness properties of static analyses that greatly increase the confidence we can have in compilers and program verification tools.
The topics covered in this lecture include:
- Non-optimizing compilation of a structured imperative language to a virtual machine, and its correctness proof.
- Notions of semantic preservation.
- Various forms of operational semantics and their mechanization in Coq: big-step, small-step, small-step with continuations.
- Examples of program optimizations: dead code elimination, register allocation.
- Design and soundness proof of a generic static analyzer based on abstract interpretation.
- Compiler verification "in the large" : an overview of the
CompCert verified C compiler.
We will use the Coq proof assistant and build on the formalization of the IMP language shown in Benjamin Pierce's "Software Foundations" lectures.
Course material
- The slides for the lecture:
for presentation and
for printing (6 on a page).
- Coq development, source distribution:
compiler-verification.tar.gz
.
(Includes a copy of "Software Foundations", for convenience).
- Coq development, commented and pretty-printed for online viewing:
- Semantics: various forms of semantics for IMP.
- Compil: compilation from IMP to a virtual machine, proofs of correctness for this compiler.
- Deadcode: liveness analysis and dead code optimization.
- Regalloc: extension to register allocation.
- Analyzer1: a simple generic static analyzer based on abstract interpretation.
- Analyzer2: a more sophisticated version of the generic static analyzer.
- Library Sequences: transition sequences and their properties.
- Appendix Fixpoint: computing fixpoints in well-founded lattices, with applications to liveness analysis.
Introductory reading
Further reading
Coq pointers
Xavier.Leroy@inria.fr