FormaliSE 2025
Sun 27 - Mon 28 April 2025 Ottawa, Ontario, Canada
co-located with ICSE 2025

This program is tentative and subject to change.

Mon 28 Apr 2025 11:00 - 11:30 at 203 - Session 4 – Generative AI and Fuzzy Logic

The weakest precondition of a program describes the largest set of initial states from which all terminating executions of the program satisfy a given postcondition. The generation of these weakest preconditions is an important task with practical applications in areas such as software verification and runtime error-checking. For programs containing loops, weakest precondition generation critically depends on synthesizing loop invariants that are inductive in nature; these weakest preconditions then essentially embed the inductive properties required for program verification. This paper investigates the use of Large Language Models (LLMs) to generate weakest preconditions (and accompanying loop invariants) in order to prove the correctness of non-deterministic programs containing arrays, loops, and arithmetic. Specifically, we employ several models of ChatGPT to derive the weakest preconditions and invariants for the aforementioned programs. We then compare these LLM-derived weakest preconditions and invariants to their provable counterparts obtained by the MaxPrANQ tool. We find that the quality of the LLM-derived results can vary greatly and is highly dependent on the underlying model used by ChatGPT. This variance in performance propels us to outline directions for future work and discuss how LLMs and formal-tools can complement one another in generating valid weakest preconditions and strong invariants.

This program is tentative and subject to change.

Mon 28 Apr

Displayed time zone: Eastern Time (US & Canada) change

11:00 - 12:30
Session 4 – Generative AI and Fuzzy LogicResearch Track at 203
11:00
30m
Talk
LLM-based Generation of Weakest Preconditions and Precise Array Invariants
Research Track
Daragh King Trinity College Dublin, Vasileios Koutavas Trinity College Dublin, Laura Kovács TU Wien
11:30
30m
Talk
VeCoGen: Automating Generation of Formally Verified C Code with Large Language Models
Research Track
Merlijn Sevenhuijsen Scania CV, AB & KTH Royal Institute of Technology, Khashayar Etemadi KTH Royal Institute of Technology, Mattias Nyberg Scania CV, AB & KTH Royal Institute of Technology
12:00
30m
Talk
Embracing Uncertainty: A Fuzzy Theoretical Model for Goal Fulfillment Assessment
Research Track
Vincenzo Grassi University of Roma "Tor Vergata", Raffaela Mirandola Karlsruhe Institute of Technology (KIT), Diego Perez-Palacin Linnaeus University