Challenges identified for proof scores

Short description of the challenge

Large Language Models (LLMs) have revolutionized programming by enabling developers to generate, debug, and optimize code more efficiently than ever before. However, the domain of formal specifications has not benefited equally from this explosion, mainly because there is insufficient publicly available data online to train such models effectively. As a result, it is worthwhile to investigate how existing general-purpose models can be refined to better support the creation of formal specifications in Maude. In particular, the COMFIA project aims to pursue this goal by leveraging the Maude and Dafny languages.

Research team/s involved (or communities)

FADoSS - UCM

Open problems related to the challenge

There are two problems in this challenge:

There is not enough training data for automating the development of formal specification of systems.
Further refinements of programs by IA are not guaranteed to be equivalent.

Proposed solutions

Refining existing models to deal with formal languages. We propose Maude, a rewriting engine, and Dafny, a theorem prover, as starting point.
Using a formal verification and the corresponding AST as basis for programs that can be later modified by LLMs.

Techniques and technologies related to the challenge

On the one hand, we have LLMs, that will be refined in order to help developers with their specifications.
On the other hand, we have the specification/verification languages Maude and Dafny, which have been chosen as case studies for improving its use via LLMs.

Example(s)

More information about this project is available here.

list of tools

list of challenges

Challenges identified for proof scores

Short description of the challenge

Research team/s involved (or communities)

Open problems related to the challenge

Proposed solutions

Techniques and technologies related to the challenge

Example(s)

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Clone this wiki locally