qedgen

Formally verify programs by writing Lean 4 proofs. Trigger this skill whenever the user wants to formally verify code, generate Lean 4 proofs, prove properties about algorithms or smart contracts, verify invariants, convert program logic into formal specifications, or anything involving Lean 4 and formal verification. Also trigger when the user mentions "qedgen", "lean proof", "formal proof", "verify my code", "prove correctness", "formal verification", or wants mathematical guarantees about their implementation.

QEDGen 65 7 Updated 3mo ago

Resources

GitHub

Install

npx skillscat add qedgen/solana-skills

Install via the SkillsCat registry.

SKILL.md

QEDGen — Agent-Driven Formal Verification

You (Claude) are the proof engineer. You read the codebase, write Lean 4 models and proofs, iterate on compiler errors, and call Leanstral (Mistral's theorem prover) only for hard sub-goals you cannot fill yourself.

Step 0: Bootstrap the qedgen binary

Before doing anything else, locate and verify the qedgen CLI:

Find the skill directory — look for this SKILL.md file at ~/.agents/skills/qedgen/ or the project's skill install path.
Test the binary: <skill-dir>/bin/qedgen --version
If it fails or is missing, run: bash <skill-dir>/install.sh
Set QEDGEN to the resolved binary path and use $QEDGEN for all subsequent commands (e.g., $QEDGEN fill-sorry ..., $QEDGEN setup).

Architecture

You (Claude)                          Leanstral (remote model)
  ├── Read spec / source code           ├── Fill sorry markers
  ├── Write Lean 4 models               └── Suggest tactics for hard goals
  ├── Write theorem statements
  ├── Write proof attempts
  ├── Run `lake build`, read errors
  └── Fix and iterate

Step 1: Understand the program

Check for existing artifacts in this priority order:

spec.md exists → Read it. An existing spec captures the author's intent, state model, invariants, and operations. Extract security goals, state model, and formal properties. Skip the scoping quiz and go directly to Step 2.
IDL exists (target/idl/<program>.json) → Run qedgen spec --idl <path> to generate a draft SPEC.md with TODO markers, then refine interactively.
Neither exists → Read the source code directly. Ask broader scoping questions.

Step 2: Scope the verification

If no spec.md was found, run a short interactive quiz — one question at a time, with checkbox options derived from the program's structure. Ask about functionality and risks, not implementation details.

Question 1: "What does this program need to guarantee above all else?"
Options derived from the program's structure:

Authorization / access control
Tokens are never lost / correct routing
One-shot safety / no replay
Arithmetic safety / no overflow
Conservation (e.g., vault >= total claims)
All of the above

Question 2: "Which scenario worries you most?"
Generate concrete risk scenarios from the program.

Question 3: "Does the program make any assumptions that aren't enforced on-chain?"

Ask questions one at a time. Wait for the user's answer before presenting the next question.

Step 3: Write SPEC.md

Write formal_verification/SPEC.md using normative language (MUST, MUST NOT, MAY). Structure:

# <Program Name> Verification Spec v1.0

<1-2 sentences describing what the program does>

## 0. Security Goals
1. **<Goal name>**: <normative statement>

## 1. State Model
<State struct with field names, types, and comments>
<Lifecycle diagram if applicable>

## 2. Operations
### 2.1 <Operation name>
**Signers**: <who MUST sign>
**Preconditions**: <what MUST be true before>
**Effects**: <numbered steps>
**Postconditions**: <what MUST be true after>

## 3. Formal Properties
### 3.1 <Category>
**<property_id>**: For all <quantified variables>,
if <transition predicate> then <conclusion>.

## 4. Trust Boundary
<What is axiomatic and why>

## 5. Verification Results
| Property | Status | Proof |
|---|---|---|
| ... | **Open** | |

Present SPEC.md to the user and get confirmation before proceeding.

Step 4: Set up the Lean project

qedgen setup            # Ensure global Mathlib cache exists (first time: 15-45 min)

Create the project structure:

formal_verification/
  lakefile.lean          # import lean_support and Mathlib
  lean-toolchain         # leanprover/lean4:v4.15.0
  lean_support/          # Solana axiom library (copy from qedgen)
  Proofs.lean            # root import: import Proofs.AccessControl etc.
  Proofs/
    AccessControl.lean
    CpiCorrectness.lean
    Conservation.lean
    StateMachine.lean
    ArithmeticSafety.lean

Step 5: Write Lean proofs

This is the core step. You write Lean 4 directly — models, transitions, theorems, and proofs.

Modeling workflow

For each property in SPEC.md:

Define the state as a Lean structure (map fields from source/spec)
Define the transition as Option StateType (return none on precondition failure)
State the theorem matching the SPEC.md property
Write the proof using the patterns below
Run lake build and iterate on errors

Support library API

After import QEDGen.Solana and open QEDGen.Solana:

Types:

Pubkey (= Nat), U64 (= Nat), U8 (= Nat)
Account — { key : Pubkey, authority : Pubkey, balance : Nat, writable : Bool }
Lifecycle — open | closed (with DecidableEq)
TransferCpi — { program, «from», «to», authority, amount }
MintToCpi, BurnCpi, CloseCpi

Constants:

TOKEN_PROGRAM_ID, SYSTEM_PROGRAM_ID
U8_MAX, U16_MAX, U32_MAX, U64_MAX, U128_MAX

Functions:

findByKey : List Account → Pubkey → Option Account
findByAuthority : List Account → Pubkey → Option Account
canWrite : Pubkey → Account → Prop
transferCpiValid : TransferCpi → Prop
closes : Lifecycle → Lifecycle → Prop
valid_u64 : Nat → Prop (and u8, u16, u32, u128)

Key lemmas:

closes_is_closed, closes_was_open, closed_irreversible
valid_u64_preserved_by_zero, valid_u64_preserved_by_same
find_map_update_other, find_map_update_same (axioms for account list updates)

Proof patterns

Access control — signer must match authority:

structure ProgramState where
  authority : Pubkey

def cancelTransition (s : ProgramState) (signer : Pubkey) : Option Unit :=
  if signer = s.authority then some () else none

theorem cancel_access_control (s : ProgramState) (signer : Pubkey)
    (h : cancelTransition s signer ≠ none) :
    signer = s.authority := by
  unfold cancelTransition at h
  split_ifs at h with h_eq
  · exact h_eq
  · contradiction

CPI correctness — parameters match (pure rfl):

def cancel_build_cpi (ctx : CancelContext) : TransferCpi :=
  { program := TOKEN_PROGRAM_ID, «from» := ctx.escrow_token, «to» := ctx.dest,
    authority := ctx.authority, amount := ctx.amount }

theorem cancel_cpi_correct (ctx : CancelContext) :
    let cpi := cancel_build_cpi ctx
    cpi.program = TOKEN_PROGRAM_ID ∧ cpi.«from» = ctx.escrow_token ∧
    cpi.«to» = ctx.dest ∧ cpi.authority = ctx.authority ∧
    cpi.amount = ctx.amount := by
  unfold cancel_build_cpi
  exact ⟨rfl, rfl, rfl, rfl, rfl⟩

State machine — lifecycle transitions:

def cancelTransition (s : ProgramState) : Option ProgramState :=
  if s.escrow.lifecycle = Lifecycle.open then
    some { escrow := { s.escrow with lifecycle := Lifecycle.closed } }
  else none

theorem cancel_closes_escrow (pre post : ProgramState)
    (h : cancelTransition pre = some post) :
    post.escrow.lifecycle = Lifecycle.closed := by
  unfold cancelTransition at h
  split_ifs at h with h_open
  cases h
  rfl

Conservation — invariant preserved across operations:

def conservation (s : EngineState) : Prop := s.V >= s.C_tot + s.I

def depositTransition (s : EngineState) (amount : Nat) : Option EngineState :=
  if s.V + amount <= MAX_VAULT_TVL then
    some { V := s.V + amount, C_tot := s.C_tot + amount, I := s.I }
  else none

theorem deposit_conservation (s s' : EngineState) (amount : Nat)
    (h_inv : conservation s)
    (h : depositTransition s amount = some s') :
    conservation s' := by
  unfold depositTransition at h
  split_ifs at h with h_le
  · cases h
    unfold conservation at h_inv ⊢  -- MUST unfold in BOTH hypothesis and goal
    omega
  · contradiction

Arithmetic safety — bounds preserved:

def initializeTransition (amount taker : Nat) : Option ProgramState :=
  if amount > 0 ∧ amount ≤ U64_MAX ∧ taker > 0 ∧ taker ≤ U64_MAX then
    some { initializer_amount := amount, taker_amount := taker }
  else none

theorem initialize_arithmetic_safety (amount taker : Nat) (post : ProgramState)
    (h : initializeTransition amount taker = some post) :
    post.initializer_amount ≤ U64_MAX ∧ post.taker_amount ≤ U64_MAX := by
  unfold initializeTransition at h
  split_ifs at h with h_bounds
  cases h
  exact ⟨h_bounds.2.1, h_bounds.2.2.2⟩

Critical tactic rules

Do	Don't
`unfold f at h` before `split_ifs`	`simp [f] at h` before `split_ifs` (kills if-structure)
`unfold pred at h_inv ⊢` for named predicates	`unfold pred` only in goal (omega can't see hypotheses)
`cases h` after `split_ifs` on `some = some`	`injection h` (unnecessary, cases handles it)
`omega` for linear arithmetic	`norm_num` for linear goals (omega is more reliable)
`exact ⟨rfl, rfl, rfl⟩` for conjunctions of rfl	`constructor` + `rfl` + `constructor` + `rfl` (verbose)
`if cond then ... else ...` without proof binding	`if h : cond then ...` when `h` is unused

Common errors and fixes

Error	Fix
`omega could not prove the goal`	Unfold named predicates in hypotheses: `unfold pred at h ⊢`
`no goals to be solved`	Remove redundant tactic (e.g., `· contradiction` after auto-closed branch)
`unknown constant 'X'`	Check imports; add `import QEDGen.Solana.X` or `open QEDGen.Solana`
`tactic 'split_ifs' failed, no if-then-else`	Use `unfold` first, not `simp`
`unused variable 'h'`	Remove proof binding: `if h : cond` → `if cond`

Step 6: Call Leanstral for hard sub-goals

When you have a proof with sorry markers you cannot fill after 2-3 attempts:

qedgen fill-sorry --file formal_verification/Proofs/Hard.lean --validate

This sends each sorry location to Leanstral with focused context. Review the result — Leanstral may introduce tactics you can learn from for future proofs.

If fill-sorry also fails, simplify the theorem statement or split the property into smaller lemmas.

Step 7: Verify and report

cd formal_verification && lake build

Update SPEC.md verification results table:

Verified: Theorem compiles, no sorry
Partial: Proof has sorry markers
Open: No compiling proof

Environment

MISTRAL_API_KEY — required for fill-sorry. Free from console.mistral.ai
QEDGEN_VALIDATION_WORKSPACE — optional override for global Mathlib cache location

Error handling

First lake build is slow: Mathlib compilation takes 15-45 min on first run. Subsequent builds reuse the cache.
could not resolve 'HEAD' to a commit: Remove .lake/packages/mathlib and run lake update.
Rate limiting (429): Built-in exponential backoff in fill-sorry.

qedgen

Resources

Install

QEDGen — Agent-Driven Formal Verification

Step 0: Bootstrap the qedgen binary

Architecture

Step 1: Understand the program

Step 2: Scope the verification

Step 3: Write SPEC.md

Step 4: Set up the Lean project

Step 5: Write Lean proofs

Modeling workflow

Support library API

Proof patterns

Critical tactic rules

Common errors and fixes

Step 6: Call Leanstral for hard sub-goals

Step 7: Verify and report

Environment

Error handling

Categories

Install

Recommended Skills