Formally verify programs by writing Lean 4 proofs. Trigger this skill whenever the user wants to formally verify code, generate Lean 4 proofs, prove properties about algorithms or smart contracts, verify invariants, convert program logic into formal specifications, or anything involving Lean 4 and formal verification. Also trigger when the user mentions "qedgen", "lean proof", "formal proof", "verify my code", "prove correctness", "formal verification", or wants mathematical guarantees about their implementation.
Resources
10Install
npx skillscat add qedgen/solana-skills Install via the SkillsCat registry.
QEDGen — Agent-Driven Formal Verification
You (Claude) are the proof engineer. You read the codebase, write Lean 4 models and proofs, iterate on compiler errors, and call Leanstral (Mistral's theorem prover) only for hard sub-goals you cannot fill yourself.
Step 0: Bootstrap the qedgen binary
Before doing anything else, locate and verify the qedgen CLI:
- Find the skill directory — look for this SKILL.md file at
~/.agents/skills/qedgen/or the project's skill install path. - Test the binary:
<skill-dir>/bin/qedgen --version - If it fails or is missing, run:
bash <skill-dir>/install.sh - Set
QEDGENto the resolved binary path and use$QEDGENfor all subsequent commands (e.g.,$QEDGEN fill-sorry ...,$QEDGEN setup).
Architecture
You (Claude) Leanstral (remote model)
├── Read spec / source code ├── Fill sorry markers
├── Write Lean 4 models └── Suggest tactics for hard goals
├── Write theorem statements
├── Write proof attempts
├── Run `lake build`, read errors
└── Fix and iterateStep 1: Understand the program
Check for existing artifacts in this priority order:
- spec.md exists → Read it. An existing spec captures the author's intent, state model, invariants, and operations. Extract security goals, state model, and formal properties. Skip the scoping quiz and go directly to Step 2.
- IDL exists (
target/idl/<program>.json) → Runqedgen spec --idl <path>to generate a draft SPEC.md with TODO markers, then refine interactively. - Neither exists → Read the source code directly. Ask broader scoping questions.
Step 2: Scope the verification
If no spec.md was found, run a short interactive quiz — one question at a time, with checkbox options derived from the program's structure. Ask about functionality and risks, not implementation details.
Question 1: "What does this program need to guarantee above all else?"
Options derived from the program's structure:
- Authorization / access control
- Tokens are never lost / correct routing
- One-shot safety / no replay
- Arithmetic safety / no overflow
- Conservation (e.g., vault >= total claims)
- All of the above
Question 2: "Which scenario worries you most?"
Generate concrete risk scenarios from the program.
Question 3: "Does the program make any assumptions that aren't enforced on-chain?"
Ask questions one at a time. Wait for the user's answer before presenting the next question.
Step 3: Write SPEC.md
Write formal_verification/SPEC.md using normative language (MUST, MUST NOT, MAY). Structure:
# <Program Name> Verification Spec v1.0
<1-2 sentences describing what the program does>
## 0. Security Goals
1. **<Goal name>**: <normative statement>
## 1. State Model
<State struct with field names, types, and comments>
<Lifecycle diagram if applicable>
## 2. Operations
### 2.1 <Operation name>
**Signers**: <who MUST sign>
**Preconditions**: <what MUST be true before>
**Effects**: <numbered steps>
**Postconditions**: <what MUST be true after>
## 3. Formal Properties
### 3.1 <Category>
**<property_id>**: For all <quantified variables>,
if <transition predicate> then <conclusion>.
## 4. Trust Boundary
<What is axiomatic and why>
## 5. Verification Results
| Property | Status | Proof |
|---|---|---|
| ... | **Open** | |Present SPEC.md to the user and get confirmation before proceeding.
Step 4: Set up the Lean project
qedgen setup # Ensure global Mathlib cache exists (first time: 15-45 min)Create the project structure:
formal_verification/
lakefile.lean # import lean_support and Mathlib
lean-toolchain # leanprover/lean4:v4.15.0
lean_support/ # Solana axiom library (copy from qedgen)
Proofs.lean # root import: import Proofs.AccessControl etc.
Proofs/
AccessControl.lean
CpiCorrectness.lean
Conservation.lean
StateMachine.lean
ArithmeticSafety.leanStep 5: Write Lean proofs
This is the core step. You write Lean 4 directly — models, transitions, theorems, and proofs.
Modeling workflow
For each property in SPEC.md:
- Define the state as a Lean structure (map fields from source/spec)
- Define the transition as
Option StateType(returnnoneon precondition failure) - State the theorem matching the SPEC.md property
- Write the proof using the patterns below
- Run
lake buildand iterate on errors
Support library API
After import QEDGen.Solana and open QEDGen.Solana:
Types:
Pubkey(= Nat),U64(= Nat),U8(= Nat)Account—{ key : Pubkey, authority : Pubkey, balance : Nat, writable : Bool }Lifecycle—open | closed(with DecidableEq)TransferCpi—{ program, «from», «to», authority, amount }MintToCpi,BurnCpi,CloseCpi
Constants:
TOKEN_PROGRAM_ID,SYSTEM_PROGRAM_IDU8_MAX,U16_MAX,U32_MAX,U64_MAX,U128_MAX
Functions:
findByKey : List Account → Pubkey → Option AccountfindByAuthority : List Account → Pubkey → Option AccountcanWrite : Pubkey → Account → ProptransferCpiValid : TransferCpi → Propcloses : Lifecycle → Lifecycle → Propvalid_u64 : Nat → Prop(and u8, u16, u32, u128)
Key lemmas:
closes_is_closed,closes_was_open,closed_irreversiblevalid_u64_preserved_by_zero,valid_u64_preserved_by_samefind_map_update_other,find_map_update_same(axioms for account list updates)
Proof patterns
Access control — signer must match authority:
structure ProgramState where
authority : Pubkey
def cancelTransition (s : ProgramState) (signer : Pubkey) : Option Unit :=
if signer = s.authority then some () else none
theorem cancel_access_control (s : ProgramState) (signer : Pubkey)
(h : cancelTransition s signer ≠ none) :
signer = s.authority := by
unfold cancelTransition at h
split_ifs at h with h_eq
· exact h_eq
· contradictionCPI correctness — parameters match (pure rfl):
def cancel_build_cpi (ctx : CancelContext) : TransferCpi :=
{ program := TOKEN_PROGRAM_ID, «from» := ctx.escrow_token, «to» := ctx.dest,
authority := ctx.authority, amount := ctx.amount }
theorem cancel_cpi_correct (ctx : CancelContext) :
let cpi := cancel_build_cpi ctx
cpi.program = TOKEN_PROGRAM_ID ∧ cpi.«from» = ctx.escrow_token ∧
cpi.«to» = ctx.dest ∧ cpi.authority = ctx.authority ∧
cpi.amount = ctx.amount := by
unfold cancel_build_cpi
exact ⟨rfl, rfl, rfl, rfl, rfl⟩State machine — lifecycle transitions:
def cancelTransition (s : ProgramState) : Option ProgramState :=
if s.escrow.lifecycle = Lifecycle.open then
some { escrow := { s.escrow with lifecycle := Lifecycle.closed } }
else none
theorem cancel_closes_escrow (pre post : ProgramState)
(h : cancelTransition pre = some post) :
post.escrow.lifecycle = Lifecycle.closed := by
unfold cancelTransition at h
split_ifs at h with h_open
cases h
rflConservation — invariant preserved across operations:
def conservation (s : EngineState) : Prop := s.V >= s.C_tot + s.I
def depositTransition (s : EngineState) (amount : Nat) : Option EngineState :=
if s.V + amount <= MAX_VAULT_TVL then
some { V := s.V + amount, C_tot := s.C_tot + amount, I := s.I }
else none
theorem deposit_conservation (s s' : EngineState) (amount : Nat)
(h_inv : conservation s)
(h : depositTransition s amount = some s') :
conservation s' := by
unfold depositTransition at h
split_ifs at h with h_le
· cases h
unfold conservation at h_inv ⊢ -- MUST unfold in BOTH hypothesis and goal
omega
· contradictionArithmetic safety — bounds preserved:
def initializeTransition (amount taker : Nat) : Option ProgramState :=
if amount > 0 ∧ amount ≤ U64_MAX ∧ taker > 0 ∧ taker ≤ U64_MAX then
some { initializer_amount := amount, taker_amount := taker }
else none
theorem initialize_arithmetic_safety (amount taker : Nat) (post : ProgramState)
(h : initializeTransition amount taker = some post) :
post.initializer_amount ≤ U64_MAX ∧ post.taker_amount ≤ U64_MAX := by
unfold initializeTransition at h
split_ifs at h with h_bounds
cases h
exact ⟨h_bounds.2.1, h_bounds.2.2.2⟩Critical tactic rules
| Do | Don't |
|---|---|
unfold f at h before split_ifs |
simp [f] at h before split_ifs (kills if-structure) |
unfold pred at h_inv ⊢ for named predicates |
unfold pred only in goal (omega can't see hypotheses) |
cases h after split_ifs on some = some |
injection h (unnecessary, cases handles it) |
omega for linear arithmetic |
norm_num for linear goals (omega is more reliable) |
exact ⟨rfl, rfl, rfl⟩ for conjunctions of rfl |
constructor + rfl + constructor + rfl (verbose) |
if cond then ... else ... without proof binding |
if h : cond then ... when h is unused |
Common errors and fixes
| Error | Fix |
|---|---|
omega could not prove the goal |
Unfold named predicates in hypotheses: unfold pred at h ⊢ |
no goals to be solved |
Remove redundant tactic (e.g., · contradiction after auto-closed branch) |
unknown constant 'X' |
Check imports; add import QEDGen.Solana.X or open QEDGen.Solana |
tactic 'split_ifs' failed, no if-then-else |
Use unfold first, not simp |
unused variable 'h' |
Remove proof binding: if h : cond → if cond |
Step 6: Call Leanstral for hard sub-goals
When you have a proof with sorry markers you cannot fill after 2-3 attempts:
qedgen fill-sorry --file formal_verification/Proofs/Hard.lean --validateThis sends each sorry location to Leanstral with focused context. Review the result — Leanstral may introduce tactics you can learn from for future proofs.
If fill-sorry also fails, simplify the theorem statement or split the property into smaller lemmas.
Step 7: Verify and report
cd formal_verification && lake buildUpdate SPEC.md verification results table:
- Verified: Theorem compiles, no
sorry - Partial: Proof has
sorrymarkers - Open: No compiling proof
Environment
MISTRAL_API_KEY— required forfill-sorry. Free from console.mistral.aiQEDGEN_VALIDATION_WORKSPACE— optional override for global Mathlib cache location
Error handling
- First
lake buildis slow: Mathlib compilation takes 15-45 min on first run. Subsequent builds reuse the cache. could not resolve 'HEAD' to a commit: Remove.lake/packages/mathliband runlake update.- Rate limiting (429): Built-in exponential backoff in
fill-sorry.