GitHub - uncle-betty/satarash: The world needs more LRAT proof checkers. This one's in Agda.

NOTE - I've abandoned this code. Its performance leaves much to be desired and I'll start over from scratch based on what I learned. I just made too many wrong assumptions regarding how to write fast code in Agda.

Sataraš (satarash)

Today's SAT solvers offer to produce a proof of any unsatisfiability result they arrive at for a given Boolean formula. In order to keep honest SAT solvers honest, we should verify these proofs. Ideally, this would be done with a certified verifier.

Sataraš is my attempt at naively implementing such a certified verifier while learning Agda. More efficient certified verifiers exist, implemented in Coq or ACL2, for example.

SAT solvers produce what's known as DRAT proofs. Starting with the original Boolean formula in CNF, a DRAT proof consists of a sequence of proof steps that successively modify the original formula by adding or removing clauses. Each step is designed to preserve the unsatisfiability of the formula created by the previous step and thus, transitively, the unsatisfiability of the original formula. The formula produced by the last step is false, i.e., trivially unsatisfiable, so that all steps together transitively prove that the original formula is unsatisfiable.

DRAT proofs can be converted into equivalent LRAT proofs, which are easier to verify. Sataraš follows the verification algorithm for LRAT proofs put forward in the paper that first introduced them: Efficient Certified RAT Verification. It's available here: https://www.cs.utexas.edu/users/marijn/publications/lrat.pdf

The input to a SAT solver is a CNF given in the text-based DIMACS format. Note that my DIMACS parser is a little finicky. It doesn't accept comment lines, empty lines, or more than one space between tokens.

For a C++ prototype of the verifier, run make checker in the top-level directory of this repository. For the Agda version, run make Checker. As of this writing, I'm using the latest version of Agda and the standard library available from the Agda GitHub repo.

The top-level verification function looks like this:

checkLRAT : (f : Formula) → Proof → Maybe (∀ a → eval a f ≡ false)

It takes a formula f in CNF along with an LRAT proof. When the proof is considered invalid, it evaluates to nothing. Otherwise, it provides a proof that for all assignments a to the variables in f, f evaluates to false.

In other words, bugs in Sataraš may make it reject a valid LRAT proof, but won't ever make it accept an invalid LRAT proof.

In LRAT proofs, the clauses of a formula are identified by unbounded numeric indices. Sataraš represents these indices as fixed-length bit vectors, i.e., in binary form, and keeps the clauses in a trie. The length of the bit vectors is given by bitsᶜ in Verifier.agda. By default, bitsᶜ is 24, which is good for proofs with up to ~16.8 million clauses. If necessary, it can be increased.

When a proof step deletes a clause from the CNF, Sataraš records its index, so that it can be reused when subsequent proof steps add new clauses to the CNF. So, when the LRAT proof simply assigns a new index to an added clause, Sataraš reuses an old index, if available. This leads to a discrepancy in indices between the LRAT proof and Sataraš's internal representation and we need a mapping to bridge this discrepancy. This mapping is a Translator (in Parser.agda). Indices of deleted clauses are collected for future reuse in a Recycler (also in Parser.agda). The idea is to be economic about indices as they aren't unbounded in Sataraš. This allows for a lower bitsᶜ and thus for quicker trie lookups. This tweak is the only major deviation from the verification algorithm in the above paper.

Let's take a look at the mul_com.v example in the test subdirectory to see how things fit together. The file contains the Verilog description of a combinational circuit (= a Boolean formula) that outputs (= evaluates to) true, if the multiplication of two 10-bit values isn't commutative. Proving the formula unsatisfiable proves that 10-bit multiplication is commutative.

First we need to obtain a CNF to be fed to the SAT solver. We derive the CNF from our Verilog code with the open-source Verilog synthesis suite Yosys, which is available here: https://github.com/YosysHQ/yosys

The synth.sh script in the test subdirectory simplifies running Yosys in order to produce mul_com.cnf, which contains the CNF corresponding to the Verilog code.
Next, we need a SAT solver to prove the formula in mul_com.cnf unsatisfiable. I use Cadical, which can be found here: https://github.com/arminbiere/cadical

In order to produce a DRAT proof of mul_com.cnf's unsatisfiability, run cadical mul_com.cnf mul_com.drat. This writes the DRAT proof to mul_com.drat.
Next, we need to turn the DRAT proof into an LRAT proof. That's what drat-trim is for, which can be found here: https://github.com/marijnheule/drat-trim

The conversion is done with drat-trim mul_com.cnf mul_com.drat -L mul_com.lrat. This writes the LRAT proof to mul_com.lrat. In the repo, this file is compressed so that it doesn't exceed GitHub's file size limit.
Finally, we can verify that mul_com.lrat is, in fact, a proof of mul_com.cnf's unsatisfiability. From within the test subdirectory, run ../Checker mul_com.cnf mul_com.lrat and wait for an ok result. This may take a while. As said above, the implementation and the formula representation of the checker is pretty naive.

Name		Name	Last commit message	Last commit date
Latest commit History 130 Commits
src		src
test		test
.gitignore		.gitignore
Makefile		Makefile
README.md		README.md
checker.cc		checker.cc
satarash.agda-lib		satarash.agda-lib
satarash.sh		satarash.sh
test.sh		test.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

About

Releases

Packages

Languages

uncle-betty/satarash

Folders and files

Latest commit

History

Repository files navigation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages