Skip to content

dlshriver/dnnv-benchmarks

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

27 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

DNNV Benchmarks

A collection of DNN verification benchmarks, specified in DNNP and ONNX for use with DNNV and DNNF.

Usage

To access the benchmarks, simply clone this repo:

$ git clone https://github.com/dlshriver/benchmarks.git

We currently include 7 benchmarks in DNNP and ONNX format. Each one is in its own sub-directory in the benchmarks directory, and each has a README with a description of the benchmark.

We also include some useful tool scripts in the tools directory. tools/resmonitor.py will run a command with specified memory and time limits. tools/run_dnnv.py can run all the problems in a specified benchmark using dnnv on multiple processes with resource limits.

To check the problems in the benchmarks, first install DNNV, as well as whichever verifiers you wish to use. After installation, DNNV can be run on a single problem as follows:

$ dnnv /path/to/property --network N /path/to/network --VERIFIER

For example, to run the nnenum verifier on property 1 and network N_1_1.onnx from the ACAS Xu benchmark, run:

$ dnnv benchmarks/ACAS_Xu/properties/property_1.py --network N benchmarks/ACAS_Xu/onnx/N_1_1.onnx --nnenum

To run all of the problems in a benchmark, we provide tools/run_dnnv.py. This tool expects both DNNV and pandas to be installed.

$ tools/run_dnnv.sh -b /path/to/benchmark/dir/ -p /path/to/problems_csv --VERIFIER

For example, to run the nnenum verifier on the ACAS Xu benchmark, run:

$ tools/run_dnnv.sh -b benchmarks/ACAS_Xu -p problems_original.csv --nnenum

Additional options (such as those for restricting the time and memory usage) can be seen by running:

$ tools/run_dnnv.sh -h

Any additionally specified arguments will be passed directly to DNNV.

About

A Collection of DNN Verification Benchmarks

Topics

Resources

License

Code of conduct

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published