Details of how SDPB works are described in the manual. An example input file pvm.xml is included with the source code.
The build system creates the executables pvm2sdp
, sdp2input
, and
sdpb
in the build
directory. There are two steps when running
SDPB.
You will normally start with either an SDP file in Mathematica or JSON
format, or a Polynomial Vector Matrices file in XML format. These
must first be converted, using sdp2input
or pvm2sdp
, into a format
that SDPB can quickly load. The format is described in
SDPB_input_format.md. When creating these
input files, you must choose a working precision. In general, you
should use the same precision as when you run sdpb
, though you may
also use a larger precision. Both sdp2input
and pvm2sdp
will run
faster in parallel.
Use sdp2input
to create input from files with an SDP. The usage is
sdp2input --precision=[PRECISION] --input=[INPUT] --output=[OUTPUT]
[PRECISION]
is the number of bits of precision used in the
conversion. [INPUT]
is a single Mathematica, JSON, or NSV
(Null Separated Value) file. [OUTPUT]
is an output directory.
The single file Mathematica and JSON formats are described in Section 3.2 of the manual. In addition, for JSON there is a schema.
The NSV format allows you to load an SDP from multiple JSON and/or
Mathematica files. NSV files contain a list of files, separated by
null's. When using multiple files, each component is optional. If
there are multiple versions of normalization
or objective
,
sdp2input
will use the last one it sees. Multiple elements of
PositiveMatrixWithPrefactor
will be concatenated together.
One way to generate NSV files is with the find
command. For
example, if you have a directory input
with many files, you can
generate an NSV file with the command
find input/ -name "*.m" -print0 > file_list.nsv
sdp2input
assumes that files ending with .nsv
are NSV,
files ending with .json
are JSON, and .m
is
Mathematica. NSV files can also recursively reference other NSV
files.
There are example input files in Mathematica, JSON, and NSV format. They all define the same SDP, with the NSV example loading the SDP from two Mathematica files: sdp2input_split1.m and sdp2input_split2.m.
Use pvm2sdp
to create input files from Polynomial Vector Matrix files
in XML. The format for these XML files is described in Section 3.1 of
the manual. The usage is
pvm2sdp [PRECISION] [INPUT] ... [OUTPUT]
[PRECISION]
is the number of bits of precision used in the
conversion. [INPUT] ...
is a list of one or more files. Each of
those input files can be either an XML file with Polynomial Vector
Matrices, or an Null Separated Value (NSV) file with a list of
files. [OUTPUT]
is an output directory.
For example, the command to convert the file test/data/pvm2sdp/pvm.xml
, using
1024 bits of precision, and store the result in the directory
test/out/pvm2sdp/sdp.zip
, is
pvm2sdp 1024 test/data/pvm2sdp/pvm.xml test/out/pvm2sdp/sdp.zip
For input, you can also use an NSV file containing a list of file names
separated by nulls. There is an example in test/data/pvm2sdp/file_list.nsv
.
Using it is similar to the xml files
pvm2sdp 1024 test/data/pvm2sdp/file_list.nsv test/out/pvm2sdp/sdp.zip
One way to generate this file list is with the find
command. For
example,
find test/data/pvm2sdp/ -name "pvm.xml" -print0 > test/data/pvm2sdp/file_list.nsv
will regenerate test/data/pvm2sdp/file_list.nsv
. If you have a directory input
with many xml files, you can generate a file list with the command
find test/data/pvm2sdp/ -name "*.xml" -print0 > test/data/pvm2sdp/file_list.nsv
pvm2sdp
assumes that anything with the .nsv
extension is a null
separated file list, and .xml
is an xml file. File lists
can also recursively reference other file lists. So running
find test/data/pvm2sdp/ a-print0 -name "*.nsv" > test/out/file_list_list.nsv
pvm2sdp 1024 test/out/file_list_list.nsv test/out/pvm2sdp/sdp.zip
will also work.
The options to SDPB are described in detail in the help text, obtained
by running build/sdpb --help
. The most important options are -s [--sdpDir]
,
--precision
, and --procsPerNode
.
You can specify output and checkpoint directories by -o [ --outDir ]
and -c [ --checkpointDir ]
, respectively.
SDPB uses MPI to run in parallel, so you may need a special syntax to
launch it. For example, if you compiled the code on your own laptop,
you will probably use mpirun
to invoke SDPB. If you have 4 physical
cores on your machine, the command is
mpirun -n 4 build/sdpb --precision=1024 --procsPerNode=4 -s test/data/sdp.zip -o test/out/sdpb -c test/out/sdpb/ck
On the Yale Grace cluster, the command used in the Slurm batch file is
mpirun build/sdpb --precision=1024 --procsPerNode=$SLURM_NTASKS_PER_NODE -s test/data/sdp.zip -o test/out/sdpb -c test/out/sdpb/ck
In contrast, the Harvard Odyssey 3 cluster, which also uses Slurm, uses the srun command
srun -n $SLURM_NTASKS --mpi=pmi2 build/sdpb --precision=1024 --procsPerNode=$SLURM_NTASKS_PER_NODE -s test/data/sdp.zip -o test/out/sdpb -c test/out/sdpb/ck
The documentation for your HPC system will tell you how to write a batch script and invoke MPI programs.
To efficiently run large MPI jobs, SDPB needs an accurate measurement
of the time to evaluate each block. If block_timings
does not
already exists in the input directory or a checkpoint directory, SDPB
will create one. SDPB will run for 2 iterations and write the time to
evaluate each block into block_timings
. SDPB has to run for 2
iterations because measuring the first step generally gives a poor
estimate. During the first step, many quantities may be zero.
Adding and multiplying zero is much faster with extended precision.
If you are running a large family of input files with the same
structure but different numbers, the measurements are unlikely to
differ. In that case, you can reuse timings from previous inputs by
copying the block_timings
file to other input directories.
If different runs have the same block structure, you can also reuse
checkpoints from other inputs. For example, if you have a previous
checkpoint in test/out/test.ck
, you can reuse it for a different input
in test/data/sdp2.zip
with a command like
mpirun -n 4 build/sdpb --precision=1024 --procsPerNode=4 -s test/data/sdp2.zip -i test/out/test.ck
In addition to having the same block structure, the runs must also use
the same precision
, procsPerNode
, and number and distribution of
cores.
SDPB's defaults are set for optimal performance. This may result in
using more memory than is available. Running SDPB on more nodes will
reduce the amount of memory required on each node. If this is not
sufficient, you can also use the option --procGranularity
.
This option sets minimum number of processes that a block group can
have, so it must evenly divide the --procsPerNode
option. Using a
larger granularity will result in less memory use (up to a point)
because SDPB will make fewer local copies of the matrix Q. However,
larger granularity is also slower because even small blocks will be
distributed among multiple cores. So you should use
--procGranularity
only when absolutely needed.
If you have a family of SDP's and a solution to one of these SDP's,
approx_objective
can compute the approximate value of the objective
for the rest of the family. The approximation is, by default,
quadratic in the difference between the two SDP's (b
, c
, B
).
approx_objective
assumes that the bilinear bases A
are the same.
To compute approximate objectives, write out a text checkpoint when
computing the initial solution with sdpb
by including the option
--writeSolution=x,y,X,Y
. approx_objective
will then read in this
solution, setup a solver, and compute the new objective.
You specify the location of the new SDP with the option --newSdp
.
This file would be the output of pvm2sdp
or sdp2input
. If you
have multiple SDP's, then you can create an NSV as in the instructions
for sdp2input
that list all of the files.
Setting up the solver can take a long time. If you have an SDP that
you have perturbed in many different directions, and for logistical
reasons you need to run approx_objective
separately for each one,
you can save time by saving the solver state with the option
--writeSolverState
. To that end, you can also run
approx_objective
without any new SDP's, only saving the solver
state.
A full example of the whole sequence is
mpirun -n 4 build/sdpb --precision=1024 --procsPerNode=4 -s test/data/sdp.zip -o test/out/approx_objective --writeSolution=x,y,X,Y
mpirun -n 4 build/approx_objective --precision=1024 --procsPerNode=4 --sdp test/data/sdp.zip --writeSolverState
mpirun -n 4 build/approx_objective --precision=1024 --procsPerNode=4 --sdp test/data/sdp.zip --newSdp=test/data/sdp2.zip
The output is a JSON list with each element including the location of the new SDP, the approximate objective, and the first and second order terms. There is a JSON schema describing the format.
The objective can be surprisingly sensitive to small changes,
so the first and second order terms should give you an idea of how
accurate the approximation is. In general, the first order term
d_objective
, should be smaller than objective
. Also, the second
order term dd_objective
should be similar in size to the square
of d_objective
scaled by objective
.
dd_objective ~ (d_objective / objective)²
If this is not the case (e.g. dd_objective > d_objective
), then the
approximate objective is probably inaccurate.
If the linear approximation is accurate enough for you, you can use
the --linear
option. This avoids the one-time cost of an expensive
solve. Also, it only requires the solutions for x
and y
, which
SDPB writes out by default.
Another thing that you can do now that you have a solution is to
extract the spectrum. You could use Spectrum
Extraction,
but SDPB now ships with spectrum
, its own parallel implementation of
spectrum extraction. The options are described in more detail in the
help text, obtained by running spectrum --help
. As a simple
example, extracting the spectrum from the toy example would be
mpirun -n 4 build/spectrum --input=test/data/spectrum/pvm.xml --solution=test/data/spectrum/solution --threshold=1e-10 --format=PVM --output=test/out/spectrum/spectrum.json --precision=1024
This will output the spectra into test/out/spectrum/spectrum.json
and should look like
[
{
"zeros":
[
{
"zero": "1.042496785718158120984006519404233029358116776979134529895423630711390744689477902669840296432317928924423313326990327506518264981179194497998966064495830449099906260064344924049936036999283366682972235210076078068737842418915631791070882527833746310933972007746317544906692813981560963543407009636600477760275",
"lambda":
[
"0.9185423261272011850274891418839269990813583260329451648910407000720586506746630441457833744183236806171597526139823408776201480812734713281917240661102772732679114912585794391259809037078324838527906848460941323049630998481551259846120994742359929339200105456931828844698382241896844615225241582897538916998522"
]
}
],
"error": "3.270654739398825572856615907651196430155073411176062513853164640754491556816769579861611709003779514625636694652117900107560196318376153341414775658107623405929672675109050400217760922806803922272334854654452970680533543987925006373170235909839750992547956738264810134223945517825404125007202619504819133489523e-26"
}
]
It is a json file with arrays of zeros. There is a JSON schema describing the format.