A Multi-Level Library for Understanding I/O Activity in HPC Applications
We believe that multi-level I/O tracing and trace data analysis tool can help end users understand the behavior of their application and I/O subsystem, and can provide insights into the source of I/O performance bottlenecks.
Recorder is a multi-level I/O tracing framework that can capture I/O function calls at multiple levels of the I/O stack, including HDF5, MPI-IO, and POSIX I/O. Recorder requires no modification or recompilation of the application and users can control what levels are traced.
We chose to build Recorder as a shared library so that it does not require modification or recompilation of the application. Recorder uses function interpositioning to prioritize itself over standard functions, as shown in the Figure below. Once Recorder is specified as the preloading library, it intercepts HDF5 function calls issued by the application and reroutes them to the tracing implementation where the timestamp, function name, and function parameters are recorded. The original HDF5 function is called after this recording process. The mechanism is the same for the MPI and POSIX layers.
Note that your application and Recorder must use the same version of HDF5 and MPI.
1. Install manually (recommended)
git clone https://github.com/uiuc-hpc/Recorder.git
cd Recorder
./autogen.sh
./configure --prefix=[install location]
make
make install
By default, Recorde traces function calls from all levels: HDF5, MPI and POSIX.
Options for configure
can be used to disable one ore more levels of traces. Valid options:
- --disable-posix
- --disable-mpi
- --disable-hdf5
Other options:
(1) fcntl
:
Since v2.1.7, fcntl(int fd, int cmd, ...) is intercepted. The commands (2nd argument) defined in POSIX standard
are supported. If non-POSIX commands were used, please disable fcntl tracing at configure time with --disable-fcntl
.
If MPI or HDF5 is not installed in standard locations, you may need to set CFLGAS and LDFLAGS to specify their location, e.g.,
./configure --prefix=[install location] CFLAGS=-I/path/to/hdf5/include LDFLAGS=-L/path/to/hdf5/lib
(2) Logging pointers
Since v2.1.8, Recorder by default does not log the pointers (memory addresses) any more as they provide little information yet
cost a lot of space.
However, you can change this behaviour by set the enviroment RECORDER_LOG_POINTER
to 1.
2. Install from Spack
spack install recorder
By default Recorder generates traces from all levels, you can use ~ to disable a specific level.
E.g., the following command will install Recorder with HDF5 and MPI tracing disabled.
spack install recorder~hdf5~mpi
Assume $RECORDER_ROOT
is the location where you installed Recorder.
1. Generate traces.
LD_PRELOAD=$RECORDER_ROOT/lib/librecorder.so mpirun -np N ./your_app
mpirun can be changed to your workload manager, e.g. srun.
The trace files will be written to the current directory.
2. Visualization
We use Python libraries pandas, bokeh and prettytable to generate the report.
Please install those dependencies first.
Also, the visulation may take some time if there are a large number of I/O records.
python $RECORDER_ROOT/bin/reporter/reporter.py /path/to/your_trace_folder/
The visualization report (recorder-report.html) will be written into the current directory. Below are some example graphs generated from the FLASH traces.
3. Human-readable traces?
Recorder uses its own binary tracing format to compress and store traces.
We provide a tool (recorder2text) that can convert the recorder format traces to plain text format.
$RECORDER_ROOT/bin/recorder2text /path/to/your_trace_folder/
This will generate text fomart traces under /path/to/your_trace_folder/_text
.
Recorder 2.2.0 Jan 25, 2021
- Add support for MPI_Cart_sub, MPI_Comm_split_type, etc.
- Assign each MPI_Comm object a globally unique id.
Recorder 2.1.9 Jan 14, 2021
- Clean up the code
- Fixed a few memory leak issues
- Add support for fseeko and ftello
Recorder 2.1.8 Dec 18, 2020
- Add MPI_Test, MPI_Testany, MPI_Testsome and MPI_Testall
- Add MPI_Ireduce, MPI_Iscatter, MPI_Igather and MPI_Ialltoall
- Do not log pointers by default as it delivers not so much information
Recorder 2.1.7 Nov 11, 2020
- Add fcntl() support. Only support commands defined in POSIX standard.
- Add support for MPI_Ibcast()
Recorder 2.1.6 Nov 05, 2020
- Generate unique id for communicators
- Fix bus caused by MPI_COMM_NULL
- Add support for MPI_File_get_size
Recorder 2.1.5 Aug 27, 2020
- Add MPI_File_seek and MPI_File_seek_shared
- Add documentation on how to install using Spack.
Recorder 2.1.4 Aug 26, 2020
- Update LICENSE
- Update automake/autotools files to make it ready for Spack
Recorder 2.1.3 Aug 24, 2020
- Use autotools and automake for compilation.
- Add support for MPI_Comm_split/MPI_Comm_dup/MPI_Comm_create
- Store the value of MPI_Status
Recorder 2.1.2 Aug 06, 2020
- Rewrite the reader program with C.
- Add Python bindings to call C functions.
- Add support for MPI_Waitall/Waitany/Waitsome and MPI_Ssend
- Remove oft2 converter.
- Clean up the Makefile.
Recorder 2.1.1 Jun 28, 2020
- Use uthash library to replace the original hash map implementation
- Remove zlib support
Recorder 2.1 May 15, 2020
- Dump a VERSION file for the reader script to decide the trace format.
- Include the return value in each record.
- Implement conflict detection algorithm for commit semantics and session semantics.
Recorder 2.0.1 Nov 7, 2019
- Implement compressed peephole encoding schema.
- Intergrat zlib as another compression choice.
- Users can choose compression modes by setting environment variables.
Recorder 2.0 Jul 19, 2019
- Add the binary format output.
- Implement a converter that can output OTF2 trace format.
- Write a separate log unit to provide an uniform logging interface. Re-write most of the code to use this new log unit.
- Ignore files (e.g. /sockets) that are not used by the application itself.
- Add a built-in hashmap to support mappings from function name and filename to integers.
- Put all function (that we plan to intercept) signatures in the same header file