Skip to content

Latest commit

 

History

History
99 lines (70 loc) · 7.1 KB

README.md

File metadata and controls

99 lines (70 loc) · 7.1 KB

rFaaS: RDMA-Enabled FaaS Platform for Serverless High-Performance Computing

A high-performance FaaS platform with RDMA acceleration for function invocations.

License GitHub issues GitHub pull requests

rFaaS vs HPC vs FaaS The cloud paradigm Function-as-a-Service (FaaS) provides an ability to execute stateless and fine-grained functions on elastic and ephemeral resources. However, serverless struggles to achieve the performance needed in high-performance computing: slow invocations, low network bandwidth, and the overheads of the FaaS management system make it difficult to incorporate serverless functions when every millisecond counts. Therefore, we decided to combine the best of both worlds: elasticity of FaaS and high-performance of cluster batch systems. We built a new FaaS platform with RDMA-accelerated network transport.

rFaaS is a serverless platform redesigned to support high-performance and low-latency invocations with a direct RDMA connection. In rFaaS, the centralized schedulers and API gateway are replaced with a decentralized allocation mechanism. Instead of using a traditional cloud trigger, HPC applications query executor servers, obtain resource allocation and establish RDMA connections to remote workers. Every function is invoked by writing input data directly to the memory of the worker. This allows us to achieve a single-digit microsecond hot invocation latency - hot invocations add less than 350 nanoseconds overhead on top of the fastest available network transmission.

To use rFaaS, please read the documentation on software and hardware requirements, installation instructions, and the basic example of using rFaaS. rFaaS comes with a set of benchmark applications and tests. We provide an extended set of C++ serverless functions, including multimedia and ML inference examples from the serverless benchmarking suite SeBS. Finally, you can find more details about rFaaS in the documentation on the system and the client rFaaS library.

Do you have further questions not answered by our documentation? Did you encounter troubles with installing and using rFaaS? Or do you want to use rFaaS in your work and you need new features? Feel free to reach us through GitHub issues or by writing to [email protected].

Paper

When using rFaaS, please cite our arXiv paper preprint, and you can find more details about research work in this paper summary. You can cite our software repository as well, using the citation button on the right.

@misc{copik2021rfaas,
   title={RFaaS: RDMA-Enabled FaaS Platform for Serverless High-Performance Computing}, 
   author={Marcin Copik and Konstantin Taranov and Alexandru Calotoiu and Torsten Hoefler},
   year={2021},
   eprint={2106.13859},
   archivePrefix={arXiv},
   primaryClass={cs.DC}
}

Requirements

Hardware rFaaS supports SoftROCE and RoCE RDMA NICs with the help of ibverbs. Evaluation and testing with IB fabric is currently in progress.

In future versions, we plan for rFaaS to support Cray interconnect through libfabric and its ugni provider.

Software Currently, rFaaS works only on Linux systems as we realy heavily on POSIX interfaces. We require the following libraries and tools:

  • CMake >= 3.11.
  • C++ compiler with C++17 support.
  • libibverbs with headers installed.
  • librdmacm with headers installed.
  • pistache - HTTP and REST framework.

Furthermore, we fetch and build the following dependencies during CMake build - unless they are found already in the system.

Containers rFaaS supports two types of function executors - a bare-metal process and a Docker container. For Docker, we use the SR-IOV plugin from Mellanox to run Docker-based function executors with virtual NIC device functions. Please follow Mellanox documentation and instructions to install and configure the plugin. rFaaS expects that docker_rdma_sriov binary is available in PATH.

In future versions, we plan to support Singularity containers and offer a simpler, but less secure Docker networking.

Installation

To build rFaaS, run the following CMake configuration:

cmake -DCMAKE_CXX_COMPILER=<your-cxx-compiler> -DCMAKE_BUILD_TYPE=Release <source-dir>
cmake --build .

To enable more verbose logging, change the CMake configuration parameter to: -DCMAKE_BUILD_TYPE=Debug.

The CMake installation has the following optional configuration parameters.

Arguments
WITH_EXAMPLES EXPERIMENTAL Build additional examples (see examples subsection for details on additional dependencies).
WITH_TESTING EXPERIMENTAL Enable testing - requires providing device database and testing configuration (see below). See testing subsection for details.
DEVICES_CONFIG File path for the JSON device configuration.
TESTING_CONFIG File path for the JSON device configuration.
CXXOPTS_PATH Path to an existing installation of the cxxopts library; disables the automatic fetch and build of the library.
SPDLOG_PATH Path to an existing installation of the spdlog library; disables the automatic fetch and build of the library.
LIBRDMACM_PATH Path to a installation directory of the librdmacm library.

Usage

To learn how to use rFaaS, please follow the tutorial

For an in-depth analysis of each component and their configuration, please look at the system documentation.

Authors