Spirit: Fair Resource Allocation in Remote Memory Systems

Spirit is a system designed to fairly allocate interdependent resources in remote memory systems, such as those using RDMA over Converged Ethernet (RoCE). It specifically targets network bandwidth and local memory resources, leveraging a price-driven, auction-based algorithm (Competitive Equilibrium from Equal Incomes, or CEEI).

For artifact evaluation, you can directly go to the artifact evaluation instructions.

Repository Overview

This repository contains the code for Spirit, including the resource enforcer, benchmark applications, and datasets. The code is organized as follows:

Artifact Evaluation

ae: Contains the artifact evaluation scripts and setup instructions. Please check its scripts for detailed system dependencies and configuration.

Spirit System

global-enforcer: Contains the global resource enforcer implementation.
local-enforcer: Contains the local resource enforcer implementation and scripts to automatically run benchmark applications.

Remote Memory System

remote_mem: Contains the remote memory system implementation, which provides remote memory access via a swap partition on a virtual block device, fetching/evicting data from/to the memory node.

Supporting Modules

bench-mc-client: Contains the Memcached client implementation.
lib: Contains utility/library code shared between other Rust crates.
trace-loader: Contains the trace loader code to parse and load request traces for benchmarking. It is essentially an adaptor of the libCacheSim library.
sample_configs: Contains sample configuration files for the Spirit cluster and application deployment scenarios (e.g., which applications will be collocated on which compute node).

Symbiosis Resource Allocation Algorithm

res_allocation: Contains the implementation of the Symbiosis resource allocation algorithm, along with the performance estimator. It also includes infrastructure code to run the algorithm, communicate with resource enforcers to monitor current resource usage and performance, and update the resource allocation.
scripts: Contains scripts to run experiments with the Symbiosis resource allocation algorithm. It also includes scripts to prepare Docker containers for benchmark applications.

Tested Environment

Spirit was tested on a machine featuring an Intel® Xeon® Gold 6252N CPU and Mellanox/NVIDIA ConnectX-5 NICs. The VMs run Linux 6.13 and are hosted on a Windows Server 2022 machine, which enables PEBS support inside the VMs.

Each compute VM provides 48 cores, of which 32 are used for running application workloads (e.g., server instances and microservices).
Each memory VM is provisioned with 120 GB of RAM to store data pages swapped in from the compute VMs.

Artifact evaluation instructions

CloudLab Setup

We provide a CloudLab profile for easy setup, which includes the preinstalled Linux kernel used by Spirit and this repository. You can find the profile at CloudLab Profile.

The profile uses xl170 instances equipped with Intel Xeon E5-2640 v4 processors (10 cores) and 64 GB of memory. Our example experiment with two applications (Stream and Memcached) utilizes all CPU cores (with hyperthreading enabled) and nearly all of the memory, especially on the memory node.

Start the experiment using the provided CloudLab profile. You can find CloudLab documentation here.
Once the experiment is running, SSH into the nodes. The first node (node 0) will be the 🖥️compute node, and the second node (node 1) will be the 🗂️memory node (to provide remote memory accessed via RoCE) and the controller (to repurpose its unused CPU cycles).
In the 🖥️compute node, run the first initialization script that will configure Intel PEBS and reboot the machine:

cd /opt/spirit/spirit-controller
cd ae/compute_node
./1.init.sh

In the 🗂️memory node, run the initialization script that will configure huge pages and reboot the machine:

cd /opt/spirit/spirit-controller
cd ae/memory_node
./1.init.sh

Note) Rebooting machines usually takes 10+ minutes, so please be patient 😉 (If you think it gets stuck, you can go to cloudlab's "experiments" page and manually "reboot" servers using the per-node ⚙️ button)

After the reboot, SSH back into the 🖥️compute node and run the second initialization script.

For 🖥️compuate node:

cd /opt/spirit/spirit-controller/ae/compute_node
./2.init_after_reboot.sh

Please follow the instructions on the screen. The script will configure system dependencies, Docker containers, and the Spirit binaries (resource enforcer, benchmark applications, and dataset; for the artifact evaluation, we used two applications, Stream and Memcached, as examples due to the system resource limit, such as CPUs and memory).

Similarly, in the 🗂️memory node:

cd /opt/spirit/spirit-controller/ae/memory_node
./2.init_after_reboot.sh

This script on the 🗂️memory node will run the remote memory server program and configure a Jupyter notebook that will guide you through the artifact evaluation.

Follow the instructions in the Jupyter notebook (🗂️). To open the web interface, you may want to use -L option to forward the port from the 🗂️memory node to your local machine.

You can open a new ssh session to the memory node with:

ssh -L <8888 or local port you want to use>:localhost:8888 <username>@<memory_node_ip>

Then, you will be able to access the web interface on your local web browser using

http://localhost:<local port above>/notebooks/spirit_ae.ipynb

Note) If the Jupyter notebook does not open a file automatically, please open spirit_ae.ipynb.

Note) If you need to start only the Jupyter notebook (e.g., if the notebook is terminated), you can use (🗂️):

cd /opt/spirit/spirit-controller/ae/memory_node
./3.run_notebook.sh

Reference

This repository contains the code for the paper "Spirit: Fair Allocation of Interdependent Resources in Remote Memory Systems," presented at SOSP 2025.

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
.github/workflows		.github/workflows
ae		ae
bench-mc-client		bench-mc-client
global-enforcer		global-enforcer
lib		lib
local-enforcer		local-enforcer
remote_mem		remote_mem
res_allocation		res_allocation
sample_configs		sample_configs
scripts		scripts
trace_loader		trace_loader
.gitignore		.gitignore
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
Configuration.md		Configuration.md
LICENSE		LICENSE
README.md		README.md
Rocket.toml		Rocket.toml
install.sh		install.sh
install_docker.sh		install_docker.sh
justfile		justfile

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Spirit: Fair Resource Allocation in Remote Memory Systems

Repository Overview

Artifact Evaluation

Spirit System

Remote Memory System

Supporting Modules

Symbiosis Resource Allocation Algorithm

Tested Environment

Artifact evaluation instructions

CloudLab Setup

Reference

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Spirit: Fair Resource Allocation in Remote Memory Systems

Repository Overview

Artifact Evaluation

Spirit System

Remote Memory System

Supporting Modules

Symbiosis Resource Allocation Algorithm

Tested Environment

Artifact evaluation instructions

CloudLab Setup

Reference

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages