Abstract

Maximal biclique enumeration (MBE) in bipartite graphs is an important problem in data mining with many real-world applications. All existing solutions for MBE are designed for CPUs. Parallel MBE algorithms for GPUs are needed for MBE acceleration leveraging its many computing cores. However, enumerating maximal bicliques using GPUs has three main challenges including large memory requirement, thread divergence, and load imbalance. In this paper, we propose GMBE, the first highly-efficient GPU solution for the MBE problem. To overcome the challenges, we design a stack-based iteration approach to reduce GPU memory usage, a pro-active pruning method using the vertex's local neighborhood size to alleviate thread divergence, and a load-aware task scheduling framework to achieve load balance among threads within GPU warps and blocks. Our experimental results show that GMBE on an NVIDIA A100 GPU can achieve 70.6x speedup over the state-of-the-art parallel MBE algorithm PARMBE on a 96-core CPU machine.

Try out GMBE

Hardware requirements

A machine with GPUs.

Software Dependencies

GNU Make 4.2.1
CMake 3.22.0
CUDA toolkit 11.7
GCC/G++ 10.3.0
Python 2.7.18
Python packages: zplot 1.41, pathlib 1.0.1
C++ library: libtbb-dev 2020.1-2
Ubuntu apt package: Ghostscript, Texlive-extra-utils
Nvidia driver 510.85.02
Docker 20.10.10

Deploy with docker image

We provide a docker image for the convenience to deploy this project. You should install some packages and download the docker image with the following commands before you run the docker image.

sudo apt install nvidia-docker2
docker pull fhxu00/gmbe

To run the docker image, you should execute the following command. Assure that the host machine has installed the nvidia driver with the version mentioned above.

docker run -it --gpus all --name gmbe-test fhxu00/gmbe bash

We have prepared the source code in the directory ~/MBE-GPU and downloaded all testing datasets in the docker image. You can now compile the source code and run the testing scripts in the docker as follows.

Compiling

Using the following commands, one can easily compile the GMBE. The generated executable file is located at bin/MBE_GPU.

# Get source code
git clone --recursive [repository_url]
cd MBE-GPU

# compiling with specific GPU type. If your GPU is A100, V100 and 2080TI, you can replace [GPU_TYPE] with the specific GPU type,
# otherwise you should revise the file CMakeLists.txt under the directory src/ to support your GPU.  
bash ./scripts/compile-GMBE.sh [GPU_TYPE]

You can refer to the QA-1 about how to support your GPUs.

Dataset preparing

For convenience, we provide a script to download and preprocess datasets. You can run the following command and you will find the preprocessed datasets under the new directory datasets/. You will find two formats of each datasets, one is in adjacency format stored with the extention .adj, another one is in edge pairs format stored with the extention .graph. The latter one is specifically prepared for ooMBEA and PARMBE.

bash ./preprocess/prepare_dataset.sh

Running

You can run GMBE with the following command-line options.

./bin/MBE_GPU 
 -i: The path of input dataset file.
 -s: Select one GMBE version to run. 0: GMBE-WARP, 1: GMBE-BLOCK, 2: GMBE, 3: GMBE-Multi-GPUs
 -x: Number of GPUs used to run GMBE, only useful in the multi-GPUs version.
 -m: bound_height, default 20.
 -n: bound_size, default 1500.
 -p: Set which to enable printing the exit time of each SM.
 -f: Set which to disable computing the statistical informations of the graph. Recommended to set.

Experimental workflow

We provide the scripts to generate the experimental results of Figure 6-13 and Table 2 in the directory scripts/. You can execute the scripts as following.

# Running on a machine with a 96-core CPU and a GPU
bash ./scripts/gen-fig-6.sh [GPU_TYPE]

# Running on any machine
bash ./scripts/gen-fig-7.sh

# Running on a machine with a GPU
bash ./scripts/gen-fig-8.sh [GPU_TYPE]

# Running on a machine with a GPU
bash ./scripts/gen-fig-9.sh [GPU_TYPE]

# Running on a machine with a GPU
bash ./scripts/gen-fig-10.sh [GPU_TYPE]

# Running on a machine with a GPU
bash ./scripts/gen-fig-11.sh [GPU_TYPE]

# Running on three machine with A100, V100 and 2080Ti respectively. 
# It is required to collect all the results in ./fig/fig-12/[GPU_TYPE].data 
# on different machines into ./fig/fig-12/fig-12.data on a specific machine.
# The README.md under fig/fig-12 specify the format of fig-12.data.
bash ./scripts/gen-fig-12.sh [GPU_TYPE]

# Running on a machine with 8 GPUs
bash ./scripts/gen-fig-13.sh [GPU_TYPE]

We provide the script to generate figures in the directory fig/ with the results generated in above. You can execute the script as following.

cd fig/
bash genfig.sh

Then you will find the figures under the directory fig/.

To generate the experimental result of Table 2, you can execute the script as following.

# Running on a machine with a GPU
bash ./scripts/gen-table-2.sh [GPU_TYPE]

Then you will find the experimental result under the directory table/.

QA

How to solve the error GPUassert: named symbol not found /usr/local/MBE-GPU/src/IterFinderGpu.cu 215?
This usually means that the flag CUDA_NVCC_FLAGS in src/CMakeLists.txt mismatch with your GPUs. To solve the problem, you should revise the file CMakeLists.txt to support your GPU. First, you should lookup the CUDA_NVCC_FLAGS matching with your GPU from this link. Second, add a if branch to set CUDA_NVCC_FLAGS for your GPU in CMakeLists.txt.

Name		Name	Last commit message	Last commit date
Latest commit History 41 Commits
baselines		baselines
fig		fig
preprocess		preprocess
scripts		scripts
src		src
.gitignore		.gitignore
.gitmodules		.gitmodules
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Abstract

Try out GMBE

Hardware requirements

Software Dependencies

Deploy with docker image

Compiling

Dataset preparing

Running

Experimental workflow

QA

About

Uh oh!

Releases

Packages

Languages

DH13768095744/MBE-GPU

Folders and files

Latest commit

History

Repository files navigation

Abstract

Try out GMBE

Hardware requirements

Software Dependencies

Deploy with docker image

Compiling

Dataset preparing

Running

Experimental workflow

QA

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages