NONA - A Framework for Elastic Stream Provenance

This is the repository accompanying the research work

NONA - A Framework for Elastic Stream Provenance

Havers, B., Papatriantafilou, M., Gulisano, V.

accepted for publication at the 44th IEEE International Conference on Distributed Computing Systems (ICDCS 2024)

The framework published here allows to obtain the Forward Provenance graph, i.e., the live, duplicate-free graph of dependencies between input and output tuples for a set of Stream Processing queries whose composition changes dynamically. This happens through the user's adding or removing of queries at runtime of the system. See the section "Running custom experiments" below for more details.

Furthemore, the code published here allows a full replication of the results of the paper, with all necessary steps described below.

Setup

This setup has been tested on the Linux distributions CentOS 7 (x64), Ubuntu 20.04.2 LTS (x64 and arm64), Ubuntu 22.04 (arm64), and Ubuntu 18.04.6 LTS (arm32).

Requirements

Executing the experiments requires access to two machines. One of these will serve as the data provider for the experiments and must be reachable via ssh. You will need the following programs on your main machine:

git
wget
tar
conda
java 8
mvn

Furthermore, on the remote machine / data provider machine you will need:

git
java 8
mvn
unzip

Procedure

Download this repository. We will in the following refer to the root folder of this repository as ROOT.
To download and unpack Apache Flink 1.10, execute the following commands in ROOT:

wget https://archive.apache.org/dist/flink/flink-1.10.0/flink-1.10.0-bin-scala_2.11.tgz
tar zxvf flink-1.10.0-bin-scala_2.11.tgz

To download and unpack Apache Kafka 3.2.1, execute the following commands in ROOT:

wget https://archive.apache.org/dist/kafka/3.2.1/kafka_2.13-3.2.1.tgz
tar zxvf kafka_2.13-3.2.1.tgz

To install kafkacat on your machine, enter ROOT/scripts and run the command

bash kcat_installer.sh

This will require you to have sudo privileges.

On your remote machine / data provider at XXX.XX.XX.XXX, download this repository. We will refer to the root folder of this repository on your remote machine as REMOTE_ROOT. Then, in REMOTE_ROOT, execute the following to package the required java classes:

mvn clean package

Then, download the datasets on the remote machine. On the remote machine, execute the following commands in REMOTE_ROOT/scripts:

bash linear_road_downloader.sh
bash car_local_downloader.sh
bash mhealth_downloader.sh

This will download and extract the datasets into the folder REMOTE_ROOT/input. Finally, download Kafka on the remote machine. In REMOTE_ROOT, execute

wget https://archive.apache.org/dist/kafka/3.2.1/kafka_2.13-3.2.1.tgz
tar zxvf kafka_2.13-3.2.1.tgz

Back on your main machine, to enable communication with your remote machine / data provider at XXX.XX.XX.XXX, edit the file ROOT/configs/global_config.yaml:

at kafka_server_port_remote, enter "XXX.XX.XX.XXX:9092"
at remote_root, enter "REMOTE_ROOT"
at ssh_prefix, enter ssh your_remote_username@XXX.XX.XX.XXX.

NOTE: It is important that you have set up passwordless ssh login to your remote machine / data provider, as explained for example here.

To setup the Anaconda environment, run in ROOT:

conda env create -f environment.yml

Running Experiments

There are four individual experiments in the paper, corresponding to experiment scripts in this repository in the folder ROOT/experiments. These are run on specific devices, server (Intel Xeon Phi, 72 cores, 1.5GHz, 102GB Ram) and Odroid (Samsung Exynos 5422, Cortex A15 / Cortex A7 octacore, up to 2Hz, 2GB Ram) (your hardware configuration may differ). Furthermore, the maximum duration of each experiment and the number of repetitions may vary. The table lists the mapping between figure number in the paper, experiment script, device, duration (in minutes), repetitions, and the plot name (used later):

figure number	SCRIPT	device	DURATION	REPS	PLOT_NAME
5 (a)	LR_static_overheads_odroid.yaml	odroid	10	10	lr_overheads
5 (b)	CL_static_overheads_server.yaml	server	10	10	cl_overheads
6 (a)	LR_static_odroid.yaml	odroid	10	10	lr_static
6 (b)	RI_static_odroid.yaml	odroid	10	10	ri_static
6 (c)	CL_static_server.yaml	server	10	10	cl_static
7 (a)	LR_pyramid_server.yaml	server	25	10	lr_dynamic
7 (b)	RI_pyramid_server.yaml	server	25	10	ri_dynamic
8	LR_pyramid_distributed_odroid.yaml	odroid	25	10	lr_dynamic_odroid_cluster
9	synthetic_descending_server.yaml	server	25	10	synthetic
10 (not shown in paper)	RI_pyramid_distributed_odroid.yaml	odroid	25	10	ri_dynamic_odroid_cluster

(odroid: Odroid XU4 2016a, ARM; server: Intel Xeon-Phi server with 72 1.5GHz cores, x64) Note: Experiment results will differ on different hardware.

To run an experiment as the server or the Odroid, execute the following steps (using device as placeholder for server or odroid):

Copy the correct Flink configuration. In ROOT, execute

cp configs/device/flink-conf.yaml flink-1.10.0/conf/.

Activate the conda environment:

conda activate nona

In the folder ROOT/scripts, execute

python run.py ../experiments/SCRIPT REPS DURATION

where SCRIPT is the chosen experiment script. This will run the experiment described in SCRIPT for REPS times, with each run taking at most DURATION minutes.

NOTE 1: For running the experiment related to Figure 8, see below.

NOTE 2: During experiment execution, a host of debugging and logging information is printed to screen in addition to information about the remaining runtime of the experiment. It is safe to ignore this information, at the end of each experiment the script will output in detail which runs succeeded, which failed, and where the experiment output is stored.

Running the distributed experiment

For running the distributed experiments pertaining to figures 8 and 10 (the latter is not included in the paper), you will need four nodes (in our paper, we use Odroids for this experiment) and the data provider external machine. One node is the main node, the other three are workers, referred to as worker1, worker2, and worker3 here.

Perform steps 1-6 from Setup/Procedure above, on the main node and the data provider, where required from the instructions above.
Perform steps 1-2 on each worker.
Ensure that the main node can ssh without password into each of the workers.
On the main node, enter the ssh handle and root of the repo into the file ROOT/experiments/LR_pyramid_distributed_odroid.yaml, e.g.:

query_workers:
  - ssh_prefix: "worker1"
    root: "/root/on/worker1"
  - ssh_prefix: "worker2"
    root: "/root/on/worker2"
  - ssh_prefix: "worker3"
    root: "/root/on/worker3"

On each worker, put the IP address of the main node, YY.Y.Y.YYY:9092 into the file /root/on/workerX/configs/global_config.yaml at kafka_server_port. Furthermore, at kafka_server_port_remote, enter the IP address of the data provider, XX.X.X.XXXX:9092.
On each worker, go into /root/on/workerX and execute the command

mvn clean package

Now you can run the distributed Odroid experiment from the main worker.

Running custom experiments

To run custom experiments, take a look at the yaml experiment scripts in ROOT/experiments. Especially, each yaml experiment script designates a file describing the sequence of transitions (procedure) of the dynamic query set. These procedure scripts are located at ROOT/experiments/procedure_scripts/, and allow you for example to change the query that is added or removed, change the number of queries, or even implement a custom transition procedure.

Visualizing results

We provide a python script to recreate the figures from the paper. To execute the script, activate the conda environment

conda activate nona

Then, in the directory /ROOT/scripts/visualization, call the plotting facility:

python plotter.py PLOT_NAME TARGET_FOLDER [--show-in-popup]

where TARGET_FOLDER is the folder in which the output of your experiment runs is stored (the exact folder path is printed to the terminal after running run.py and will be inside ROOT/results). The optional flag will display the experiment plot in a popup window. See the table in section Running Experiments above, column PLOT_NAME, for appropriate values for the PLOT_NAME parameter.

Credit

Nona re-uses code from our earlier work on Forward Provenance from the repository Ananke.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
configs		configs
experiments		experiments
input		input
scripts		scripts
src/main		src/main
environment.yml		environment.yml
pom.xml		pom.xml
readme.md		readme.md
sysArch.png		sysArch.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

NONA - A Framework for Elastic Stream Provenance

Setup

Requirements

Procedure

Running Experiments

Running the distributed experiment

Running custom experiments

Visualizing results

Credit

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

BastianHavers/nona

Folders and files

Latest commit

History

Repository files navigation

NONA - A Framework for Elastic Stream Provenance

Setup

Requirements

Procedure

Running Experiments

Running the distributed experiment

Running custom experiments

Visualizing results

Credit

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Packages