Federated Learning (FL) with GRPO Setup Guide

This guide provides setup instructions for running GRPO (Group Relative Policy Optimization) experiments using FedML.

Initial Setup

Note: Server and Client(s) use the same initial setup process.

1. Clone Repository

git clone --recurse-submodules https://github.com/bagel-org/FedML.git

2. Install Dependencies

pip install -r python/spotlight_prj/fedllm/requirements.txt
pip install "trl>=0.9.0" "accelerate>=0.27.0"
pip install -e python/
cd FedML/python/spotlight_prj/fedllm

3. Environment Configuration

Set up AWS credentials:

export AWS_ACCESS_KEY_ID=<your_key>
export AWS_SECRET_ACCESS_KEY=<your_other_key>

Generate a unique run ID:

export RUN_ID=$(python -c "import uuid; print(uuid.uuid4().hex)")

Important: Server and Client(s) should all use the same run ID for a given run to avoid data conflicts in the S3 bucket.

4. Weights & Biases Setup

Configure wandb for experiment logging:

wandb login

Running Experiments

1-Client Test

Server

bash scripts/run_fedml_server_custom.sh 0 "$RUN_ID" localhost 29500 1 auto fedml_config/scenario1.yaml

Client

bash scripts/run_fedml_client_custom.sh 1 "$RUN_ID" localhost 29500 1 auto fedml_config/scenario1.yaml

Note: To run with 2 or more clients, the first number after scripts/run_fedml_client_custom.sh will indicate the client id. One client script should be executed in each client with a different client id.

Notes

The RUN_ID should be unique for each experimental run to prevent data conflicts across different experiments.
All participants (server and clients) must use the same RUN_ID for a given experimental run.
Make sure AWS credentials are properly configured before starting the experiments.

The YAML Configuration File

All parameters of GRPO can be setup using yaml configuration files. Usually we store these files in python/spotlight_prj/fedllm/fedml_config. In these file we can configure the GRPO parameters like the batch size, number of rollouts, completion length, etc. The client and server scripts read the parameters from these files.

The table below shows the yaml configuration files for each escenario analyzed in the paper.

Scenario	`yaml` file
#1	`scenario1.yaml`
#2	`scenario2.yaml`
#3	`scenario3.yaml`

Name		Name	Last commit message	Last commit date
Latest commit History 12,140 Commits
.github		.github
android		android
devops		devops
docs		docs
examples		examples
installation		installation
ios		ios
iot		iot
python		python
research		research
.gitignore		.gitignore
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Federated Learning (FL) with GRPO Setup Guide

Initial Setup

1. Clone Repository

2. Install Dependencies

3. Environment Configuration

4. Weights & Biases Setup

Running Experiments

1-Client Test

Server

Client

Notes

The YAML Configuration File

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

bageldotcom/FedML

Folders and files

Latest commit

History

Repository files navigation

Federated Learning (FL) with GRPO Setup Guide

Initial Setup

1. Clone Repository

2. Install Dependencies

3. Environment Configuration

4. Weights & Biases Setup

Running Experiments

1-Client Test

Server

Client

Notes

The YAML Configuration File

About

Resources

License

Code of conduct

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages