GRAGC: Graph-Based Retrieval-Augmented Code Generation

GRAGC trains a heterogeneous Graph Neural Network (HeteroGraphSAGE) to retrieve relevant code context from an entire repository for LLM-based code generation. It constructs a whole-repository call graph and learns to rank functions and classes by relevance to a query at inference time.

Data Availability

The training dataset consists of 1,672 public GitHub repositories listed in meta.csv. Each entry contains owner/repo and the exact commit hash at which it was accessed, allowing the training graphs to be reproduced deterministically. No proprietary data was used.

Installation

Requires Python 3.11 and uv.

uv sync

Reproducing Results

EvoCodeBench

1. Clone the benchmark

git clone https://github.com/Poseidondon/EvoCodeBenchPlus.git

Follow its setup instructions to download the benchmark repositories.

2. Build graph dataset

uv run python -m ragc.datasets.create_dataset \
    --evocodebench /path/to/EvoCodeBenchPlus/dataset/repos \
    configs/evocodebench/create_ds.yml

Caches PyG graphs under data/torch_cache/evocodebench/.

3. Train the GNN — see Training below.

4. Configure — edit configs/evocodebench/<method>/greedy_*.yml:

Field	Description
`retrieval.model_path`	Path to trained checkpoint
`fusion.generator`	LLM model name or API endpoint
`task_path`	Path to EvoCodeBench `oracle.jsonl`
`repos_path`	Path to cloned EvoCodeBench repos

5. Run inference

# Code completion
uv run python -m ragc.test.inference \
    -t completion \
    -o output/evocodebench/completions.jsonl \
    -c configs/evocodebench/gnn/greedy_deepseekv3.yml

# Retrieval metrics only (recall / precision)
uv run python -m ragc.test.inference \
    -t retrieval \
    -o output/evocodebench/retrieval.json \
    -c configs/evocodebench/gnn/greedy_deepseekv3.yml

Retrieval Methods

Method	Config dir	Description
GNN (proposed)	`gnn/`	HeteroGraphSAGE trained on call-graph triplets
GNN + local context	`gnn_local_context/`	GNN retrieval combined with surrounding file context
Local context	`local_context/`	Embedding retrieval over functions in the same file
Golden context	`golden_context/`	Oracle: ground-truth context provided to the LLM
Local golden	`local_golden/`	Oracle local context (ground-truth functions in same file)
No context	`without_context/`	LLM only, no retrieval

Training the GNN

First build the graph dataset (Step 2 above). Training is driven by the Trainer class in ragc/train/gnn/training.py — instantiate it with a TorchGraphDataset, a HeteroGraphSAGE model, and a TripletLoss, then call trainer.train(). The checkpoint is saved as a standard PyTorch state dict.

Name		Name	Last commit message	Last commit date
Latest commit History 142 Commits
configs		configs
prompts		prompts
ragc		ragc
run		run
scripts		scripts
.gitignore		.gitignore
README.md		README.md
clean_outputs.sh		clean_outputs.sh
meta.csv		meta.csv
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

GRAGC: Graph-Based Retrieval-Augmented Code Generation

Data Availability

Installation

Reproducing Results

EvoCodeBench

Retrieval Methods

Training the GNN

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

GRAGC: Graph-Based Retrieval-Augmented Code Generation

Data Availability

Installation

Reproducing Results

EvoCodeBench

Retrieval Methods

Training the GNN

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages