Name	Name	Last commit message	Last commit date
parent directory ..
__pycache__	__pycache__
model	model
parser	parser
.DS_Store	.DS_Store
README.md	README.md
adapter_utils.py	adapter_utils.py
adv_fusion_run.py	adv_fusion_run.py
bleu.py	bleu.py
bleu_sum.py	bleu_sum.py
fusion_run.py	fusion_run.py
model.py	model.py
run.py	run.py
task_run.py	task_run.py

Code Refinement

Task Definition

Code refinement aims to automatically fix bugs in the code, which can contribute to reducing the cost of bug-fixes for developers. In CodeXGLUE, given a piece of Java code with bugs, the task is to remove the bugs to output the refined code. Models are evaluated by BLEU scores and accuracy (exactly match).

Dataset

We use the dataset released by this paper(https://arxiv.org/pdf/1812.08693.pdf). The source side is a Java function with bugs and the target side is the refined one. All the function and variable names are normalized. Their dataset contains two subsets ( i.e.small and medium) based on the function length.

Data Format

The dataset is in the "data" folder. Each line of the files is a function. You can get data using the following command:

unzip data.zip

Data Statistics

Data statistics of this dataset are shown in the below table:

	#Examples	#Examples
	Small	Medium
Train	46,680	52,364
Valid	5,835	6,545
Test	5,835	6,545

Pipeline-GraphCodeBERT

Dependency

pip install torch
pip install transformers
pip install tree_sitter

Tree-sitter (optional)

If the built file "parser/my-languages.so" doesn't work for you, please rebuild as the following command:

cd parser
bash build.sh
cd ..

Fine-tune

We use 4*V100-16G to fine-tune. Taking the "small" subset as example:

scale=small
lr=1e-4
batch_size=32
beam_size=10
source_length=320
target_length=256
output_dir=saved_models/$scale/
train_file=data/$scale/train.buggy-fixed.buggy,data/$scale/train.buggy-fixed.fixed
dev_file=data/$scale/valid.buggy-fixed.buggy,data/$scale/valid.buggy-fixed.fixed
epochs=50 
pretrained_model=microsoft/graphcodebert-base

mkdir -p $output_dir
python run.py --do_train --do_eval --model_type roberta --model_name_or_path $pretrained_model --tokenizer_name microsoft/graphcodebert-base --config_name microsoft/graphcodebert-base --train_filename $train_file --dev_filename $dev_file --output_dir $output_dir --max_source_length $source_length --max_target_length $target_length --beam_size $beam_size --train_batch_size $batch_size --eval_batch_size $batch_size --learning_rate $lr --num_train_epochs $epochs 2>&1| tee $output_dir/train.log

Inference

We use full test data for inference.

batch_size=64
dev_file=data/$scale/valid.buggy-fixed.buggy,data/$scale/valid.buggy-fixed.fixed
test_file=data/$scale/test.buggy-fixed.buggy,data/$scale/test.buggy-fixed.fixed
load_model_path=$output_dir/checkpoint-best-bleu/pytorch_model.bin #checkpoint for test

python run.py --do_test --model_type roberta --model_name_or_path $pretrained_model --tokenizer_name microsoft/graphcodebert-base --config_name microsoft/graphcodebert-base --load_model_path $load_model_path --dev_filename $dev_file --test_filename $test_file --output_dir $output_dir --max_source_length $source_length --max_target_length $target_length --beam_size $beam_size --eval_batch_size $batch_size 2>&1| tee $output_dir/test.log

Result

The results on the test set are shown as below:

Small:

Method	BLEU	Acc (100%)
Naive copy	78.06	0.0
LSTM	76.76	10.0
Transformer	77.21	14.7
CodeBERT	77.42	16.4
GraphCodeBERT	80.02	17.3

Medium:

Method	BLEU	Acc (100%)
Naive copy	90.91	0.0
LSTM	72.08	2.5
Transformer	89.25	3.7
CodeBERT	91.07	5.16
GraphCodeBERT	91.31	9.1

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

Code Refinement

Task Definition

Dataset

Data Format

Data Statistics

Pipeline-GraphCodeBERT

Dependency

Tree-sitter (optional)

Fine-tune

Inference

Result

FilesExpand file tree

code2nl

Directory actions

More options

Directory actions

More options

Latest commit

History

code2nl

Folders and files

parent directory

README.md

Code Refinement

Task Definition

Dataset

Data Format

Data Statistics

Pipeline-GraphCodeBERT

Dependency

Tree-sitter (optional)

Fine-tune

Inference

Result