From e3b8d51d08fe4a3c9f48b320698a1ffafb5967d7 Mon Sep 17 00:00:00 2001 From: Wei Wang Date: Wed, 3 Nov 2021 13:39:05 +0800 Subject: [PATCH] Change github download url list filename The filename is far from the actual url list: original name | lines | new name ---------------------------------------- c100.txt | 6306 | c1000.txt cpp100.txt | 13915 | cpp10000.txt java10.txt | 73783 | java10000.txt --- README.md | 8 ++++---- github/{c100.txt => c1000.txt} | 0 github/{cpp100.txt => cpp10000.txt} | 0 github/{java10.txt => java10000.txt} | 0 4 files changed, 4 insertions(+), 4 deletions(-) rename github/{c100.txt => c1000.txt} (100%) rename github/{cpp100.txt => cpp10000.txt} (100%) rename github/{java10.txt => java10000.txt} (100%) diff --git a/README.md b/README.md index 2048441..8fdbb9d 100644 --- a/README.md +++ b/README.md @@ -142,15 +142,15 @@ statements that appear in C programs.* If you want to use your own repository for mining patterns, jump to Step 1.2. -1.1 __Downloading Top-100 GitHub repos for C language__ +1.1 __Downloading Top-1000 GitHub repos for C language__ -Steps below show how to download Top-100 GitHub repos for C language -(`c100.txt`) and generate training data. `training_repo_dir` is a directory +Steps below show how to download Top-1000 GitHub repos for C language +(`c1000.txt`) and generate training data. `training_repo_dir` is a directory where the command below will clone all the repos. ``` $ cd github -$ python download_repos.py -f c100.txt -o -m clone -p 5 +$ python download_repos.py -f c1000.txt -o -m clone -p 5 ``` 1.2 __Mining patterns from downloaded repositories__ diff --git a/github/c100.txt b/github/c1000.txt similarity index 100% rename from github/c100.txt rename to github/c1000.txt diff --git a/github/cpp100.txt b/github/cpp10000.txt similarity index 100% rename from github/cpp100.txt rename to github/cpp10000.txt diff --git a/github/java10.txt b/github/java10000.txt similarity index 100% rename from github/java10.txt rename to github/java10000.txt