Skip to content

Commit 09e4c16

Browse files
committed
Updated cmb doc fro Dec 3rd testing
This is updated commands and steps for the december 3rd testing/running. Signed-off-by: JJ Asghar <awesome@ibm.com>
1 parent 22c9766 commit 09e4c16

File tree

1 file changed

+18
-2
lines changed

1 file changed

+18
-2
lines changed

docs/cmb/build_process.md

Lines changed: 18 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -25,14 +25,23 @@ ilab taxonomy diff
2525
`~/.local/share/instructlab/datasets` -- should be empty before starting
2626
Every gpu should be "empty", or `0%` check with `nvidia-smi`
2727

28+
!!! note
29+
These steps were tested on the `a100` x8 machine that was given to the team as of Dec
30+
3rd, 2024. If you have different hardware you'll need a different profile, and different
31+
options.
32+
2833
## Create the data
2934
```bash
30-
ilab data generate
35+
# annouce the start of the SDG
36+
ilab data generate --pipeline full --gpus 8
37+
# annouce the completion of the SDG
3138
```
3239

3340
## Run the training after the generate is complete
3441
```bash
35-
ilab model train --strategy lab-multiphase --phased-phase1-data ~/.local/share/instructlab/datasets/knowledge_train_msgs_XXXXXXX.jsonl --phased-phase2-data ~/.local/share/instructlab/datasets/skills_train_msgs_XXXXXXX.jsonl
42+
# annouce the start of the training
43+
ilab model train --strategy lab-multiphase --phased-phase1-data ~/.local/share/instructlab/datasets/knowledge_train_msgs_XXXXXXX.jsonl --phased-phase2-data ~/.local/share/instructlab/datasets/skills_train_msgs_XXXXXXX.jsonl --skip-user-confirm --pipeline accelerated --force-clear-phased-cache
44+
# annouce the completion of the training
3645
```
3746

3847
## Post training evaluation steps
@@ -71,6 +80,9 @@ ilab model evaluate --benchmark mt_bench_branch --model ~/.local/share/checkpoin
7180

7281
## Hosting the release candidates
7382

83+
!!! warning
84+
This needs to be revisited as a process, this was a hack to start.
85+
7486
rsync over the files
7587
```bash
7688
mkdir $(date +%F)
@@ -108,6 +120,7 @@ Find the `ilab` random command to host the model, send that on after the PR lett
108120
```
109121
cat model_ilab_scripting.sh
110122
```
123+
111124
## Form letter for PRs
112125

113126
Hi! 👋
@@ -124,17 +137,20 @@ With confirmed success, tag the PR with "ready-for-merge" and remove the "commun
124137
After you have merged in the PRs to the taxonomy, now you need to push this to huggingface, if you don't have access to HuggingFace, you will need to find someone to add you to it ;).
125138

126139
1) Clone down the repository on the staging box if you haven't already
140+
127141
```bash
128142
git clone https://huggingface.co/instructlab/granite-7b-lab
129143
cd granite-7b-lab
130144
vi .git/config
131145
# url = git@hf.co:instructlab/granite-7b-lab
132146
# verify you can authenticate with hf.com: ssh -T git@hf.co
133147
```
148+
134149
2) Copy in the `samples_xxxx` into the granite-7b-lab
135150
3) `git add . && git commit`
136151
4) Write up a good commit message
137152
5) tag and push
153+
138154
```bash
139155
git tag cmb-run-XXXXX
140156
git push origin main

0 commit comments

Comments
 (0)