PaddleFormers CLI(Command Line Interface)提供了基于终端的程序交互,通过配置文件来管理各类参数,高效灵活地执行模型训练、推理和评估任务。
安装
参考README 文档进行 paddleformers 安装
验证安装:
paddleformers-cli help预期输出:
------------------------------------------------------------
| Usage: |
| paddleformers-cli train : model finetuning |
| paddleformers-cli export : model export |
| paddleformers-cli help: show helping info |
------------------------------------------------------------AI 计算卡配置
默认情况下,CLI 中使用所有可用的 AI 计算卡。 如果您想指定特定的计算卡,请在运行 CLI 之前设置对应的环境变量。
对于英伟达 GPU 或者天数智芯计算卡,通过以下环境变量进行设置:
# Single GPU / Iluvatar GPU
export CUDA_VISIBLE_DEVICES=0
# Multi GPUs / Iluvatar GPUs
export CUDA_VISIBLE_DEVICES=0,1,2,3,4,5,6,7对于昆仑芯计算卡,通过以下环境变量进行设置:
# Single XPU
export XPU_VISIBLE_DEVICES=0
# Multi XPUs
export XPU_VISIBLE_DEVICES=0,1,2,3,4,5,6,7以下环节使用 Qwen/Qwen3-0.6B-Base 模型进行演示
# Example 1: PT-Full using online dataset
paddleformers-cli train examples/config/pt/full.yaml
# Example 2: PT-Full using offline dataset
paddleformers-cli train examples/config/pt/full_offline_data.yaml# Example 1: SFT
paddleformers-cli train examples/config/sft/lora.yaml
# Example 2: SFT-Full
paddleformers-cli train examples/config/sft/full.yaml# Example 1: 8K seq length, DPO
paddleformers-cli train examples/config/dpo/full.yaml
# Example 2: 8K seq length, DPO-LoRA
paddleformers-cli train examples/config/dpo/lora.yamlpaddleformers-cli export examples/config/run_export.yamlNNODES={num_nodes} MASTER_ADDR={your_master_addr} MASTER_PORT={your_master_port} RANK={rank} CUDA_VISIBLE_DEVICES=0,1,2,3,4,5,6,7 paddleformers-cli train examples/config/sft_full.yaml先写一个脚本,例如scripts/train_96_gpus.sh,内容为:
NNODES={num_nodes} MASTER_ADDR={your_master_addr} MASTER_PORT={your_master_port} RANK={rank} CUDA_VISIBLE_DEVICES=0,1,2,3,4,5,6,7 paddleformers-cli train examples/config/sft_full.yaml然后:
mpirun bash scripts/train_96_gpus.shpaddleformers-cli 支持在输入命令中传入参数用于覆盖配置文件中的内容,具体的用法如下:
paddleformers-cli train examples/config/sft/lora.yaml key1=value key2=value2
# 示例 修改模型名称和LoRA配置
paddleformers-cli train examples/config/sft/lora.yaml model_name_or_path=./models/Qwen3-0.6B lora_rank=8