Nano

Setup

Local

Firstly install pixi, then run:

pixi install

Remote

Run to create / update pixi on cluster:

python update_pixi.py --config-path configs --config-name tiny_remote

Requirements:

Config file must specify infrastructure.server (target cluster)
infrastructure.slurm.script must contain export PIXI_HOME=... line
Only affects the cluster specified in the config file

What it does:

Copies local pixi.toml and pixi.lock to remote cluster
Runs pixi install on compute node via SLURM (GPU params auto-removed)
Archives old pixi files before installing new environment

Running Experiments

Local

pixi shell
python main.py --config-path configs --config-name tiny_local

Remote

python run_exp.py --config-path configs --config-name tiny_remote

Note: run_exp.py does not copy pixi files (pixi.toml, pixi.lock) to the cluster to avoid inflating memory and file count in $HOME. Use update_pixi.py (see Setup > Remote) to update the pixi environment on the cluster first.

Hydra

Uses Hydra for configuration management. Classes are instantiated via _target_:

trainer:
  train_dataloader:
    _target_: src.core.datasets.get_dataloader
    dataset_path: /path/to/data
    sequence_length: 2048

FAQ

Why state of model, optim, scheduler is separated from other state parameters?

We want to start metric_logger ASAP, loading model's distributed checkpoint forces us to create model before loading weights.

How to load llama weights? Set following fields in a config:

trainer:
  checkpoint:
   load:
    type: huggingface
    path: "meta-llama/Llama-3.2-1B"
  n_steps: 0

Name		Name	Last commit message	Last commit date
Latest commit History 95 Commits
.github/workflows		.github/workflows
configs		configs
grid_generator		grid_generator
jupyter_notebooks		jupyter_notebooks
resolver		resolver
scripts		scripts
src		src
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
main.py		main.py
pixi.lock		pixi.lock
pixi.toml		pixi.toml
run_exp.py		run_exp.py
update_pixi.py		update_pixi.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Nano

Setup

Local

Remote

Running Experiments

Local

Remote

Hydra

FAQ

About

Uh oh!

Releases

Packages

Contributors 5

Uh oh!

Languages

License

llm-random/nano

Folders and files

Latest commit

History

Repository files navigation

Nano

Setup

Local

Remote

Running Experiments

Local

Remote

Hydra

FAQ

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 5

Uh oh!

Languages

Packages