moe-interp

data

math, physics, biology

these were scraped from the arxiv.org until we reached 200k tokens on the most recent papers of each category.

github

data/github.txt was retrieved from OLMoE's repository.

legal

data/legal.txt was retrieved from CUAD_v1 dataset. it includes first 15 documents from the dataset (~200k tokens)

plots

plots/ contains the plots for the expert distribution for all experiments. (% of tokens going to a particular expert)

results

results/ contains the model outputs for various experiments like original model generation and swapped experts model generation.

experiments

expert-routing.ipynb contains the code for the expert routing and expert distribution experiments where we tracked the % of tokens that went to a particular expert.

swap-experts.ipynb contains the code for the expert swapping experiments where we swapped experts between the same and across different layers and compared the model outputs on various inputs.

swap-routing.ipynb contains the code for the expert distribution after swapping experts in various combinations.

swap-mlp-transformer.ipynb contains the code for the model outputs after swapping MLPs of various layers in dense transformers.

zero-out-exp.ipynb contains the code for the model outputs after zeroing out the experts in the model in various combinations.

zero-out-mlp-transformer.ipynb contains the code for the model outputs after zeroing out the MLPs of various layers in dense transformers.

Name		Name	Last commit message	Last commit date
Latest commit History 161 Commits
archive		archive
data-ext		data-ext
interp-data		interp-data
.DS_Store		.DS_Store
.gitignore		.gitignore
README.md		README.md
cosine-sim.ipynb		cosine-sim.ipynb
cuda-gate.ipynb		cuda-gate.ipynb
cuda.sh		cuda.sh
deepseek_logit_lens.ipynb		deepseek_logit_lens.ipynb
moe-gate-ext-deepseek.ipynb		moe-gate-ext-deepseek.ipynb
moe-gate-ext-olmoe.ipynb		moe-gate-ext-olmoe.ipynb
moe-gate-ext-qwen.ipynb		moe-gate-ext-qwen.ipynb
moe-gate-ext.ipynb		moe-gate-ext.ipynb
moe-gate.ipynb		moe-gate.ipynb
moe-lens-chat.ipynb		moe-lens-chat.ipynb
moe-lens-visualize.ipynb		moe-lens-visualize.ipynb
moe-lens.ipynb		moe-lens.ipynb
olmoe-gate.ipynb		olmoe-gate.ipynb
pca.ipynb		pca.ipynb
plot_pplx.ipynb		plot_pplx.ipynb
pplx.ipynb		pplx.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

moe-interp

data

math, physics, biology

github

legal

plots

results

experiments

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

moe-interp

data

math, physics, biology

github

legal

plots

results

experiments

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages