Skip to content

Model Builder: Add Post processing script to convert fp16/32 LM_HEAD to int8 and use tied embeddings #1437

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Closed
wants to merge 2 commits into from

Add script to post process an onnx model with fp16/fp32 lm_head to ha…

e2926e1
Select commit
Loading
Failed to load commit list.
Closed

Model Builder: Add Post processing script to convert fp16/32 LM_HEAD to int8 and use tied embeddings #1437

Add script to post process an onnx model with fp16/fp32 lm_head to ha…
e2926e1
Select commit
Loading
Failed to load commit list.