Skip to content

Feature/slem with context aware#1025

Open
krzysiekfonal wants to merge 3 commits intoml-explore:mainfrom
krzysiekfonal:feature/slem-with-context-aware
Open

Feature/slem with context aware#1025
krzysiekfonal wants to merge 3 commits intoml-explore:mainfrom
krzysiekfonal:feature/slem-with-context-aware

Conversation

@krzysiekfonal
Copy link
Copy Markdown

This PR adds option to use different token for drafter model than verifier in speculative decoding and also optionally allows to enable token translation with context-awarness.
It is inspired by this work:
https://huggingface.co/blog/universal_assisted_generation

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant