feat(train_reward_model): add chatml formatting and aggregation of more statistics by maxreciprocate · Pull Request #21 · CarperAI/autocrit

maxreciprocate · 2023-12-07T17:58:13Z

No description provided.

for consistency, formatting has to happen either through tokenizer's `apply_chat_format` or throught ahead of time formatting in the dataset

maxreciprocate added 2 commits December 7, 2023 18:33

feat(train_reward_model): force chatml & add stats

8021879

for consistency, formatting has to happen either through tokenizer's `apply_chat_format` or throught ahead of time formatting in the dataset

feat(README): usage snippet with apply_chat_template

9ae61bd

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(train_reward_model): add chatml formatting and aggregation of more statistics#21

feat(train_reward_model): add chatml formatting and aggregation of more statistics#21
maxreciprocate wants to merge 2 commits intomainfrom
update-reward-trainer

maxreciprocate commented Dec 7, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

maxreciprocate commented Dec 7, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant