-
Notifications
You must be signed in to change notification settings - Fork 255
Open
Description
In below PR -
I have tried to use new dataset from hf got somewhat success but not completely successful
Completed : -
New dataset added related to fitness
-
Changed dataset formatting and created pre training data.
-
Used dataset - fitness-question-answers
-
Created pre training data with - ibm/granite-4-h-tiny
Tried but unsuccessful : -
Working on
Adjusting hyperparameters.
-
Changed
MAX_SEQ_LENand found that 180 works. -
Still working on
learning rateandbatch size
Any Suggestion/Help is welcome
Thanks in advance
Metadata
Metadata
Assignees
Labels
No labels