Skip to content

Fix/readme#43

Closed
chandanms wants to merge 25 commits intosimple-stories:mainfrom
chandanms:fix/readme
Closed

Fix/readme#43
chandanms wants to merge 25 commits intosimple-stories:mainfrom
chandanms:fix/readme

Conversation

@chandanms
Copy link
Collaborator

Description

Updated readme. Fix for previous error in adding readme to wrong folder

Related Issue

Motivation and Context

How Has This Been Tested?

Does this PR introduce a breaking change?

chandanms and others added 25 commits July 27, 2025 11:48
Add pruning functionality to tokenizer creation to remove zero frequency tokens
* Add rms_norm_eps argument to config with default 1e-6

* Fix type issues

* Add convert_to_hf.py script and tests

* Re-enable tests in CI

* Use torch>=2.6 in pyproject.toml

* Improve naming

* Fix type errors

* Added conversion scripts and corresponding tests

* Fixed pyright issues

* Marked a test as slow since it downloads all models from HF

* Revert "Marked a test as slow since it downloads all models from HF"

This reverts commit 2c9aedb.

Wrong commit with pytest!

* Marked a test as slow since it downloads all models from HF

* corrected the docstring of a test case. Made it more verbose to mention the backward compatibility

---------

Co-authored-by: chandanms <mschandan96@gmail.com>
* Reset train dataloader when depleted

* Fix pyright errors

* Cast instead of isinstance

* Update pinned torch version

* Factor out gpt2 and make general train.py

* Prefix wandb run name with model_id

* Create gpt2 hf converters

* Create push_to_hf

* Upload tokenizer to hf too

* Refactor gpt conversions
…ens; Made the data to tokenizer training iterable.
…_test_cases

Tokenizer test cases and reformatting of tokenizer training file
@chandanms chandanms closed this Aug 15, 2025
@chandanms chandanms deleted the fix/readme branch August 15, 2025 20:30
@chandanms chandanms restored the fix/readme branch August 15, 2025 20:31
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants