An open-source Urdu → Roman Urdu dictionary and lexicon designed for:
- Urdu transliteration
- Roman Urdu generation
- NLP research
- speech recognition pipelines
- subtitle generation
- language processing tools
This project aims to build the most complete Urdu → Roman Urdu word mapping dataset available.
- Urdu → Roman Urdu word mappings
- phonetic transliteration support
- normalization rules
- expanding Urdu lexicon
- planned integration of full Urdu Lughaat
- Full Urdu Lughaat dictionary
- frequency word lists
- common Roman Urdu spellings
- Urdu morphological variants
- subtitle-friendly Romanization