-
-
Notifications
You must be signed in to change notification settings - Fork 2k
Support for minority languages #786
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
At present, minor languages are not supported. It is expected that related research will be conducted in the second half of this year. |
Developers can try to modify fonts by hacking & referencing the following files. However, BabelDOC will not provide custom font functionality in the short term. https://github.com/funstory-ai/BabelDOC/blob/main/babeldoc/assets/embedding_assets_metadata.py https://github.com/funstory-ai/BabelDOC/blob/main/babeldoc/tools/generate_font_metadata.py https://github.com/funstory-ai/BabelDOC/blob/main/babeldoc/assets/assets.py |
Please upload the original input PDF and the translated output PDF. |
Why Bharat Matters.pdf orignal Why Bharat Matters-mono (2).pdf Translated |
You can take a screenshot, and then mark the problematic areas. I don't understand Hindi language, so I don't know exactly which part has errors. |
The font mistakes like this are present at several places in the translated file this is just a sample, in earlier version pdf2zh1.9.0 I thought it was an encoding issue but when I tested other font packages same issue arises while constructing the pdf but the font package works for txt files |
This is a typesetting issue and has nothing to do with the font. Currently, our typesetting system does not support the special typesetting features required by these languages. |
Okay thank you for the help👍 |
Before you asking
Environment
How to install pdf2zh
pip
Describe the bug
GoNotoKurrent.ttf package not able to print the hindi font correctly.
To Reproduce
Expected behavior
it is expected to translate like this in devanagri format. Help me out please as i tried different font packages but the babeldoc does not recognize the package says invalid or corrupted file as it verifies the hash
पुस्तक में 11 अध्याय हैं, पहले 10 अध्याय विदेश नीति में खतरों, अवसरों और अपनाई गई नीतियों के बारे में विस्तृत जानकारी प्रदान करते हैं। अंतिम अध्याय 'भारत' के महत्व को समझाता है। भारत, भारत का हिंदी नाम है, जो एक दक्षिण एशियाई राष्ट्र है जो अपनी सांस्कृतिक विविधता, ऐतिहासिक महत्व और आर्थिक शक्ति के लिए जाना जाता है। प्राचीन भारतीय शास्त्रों में निहित सांस्कृतिक और ऐतिहासिक महत्व के कारण हाल ही में भारतीय राजनेताओं द्वारा 'भारत' शब्द का अक्सर उपयोग किया गया है। भारत दुनिया का सबसे बड़ा लोकतंत्र और तेजी से बढ़ती अर्थव्यवस्था के रूप में महत्वपूर्ण वैश्विक महत्व रखता है जिसका अंतर्राष्ट्रीय व्यापार पर महत्वपूर्ण प्रभाव है। भारत की सॉफ्ट पावर, जिसमें
Relevant log output
Origin PDF file
No response
Anything else?
No response
The text was updated successfully, but these errors were encountered: