Skip to content

[FEAT] Optical Character Recognition (OCR) Support #7

@mark-e-deyoung

Description

@mark-e-deyoung

Implement screen.ocr and screen.findText capabilities to enable semantic observation of applications that lack standard UIA/Win32 metadata.

Architectural Guidance:

  • Placement: This should reside in the Client Layer (GUI/CLI) or as a separate Plugin Module rather than the wininspect_core.
  • Reasoning: Core should remain lightweight and injection-free. OCR engines add significant binary size and external dependencies.
  • Workflow: Client captures a screen region using the existing core methods and performs OCR locally.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions