fix: Unrestricted file content processing by mrwind-up-bird · Pull Request #27 · nyxCore-Systems/mini-chat-rag

mrwind-up-bird · 2026-02-26T14:17:58Z

AutoFix: Unrestricted file content processing

Category: security
Severity: medium

Issue

The extract_text function processes file content without size limits or content validation. Large files could cause memory exhaustion, and malicious PDF/DOCX files could exploit parser vulnerabilities in pypdf or python-docx libraries.

Fix

Added file size validation at the start of extract_text() to prevent memory exhaustion attacks. Added page count limits for PDF processing to prevent resource exhaustion. Wrapped PDF and DOCX parsing in try-catch blocks to handle parser vulnerabilities gracefully by converting exceptions to ValueError with descriptive messages, preventing crashes and information disclosure.

Generated by nyxCore AutoFix

fix(autofix): Unrestricted file content processing

8baf928

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: Unrestricted file content processing#27

fix: Unrestricted file content processing#27
mrwind-up-bird wants to merge 1 commit intomainfrom
autofix/dd2853cd/unrestricted-file-content-proc

mrwind-up-bird commented Feb 26, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

mrwind-up-bird commented Feb 26, 2026

AutoFix: Unrestricted file content processing

Issue

Fix

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant