Skip to content

Replace random_split with Subset and add image path validation#110

Open
agentksimha wants to merge 15 commits intohumanai-foundation:mainfrom
agentksimha:fix/random-split-to-subset1
Open

Replace random_split with Subset and add image path validation#110
agentksimha wants to merge 15 commits intohumanai-foundation:mainfrom
agentksimha:fix/random-split-to-subset1

Conversation

@agentksimha
Copy link

Changes:

  • Replaced random_split with Subset using explicit index slicing for deterministic and reproducible dataset splits
  • eval_dataset now correctly wraps dataset_val ensuring validation-specific transforms are applied separately from training transforms
  • Added os.path.isfile() guard in generate_text_from_image_segment to handle invalid or missing image paths gracefully instead of crashing with an unhandled exception

@abhiram123467
Copy link

“I’ll work on this – will add a detailed README for the OCR‑1 subfolder.”

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants