Skip to content

Fix duplicate load_inference_datasets definition and improve dataset loading utilities#6

Open
swagat-mishra28 wants to merge 2 commits intohumanai-foundation:mainfrom
swagat-mishra28:fix-duplicate-dataloader
Open

Fix duplicate load_inference_datasets definition and improve dataset loading utilities#6
swagat-mishra28 wants to merge 2 commits intohumanai-foundation:mainfrom
swagat-mishra28:fix-duplicate-dataloader

Conversation

@swagat-mishra28
Copy link

Hi,

While reviewing the dataset loading utilities, I noticed that the function load_inference_datasets appears twice in the same file. The second definition partially overrides the earlier one and may lead to unexpected behavior.

This PR removes the duplicate function definition and keeps the complete implementation.

I also removed the tqdm progress bar inside the dataset loading loop. Running progress bars inside dataset iteration can introduce unnecessary overhead during training, especially when the dataset is accessed frequently by the DataLoader.

Additionally, a few small adjustments were made to related utilities to keep the dataset loading logic consistent.

Overall, these changes aim to make the dataset loader cleaner, slightly more efficient, and easier to maintain.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant