Skip to content

Data mapping cache blocks function updates #18

@antoniorv6

Description

@antoniorv6

HuggingFace datasets use aggressive caching for map() operations, which creates a significant development workflow issue: when you modify a function used in dataset mapping, the cached results from the old function version are still returned, making it appear as if your code changes have no effect.

We need some sort of cache cleaning or debug_mode for development reasons, as users may have issues when trying to customize the data loading and preprocessing functions of the repository.

Metadata

Metadata

Assignees

Labels

enhancementNew feature or request

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions