This might include sensible defaults for encoding (UTF-8), a cosmopolitan choice for line endings, endian-ness, etc. We could enable this by including appropriate EditorConfig settings that capture these choices.
Also some other issues beyond tidiness, like these data-cleaning concerns listed in Hadley's original paper:
...parsing dates and numbers, identifying missing values, correcting
character encodings (for international data), matching similar but
not identical values (created by typos), verifying experimental
design, and filling in structural missing values, not to mention
model-based data cleaning that identifies suspicious values. Can we
develop other frameworks to make these tasks easier?