Skip to content

Document and enable the "missing pieces" for tidy (plain-text) files #2

@jimallman

Description

@jimallman

This might include sensible defaults for encoding (UTF-8), a cosmopolitan choice for line endings, endian-ness, etc. We could enable this by including appropriate EditorConfig settings that capture these choices.

Also some other issues beyond tidiness, like these data-cleaning concerns listed in Hadley's original paper:

...parsing dates and numbers, identifying missing values, correcting
character encodings (for international data), matching similar but
not identical values (created by typos), verifying experimental
design, and filling in structural missing values, not to mention
model-based data cleaning that identifies suspicious values. Can we
develop other frameworks to make these tasks easier?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions