Skip to content

DE gov site strips info from field in order not to clash with ',' delimiter #2

@tomdottom

Description

@tomdottom

I feel we're trying to reverse a lossy process where , is been stripped from the data before saving it as a csv. We have no real way of knowing if the double spaces represent a double space or a comma followed by a space.

We may wish to alternatively try the xls export. This in fact is not an xls file but a html dump which can easily be parsed with BeautifulSoup4. See http://stackoverflow.com/a/16697784/2398354 for a working example

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions