Skip to content

Charset encoding #19

@GoogleCodeExporter

Description

@GoogleCodeExporter
Ran and compiled on Ubuntu 14.04.

The input file is encoded in UTF-8 but output files (vocabulary and text 
vectors file) are encoded in ISO-8859-1.

All accents are wrong.

Original issue reported on code.google.com by pierpaol...@gmail.com on 10 Sep 2014 at 5:33

Metadata

Metadata

Assignees

No one assigned

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions