The CVE schema expects data to be in UTF-8. However, many records contain unicode escape sequences instead of the expected Unicode characters. We should add this as a check for the lint tool.