- Capitalization should be consistent within and across data files. - Yes/No columns should be converted to bit fields using 1/0 to make ETL to this schema more standardized. - Gender, Race and Mortality status values should conform to a known coding system or should be provided - Missing information should be handled consistently. ie. Mortality status using blank and gender uses unknown