Let a yente be able to generate a testing report of how it fares against a set of standardized testing datasets. Everything in the universe changes all the time: our data, our dependencies... but to have some sort of record how a given release performed on the data at the time, also as documentation for our customers, we should be able to generate such a report.
Standard datasets for now:
- UN list untreated (positives)
- UN list treated with some shenanigans (positives)
- A random collection of international names (negatives)
Let a yente be able to generate a testing report of how it fares against a set of standardized testing datasets. Everything in the universe changes all the time: our data, our dependencies... but to have some sort of record how a given release performed on the data at the time, also as documentation for our customers, we should be able to generate such a report.
Standard datasets for now: