This is an esoteric check:
In principle, the quality of the data and the quality of the workers should finally converge into the same point. We should calculate first the quality of the workers (already doing that) and then estimate what is the "worker quality" under the level of redundancy that we observe in the data. This should be another estimate of the data quality and should be pretty close to the actual quality.