- Calculate precision and recall for each commit (report as per project) - Log execution time for both tools - Observe side effects such as if changing a rule fails to detect previously detected refactoring