We noticed that when the stepper flag is close to a transition state on start the test will fail. It is possible the debounce setting might be too high in firmware and not catching the transition.
This is a bug, but we will wait until we improve the flag design. Current resolution is to rotate the flag into the middle of a transition.