Transition to scale-invariant detection of cues instead of using duplicated templates at varying sizes.
N.B.: Would match streams using scaling frame/UI (e.g. bCTxwRJOioQ) or a multi-cast (e.g. mrYQI4P8YTM) so this could lead to further complications.