SIFT instead of cv2.matchTemplate

Transition to scale-invariant detection of cues instead of using duplicated templates at varying sizes.  

**N.B.**: Would match streams using scaling frame/UI (e.g. `bCTxwRJOioQ`) or a multi-cast (e.g. `mrYQI4P8YTM`) so this could lead to further complications.