are you guys pursuing any idea beyond async inference? We are working on smth (will open source soon) and want to dedup any efforts!