kouteiheika parent
This seems... pretty easy to get around? There are already open weight models which can take any photo and audio and make a video out of it with the character speaking/singing/whatever, and it runs on normal consumer hardware.
So you wouldn't know what the three numbers are ahead of time, you'd have to be using a real time face replacement model (or I guess live-switching between pre-rendered clips) and somehow convince the app that you're the iPhone selfie cam.
But at that point you might as well just use WAN 2.2 Animate and forget about Sora.