Well, it started off this way, but I went through several iterations with the audit prompt and the system prompt to improve the issue detection, the solutions suggested, tag them, reduce hallucinations, provide approximate image coordinates, provide examples to the model, etc.
It's currently at over 20 lines of prompting and I guess this will grow over time.
or rather, significantly more?
this seems super, super simple.